Company Name: Ashby
Location: Remote – North to South America (Specifically mentions New York, Toronto, Vancouver, Colombia, San Francisco, among other locations in the Americas)
Job Type: Full-time
Salary Range: USA: $200K – $260K yearly (plus Equity) CAN (Canada): CA$205K – CA$335K yearly (plus Equity)
Industry: B2B SaaS (Software as a Service) / Recruiting Technology / Site Reliability Engineering / Platform Engineering
Job Overview
Ashby is a leading B2B SaaS company that’s revolutionizing Recruiting Technology, empowering businesses across the globe to optimize their hiring processes. We’re seeking a highly curious, rigorous, and problem-hungry Principal Site Reliability Engineer (also referred to as Platform Engineer and SRE) to join our fully remote team, spanning from North to South America. This full-time, Principal-Level role offers a unique opportunity to build the “paved road” for our engineering teams, ensuring safety and scalability while directly impacting the core developer and user experience.
As a Principal Site Reliability Engineer, you’ll be a “Swiss army knife” for hard problems, capable of tackling infrastructure updates, security enforcements, database optimization, and Kubernetes debugging. You’ll leverage your deep understanding of cloud infrastructure, automation, and robust design principles to help engineers ship features fast and maintain a high bar for quality. If you’re passionate about building resilient systems, thrive in ambiguity, and are eager to work with others who share a high bar for quality and impact, Ashby invites you to join our groundbreaking team.
Duties and Responsibilities
- Act as a curious, rigorous, problem-hungry Site Reliability Engineer who codes to solve complex challenges.
- Build a “paved road” for excellent engineering teams to ensure safety and scalability in our B2B SaaS environment.
- Apply experience building infrastructure at a slightly later stage than Ashby’s current phase, including dealing with millions of data points and understanding infrastructure’s impact on customer experience.
- Lead efforts in automating processes from provisioning to monitoring and release.
- Possess “Swiss army knife” capability: expertly tackling hard problems like infrastructure updates, security enforcements, database optimization, Kubernetes debugging, and digging through Typescript traces.
- Be comfortable evaluating and taking risks, making principled decisions for system resilience.
- Advocate for the future user, edge cases, and robust design; seeing product engineers as allies in shared goals.
- Care deeply about the work and the team, and actively seek to collaborate with others who share this passion.
- Demonstrate proficiency in coding: reviewing and submitting code changes daily.
- Be comfortable making independent decisions on building the best platform for Ashby.
- Possess the ability to dive into gnarly SQL reports and advise engineers on performant data models.
- Utilize written communication as the primary mode for best practices and fostering an asynchronous culture.
- Demonstrate proven ability to deliver projects independently without constant prodding; capable of self-project management, seeking help when stuck, and cutting scope when necessary.
- Leverage experience with TypeScript (frontend & backend), Node.js, React, Apollo GraphQL, Postgres, Redis (prior experience not required, but helpful).
- Be familiar with Datadog and Sentry for comprehensive system monitoring.
- Possess experience with AWS cloud infrastructure.
- Participate in an on-call rotation as all engineers are on call in a follow-the-sun model.
- Contribute to developer tooling, as everyone on the team is expected to contribute.
- Apply the ability to optimize custom DSL-to-SQL compilers and create developer tools.
- Create automated guardrails for security and privacy of customer data.
- Help developers ship features fast via canary deploys, gradual rollouts, and feature flags.
- Define SLOs (Service Level Objectives) and implement SLIs (Service Level Indicators) to ensure system reliability.
- Ensure communication with external services supports retries and circuit-breakers for fault tolerance.
- Implement robust infrastructure for event-driven architecture and data warehouses.
- Exercise strong judgment in deciding the “best paved road” for Ashby, balancing innovation with stability.
Qualifications
- Experience Level: Principal-Level (indicated by the job title and the depth of technical leadership and problem-solving required).
- Education Requirement: Relevant experience or a Bachelor’s degree in a related field is typically preferred.
- Required Skills:
- Curious, rigorous, problem-hungry Site Reliability Engineer who codes.
- Ability to build a “paved road” for excellent engineering teams to ensure safety and scalability.
- Experience building infrastructure at a slightly later stage than Ashby’s current phase, including dealing with millions of data points and understanding infrastructure’s impact on customer experience.
- Experience automating processes from provisioning to monitoring and release.
- “Swiss army knife” capability: tackling hard problems like infrastructure updates, security enforcements, database optimization, Kubernetes debugging, and digging through Typescript traces.
- Comfortable evaluating and taking risks.
- Advocates for the future user, edge cases, and robust design; sees product engineers as allies.
- Cares about the work and the team, and wants to work with others who share this passion.
- Proficiency in coding: reviewing and submitting code changes daily.
- Comfortable making independent decisions on building the best platform.
- Ability to dive into gnarly SQL reports and advise engineers on performant data models.
- Primary mode of communication for best practices is written communication (async culture).
- Proven ability to deliver projects independently without constant prodding; capable of self-project management, seeking help when stuck, and cutting scope when necessary.
- Experience with TypeScript (frontend & backend), Node.js, React, Apollo GraphQL, Postgres, Redis (prior experience not required, but helpful).
- Familiarity with Datadog and Sentry for monitoring.
- Experience with AWS cloud infrastructure.
- All engineers are on call in a follow-the-sun model.
- Everyone contributes to developer tooling.
- Ability to optimize custom DSL-to-SQL compilers and create developer tools.
- Creating automated guardrails for security and privacy of customer data.
- Helping developers ship features fast via canary deploys, gradual rollouts, and feature flags.
- Defining SLOs and implementing SLIs.
- Ensuring communication with external services supports retries and circuit-breakers.
- Implementing infrastructure for event-driven architecture and data warehouses.
Salary and Benefits
Ashby offers a highly competitive annual salary for this Full-time Principal Site Reliability Engineer – Americas position. The salary ranges vary by region: USA: $200K – $260K yearly (plus Equity) CAN (Canada): CA$205K – CA$335K yearly (plus Equity)
We believe in investing in our employees and offer a comprehensive benefits package designed to support your overall well-being and professional growth. While specific benefits vary by region, they typically include robust health insurance options, generous paid time off, and opportunities for continuous learning and career development in a high-growth SaaS environment, complemented by company equity.
Working Conditions
This is a Full-time, Remote position, spanning across North to South America. You’ll work from your home office, utilizing various development, cloud management, and collaboration tools. The role demands exceptional technical proficiency, strong problem-solving skills, and the ability to operate autonomously while contributing to a globally distributed team. You’ll be expected to own projects end-to-end, advocate for robust design, and continuously seek ways to improve the developer and user experience. Some flexibility in work hours may be required to accommodate team meetings across diverse time zones. This role involves participation in an on-call rotation following a follow-the-sun model.
Why Work with Us
At Ashby, we’re building the future of Recruiting Technology, and we’re looking for visionary Principal Site Reliability Engineers to join us on this exciting journey. We’re a rapidly growing B2B SaaS company with a culture that values rigor, curiosity, and a deep commitment to solving hard problems.
As a Principal SRE, you’ll have the unique opportunity to define and build the foundational infrastructure that empowers our entire engineering team and enhances our customer experience. You’ll work in a highly autonomous remote environment where your technical depth, strategic thinking, and “Swiss army knife” problem-solving approach will be celebrated. If you’re ready to make a significant impact on core systems, drive innovation in Site Reliability Engineering and Platform Engineering, and thrive in a fast-paced, mission-driven culture, Ashby offers an unparalleled opportunity for your career.