Senior Site Reliability Engineer

Company:  ServiceTitan
Location: Little Ferry
Closing Date: 08/11/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

Ready to be a Titan?

The Senior Site Reliability Engineer will be a key player in managing, optimizing, and ensuring the reliability and scalability of our SQL Server and PostgreSQL databases both in the cloud and on-premises. The ideal candidate will have extensive experience with Azure and AWS platforms, with a strong preference for Azure expertise. You will work closely with our development and operations teams to drive improvements in database performance, automate processes, and implement robust backup and recovery procedures.

What you'll do:

  1. Design, implement, and manage SQL Server and PostgreSQL database systems on both cloud (Azure and AWS) and on-premises environments.
  2. Develop and enforce database administration and security standards.
  3. Monitor database performance, implement changes and apply new patches and versions when required.
  4. Automate repetitive DBA tasks using scripting and automation tools.
  5. Ensure high availability and acceptable levels of performance of mission-critical database resources.
  6. Develop strategies for database disaster recovery including setting up RTO (Recovery Time Objective) and RPO (Recovery Point Objective) metrics.
  7. Work with cloud infrastructures, focusing on Infrastructure as Code (IaC) using Terraform, container orchestration with Kubernetes, and Docker.
  8. Implement and manage continuous integration and deployment (CI/CD) systems using tools such as TeamCity, GitHub Actions, and Azure DevOps.
  9. Utilize observability tools (Datadog, ELK stack, Grafana, Prometheus) to monitor systems and databases effectively.
  10. Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement.
  11. Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
  12. Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.

What you'll need:

  1. Bachelor’s degree in Computer Science, Information Technology, or a related field.
  2. 5+ years of experience as a DBA or SRE, with a focus on SQL Server and PostgreSQL.
  3. Strong experience with Azure and AWS cloud platforms, with a preference for Azure expertise.
  4. Solid understanding of SLI/SLOs and general SRE practices.
  5. Experience with Infrastructure as Code (IaC), preferably Terraform, Kubernetes, and Docker.
  6. Familiarity with GitOps and experience with Flux/Argo CD is a big plus.
  7. Proficiency in CI/CD tools such as TeamCity, GitHub Actions, and Azure DevOps.
  8. Knowledge of observability tools like Datadog, ELK stack, Grafana, and Prometheus.
  9. Strong problem-solving skills and the ability to work under pressure.

Be Human With Us:

Being human isn’t about checking every box on a list. It’s about the experiences we have, people we meet, and the perspectives we share. So, if you have the skills but are hesitant to apply because of your background, apply anyway. We need amazing people like you to help us challenge the conventional and think differently about the problems that we’re solving. We’re in this together. Come be human, with us.

What We Offer:

When you join our team, you’re not just accepting a job. You’re making a career move. Here’s how we’ll support you in doing some of the most impactful work of your career:

  1. Flextime, recognition, and support for autonomous work: Flexible time off with ample learning and development opportunities to continue growing your career. We offer a comprehensive onboarding program, leadership training for Titans at all levels, and other programs and events. Great work is rewarded through Bonusly, peer-nominated awards, and more.
  2. Holistic health and wellness benefits: Company-paid medical, dental, and vision (with 100% employer paid options and 90% coverage for dependents), FSA and HSA, 401k match, and telehealth options including memberships to Headspace, Galileo, One Medical, Ginger and more.
  3. Support for Titans at all stages of life: Parental leave and support, up to $20k in adoption reimbursement, on demand maternity support through Maven Maternity, free breast milk shipping through Maven Milk, pet insurance, legal advisory services, financial planning tools, and more.

At ServiceTitan, we celebrate individuality and uniqueness. We believe that the convergence of fresh perspectives and experiences from all walks of life is what makes our product and culture so great. We strongly encourage people from underrepresented groups to apply. We do not discriminate against employees based on race, color, religion, sex, national origin, gender identity or expression, age, disability, pregnancy (including childbirth, breastfeeding, or related medical condition), genetic information, protected military or veteran status, sexual orientation, or any other characteristic protected by applicable federal, state or local laws.

ServiceTitan is committed to fair and equitable compensation for all of our employees. We thoughtfully consider a wide range of factors when determining individual compensation. The expected salary range for this role for candidates residing in the United States is between $126,000 USD - $182,000 USD. Compensation for candidates residing outside the United States will vary by location and the specific salary range will be discussed during the hiring process. Actual compensation for an individual may vary depending on skills, performance over time, qualifications, experience, and location. In addition to the base salary, the total compensation package also includes an annual bonus, equity and a holistic suite of benefits.

#J-18808-Ljbffr
Apply Now
Share this job
ServiceTitan
  • Similar Jobs

  • Senior Site Reliability Engineer

    New York
    View Job
  • Senior Site Reliability Engineer (SRE)

    New York
    View Job
  • Senior IT Site Reliability Engineer

    New York
    View Job
  • Senior Site Reliability Engineer (SRE)

    Little Ferry
    View Job
  • Senior IT Site Reliability Engineer

    New York
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙