Sr Site Reliability Engineer

Company:  IDeaS
Location: BLOOMINGTON
Closing Date: 20/10/2024
Salary: £100 - £125 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

We are seeking a Senior Site Reliability Engineer that will be at the forefront of establishing and driving best practices in system reliability, performance optimization, and observability. With over five years of experience, you bring deep expertise in software development and infrastructure operations, particularly in building and maintaining scalable, data-intensive systems. Your key focus will be on defining and implementing Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to ensure our solutions meet rigorous performance standards. You will work closely with cross-functional teams to build observability frameworks that empower teams to monitor, diagnose, and improve system performance proactively. Your leadership and persistence will be vital in identifying and resolving performance bottlenecks, ensuring long-term scalability and efficiency across our systems.

What you’ll be doing...

  • Collaborate with development and operations teams to design, implement, and maintain observability frameworks that provide deep insights into system performance, particularly for data and ML pipelines.
  • Lead the establishment of Service Level Objectives (SLOs) and Service Level Indicators (SLIs), ensuring they align with business goals and drive continuous performance improvements.
  • Partner with stakeholders to understand system performance requirements and translate them into actionable performance engineering strategies.
  • Proactively identify performance bottlenecks and collaborate with teams to implement solutions that enhance system scalability and reliability.
  • Design and execute performance regression test suites, focusing on data-intensive and ML workloads, to ensure continuous performance optimization.
  • Own the reliability and performance metrics of our systems, driving a culture of performance excellence and proactive issue resolution.
  • Collaborate with subject matter experts to gain a deep understanding of domain-specific performance challenges, particularly in data and ML pipelines.
  • Utilize tools like Datadog, Jira, and GitHub to monitor system performance, manage projects, and track issues, with a strong emphasis on performance-related metrics.
  • Define and monitor success metrics, ensuring our systems consistently meet or exceed performance and reliability targets.
  • Actively contribute to the continuous improvement of performance engineering practices across the team, fostering a culture of excellence in observability and system performance.
  • Perform other duties as assigned.

What you’ll bring to us…

  • Bachelor’s degree in Computer Science, Engineering, or a related field.
  • Five years of experience in a site-reliability-focused role responsible for establishing reliability standards in a cloud-native environment.
  • Strong expertise in establishing SLOs/SLIs and building observability frameworks for complex systems.
  • Proficiency with cloud services, particularly AWS, and experience in designing scalable and reliable architectures.
  • Hands-on experience with performance monitoring and observability tools like Datadog.
  • Proficiency in version control systems like Git/GitHub and infrastructure as code tools like Terraform.
  • Strong interpersonal skills and excellent communication abilities, with a focus on driving performance improvements across teams.

Preferred:

  • Proficiency in Java programming and hands-on experience with REST, Spring and microservices development.
  • Proficiency in RDBMS schema design and index utilization.

We Support Who You Are…

As a global company, we strive to create an inclusive environment where diverse perspectives spark innovation and meet the challenges of an evolving world. Whether you’re launching a new career or expanding your current one, IDeaS is a company where you can balance great work with all other aspects of your life.

At IDeaS, we also aspire to live our values each day by being Accountable, Curious, Passionate and Authentic. And we continue our quest to build a more inclusive environment that attracts, represents and provides a place for diverse ideas, unique perspectives, and authentic voices.

Additional Information:

To qualify, applicants must be legally authorized to work in the United States , and should not require, now or in the future, sponsorship for employment visa status.

SAS is an equal opportunity/Affirmative Action employer. All qualified applicants are considered for employment without regard to race, color, religion, gender, sexual orientation, gender identity, age, national origin, disability status, protected veteran status or any other characteristic protected by law.

Equivalent combination of education, training, and relevant experience may be considered in place of the education requirement stated above.

Resumes may be considered in the order they are received.

IDeaS/SAS employees performing certain job functions may require access to technology or software subject to export or import regulations. To comply with these regulations, IDeaS/SAS may obtain nationality or citizenship information from applicants for employment.

IDeaS/SAS collects this information solely for trade law compliance purposes and does not use it to discriminate unfairly in the hiring process.

#J-18808-Ljbffr
Apply Now
Share this job
IDeaS
  • Similar Jobs

  • Sr Site Reliability Engineer

    Bloomington
    View Job
  • Sr. RF Engineer

    Bloomington
    View Job
  • Sr. Manufacturing Engineer

    Bloomington
    View Job
  • Sr. Controls Engineer

    Bloomington
    View Job
  • Sr. RF Engineer

    Bloomington
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙