Site Reliability Engineer

Company:  Open Systems Technologies
Location: Portland
Closing Date: 05/11/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description

A software company is looking for a Site Reliability Engineer to join their team in Portland, OR.


Compensation: $140-160k


Responsibilities:

  • Help define technology choices, best practices and process for the team.
  • Develop and maintain documentation standard for the team.
  • Develop new tools and libraries for broader use by SaaS Operations and Engineering teams.
  • Enable engineering teams and understand problems quicker.
  • Work with product architects and make suggestions for architectural changes and design platform component roadmaps.
  • Act as a subject matter expert (SME) for components and functions desired. Develop the skill as required, to become SME for components in need.
  • Assist engineering teams in deep troubleshooting and application code review to find opportunities to improve performance and scalability.
  • Work with Engineering and peer SRE teams to design and use firm coding standards and best practices.
  • Respond to incidents coordinated by SRE and Incident Response teams. Act as a Incident Commander during incidents.
  • Participate in escalation and off-hours on-call schedule.
  • Mentor and train junior members of the team. Design training curriculum for the team.


Qualifications:

  • 7+ years industry experience
  • BS in CS or equivalent combination of education and experience
  • Strong experience operating Kubernetes in production environments – EKS Anywhere is preferred
  • Experience with middleware systems (Kafka, AMQ, Redis, Memcache, etc.)
  • Experience managing CI/CD systems (Flux, Concourse)
  • Experience deploying and/or operating Observability stack (Splunk, Datadog, Grafana)
  • Experience with large scale systems
  • Familiarity with working with PostgreSQL and MongoDB
  • Background working in a multi-platform environment (Linux, Windows)
  • Familiarity of programming/scripting languages (ie. Python, Bash, PowerShell, Go, etc.)
  • Familiarity with Agile/Scrum/Kanban methodologies
  • Strong interpersonal skills
Apply Now
Share this job
Open Systems Technologies
  • Similar Jobs

  • Site Reliability Engineer

    Portland
    View Job
  • Site Reliability Engineer

    Portland
    View Job
  • Site Reliability Engineer

    Portland
    View Job
  • DevOps Engineer - Site Reliability

    Portland
    View Job
  • Site Reliability Engineer (*)REMOTE

    Portland
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙