(Full Time) Site Reliability Engineer at PicnicHealth (United States)
Site Reliability Engineer
PicnicHealth United States
Date Posted: 10 Aug, 2023
Work Location: San Francisco, United States
Salary Offered: $160 — $190 yearly
Job Type: Full Time
Experience Required: 6+ years
Remote Work: Yes
Stock Options: No
Vacancies: 1 available
Healthcare needs good data. At PicnicHealth, we are building deep real-world datasets to fuel cutting-edge research, while giving patients control of their own medical data. These complete, clinically-rich datasets produce unique insights across dozens of diseases to ultimately get the right treatments into patients’ hands faster. We make this happen by working directly with patients and leveraging state of the art machine learning to transform messy medical records into structured, research-ready datasets.
About the Role:
As a Senior Site Reliability Engineer at PicnicHealth, you will be responsible for the reliability, efficiency, and architecture of our cloud, developer, and security operations. Day to day, you will take the lead to identify and resolve infrastructure issues, while supporting requests from our team. PicnicHealth’s engineering team is highly engaged and motivated, and as senior SRE, you will be a thought partner that helps our team level up the code we write.
Our Tech Stack:
- Cloud Vendors: GCP (primary), AWS, Azure
- Cloud-Native Services: Kubernetes, Cloud Functions/Run, Postgres, Redis, Pub/Sub, BigQuery, Airflow, Cloudflare
- Self-Hosted Services: GitHub Enterprise Server, GitHub Actions, Jenkins, Hasura, Elasticsearch, Metabase, Grafana, Retool, LogRocket
- Languages/Frameworks: Python, Shell, TypeScript/Node.js, Terraform/Terragrunt, Helm, React, Apollo, GraphQL
As SRE, your responsibilities will include:
- Partner with developers throughout PicnicHealth. Work across our teams to ensure that our developers produce code that is efficient and secure at scale.
- Own our incident response processes. Continuously maintain and provide front-line support for cloud infrastructure, DevOps, and security.
- Troubleshoot, escalate, or resolve issues in our developer workflow. Take the lead on sporadic issues arising from areas in our tech stack.
- Take ownership of our IaC. Lead the design, set up, and support of our cloud architecture.
- Monitor, maintain, and upgrade all components of our infrastructure hosted on Google Cloud Platform, AWS, and Azure.
- Participate in our security and compliance team. Lead the engineering-related efforts to implement and enforce policy and technical controls.
- Improve the efficiency and experience of cloud, developer, and security operations. Write clean, effective, and reliable code to automate processes and improve operational efficiency.
You are a great fit if you have:
- Demonstrated skill in operating and troubleshooting Kubernetes, Linux, and VPC networking
- Demonstrated skill in troubleshooting PostgreSQL and queries
- Strong understanding of cloud-native concepts and products from at least one of GCP, AWS, Azure
- Demonstrated fluency in Python or TypeScript/Node.js
- Experience with Infrastructure as Code and CI/CD systems
- Excellent problem-solving skills, with a focus on automating processes to solve complex issues
Nice to Haves:
- Experience operating GPU-enabled workloads in Kubernetes
- Familiarity with modern SDLC practices and monorepos
- Recent experience with our tech stack
Perks & Benefits @PicnicHealth
At PicnicHealth you get to solve real problems with real solutions, great tech, and great people.
You also get:
- Competitive salary
- Comprehensive benefits including above market Health, Dental, Vision
- Family friendly environment
- Flexible time off
- 401k plan
- Free PicnicHealth account
- Equipment and internet funds for home office set up
Equal Opportunity Statement
PicnicHealth is committed to promoting an inclusive work environment free of discrimination and harassment. We value a diverse and balanced team where everyone can belong.
#J-18808-Ljbffr