Site Reliability Engineering Manager

Company: Plume Design, Inc

Location: Palo Alto

Closing Date: 02/11/2024

Hours: Full Time

Type: Permanent

Apply Now

Job Requirements / Description

Life at Plume

At Plume, we believe that technology isn't about moving faster, it's about making life’s moments better. Which is why we’ve built the world's first, and only, open and hardware-independent service delivery platform for smart homes, small businesses, enterprises, and beyond. Our SaaS platform uses WiFi, advanced AI, and machine learning to create the future of connected spaces—and human experiences—at massive scale.

We now deliver services to over 50 million locations globally and have managed over 2.5 billion devices on our platform. We’re expanding rapidly, pioneering a new category, and we achieved our Series F funding in just four years. Our customers include many of the world's largest Communications Service Providers (CSPs) who look to Plume to help them evolve their smart home offerings while gleaning insights from their own data.

With a bias for action and a love for being trailblazers, the team at Plume embodies a combination of relentless curiosity and imaginative innovation. We challenge ourselves to think in ways that other companies don't, work to do what should be done (rather than what can), and if we can’t do it exceptionally well, we don’t do it. It’s how we've assembled a team of world-class builders, thinkers, and doers. And it’s how we’re reinventing what’s possible every day.

We’re looking for a seasoned Technical Manager, experienced with Customer Facing environments, to Captain our Site Reliability Engineering Team. This team is focused on deployments, fixes, and sustainability. The right candidate needs to have strong technical knowledge in key areas while focusing on customer satisfaction.

What You’ll Do

Supervise a team of Site Reliability Engineers who provide first-line support to Customer Clouds. Deployments, On-call, Application Provisioning are some of the routine tasks.
Attend and conduct customer Meetings for Project and Roadmap specification.
Manage growth and performance of SRE team members.
Be able to step in and execute or triage issues as much as the Engineers. Hands-on past experience is beneficial. Some examples are as follows:
Provision and scale multi-datacenter Kubernetes Infrastructure and Applications (EKS)
Deploy Software in multiple Production Environments
Own monitoring and alerting to production systems, improvements and changes
Contribute improvements to the current automation
Contribute improvements to our on-call process and alerting
Play a key role in the recruitment and retention of top talent.

What You’ll Bring

Availability to be in on-call rotation for Production issues
Availability to work with a distributed team in different timezones
Advanced communication skills
Experience managing people

Desired Skill Set

10+ Years of experience with Production Troubleshooting
Minimum 5+ Years of experience leading or managing teams
Bachelor’s degree in related field or equivalent experience, Advanced degree preferred.
This is a leadership role, but you must have Technical knowledge and working experience with:
Kubernetes (operate)
Basic Terraform Knowledge
Experience Programming/Scripting - one of the following (eg. Perl, Python, PHP, GoLang, Java, etc)
Experience with modern cloud infrastructure, preferably AWS
Experience with modern Linux Operating systems (Enterprise Linux or Debian based)
Experience both setting up and utilizing self-managed Monitoring and observability tools (e.g. Nagios/Icinga, Grafana, Prometheus)

Differentiators

Troubleshooting production performance/service degradation or outage issues at scale
Experience with Infrastructure Troubleshooting in VMs and/or Bare Metal (ssh/Linux)
Advanced Kubernetes knowledge
Advanced Terraform knowledge
Customer Facing experience in previous roles
Experience operating Kafka in Production
Experience operating NoSQL Databases in Production
Experience operating Relational Databases in Production
Configuration Management experience

HYBRID - This position requires someone to come into our Palo Alto, CA office 4 days a week. Candidates must be in commutable distance. We are not offering relocation at this time.

Total Compensation package would include: anticipated compensation range of $181,000 - $213,000 + bonus + equity + benefits. Benefits include: a 401k plan and a company match, basic life insurance plus unparalleled health, dental, vision and other benefits and perks. Please see here for more details.

An employee’s base salary and its position within the range may depend on a number of factors including job related knowledge, education, skills, experience and other business related considerations. Published ranges are provided in good faith at the time of posting.

About Plume

As the creator of the only open, hardware-independent, cloud-controlled experience platform for Communication service providers and their subscribers, Plume partners with over 350 Communication service provider customers, including some of the world’s largest such as Comcast, Charter, Liberty Global, and J:COM.

Using OpenSync, the most widely supported open-source, silicon-to-cloud framework for smart spaces, Plume’s software-defined network allows CSPs to decouple their service offerings from hardware and rapidly curate and deliver new services over a multi-vendor, open-platform architecture.

Backed by investors such as Insight Partners and SoftBank Vision Fund 2, Plume is now valued at $2.6B, having added over $500M in funding in 2021 alone.

Plume is an equal opportunity workplace that maintains a continuing policy of nondiscrimination in all employment practices and decisions, ensuring equal employment opportunities for all qualified individuals without regard to race, color, creed, religion, sex, national origin, age, physical or mental disability, sexual orientation, gender identity, marital status, pregnancy, childbirth or related individual conditions, medical conditions (as defined by state law), military or veteran status, or any other characteristic protected by federal, state or local law.

Apply Now

Share this job

Plume Design, Inc

Useful Links

More Jobs in Palo Alto
Full Time Jobs in Palo Alto
Part Time Jobs in Palo Alto
Management Jobs
Engineering Jobs

Similar Jobs
Site Reliability Engineering Manager
Palo Alto
View Job
Site Reliability Engineering Manager
Mountain View
View Job
Site Reliability Engineer, Data Engineering - USDS
Mountain View
View Job
Site Reliability Engineer
Mountain View
View Job
Site Reliability Engineer
Sunnyvale
View Job

Site Reliability Engineering Manager

Similar Jobs