Role
As a Senior DevOps Engineer you will manage, design, and develop our large-scale cloud infrastructure where low latency and reliability are both critical, forming the foundation of Alpaca and empower millions of users who are trading billions of dollars worth of assets.
The team is responsible for maintaining the cloud infrastructure and supporting infrastructure layer applications such as databases, message brokers, observability tools, etc. while working closely with software engineering teams.
For this role, we are seeking an experienced devops engineer who can perform well in a distributed working environment. In this role, you will have the autonomy to design and implement solutions to achieve defined goals while actively participating in defining those goals in cooperation with the team.
Things You Get To Do:
- Management of our postgreSQL database clusters - scale, fine-tune, improve stability and observability as we grow
- Manage our Kubernetes clusters, and infrastructure service components that are used by our applications - such as message brokers, monitoring, continuous delivery and others
- Manage our underlying infrastructure in Google Cloud, including external and internal network connectivity
- Take part of the rotation to watch alerts and dispatch other engineering team’s requests
Who you are (must-haves):
- at least 5 years of experience in a DevOps role, or in a similar field.
- Strong ability to work independently , lead and deliver on large tasks, and collaborate with other members of the organization or external partners
- Strong experience with PostgreSQL replication, troubleshooting, performance tuning, and production database maintenance
- Significant production experience with Kubernetes
- Hands-on experience with Google Cloud Platform (GCP) by maintaining production applications on it
- Strong ability to troubleshoot issues in PostgreSQL, K8s, cloud and finding the root cause
Who You Might Be (Nice-to-Haves):
You can successfully start working in this position without having some of the following skills, but it will be necessary to acquire them in the early stages:
- Always think with IT security in mind
- Good communicator in both sync and async channels, and value good documentation
- Hands-on experience deploying and maintaining software you'll be responsible for
- Strong understanding and comfort with Linux (Debian/Ubuntu) and shell usage
- Deep understanding of Docker/containerd
- Experience building and scaling continuous integration and continuous delivery pipelines
- Familiarity with Git for distributed source control and branching strategies
- Experienced in multi-regional high availability
- Experience building a development environment with Tilt
- Experience with Golang
- Experience with L2/L3 networking