A fast-growing Series B company in our portfolio is looking for a hands-on leader to head their infrastructure team. This team plays a critical role in their machine learning platform, designing, developing, and optimizing core systems. This position is perfect for an infrastructure engineering expert with a solid technical background who thrives in mentoring and leading teams.
About the Company
:
They're developing the most efficient, scalable, and reliable solution for running machine learning workloads—whether in their cloud or the customer
s.
In this role, you will have the opportunity
- to:
Lead, manage, and mentor the infrastructure engineering team responsible for building the backbone of the ML platf - orm.Define and execute the technical strategy for infrastructure, ensuring performance, security, and scalability of key syst
- ems.Collaborate with ML teams and cross-functional stakeholders to ensure seamless integration of models into production environme
- nts.Design and implement scalable solutions, including CI/CD pipelines, container orchestration, and cloud infrastructure (AWS, GCP, et
- c.).Optimize system performance by identifying and addressing infrastructure bottlene
- cks.Own end-to-end project management for infrastructure initiatives, from planning to execution and ongoing maintena
- nce.Foster engineering best practices and a culture of continuous improvement within the t
eam.
Qualificat
- ions:
Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related - field.5+ years of professional experience in infrastructure or software engineering, with at least 2 years in a technical leadership
- role.Expertise in infrastructure design, including containerization (Docker), orchestration (Kubernetes), and cloud platforms (AWS,
- GCP).Strong experience with CI/CD pipelines, infrastructure as code (Terraform, Ansible), and monitoring sy
- stems.Solid understanding of networking, security, and high-availability infrastructure d
- esign.Experience managing and scaling infrastructure for machine learning or similar high-performance work
- loads.Proven track record of leading teams and delivering large-scale, production-level infrastructure solu
- tions.Excellent problem-solving skills and the ability to drive technical projects from idea to compl
etion.
BONUS
- POINTS:
Experience optimizing infrastructure for machine learning workloads, including GPU utilization and distributed co - mputing.Familiarity with multi-cloud strategies and hybrid cloud depl
- oyments.Deep understanding of security best practices in cloud-native envir
- onments.Previous experience in a fast-paced startup environment, particularly in M
L or AI.
Logistical
Questi ons:
Sta
ge: Seri es BLocation: New
York/Hybri dReports to:
CTO/Found erTeam Si
Similar Jobs
- View Job
Manager, Software Engineering - Machine Learning Infrastructure
Little Ferry - View Job
Software Engineering Manager (Machine Learning Infrastructure)
New York - View Job
Engineering Manager - Machine Learning
Little Ferry - View Job
Senior Software Engineering Manager, Machine Learning, Labs
New York - View Job
Senior Software Engineering Manager, Machine Learning, Labs
New York