Software Engineering Manager (Machine Learning Infrastructure)

Company:  Greylock
Location: New York
Closing Date: 15/10/2024
Hours: Full Time
Type: Permanent
Job Requirements / Description

A fast-growing Series B company in our portfolio is looking for a hands-on leader to head their infrastructure team. This team plays a critical role in their machine learning platform, designing, developing, and optimizing core systems. This position is perfect for an infrastructure engineering expert with a solid technical background who thrives in mentoring and leading teams.



About the Company


:
They're developing the most efficient, scalable, and reliable solution for running machine learning workloads—whether in their cloud or the customer


s.
In this role, you will have the opportunity


  • to:
    Lead, manage, and mentor the infrastructure engineering team responsible for building the backbone of the ML platf
  • orm.Define and execute the technical strategy for infrastructure, ensuring performance, security, and scalability of key syst
  • ems.Collaborate with ML teams and cross-functional stakeholders to ensure seamless integration of models into production environme
  • nts.Design and implement scalable solutions, including CI/CD pipelines, container orchestration, and cloud infrastructure (AWS, GCP, et
  • c.).Optimize system performance by identifying and addressing infrastructure bottlene
  • cks.Own end-to-end project management for infrastructure initiatives, from planning to execution and ongoing maintena
  • nce.Foster engineering best practices and a culture of continuous improvement within the t


eam.
Qualificat


  • ions:
    Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related
  • field.5+ years of professional experience in infrastructure or software engineering, with at least 2 years in a technical leadership
  • role.Expertise in infrastructure design, including containerization (Docker), orchestration (Kubernetes), and cloud platforms (AWS,
  • GCP).Strong experience with CI/CD pipelines, infrastructure as code (Terraform, Ansible), and monitoring sy
  • stems.Solid understanding of networking, security, and high-availability infrastructure d
  • esign.Experience managing and scaling infrastructure for machine learning or similar high-performance work
  • loads.Proven track record of leading teams and delivering large-scale, production-level infrastructure solu
  • tions.Excellent problem-solving skills and the ability to drive technical projects from idea to compl


etion.
BONUS


  • POINTS:
    Experience optimizing infrastructure for machine learning workloads, including GPU utilization and distributed co
  • mputing.Familiarity with multi-cloud strategies and hybrid cloud depl
  • oyments.Deep understanding of security best practices in cloud-native envir
  • onments.Previous experience in a fast-paced startup environment, particularly in M




L or AI.

Logistical


Questi ons:
Sta

ge: Seri es BLocation: New

York/Hybri dReports to:

CTO/Found erTeam Si



ze: 8 People
Apply Now
Share this job
Greylock
  • Similar Jobs

  • Manager, Software Engineering - Machine Learning Infrastructure

    Little Ferry
    View Job
  • Software Engineering Manager (Machine Learning Infrastructure)

    New York
    View Job
  • Engineering Manager - Machine Learning

    Little Ferry
    View Job
  • Senior Software Engineering Manager, Machine Learning, Labs

    New York
    View Job
  • Manager, Software Engineering- Infrastructure

    Little Ferry
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙