Senior Distributed Systems Engineer, Technical Lead - Cloud

Company:  NVIDIA
Location: Santa Clara
Closing Date: 29/10/2024
Salary: £125 - £150 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

NVIDIA is hiring a Senior Distributed Systems Engineer to architect, lead, and develop scalable AI infrastructure and deep learning platforms! You will need to have strong programming skills, a deep understanding of distributed systems, distributed storage & compute systems, and distributed systems architecture. You will need excellent communication and planning skills. We also welcome out-of-the-box thinkers who can provide new ideas while being strong at executing tasks. Expect to be constantly challenged, improving and evolving for the better. You and other engineers in this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of AI-based applications that affect core data science.

What You'll Be Doing

  • Great opportunity to join the core group at NVIDIA working on AI infrastructure.
  • Architect and build scalable and distributed services that will help power the AI infrastructure for deep learning platforms.
  • Collaborate with multiple AI teams to understand their requirements and build a platform infrastructure that improves productivity.
  • Be a technical leader on various projects across the platform, and be a major contributor to the entire platform’s architecture.

What We Need To See

  • Masters in Computer Science, Electrical Engineering, or related field, or equivalent experience.
  • 12+ years of experience in distributed systems design and development.
  • Solid technical foundation in distributed computing and storage, including significant experience with server systems, storage, I/O, networking, and systems software.
  • Advanced programming skills to build compute systems, scalable backend services, and microservices architecture.
  • Specialist programmer in Java, Go, and/or C/C++.
  • Ability to switch effectively between long-term strategic management and near-term tactical management.
  • Highly motivated with strong interpersonal skills, with the ability to work successfully with multi-functional teams and coordinate effectively across organizational boundaries and geographies.
  • A track record of successful technical leadership and large-scale architecture that impacted critical projects.

Ways To Stand Out From The Crowd

  • Background with Large-scale Distributed Systems.
  • Experience with cloud multi-tenancy and complex resource management & sharing problems within distributed systems. Prior experience with YARN, IBM LSF, OPA is a plus.
  • Experience with Dask, Apache Spark, Apache Beam.
  • Strong hands-on knowledge of K8s, K8s networking, and K8s federation is a plus.
  • A go-getter attitude to investigate and understand technical requirements for open-ended problems.
  • Operational experience in AI Infrastructure.

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The base salary range is $216,000 - $333,500. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

#J-18808-Ljbffr
Apply Now
Share this job
NVIDIA
  • Similar Jobs

  • Senior Distributed Systems Engineer, Technical Lead - Cloud

    Santa Clara
    View Job
  • Senior Software Engineer, Distributed Systems - DGX Cloud

    Santa Clara
    View Job
  • Senior System Software Engineer, Distributed Systems - DGX Cloud

    Santa Clara
    View Job
  • Senior Software Engineer (Distributed Systems)

    Los Altos
    View Job
  • Senior Software Engineer - Distributed Data Systems

    Mountain View
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙