Senior Cloud Backend Engineer, NeMo LLM Service

Company:  NVIDIA
Location: Santa Clara
Closing Date: 18/10/2024
Salary: £125 - £150 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

We are looking for a Senior Cloud Backend Engineer to work on a groundbreaking machine learning project to make customized, world-class Large Language Models available and easy to use. Our team is building next-generation services and interfaces for training and deploying AI at scale. We are dedicated to developing NLP and multi-modal technologies that tackle real problems. We contribute to all steps of the machine learning lifecycle: from conceptualization, to applied research, engineering for optimized inference, and deployment.


What You'll Be Doing

  1. Creating customer-hosted microservices for training generative AI models
  2. Development of distributed cloud applications, microservices and SaaS platform able to scale up to huge models
  3. Implementing core infrastructure for cloud-native AI training and inference
  4. Creating flexible systems that can integrate across the ML ecosystem
  5. Relentlessly pursue speed of light performance under high load

What We Need To See

  1. BS, Masters, or equivalent experience in computer science, computer architecture, or related field
  2. 5+ years of experience
  3. Experience with the full software development lifecycle, particularly deploying and monitoring services in Cloud environments
  4. Understanding of performance, security and reliability in complex distributed infrastructure
  5. Excellent Golang, Rust, or C/C++ programming and software design skills, including debugging, performance and service health analysis, and test design.
  6. Ability to work independently, define project goals and scope, interact directly with open source community, and manage your own development effort

Ways to stand out from the crowd

  1. Experience deploying machine learning or statistical models into production environments, especially experience with frameworks such as PyTorch, Tensorflow, ONNX Runtime, and TensorRT
  2. Knowledge of or experience with developing production NLP or generative AI systems
  3. Experience working with high availability environments
  4. Kubernetes cluster administration experience
  5. Experience providing software solutions for multiple customer environments with minimal engineering overhead

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

The base salary range is 144,000 USD - 270,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. You will also be eligible for equity and benefits.

NVIDIA accepts applications on an ongoing basis. NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#J-18808-Ljbffr
Apply Now
Share this job
NVIDIA
  • Similar Jobs

  • Fullstack Software Engineer, NeMo LLM Services

    Santa Clara
    View Job
  • Full Stack Software Engineer, NeMo LLM Services

    Santa Clara
    View Job
  • Full Stack Software Engineer, NeMo LLM Services

    Santa Clara
    View Job
  • Full Stack Software Engineer, NeMo LLM Services

    Santa Clara
    View Job
  • Senior Cloud Backend Engineer

    San Jose
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙