ML Compiler Backend Engineer

Company:  Etched
Location: Cupertino
Closing Date: 27/10/2024
Salary: £125 - £150 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

Job Title: ML Compiler Backend Engineer

Etched is building the hardware for superintelligence...

GPUs and TPUs are flexible AI chips that can run many kinds of models: CNNs, RNNs, LSTMs, and more. But today, almost all AI workloads, from ChatGPT to self-driving cars, are done on one model architecture: transformers. Using flexible AI chips for transformers is very inefficient: <5% of the transistors on an H100 are used for matrix multiplication!

Etched is building a single-purpose chip exclusively for transformer inference. We only support transformers, but in exchange our chips have an order of magnitude more throughput and lower latency than an H100. With Etched, you can build products that would be impossible with GPUs, like tree-of-thought agents and ultra-low-latency audio chat bot.

Responsibilities

  1. Design, develop, and maintain our compiler stack.
  2. Collaborate closely with hardware teams to comprehend architecture, specifications, memory controllers, boot loader, external IP, requirements, and boot sequence.
  3. Conduct debugging, testing, and validation of our compiler on target platforms, boot loader reliability, and verification on hardware simulation environment.

Requirements

  1. 5+ years of experience writing production-grade software.
  2. Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals.
  3. Able to write production-grade code in C++ and in Python.
  4. Experience with modern compiler IRs, including at least one of (LLVM, MLIR, Relay).
  5. Experience with PyTorch.

Desired Qualifications

  1. Work experience at a cloud provider, AI company, or LLM startup.
  2. Experience implementing SIMD algorithms on vector processors.
  3. Experience with open source ML compiler frameworks such as MLIR.
  4. Experience with inference servers/model serving frameworks (such as Triton, TFServ).
  5. Experience with optimized libraries for transformer inference (TensorRT-LLM, vLLM, DeepSpeed, HF Inference Endpoints).

Benefits

  1. Competitive salary and equity package.
  2. Full medical, dental, and vision packages, with 100% of premium covered.
  3. Work with world-class people and state-of-the-art AIs every day.

Etched is committed to fair and equitable compensation practices. Compensation is determined based on your qualifications and experience. Compensation packages also include generous equity in Etched.

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

Company information: By burning the transformer architecture into our chips, we’re creating the world’s most powerful servers for transformer inference.

Manufacturing, Aerospace, Automotive, Electronics, Defense

#J-18808-Ljbffr
Apply Now
Share this job
Etched
  • Similar Jobs

  • ML Compiler Backend Engineer

    Cupertino
    View Job
  • Compiler CPU Backend Engineer

    Cupertino
    View Job
  • Senior Backend Compiler Engineer

    Santa Clara
    View Job
  • Apple GPU Compiler Backend Engineer

    Cupertino
    View Job
  • Sr. GPU Compiler Backend Engineer

    Cupertino
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙