Research Engineer, Applied Finetuning

Company:  Karkidi
Location: San Francisco
Closing Date: 17/10/2024
Salary: £150 - £200 Per Annum
Hours: Full Time
Type: Permanent
Job Requirements / Description

As a Research Engineer or Research Scientist in Applied Finetuning, you will directly train the models we launch to the public via Claude.AI and our API. In this role, you will design and iterate on state-of-the-art finetuning techniques, such as Constitutional AI and RLHF, to train our production Claude models. You will implement new algorithms, run experiments on data mixes, design evaluations, and improve our production model training pipeline. This role offers the opportunity to contribute to cutting-edge research while also having a direct and measurable impact on the company’s success.

Responsibilities:

  1. Implement and optimize finetuning pipelines to efficiently train production-scale language models with techniques like Constitutional AI.
  2. Develop novel prompts and prompting strategies to improve and test model behaviors.
  3. Collaborate with other research teams to translate novel finetuning techniques into our production model training process, ensuring models are helpful, honest, and harmless.
  4. Design and run a new evaluation that tests Claude’s reasoning capabilities.
  5. Collaborate with a research team to develop a robust evaluation for a new model capability they are developing.
  6. Stay current with state-of-the-art research in AI and machine learning, and propose ways to apply these advancements to production systems.

You may be a good fit if you:

  1. Have significant Python programming experience and machine learning experience.
  2. Are results-oriented, with a bias towards flexibility and impact.
  3. Pick up slack, even if it goes outside your job description.
  4. Enjoy pair programming (we love to pair!).
  5. Want to learn more about machine learning research.
  6. Care about the societal impacts of your work.
  7. Have clear written and verbal communication.

Strong candidates may also have experience with:

  1. Fine-tuning large language models with supervised learning or reinforcement learning.
  2. Developing evaluations for language models.
  3. Complex shared codebases and RL infrastructure.
  4. Authoring research papers in machine learning, NLP, or AI alignment or similar industry experience.

Deadline to apply: None. Applications will be reviewed on a rolling basis.

The expected salary range for this position is:

Annual Salary: $315,000—$510,000 USD

Logistics

Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.

US visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate; operations roles are especially difficult to support. But if we make you an offer, we will make every effort to get you into the United States, and we retain an immigration lawyer to help with this.

We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

Compensation and Benefits

Anthropic’s compensation package consists of three elements: salary, equity, and benefits. We are committed to pay fairness and aim for these three elements collectively to be highly competitive with market rates.

Equity - For eligible roles, equity will be a major component of the total compensation. We aim to offer higher-than-average equity compensation for a company of our size, and communicate equity amounts at the time of offer issuance.

US Benefits - The following benefits are for our US-based employees:

  1. Optional equity donation matching.
  2. Comprehensive health, dental, and vision insurance for you and all your dependents.
  3. 401(k) plan with 4% matching.
  4. 22 weeks of paid parental leave.
  5. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
  6. Stipends for education, home office improvements, commuting, and wellness.
  7. Fertility benefits via Carrot.
  8. Daily lunches and snacks in our office.
  9. Relocation support for those moving to the Bay Area.

UK Benefits - The following benefits are for our UK-based employees:

  1. Optional equity donation matching.
  2. Private health, dental, and vision insurance for you and your dependents.
  3. Pension contribution (matching 4% of your salary).
  4. 21 weeks of paid parental leave.
  5. Unlimited PTO – most staff take between 4-6 weeks each year, sometimes more!
  6. Health cash plan.
  7. Life insurance and income protection.
  8. Daily lunches and snacks in our office.
#J-18808-Ljbffr
Apply Now
Share this job
Karkidi
  • Similar Jobs

  • Applied AI Finetuning Engineer

    San Francisco
    View Job
  • Applied AI Finetuning Engineer

    San Francisco
    View Job
  • Research Engineer / Research Scientist, Finetuning

    San Francisco
    View Job
  • Research Engineer / Research Scientist, Finetuning

    San Francisco
    View Job
  • Research Engineer / Research Scientist, Finetuning

    San Francisco
    View Job
An error has occurred. This application may no longer respond until reloaded. Reload 🗙