Full-Time

Machine Learning Engineer

Fine Tuning

Updated on 4/22/2025

Baseten

Baseten

51-200 employees

Platform for deploying and managing ML models

Compensation Overview

$150k - $225k/yr

Mid, Senior

San Francisco, CA, USA + 1 more

More locations: New York, NY, USA

Category
Applied Machine Learning
Natural Language Processing (NLP)
AI & Machine Learning
Required Skills
LLM
Machine Learning
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or related field
  • 3+ years of experience in ML engineering with focus on model training and fine-tuning
  • Experience with advanced fine-tuning frameworks such as Axolotl, Unsloth, Transformers, TRL, PyTorch Lightning, or Torch Tune, enabling efficient model adaptation and optimization
  • Hands-on experience fine-tuning or pre-training LLMs or other foundation models
  • Excellent communication skills for explaining complex concepts to varied audiences
Responsibilities
  • Design comprehensive fine-tuning strategies that translate customer requirements into effective technical approaches—finding the optimal combination of data preparation, training techniques, and evaluation methods to deliver solutions that precisely address customer needs
  • Develop tools to enable non-ML experts to fine-tune models effectively
  • Design and implement scalable fine-tuning pipelines for large language models and other AI modalities
  • Work directly with customers to understand requirements and guide technical implementation
  • Serve as the technical point of contact for customers throughout their fine-tuning journey
  • Utilize state-of-the-art parameter-efficient fine-tuning methods (LoRA, QLoRA)
  • Build systems for efficient data preparation, evaluation, and deployment of fine-tuned models
  • Research and apply cutting-edge techniques in instruction tuning and model customization
  • Create frameworks to evaluate fine-tuned model performance against base models
  • Implement best-in-class distributed training techniques like FSDP and DDP across various hardware configurations
Desired Qualifications
  • Experience working with customers to deliver technical solutions
  • Track record of delivering ML projects to enterprise customers
  • Knowledge of distributed training systems and efficiency optimization techniques
  • Experience with advanced alignment and adaptation techniques including RLHF, DPO, constitutional AI, prompt tuning, reinforcement learning with execution feedback, PPO, or other emerging alignment methods
  • Knowledge of prompt engineering and domain adaptation methods
  • Contributions to open-source fine-tuning projects or tools
  • Experience building user-friendly interfaces for fine-tuning workflows
  • Experience with cloud platforms (AWS, GCP, Azure) and containerization technologies

Baseten provides a platform for deploying and managing machine learning (ML) models, aimed at simplifying the process for businesses. Users can select from a library of open-source foundation models and deploy them with just two clicks, making it easier to implement ML solutions without complex setup. The platform features autoscaling, which adjusts resources based on demand, and monitoring tools for tracking performance and troubleshooting. A key differentiator is Baseten's open-source model packaging framework, Truss, which allows users to package and deploy custom models easily. The company operates on a usage-based pricing model, where clients pay only for the time their models are in use, helping them manage costs effectively.

Company Size

51-200

Company Stage

Series C

Total Funding

$135M

Headquarters

San Francisco, California

Founded

2019

Simplify Jobs

Simplify's Take

What believers are saying

  • Raised $75M in Series C funding, boosting growth and innovation.
  • Launch on Google Cloud Marketplace expands reach and hybrid cloud capabilities.
  • Growing demand for serverless solutions aligns with Baseten's core offerings.

What critics are saying

  • Competition from specialized AI models like Writer's Palmyra series.
  • Potential challenges in hybrid mode adoption on Google Cloud Marketplace.
  • Rapid AI development may render current offerings obsolete without innovation.

What makes Baseten unique

  • Baseten offers a serverless backend for fast ML application development.
  • Truss framework allows seamless packaging and deployment of custom models.
  • Usage-based pricing ensures cost-effective management of ML infrastructure.

Help us improve and share your feedback! Did you find this helpful?

Benefits

💰 Competitive compensation: We aim to provide 90th percentile (or better) salaries and equity grants for every team member commensurate with their experience.

🌎 Remote-first work environment: The Baseten team is welcome to work from wherever they want; fully remote, in our San Francisco office, or a mix of both. We provide a $1,000 stipend for you to make your home office comfortable and productive.

🏓 Regular in-person team summits: We get together as a team three times a year to plan, workshop, and most importantly, get to know each other better.

🌴 Unlimited PTO: We ask that everyone take at least 4 weeks of vacation. And we have a company-wide break between Christmas and New Year's Day.

🏥 Full healthcare coverage: Medical, dental and vision insurance for you and your family.

🍼 Paid parental leave: 16-weeks fully paid parental leave (adoptive and non-birth parents included) and flexibility with schedules while returning to work.

📈 401(k): Company-sponsored 401(k) for you to contribute to.

🧠: Learning and development budget: We encourage you to take classes, attend conferences, and invest in your craft and we’ll cover expenses to make it happen.

Growth & Insights and Company News

Headcount

6 month growth

1%

1 year growth

11%

2 year growth

4%
Business Wire
Feb 20th, 2025
Baseten Lands $75M from IVP and Spark to Solve AI’s Biggest Bottleneck to Ubiquitous Adoption: Inference

Baseten, the leading inference platform for AI-native products, announced the closing of a $75 million Series C round of funding co-led by IVP and Spa

CNBC
Feb 19th, 2025
AI startup Baseten raises $75 million following DeepSeek's emergence

Baseten, a startup that runs artificial intelligence models for clients on their cloud infrastructure, has raised $75 million in funding, the company said Wednesday.

Latest Nigerian News
Feb 19th, 2025
Baseten, which helps companies launch open-source or customized AI models, raised $75M led by IVP and Spark Capital at a $825M valuation and claims 100+ clients (Jordan Novet/CNBC)

Jordan Novet / CNBC:Baseten, which helps companies launch open-source or customized AI models, raised $75M led by IVP and Spark Capital at a $825M valuation and claims 100+ clientsBaseten, a startup that runs artificial intelligence models for clients on their cloud infrastructure, has raised $75 million in funding, the company said Wednesday.

AiThority
Sep 26th, 2024
Baseten Launches on Google Cloud Marketplace With Early Access to Hybrid Mode for Flexible, Scalable AI Workloads.

Baseten, a leading provider of AI infrastructure solutions, today announced the early access (EA) of its new Hybrid Mode offering on Google Cloud Marketplace.

VentureBeat
Jul 31st, 2024
Writer’S New Ai Models Are Scary Good At Healthcare And Finance Tasks

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More. San Francisco-based AI company Writer launched two specialized large language models (LLMs) tailored specifically for the healthcare and financial services industries on Wednesday, potentially reshaping how these highly regulated sectors adopt artificial intelligence.The new models, Palmyra-Med-70b and Palmyra-Fin-70b, are now available as open-source offerings on major AI platforms including Nvidia, Baseten, and Hugging Face. Writer claims these specialized models significantly outperform larger, generalized AI models like GPT-4 in domain-specific tasks.“In general, there are not enough domain-specific models because it’s simply [too] hard to build those models,” Waseem Alshikh, CTO and Co-Founder of Writer, told VentureBeat in an interview. “Those models requirs not just engineering, it requires actually a special type of data, a special type of expertise. You will actually have experts help you build those models.”Specialized AI models outperform general-purpose counterparts in accuracy testsThe launch comes as industries grapple with how to leverage AI’s potential while navigating complex regulatory environments