Full-Time

Technical Data Delivery Lead

Pareto

Pareto

501-1,000 employees

Data collection, web research, lead generation

Compensation Overview

$110k - $150k/yr

Remote in USA

Remote

Category
Software Engineering (1)
Required Skills
LLM
Python
SQL
Machine Learning
LangChain
Requirements
  • Proficiency in Python and SQL for data manipulation, pipeline monitoring, and quality analysis — you should be comfortable writing light scripts to parse formats, run statistical checks, and build lightweight tooling
  • Working knowledge of LLM internals: RLHF/SFT training loops, how prompt structure affects output distribution, RL environment setup qualities (tool use) for agentic data collection / eval projects.
  • Hands-on experience with at least one agentic or LLM workflow framework (LangChain, DSPy, AutoGen, direct tool-use via API, or equivalent)
  • Demonstrated ownership of a data or ML pipeline from scoping through delivery — including quality design, not just throughput tracking
  • Strong written communication: you'll write technical guidelines and rubrics that distributed expert workers follow accurately, and you'll brief senior researchers on pipeline performance
  • Comfort operating with ambiguity in a fast-moving environment where model requirements shift and client priorities evolve
Responsibilities
  • Pipeline architecture: Design end-to-end data collection and evaluation pipelines for RLVR, RLHF, SFT, red-teaming, and model evaluation workflows. This includes expert sampling strategy, annotation schema, rubric structure, inter-rater calibration, and QA system design. You'll be expected to prototype novel workflows quickly, identify architectural risks before launch, and make tradeoff decisions with confidence. You’ll also need to understand how agents interact with tools to solve expert-driven tasks, and you’ll need to communicate with engineering to ensure the environment is built accordingly to enable such tasks.
  • Agentic system deployment: Build, test, and iterate on AI agents that automate pipeline tasks — quality gate review, expert matching, output flagging, throughput anomaly detection. You'll work closely with our engineering team to scope agent capabilities, write the prompts and evaluation logic that make them reliable, and monitor their performance in production. This is a growing part of the role; comfort with agentic tooling (LangChain, DSPy, custom tool-use frameworks, or equivalent) is a meaningful differentiator.
  • Quality systems: Define data quality standards across annotation, evaluation, and expert output review. Design and run audits using inter-rater reliability metrics, calibration sets, and statistical sampling. You'll be responsible not just for catching quality issues but for building systems that prevent them — automated checks, structured output validation, and model-assisted review layers where appropriate. Aside from programmatic quality testing, you’ll be responsible for spot-checking tasks & understanding what makes a datapoint meaningful & high quality. The ability to do so, and translate your findings into clear feedback and expert guidelines is particularly helpful for this role.
  • Client interface: Engage directly with AI researchers, TPMs, and PMs at our client organizations. Translate research-driven requirements — evaluation rubrics, domain coverage targets, latency constraints, benchmark specifications — into operational workflows. Communicate pipeline performance clearly, escalate technical risks early, and contribute to project scoping and pricing decisions.
  • Research integration: Stay current with developments in LLM post-training, evaluation methodology, and data tooling. Evaluate new approaches — model-assisted annotation, structured output formats, automated calibration methods — and integrate them into active pipelines where they improve quality or efficiency. Understand what method applies to what domain and project, and work towards implementing it accordingly.
Desired Qualifications
  • Direct experience with RL environment data pipelines, evaluation framework design, and red-teaming workflows
  • Background in data engineering, ML research support or equivalent
  • Experience designing or operating agentic systems in a production or near-production context
  • Familiarity with inter-rater reliability methods, calibration set design, and annotation quality frameworks
  • Prior client-facing or technical program management experience in an AI/ML-adjacent context
  • Prior experience on scoping or driving projects with fuzzy upfront specs or evolving requirement. This is a high-ownership position where you are expected to take charge and lead, not simply follow project guidelines.

Pareto provides data collection, web research, and lead generation for growing businesses. Its service combines an expert research team with powerful software to gather high-quality, vetted data quickly and accurately. Hundreds of businesses, including Modern Fertility, ClearCo, and AuntFlow, use Pareto to boost sales, marketing, and operations. The product works by blending manual data gathering with software-driven analysis to deliver reliable lists and insights for outreach. Pareto differentiates itself through its combination of human expertise and fast data processing, delivering vetted leads faster than traditional research firms. The goal is to help customers scale their growth by providing trustworthy data to fuel sales and marketing activities.

Company Size

501-1,000

Company Stage

Seed

Total Funding

$5.1M

Headquarters

Palo Alto, California

Founded

2020

Simplify Jobs

Simplify's Take

What believers are saying

  • Hundreds of businesses including Modern Fertility use Pareto for sales amplification.
  • Expert team with software delivers high-quality vetted data rapidly.
  • Serves AI labs building safer models via nondeterministic judgment transformation.

What critics are saying

  • Upwork lawsuit alleges worker misclassification, demands $15M backpay in 6-12 months.
  • Anthropic's workforce expansion cuts 40% of Pareto's AI lab contracts in 3-6 months.
  • OpenAI o1 reduces human preference data needs by 90%, obsoletes RLHF in 12-24 months.

What makes Pareto unique

  • Pareto verifies reinforcement learning with expert judgment as durable reward signals.
  • Thiel Fellow Phoebe Yao pivoted Pareto to AI model knowledge capture platform.
  • Stackmatix partners with Pareto to enhance outbound sales processes.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Pareto who can refer or advise you

Benefits

Health Insurance

Remote Work Options

Home Office Stipend

Company Equity

Growth & Insights and Company News

Headcount

6 month growth

8%

1 year growth

10%

2 year growth

13%
Tech Funding News
Feb 12th, 2026
Can GenUX fix AI’s UX problem? Heywa Labs raises $5M seed from Cherry — TFN

London-based Heywa Labs has secured a $5 million seed round led by Cherry Ventures, with participation from others.

Tech Funding News
Sep 11th, 2025
Hive Robotics raises €2M for robot swarms

Hive Robotics, a Munich-based startup, has raised €2M in a pre-seed funding round led by b2venture, with participation from VCs and business angels like Firedrop and Pareto. The company aims to enhance multi-domain autonomous robot swarms with its AI-driven control layer, enabling real-time coordination of drones, rovers, and unmanned vessels. Founded by aviation veterans Sebastian Mores and Dr. Burak Yüksel, Hive Robotics seeks to democratize swarming AI for various industries.

Hello Pareto
Sep 9th, 2022
Stackmatix partners with Pareto Inc.

Stackmatix started working with Pareto to build up their outbound process for sales.

Business Insider
Apr 27th, 2022
The memo a 23-year-old used instead of a pitch deck to raise $4.5 million for her startup

For her pre-seed round, Phoebe Yao used a pitch deck, but when she went into her seed round, she decided to switch to a memo — and it was successful.

Medium
Oct 18th, 2021
Founder’s Lessons: Phoebe Yao, CEO of Pareto

Phoebe shares her journey of dropping out of Stanford mid-pandemic, conquering self-doubt as a solo female founder, and building trust and…