Full-Time

Evaluation Engineer

Elicit

Elicit

51-200 employees

AI-powered literature review and data extraction

Compensation Overview

$140k - $170k/yr

+ Equity

Remote in USA + 1 more

More locations: Oakland, CA, USA

Remote

In-person quarterly retreats on the West Coast; time zone overlap required.

Category
Software Engineering (1)
Required Skills
Kubernetes
Python
Git
Apache Spark
SQL
Machine Learning
Postgres
Docker
AWS
REST APIs
Requirements
  • At least 3 years of experience as a professional software engineer, with demonstrated experience building complex backend systems (e.g., backend for a complex website, data pipelines, etc.)
  • Aptitude and interest in evaluating how Elicit helps with pharma decision-making
Responsibilities
  • Build a comprehensive auto-eval platform that runs fast, is easy to use, and supports quickly building new evals
  • Build a lightning-fast basic evals infrastructure that schedules tasks to introduce practically no latency, and develop strategies to solve fundamental sources of latency (building a version of Elicit, running it on a query, and evaluating it using language models)
  • Create interfaces so ML engineers can kick off evals automatically on relevant commits, and so product managers have dashboards showing performance over time and what is going wrong in production
  • Architect the codebase so other team members can understand and build on it, enabling quick addition of examples and running an eval for new features
  • Ensure evaluations are accurate and reliable by encoding real knowledge about how pharma customers make decisions (for example, choosing appropriate gold standards) and providing appropriate statistical tests and confidence intervals
Desired Qualifications
  • Knowledge of statistics (e.g., calculating power and credence intervals for evals)
  • Experience with advanced Python (asyncio/trio and parallel processing strategies)
  • Front-end experience and strong UX sensibility (dashboard development); TypeScript experience is a plus
  • Experience building developer tools (ML engineers are one of your most important clients)
  • Previous experience as a data engineer or working on AI infrastructure
  • Knowledge of pharma/biomed
  • Experience evaluating machine learning systems
  • Experience building language-model-based systems (helps with understanding Elicit and how to evaluate it)

Elicit is an AI-powered research assistant that speeds up complex scholarly workflows by using large language models to search a database of over 125 million papers through natural-language queries. It goes beyond keyword searches by finding semantically relevant studies, extracts specific data from papers, and compiles results into organized tables and summaries. Users can upload their own PDFs for analysis, with outputs grounded in verifiable sources to reduce AI hallucinations. The platform operates as a Public Benefit Corporation, serving individuals and large organizations while offering tiered freemium pricing to improve research efficiency and evidence-based decision making.

Company Size

51-200

Company Stage

Series A

Total Funding

$31M

Headquarters

Oakland, California

Founded

2023

Simplify Jobs

Simplify's Take

What believers are saying

  • Series A $22M at $100M valuation from Spark Capital fuels enterprise expansion.
  • 400,000 monthly users convert via freemium to millions in recurring revenue.
  • Genentech, Novartis partnerships drive life sciences R&D adoption rapidly.

What critics are saying

  • Perplexity Deep Research replicates features, steals 60-80% users in 6-12 months.
  • OpenAI ChatGPT Scholar undercuts freemium, erodes subscriptions in 3-6 months.
  • Semantic Scholar free tools commoditize extraction, collapse moat in 12-18 months.

What makes Elicit unique

  • Elicit automates systematic reviews, screening titles and extracting data 80% faster.
  • Clinical Trials feature launched 2026 targets pharma trial monitoring uniquely.
  • Public Benefit Corporation prioritizes societal reasoning over pure profit.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Elicit who can refer or advise you

Benefits

Health Insurance

Dental Insurance

Vision Insurance

Life Insurance

Flexible Work Hours

Remote Work Options

Paid Vacation

401(k) Retirement Plan

401(k) Company Match

Home Office Stipend

Growth & Insights and Company News

Headcount

6 month growth

-1%

1 year growth

4%

2 year growth

0%
Elicit
Jul 23rd, 2025
Introducing Clinical Trials

Today Elicit is launching Clinical Trials in Elicit.

Elicit
Mar 18th, 2025
How we evaluated Elicit Systematic Review

Last month, Elicit introduced Elicit Systematic Review, a new AI workflow that allows researchers to find papers, screen titles and abstracts, and extract data from full-text papers in 80% less time, without compromising accuracy.

Elicit
Feb 28th, 2025
Elicit Raises $22M to Build the Most Trusted AI Platform for Evidence-Backed Decisions

Elicit has raised $22M in Series A funding at a $100M valuation led by Spark Capital and Footwork. Existing investors Fifty Years, Basis Set, and Mythos also participated, reinforcing their conviction in our mission to deploy AI to radically increase good reasoning in the world. Today, Elicit is used by

Elicit
Feb 26th, 2025
Elicit Secures $22M for AI Platform

Elicit has raised $22 million in Series A funding at a $100 million valuation, led by Spark Capital and Footwork, with participation from existing investors. Elicit, used by over 400,000 researchers monthly, aims to expand its AI platform beyond academic research to support evidence-based decision-making across industries. The funding will help Elicit build infrastructure for good reasoning as AI reshapes the global economy by 2027.

Elicit
Sep 25th, 2023
Elicit raises $9 million and becomes a public benefit corporation

We're excited to share that Elicit has raised $9 million in seed funding. Elicit was born at Ought, a non-profit research organization, and is now an independent public benefit corporation. Like Ought, our mission is to scale up good reasoning using machine learning, starting with researchers. Elicit is designed to