Full-Time

Senior Research Engineer

Model Evaluation

Cohere

Cohere

501-1,000 employees

API-based NLP tools and LLMs

No salary listed

London, UK + 4 more

More locations: Seattle, WA, USA | Toronto, ON, Canada | San Francisco, CA, USA | New York, NY, USA

Hybrid

Remote-flexible; offices in Toronto, New York, San Francisco, London and Paris.

Category
AI & Machine Learning (1)
Required Skills
Kubernetes
MLOps
Python
Tensorflow
Git
Pytorch
Docker
C/C++
Linux/Unix
Data Analysis
Requirements
  • You have deep experience building with and around large language models, and you have built tools for analyzing and understanding their performance
  • You have strong software engineering skills
Responsibilities
  • Develop evaluation benchmarks, datasets, and environments for measuring the bleeding edge of model capabilities
  • Conduct research to push the state-of-the-art in LLM evaluation methods, including training LLM judges; improving evaluation efficiency; and scalably building high-quality datasets
  • Build scalable tools for investigating and understanding evaluation results that are used by all members of technical staff at Cohere, as well as leadership and our CEO
  • Learn from and work with the best researchers and engineers in the field
Desired Qualifications
  • You enjoy pushing the limits of what LLMs are capable of, and you have built high-quality evaluation resources to measure those capabilities (datasets, simulators, environments, etc.)
  • You have a track record of developing new methods and/or data to evaluate LLMs, e.g. publications at top-tier conferences, popular benchmarks, etc.

Cohere provides access to advanced Natural Language Processing (NLP) tools and Large Language Models (LLMs) through a simple API. It serves businesses that want to improve content generation, summarization, and semantic search across multiple languages. The product works by offering API access to pre-trained models that perform tasks like text classification, sentiment analysis, and semantic search; users can customize and integrate these models into their applications, enabling scalable and affordable AI-powered solutions. Cohere differentiates itself with a developer-friendly API, multilingual support, and easy customization to help organizations build smarter and faster AI solutions. The company’s goal is to make powerful generative AI tools accessible to a wide range of customers and use cases, letting them deploy AI features quickly without managing complex models themselves.

Company Size

501-1,000

Company Stage

Series E

Total Funding

$2.1B

Headquarters

Toronto, Canada

Founded

2019

Simplify Jobs

Simplify's Take

What believers are saying

  • Cohere tripled revenue past $150M in 2025, adding RBC, BCE, Dell, Thales, SAP clients.
  • Cohere raised $500M at $6.8B valuation, plus $600M Series E for merger expansion.
  • Cohere built RCM-native LLM with Ensemble using HIPAA-compliant synthetic data.

What critics are saying

  • Nvidia open-sourced Nemotron-4 340B in March 2026, commoditizing Cohere's premium LLMs.
  • OpenAI o3 outperforms Aya by 25% on multilingual benchmarks since April 2026.
  • US export controls tightened May 8, 2026, restrict Cohere's Nvidia GPU access.

What makes Cohere unique

  • Cohere merged with Aleph Alpha in 2026, backed by Schwarz Group's $600M investment.
  • Cohere launched Tiny Aya in 2026, open multilingual models supporting 70+ languages offline.
  • Cohere partnered with SAP for sovereign AI in Canadian ERP cloud for public sector.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

100% Parental Leave top-up

Weekly lunch stipend

Remote Work Options

6 weeks of vacation

Growth & Insights and Company News

Headcount

6 month growth

-2%

1 year growth

-3%

2 year growth

0%
The Associated Press
Mar 31st, 2026
Ensemble and Cohere build first RCM-native LLM for healthcare revenue cycle management

Ensemble, a US revenue cycle management services provider, has partnered with enterprise AI company Cohere to build the healthcare industry's first revenue cycle management-native large language model. The companies are creating a custom model informed by Ensemble's operational expertise and data, designed to handle complex healthcare financial operations more accurately than general-purpose LLMs. The model will be embedded into AI agents managing processes from patient intake to account resolution. Unlike standard approaches that rely on prompt engineering, this system is fine-tuned on real RCM tasks and trained using synthetic datasets in a HIPAA-compliant environment, without using identifiable patient data. The solution aims to enhance existing electronic health record systems by providing better context and guidance for navigating payer requirements whilst reducing administrative burden for healthcare providers.

TechCrunch
Feb 17th, 2026
Cohere launches Tiny Aya, open multilingual AI models supporting 70+ languages on laptops

Cohere has launched Tiny Aya, a family of open-weight multilingual AI models supporting over 70 languages that can run on everyday devices without internet connectivity. The models were unveiled at the India AI Summit by the company's research arm, Cohere Labs. The base model contains 3.35 billion parameters and includes regional variants: TinyAya-Global for broad language support, TinyAya-Earth for African languages, TinyAya-Fire for South Asian languages, and TinyAya-Water for Asia Pacific, West Asia and Europe. South Asian language support includes Bengali, Hindi, Punjabi, Tamil and Telugu. Trained on 64 Nvidia H100 GPUs using modest computing resources, the models enable offline applications like translation, particularly useful in linguistically diverse countries like India. The models are available on HuggingFace, Kaggle and the Cohere Platform.

The Associated Press
Feb 10th, 2026
SAP and Cohere launch sovereign AI solutions in Canada for public sector and regulated industries

SAP and Cohere are expanding their partnership to deliver sovereign AI solutions globally, beginning in Canada. SAP Canada plans to integrate Cohere's agentic platform, North, into its Enterprise Resource Planning Sovereign Cloud environment, creating a complete Sovereign AI Layer for public sector and regulated industries. The integration embeds Cohere's large language models into SAP's Canadian-operated sovereign cloud infrastructure, allowing organisations to deploy advanced AI whilst maintaining data residency and operational control. This addresses the challenge of innovating with AI without compromising security or data sovereignty. A recent SAP AI report found that whilst 71% of organisations rely on data for investment decisions, 75% report incomplete data as a significant challenge. The partnership aims to overcome data fragmentation by embedding AI directly into core SAP applications.

Stockwatch
Dec 30th, 2025
Cohere triples revenue past $150M, lands RBC and BCE as clients

Toronto-based Cohere raised $600 million in 2025, achieving a $7 billion valuation, as the generative AI company secured contracts with major clients including RBC, Bell, Dell, Thales, SAP and LG for its office automation software. The company, which hired researcher Joëlle Pineau as chief AI officer, entered 2025 with approximately $50 million in annualised revenues and exited the year at more than triple that level. Chief executive Aidan Gomez expects dramatic growth to continue in 2026. Cohere has joined an elite group of 77 Canadian technology companies surpassing $100 million in annual revenue, a key threshold for sector maturity. The company also expanded internationally, opening offices worldwide during its breakthrough year.

Microsoft
Oct 6th, 2025
Cohere Raises $500M, Valued at $6.8B

AI startup Cohere Inc. has secured $500 million in new funding, valuing the company at $6.8 billion. This funding round is part of Cohere's strategy to compete with larger tech firms.