Simplify Logo

Full-Time

Member of Technical Staff

Inference & Model Serving

Posted on 2/29/2024

Cohere

Cohere

501-1,000 employees

Provides AI-powered natural language processing

Hardware
Enterprise Software
Crypto & Web3
AI & Machine Learning
Financial Services
Education

Senior

San Francisco, CA, USA

Category
Backend Engineering
Web Development
Software QA & Testing
Software Engineering
Required Skills
AWS
Go
Natural Language Processing (NLP)
Google Cloud Platform
Requirements
  • Experience with serving ML models
  • Experience designing, implementing, and maintaining a production service at scale
  • Familiarity with inference characteristics of deep learning models, specifically, Transformer based architectures
  • Familiarity with computational characteristics of accelerators (GPUs, TPUs, and/or Inferentia), especially how they influence latency and throughput of inference
  • Strong understanding or working experience with distributed systems
  • Experience in performance benchmarking, profiling, and optimization
  • Experience with cloud infrastructure (e.g. AWS, GCP)
  • Experience in Golang (or, other languages designed for high-performance scalable servers)
Responsibilities
  • Developing, deploying, and operating the AI platform delivering large language models through easy to use API endpoints
  • Working closely with many teams to deploy optimized NLP models to production in low latency, high throughput, and high availability environments
  • Interfacing with customers and creating customized deployments to meet their specific needs

Cohere specializes in language AI, offering powerful embeddings models for understanding text and a Command model for generating and summarizing text. The company emphasizes secure deployment options and customizable models, enabling businesses to leverage world-leading natural language processing (NLP) technology while keeping their data private and secure.

Company Stage

Series C

Total Funding

$440M

Headquarters

Toronto, Canada

Founded

2019

Growth & Insights
Headcount

6 month growth

21%

1 year growth

67%

2 year growth

408%
INACTIVE