Full-Time

AI Inference Engineer

Confirmed live in the last 24 hours

Perplexity AI

Perplexity AI

201-500 employees

Advanced answer engine providing reliable information

Data & Analytics
Consumer Software

Compensation Overview

$190k - $240kAnnually

+ Equity

Mid, Senior

Hammersmith, London, UK

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Tensorflow
CUDA
Pytorch
Requirements
  • Experience with ML systems and deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
  • Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
  • Experience with deploying reliable, distributed, real-time model serving at scale
  • (Optional) Understanding of GPU architectures or experience with GPU kernel programming using CUDA
Responsibilities
  • Develop APIs for AI inference that will be used by both internal and external customers
  • Benchmark and address bottlenecks throughout our inference stack
  • Improve the reliability and observability of our systems and respond to system outages
  • Explore novel research and implement LLM inference optimizations

Perplexity AI provides an answer engine that delivers accurate and reliable responses to user queries. The platform uses current sources to ensure the information is both precise and relevant. It caters to a wide audience, including individuals looking for quick answers and businesses needing detailed information. Unlike many competitors, Perplexity AI emphasizes high-quality, source-backed answers, making it a valuable resource for users seeking trustworthy data. The company's goal is to meet the increasing demand for immediate access to reliable information, generating revenue through subscription fees, advertising, and partnerships.

Company Stage

N/A

Total Funding

$403.7M

Headquarters

San Francisco, California

Founded

2022

Growth & Insights
Headcount

6 month growth

216%

1 year growth

872%

2 year growth

5540%
Simplify Jobs

Simplify's Take

What believers are saying

  • Perplexity AI's recent $250 million funding round, led by Bessemer Venture Partners and including SoftBank, significantly boosts its financial stability and growth potential.
  • The planned revenue-sharing program with web publishers could create new revenue streams and foster positive relationships with content creators.
  • High-profile endorsements from industry leaders like Nvidia's CEO Jensen Huang and Shopify's CEO Tobi Lütke enhance the company's credibility and market presence.

What critics are saying

  • Accusations of plagiarism and unethical web scraping could damage Perplexity AI's reputation and lead to legal challenges.
  • The competitive landscape of AI search engines, dominated by giants like Google, poses a significant threat to Perplexity AI's market share.

What makes Perplexity AI unique

  • Perplexity AI uniquely focuses on helping individuals and organizations achieve a healthier work-life balance, unlike many AI companies that concentrate on technical or enterprise solutions.
  • Their tailored services for managing work-life balance set them apart from competitors who offer more generalized productivity tools.
  • Perplexity AI's emphasis on setting boundaries and promoting self-care is a distinctive approach in the AI-driven productivity market.

Help us improve and share your feedback! Did you find this helpful?