Full-Time

Member of Technical Staff

Research Engineer, Inference

Confirmed live in the last 24 hours

Inflection

Inflection

51-200 employees

Personalized AI assistant for self-improvement

AI & Machine Learning
Education

Compensation Overview

$175k - $325kAnnually

Mid, Senior

Palo Alto, CA, USA

Category
Backend Engineering
Software Engineering
Required Skills
LLM
Kubernetes
Pytorch
Docker

You match the following Inflection's candidate preferences

Employers are more likely to interview you if you match these preferences:

Degree
Experience
Requirements
  • Have experience with deploying and optimizing LLMs for inference, both in cloud and on-prem environments.
  • Are adept at using tools and frameworks for model optimization and acceleration, such as ONNX, TensorRT, or TVM.
  • Enjoy troubleshooting and solving complex problems related to model performance and scaling.
  • Have a deep understanding of the trade-offs involved in model inference, including hardware constraints and real-time processing requirements.
  • Are proficient with PyTorch and familiar with infrastructure management tools like Docker and Kubernetes for deploying inference pipelines.
Responsibilities
  • Optimize model inference processes.
  • Reduce latency and improve throughput without compromising model performance.
  • Ensure robust deployment in enterprise environments.

Inflection.ai operates in the artificial intelligence market, focusing on personal assistance and self-improvement. Its main product, Pi, is an AI platform available on iOS and other platforms that interacts with users in a personalized way. Pi offers various services, including journaling, planning, and learning, making it a helpful companion for those looking to enhance their personal or professional lives. Unlike many task-oriented AI assistants, Pi emphasizes emotional intelligence and personalization, providing users with straightforward explanations and emotional support. Inflection.ai likely uses a freemium business model, offering basic services for free while charging for advanced features. The company's goal is to help users organize their lives, learn new things, and reflect on their experiences.

Company Size

51-200

Company Stage

Acquired

Total Funding

$1.5B

Headquarters

Palo Alto, California

Founded

2022

Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for AI-driven personal assistants with emotional support and personalization.
  • Increased interest in AI platforms that enhance enterprise productivity and communication.
  • Expansion of AI capabilities in the freemium model attracts and converts users.

What critics are saying

  • Shift from consumer to enterprise AI may alienate existing Pi users.
  • Acquisition of multiple startups may lead to integration challenges and cultural clashes.
  • Dependency on partners like Intel and NVIDIA could affect independent innovation.

What makes Inflection unique

  • Inflection AI offers a personalized AI assistant named Pi for self-improvement.
  • The company focuses on emotional intelligence and personalization in AI interactions.
  • Inflection AI uses a freemium model to attract and monetize a large user base.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

401(k) Company Match

Unlimited Paid Time Off

Parental Leave

Growth & Insights and Company News

Headcount

6 month growth

-5%

1 year growth

-5%

2 year growth

-10%
DataPhoenix
Dec 1st, 2024
Inflection AI recently acquired three AI-focused startups to build its enterprise platform

On Tuesday, Inflection announced it has acquired two more startups, BoostKPI and Jelled.ai, to strengthen two key aspects in its enterprise platform: data and communications.

Business Wire
Nov 27th, 2024
Inflection AI Deepens Commitment to Enterprise AI With Acquisition of BoostKPI and Jelled.ai

Inflection AI today announced two new acquisitions to deepen its best-in-class AI capabilities for enterprises. Inflection for Enterprise launched ear

SWI Pipeline
Nov 26th, 2024
Inflection AI Acquires Two AI Startups

Inflection AI, under new CEO Sean White, shifted its focus to enterprise AI, acquiring startups Jelled.AI and Boost.KPI. Originally founded in 2022, Inflection AI gained attention with its AI chatbot "Pi." In July, it announced $1.3 billion in funding. Co-founders, including Mustafa Suleyman, joined Microsoft's AI efforts in a $650 million deal. White stated the company will no longer compete in developing new AI models but will focus on providing AI services to corporate clients.

AI News
Oct 24th, 2024
Inflection's Agentic Workflows Bring Trust and Action to Enterprise AI

In line with this vision, Inflection has introduced Agentic Workflows into its enterprise offering.

SDxCentral
Oct 22nd, 2024
UiPath partners with Inflection AI to enhance security-focused automation solutions

Inflection AI develops one of the world's leading LLMs and has recently introduced an enterprise-grade AI system to support the largest enterprises.

Business Wire
Oct 22nd, 2024
Inflection AI Acquires Boundaryless to Accelerate Deployments of Trusted Enterprise AI Agents

Inflection AI today announced its acquisition of Boundaryless, a leading Robotic Process Automation (RPA) solution provider in Europe, to support the

Stock Titan
Oct 22nd, 2024
UiPath and Inflection AI Announce Partnership to Bring Agentic AI to Security-Focused Industries

UiPath (NYSE: PATH) has announced a strategic partnership with Inflection AI to integrate the UiPath Platform with Inflection for Enterprise solution.

AiThority
Oct 21st, 2024
Inflection AI Acquires Boundaryless to Accelerate Deployments of Trusted Enterprise AI Agents

Inflection AI today announced its acquisition of Boundaryless, a leading Robotic Process Automation (RPA) solution provider in Europe, to support the rapid deployment of its AI agent capabilities within the enterprise.

ToolHunt
Oct 8th, 2024
Inflection AI Teams Up with Intel to Launch New LLM Appliance

Inflection AI has announced an exciting partnership with Intel to develop a cutting-edge large language model (LLM) appliance.

IndianWeb2
Oct 8th, 2024
Microsoft-owned Inflection AI and Intel Launch Enterprise AI System

Inflection Al has also collaborated with NVIDIA to develop hardware for generative artificial intelligence.