Full-Time

Solutions Engineer

AI Cloud Infrastructure

Novita AI

Novita AI

No salary listed

San Francisco, CA, USA

In Person

Category
Sales & Solution Engineering (2)
,
Required Skills
Microsoft Azure
Python
Tensorflow
Pytorch
AWS
Go
Serverless
Google Cloud Platform
Requirements
  • 5+ years of experience in a customer-facing technical role such as Solutions Engineer, Sales Engineer, or Customer Engineer, preferably within a public cloud, SaaS, or AI/ML company
  • Strong hands-on experience with public cloud platforms (Amazon Web Services, Google Cloud Platform, Microsoft Azure)
  • Solid understanding of the AI/ML landscape, including concepts like large language models, deep learning frameworks (TensorFlow, PyTorch), and the GPU computing stack
  • Proficiency in a programming language like Python or Go is highly desirable
  • Exceptional communication skills with the ability to translate complex technical concepts into clear, concise, and compelling value propositions for diverse audiences
  • Customer-obsessed with a strong sense of empathy and understanding of the sales process
  • Self-starter who thrives in a fast-paced, dynamic startup environment
  • Experience in a role that bridges technical and sales functions
Responsibilities
  • Partner with Account Executives to deeply understand customer needs, technical requirements, and business objectives and architect solutions leveraging the AI infrastructure stack (Model APIs, GPU Instances, Serverless)
  • Lead compelling, customized product demonstrations and hands-on workshops and manage and execute successful proofs of concept to prove value and performance of the platform in the customer environment
  • Articulate the value proposition of the platform to technical and non-technical audiences ranging from engineers to chief executive officers
  • Create and maintain technical sales collateral including whitepapers, best practice guides, and demo scripts and channel feedback from the field to Product and Engineering to influence the product roadmap
  • Provide initial onboarding support and architectural guidance to ensure smooth post-sales transition and long-term customer success
Desired Qualifications
  • Experience in a public cloud, SaaS, or AI/ML company is highly desirable
  • Experience with realizing enterprise AI deployments and customer-facing AI projects
  • Familiarity with GPU infrastructure deployments and inference serving
  • Experience building end-to-end AI pipelines and MLOps workflows

Company Size

N/A

Company Stage

N/A

Total Funding

N/A

Headquarters

N/A

Founded

N/A

Simplify Jobs

Simplify's Take

What believers are saying

  • Hugging Face partnership reaches 5M developers with instant Deploy on Novita feature.
  • LangGraph framework integration demonstrates practical builds for real AI applications at scale.
  • Real-time browser access and computer use capabilities support diverse autonomous agent workloads.

What critics are saying

  • OpenAI's May 1 secure agent execution in GPT-5 API eliminates Novita's niche.
  • Anthropic's April 15 Claude Agents with microVM isolation draws OpenClaw and Hermes workloads.
  • E2B's enhanced sandbox with superior browser automation directly undercuts Novita's core features.

What makes Novita AI unique

  • Sandbox Clone enables parallel computing by duplicating instances with preserved filesystem and memory states.
  • Sub-200ms startup with per-second billing scales thousands of parallel microVMs for autonomous agents.
  • Firecracker microVM isolation prevents credential leakage and cross-agent interference with dedicated kernels.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Retirement Plan

Performance Bonus

Company News

PR Newswire
Apr 14th, 2026
Novita AI partners with Hugging Face to enable instant AI model deployment for 5M developers

Novita AI has partnered with Hugging Face to provide inference services for over five million developers on the platform. The collaboration introduces a "Deploy on Novita" feature, enabling developers to instantly deploy models as production-ready APIs without managing infrastructure or configuration. The partnership launched with day-zero support for Google's Gemma 4 model. Novita AI claims to offer time-to-first-token as low as 50 milliseconds and cost savings up to 50% compared to most inference endpoints. The platform supports over 120 large language models and multimodal models through a single API. According to COO Junyu Huang, the service eliminates complex deployment steps including downloading model weights, configuring environments and provisioning GPU infrastructure, allowing developers to focus on building products rather than managing infrastructure.

BlockchainReporter
Apr 7th, 2026
NanoVita and TermiX partner to architect the "settlement layer" for the emerging AI agent economy.

NanoVita and TermiX partner to architect the "settlement layer" for the emerging AI agent economy. April 7, 2026 1:00 AM Table of contents NanoVita's strategic alliance with TermiX marks an important step towards bringing the theoretical convergence of AI and blockchain technologies into an actual working system. The partnership indicates a transition into maturity of the Agentic Web, where TermiX will serve as a single clearinghouse and settlement facility for the entire ecosystem. This partnership helps eliminate one of the largest bottlenecks in the development of decentralized AI. It provides the means for autonomous agents to engage in transactions, verification, and settlement of their obligations within a high-speed and composable environment. Infrastructure for an autonomous economy. TermiX's operational framework is the core of this collaboration; a framework designed for facilitating AI agents' various interactions. Agentic transactions, agent collaborative commerce, benefit from fast settlement of payments as well as being composable enough that you can take a component from different systems and create a mesh as one. Furthermore, TermiX supports agents in terms of clearing and routing transactions through back-office support with no need for managing transactions themselves. To accomplish this level of service, Blockchain Reporter rely on aligning two ERC standards: ERC-8183 and ERC-8004. These two key ERCs support the vision of a "unified" economy where various agents, potentially built using disparate frameworks, can interpret and fulfil their financial commitments to one another. This move to a unified economy is in line with other movements in the blockchain industry where AI agents are being seen less as tools and more as core participants in the on-chain economy. The role of YZi Labs and standardized frameworks. Backed by YZi Labs, TermiX possesses the institutional and technical strength necessary to set the benchmarks for their industry. The two are trying to create a future where agent infrastructure will have just as much uniformity with the ERC-20 tokens that were popularized last summer in the DeFi revolution. To create a chance for seamless integration, allowing all software components to connect effortlessly like Lego blocks. This plays a crucial role in the NanoVita and TermiX ecosystem, particularly in enabling efficient coordination between autonomous systems. Specifically, an AI agent capable of collecting data can seamlessly pay another agent that verifies that data using cryptographic functions, while TermiX ensures the clearing and settlement process occurs automatically behind the scenes. This level of automation is expected to give rise to Agentic DAOs and provide the infrastructure for fully autonomously decentralized marketplaces. A growing trend in Web3 integration. The rise of Web3 ecosystem collaborations is part of a much larger trend towards embracing real-world utility and complex automation in blockchain applications. Similar collaborations have been observed in sectors such as gaming and sports, as projects increasingly recognize that a standalone ecosystem does not have the capacity to scale. As the AI agent economy continues to expand, there will be greater demand for adequate reward systems and settlement solutions. This has occurred throughout the fitness industry and dance industry; each has evolved toward providing real-world rewards via Web3. TermiX and NanoVita are both creating financial "rails" for these types of reward systems in the AI space. Conclusion. This collaboration signifies a huge advance in achieving a "Post-Human" economy using blockchain technology. With the introduction of this clearing and settlement service layer there is now less friction historically preventing AI agents from functioning in the real world. Once these infrastructure components start functioning, the focus will change from what AI can do, to how they are able to function within an enormous, open and compositional economic system.

The AI Journal Ltd
Mar 24th, 2026
Novita AI named a top AI infrastructure vendor on Ramp.

Novita AI named a top AI infrastructure vendor on Ramp. Novita AI earned its spot as a leading AI infrastructure provider on Ramp's February 2026 trending software list, a distinction based on expenditure from over 50,000 businesses. SAN FRANCISCO, March 24, 2026 /PRNewswire/ - Novita AI, the platform unlocking the power of affordable and reliable AI inference for developers, has been recognized as one of the top AI infrastructure vendors on Ramp's February 2026 Top Software Vendors list. This monthly ranking is based on actual spend from more than 50,000 businesses on its corporate card and bill pay platform. Novita was recognized alongside Cerebras, Runware, Clarifai, Crusoe, and Modal as a trending AI infrastructure company. Previously, AI infrastructure was defined by Ramp as a company that hosts models and serves providers. They have renamed that category "Agent Hosting and Serving" to align with the popularity of agents and releases such as OpenClaw. However, many developers have concerns about agents due to the lack of security measures in place. With Novita AI's Agent Sandbox, users can safely run agents in a fully isolated environment with system level separation. For any other agent infrastructure needs, please visit their website to learn more. About Novita AI Novita AI is an AI agent cloud platform helping developers and startups build, deploy, and scale models and agentic applications with high performance, reliability, and cost efficiency. They offer model APIs, LLM dedicated endpoints, and GPU rentals for both developers and AI-native companies. Novita serves more than 350K developers today, who utilize more than 1T tokens per day to build and run agents. SOURCE Novita AI 15 minutes ago

Novita AI
Apr 29th, 2025
Novita AI Launches Top THUDM Models: GLM-4 Series Model

Novita AI launches top THUDM models: GLM-4 series model.

Novita AI
Mar 19th, 2025
Using DocsGPT with Novita AI: A Step-by-Step Guide

By combining Novita AI's conversational AI capabilities with DocsGPT's streamlined documentation retrieval, this partnership enhances productivity and simplifies workflows.