Full-Time
Posted on 3/14/2025
Develops and deploys large language models
No salary listed
Senior, Expert
H1B Sponsorship Available
Palo Alto, CA, USA
Mistral AI develops and deploys Large Language Models (LLMs) that can understand and generate human-like text. Their main products include an API that allows clients to access these models on a pay-as-you-go basis, making it easy for businesses of all sizes to integrate advanced AI capabilities without large upfront costs. Additionally, Mistral AI offers open-source models that can be accessed through platforms like Hugging Face, encouraging community engagement and contributions to improve the models. This combination of API access and open-source offerings sets Mistral AI apart from competitors by providing flexible and scalable solutions. The company's goal is to ensure accessibility to sophisticated AI tools while fostering continuous innovation through community involvement.
Company Size
201-500
Company Stage
Series B
Total Funding
$1.2B
Headquarters
Paris, France
Founded
2023
Help us improve and share your feedback! Did you find this helpful?
Health Insurance
Company Equity
Parental Leave
401(k) Retirement Plan
Paid Vacation
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More. Well-funded French AI model maker Mistral has consistently punched above its weight since its debut of its own powerful open source foundation model in fall 2023 — but it took some criticism among developers on X recently for its last release of a proprietary large language model (LLM) called Medium 3, which some viewed as betraying its open source roots and commitment. (Recall that open source models can be taken and adapted freely by anyone, while proprietary models must be paid for and their customization options are more limited and controlled by the model maker.)But today, Mistral is back and recommitting to the open source AI community, and AI-powered software development in particular, in a big way. The company has teamed up with open source startup All Hands AI, creators of Open Devin to release Devstral, a new open-source language model with 24-million parameters — much smaller than many rivals whose models are in the multibillions, and thus, requiring far less computing power such that it can be run on a laptop — purpose-built for agentic AI development. Unlike traditional LLMs designed for short-form code completions or isolated function generation, Devstral is optimized to act as a full software engineering agent—capable of understanding context across files, navigating large codebases, and resolving real-world issues.The model is now freely available under the permissive Apache 2.0 license, allowing developers and organizations to deploy, modify, and commercialize it without restriction
G42 and Mistral AI collaborate on AI infrastructure development.
In brief Mistral Medium 3 rivals Claude 3.7 and Gemini 2.0 at one-eighth the cost, targeting enterprise AI at scale.The model excels in coding and business applications, outperforming Llama 4 Maverick and Cohere Command A in benchmarks.Now live on Mistral La Plateforme and Amazon Sagemaker, with Google Cloud and Azure integrations coming soon.Decrypt’s Art, Fashion, and Entertainment Hub. Discover SCENEMistral Medium 3 dropped yesterday, positioning the model as a direct challenge to the economics of enterprise AI deployment.The Paris-based startup, founded in 2023 by former Google DeepMind and Meta AI researchers, released what it claims delivers frontier performance at one-eighth the operational cost of comparable models."Mistral Medium 3 delivers frontier performance while being an order of magnitude less expensive," the company said.The model represents Mistral AI’s most powerful proprietary offering to date, distinguishing itself from an open-source portfolio that includes Mistral 7B, Mixtral, Codestral, and Pixtral.At $0.4 per million input tokens and $2 per million output tokens, Medium 3 significantly undercuts competitors while maintaining performance parity. Independent evaluations by Artificial Analysis positioned the model "amongst the leading non-reasoning models with Medium 3 rivalling Llama 4 Maverick, Gemini 2.0 Flash and Claude 3.7 Sonnet."Mistral Medium 3 independent evals: Mistral is back amongst the leading non-reasoning models with Medium 3 rivalling Llama 4 Maverick, Gemini 2.0 Flash and Claude 3.7 Sonnet Key takeaways:➤ Intelligence: We see substantial intelligence gains across all 7 of our evals compared… pic.twitter.com/mc9il9WV8J — Artificial Analysis (@ArtificialAnlys) May 8, 2025The model excels particularly in professional domains.Human evaluations demonstrated superior performance in coding tasks, with Sophia Yang, a Mistral AI representative, noting that "Mistral Medium 3 shines in the coding domain and delivers much better performance, across the board, than some of its much larger competitors."Benchmark results indicate Medium 3 performs at or above Anthropic's Claude Sonnet 3.7 across diverse test categories, while substantially outperforming Meta's Llama 4 Maverick and Cohere's Command A in specialized areas like coding and reasoning.The model's 128,000-token context window is standard, and its multimodality lets it process documents and visual inputs across 40 languages.Image: Mistral AIBut unlike the models that made Mistral famous, users will not be able to modify it or run it locally.Right now, the best source for open source enthusiasts is Mixtral-8x22B-v0.3, a mixture of experts model that runs 8 experts of 22 billion parameters each. Besides Mixtral, the company has over a dozen different open-source models available.It’s also initially available for enterprise deployment and not domestic usage via LeChat—Mistral’s chatbot interface. Mistral AI emphasized the model's enterprise adaptation capabilities, supporting continuous pretraining, full fine-tuning, and integration into corporate knowledge bases for domain-specific applications.Beta customers across financial services, energy, and healthcare sectors are testing the model for customer service enhancement, business process personalization, and complex dataset analysis.The API will launch immediately on Mistral La Plateforme and Amazon Sagemaker, with a forthcoming integration planned for IBM WatsonX, NVIDIA NIM, Azure AI Foundry, and Google Cloud Vertex.The announcement sparked considerable discussion across social media platforms, with AI researchers praising the cost-efficiency breakthrough while noting the proprietary nature as a potential limitation.The model's closed-source status marks a departure from Mistral's open-weight offerings, though the company hinted at future releases."With the launches of Mistral Small in March and Mistral Medium today, it's no secret that we're working on something 'large' over the next few weeks," Mistral’s Head of Developer Relationships Sophia Yang teased in the announcement. "With even our medium-sized model being resoundingly better than flagship open source models such as Llama 4 Maverick, we're excited to 'open' up what's to come."Mistral tends to hallucinate less than the average model, which is excellent news considering its size.It’s better than Meta Llama-4 Maverick, Deepseek V3 and Amazon Nova Pro, to name a few
French startup Mistral AI introduced a new chatbot for companies amid an uptick in revenue. Co-founder and CEO Arthur Mensch addressed the revenue growth in comments to reporters, Reuters reported Wednesday (May 7). “In the last 100 days, we have tripled our business, in particular in Europe and outside of the U.S.,” Mensch said, per the report. “We’ve been… growing in the U.S. quite fast as well.”
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More. French AI startup Mistral has raised boatloads of private funding but has yet to crack the top AI usage charts globally, especially when it comes to enterprise and developer adoption.But that may change starting today: The company just unveiled Le Chat Enterprise, a unified AI assistant platform designed for enterprise-scale productivity and privacy, powered by its new Medium 3 model that outperforms larger ones at a fraction of the cost (here, “larger” refers to the number of parameters, or internal model settings, which typically denote more complexity and more powerful capabilities, but also take more compute resources such as GPUs to run). VIDEO