Full-Time
De-identification and synthetic data platform
No salary listed
San Francisco, CA, USA
In Person
Tonic.ai provides data management tools that automate data pipelines and protect privacy by de-identifying, subsetting, and synthesizing data for testing and development. It integrates with SQL, NoSQL databases, and data warehouses, allowing users to manage realistic demo data across their data stack without exposing sensitive information. The product works by connecting to an organization’s data sources, applying privacy-preserving transformations and data generation rules, and delivering usable test data through an API and tooling that fit into existing development workflows. Compared with competitors, Tonic.ai emphasizes end-to-end privacy and data realism within automated pipelines, offering tiered subscription plans that scale for startups up to large enterprises and align with developers, data scientists, and QA teams. Its goal is to help companies accelerate software releases while lowering privacy risk and reducing development time spent on data preparation.
Company Size
51-200
Company Stage
Series B
Total Funding
$46.9M
Headquarters
San Francisco, California
Founded
2018
Help us improve and share your feedback! Did you find this helpful?
Competitive salary and equity
Unlimited paid time off
401k plan with employer contribution
Medical, dental, and vision insurance
One Medical membership
Generous parental leave policy
Remote-friendly work environment
Tonic.ai announces General Availability of Tonic Textual for Microsoft Fabric. Privacy-protected, unstructured data now production-ready inside Microsoft Fabric. Tonic.ai, a leader in synthetic data solutions for AI development, announced the general availability of Tonic Textual for Microsoft Fabric, enabling enterprises to detect, sanitize, and prepare unstructured text data for AI and machine learning directly within Fabric. By bringing AI-powered entity detection, redaction, and high-fidelity synthetic text generation into the Fabric ecosystem, Tonic Textual makes sensitive documents, transcripts, and other unstructured text safe to use for production AI workflows while maintaining strong compliance and governance. With Fabric now widely adopted across global enterprises, Tonic Textual empowers data teams to transform sensitive text, from contracts and customer conversations to clinical notes and support transcripts, into privacy-protected, AI-ready datasets without moving data outside of the secure OneLake environment. "Enterprises have mountains of unstructured text data, but privacy and risk concerns create hesitancy around how to use them. With the GA of Tonic Textual in Microsoft Fabric, companies can finally operationalize their data for AI with confidence," said Adam Kamor, Co-founder and Head of Engineering at Tonic.ai. "This integration brings together world-class privacy tooling with the scale and governance of Fabric, giving teams the visibility, control, and speed they need to innovate." Unlocking value from unstructured data at enterprise scale Historically, unstructured data posed a significant barrier to downstream AI use cases due to the complexity of de-identifying sensitive entities like names, IDs, addresses, and healthcare identifiers. Manual or homegrown solutions are slow, error-prone, and difficult to scale, especially in regulated industries such as finance, healthcare, and government. Mar 19, 2026 Prev Next 1 of 42,687 "AI success depends on trusted, governed data, including unstructured text," said Dipti Borkar, Vice President and General Manager of Microsoft OneLake and Fabric ISVs. "AI capabilities in Fabric along with Tonic Textual together unlock backlogs of text and sensitive documents for AI development allowing organizations to move from experimentation to production of agentic workloads with confidence.The General Availability of Tonic Textual in Microsoft Fabric unlocks backlogs of text and sensitive documents for AI development allowing organizations to move from experimentation to production with confidence." With Tonic Textual for Microsoft Fabric, organizations can now: - Automate redaction and synthetic replacement of sensitive text directly inside Fabric. - Preserve compliance with HIPAA, GDPR, CCPA, and other regulatory frameworks. - Keep governance and security centralized within OneLake and the broader Fabric platform. - Scale AI workflows across analytics, RAG (retrieval-augmented generation), model training, and reporting. This integration brings powerful entity detection, customizable redaction policies, and high-fidelity synthetic text generation, all while enabling IT and compliance teams to enforce governance and lineage controls native to the Fabric ecosystem. Broad industry impact Organizations in regulated sectors are already leveraging Tonic Textual and Fabric to expand the footprint of trusted data across their AI initiatives. By unlocking secure access to previously untapped unstructured sources, such as sensitive notes, call center transcripts, and legal documents, enterprises can accelerate innovation without compromising privacy. Getting started Tonic Textual for Microsoft Fabric is generally available worldwide and can be added directly from the Workload Hub in the Microsoft Fabric console. Existing and new customers can begin operationalizing unstructured data for secure AI and analytics workflows today. [To share your insights with Aithority, please write to [email protected]]
Tonic.ai has announced the general availability of Tonic Textual for Microsoft Fabric, enabling enterprises to detect, sanitize and prepare unstructured text data for AI and machine learning directly within Fabric. The integration allows organisations to transform sensitive documents, transcripts and clinical notes into privacy-protected datasets without moving data outside the secure OneLake environment. The solution automates redaction and synthetic replacement of sensitive text whilst preserving compliance with HIPAA, GDPR and CCPA frameworks. Organisations can now operationalise previously untapped unstructured sources, such as call centre transcripts and legal documents, for AI workflows. Founded in 2018, Tonic.ai provides synthetic data solutions for AI development and works with customers including Comcast, eBay and UnitedHealthcare. The integration is now available worldwide through the Workload Hub in Microsoft Fabric.
Tonic.ai has acquired Fabricate, the cutting-edge, schema-first synthetic data tool built by Mockaroo.
Tonic.ai, a San Francisco-based platform for privacy-preserving data synthesis, has acquired Fabricate, a synthetic data generation product by Mockaroo. The deal amount was undisclosed. Mark Brocato, Fabricate's creator and Mockaroo CEO, will join Tonic.ai to lead product development. This acquisition enhances Tonic.ai's offerings, providing comprehensive synthetic data solutions for software and AI teams, supporting structured and unstructured data, and AI-ready data pipelines.
The Endless Thirst For More DataAs the world became more digitized, it started producing and requiring increasing amounts of data. This poses a problem, as said data is often associated with real people and real companies that might have serious privacy concerns.This has become an even bigger issue with the emergence of AI, which is able not just to do statistical analysis on batches of data but also to comb through and analyze the dataset in-depth at all levels, from an individual person to billions of numerical entries.Data is now so essential to the modern economy that demand for real, high-quality data has grown exponentially. At the same time, stricter data privacy rules and ever-larger AI models have made gathering and labeling real data increasingly difficult or impractical. – IBM ResearchThis is why synthetic data was invented as a solution. Those data replicate real-world data but do not contain any private data that could cause issues. They also can be modified and adapted to specific use cases, rare situations, or anything the statistician or tester using them might need.Here too, AI has been transformative