Full-Time
Updated on 2/12/2025
Transforms unstructured data for machine learning
Senior, Expert
Remote in USA
Remote position with occasional travel; Bay Area residents preferred.
You match the following Unstructured's candidate preferences
Employers are more likely to interview you if you match these preferences:
Unstructured.io transforms raw natural language data into formats suitable for machine learning applications, serving developers and data scientists. It provides open-source libraries and APIs for creating custom preprocessing pipelines that handle various unstructured data types like HTML and PDFs. Unlike competitors, Unstructured.io emphasizes rapid orchestration of these pipelines and secure integration with downstream services. The company's goal is to simplify the use of unstructured data at scale, enabling organizations to make informed, data-driven decisions.
Company Size
51-200
Company Stage
Series B
Total Funding
$63.2M
Headquarters
San Francisco, California
Founded
2022
Help us improve and share your feedback! Did you find this helpful?
Remote Work Options
Unlimited Paid Time Off
Home Office Stipend
Health Insurance
Dental Insurance
Vision Insurance
Professional Development Budget
Intelligent Document Processing provider Unstructured has announced the launch of its new enterprise ETL (extract, transform, and load) platform that automates the complex process of transforming unstructured data in any format and from any source into organizations' GenAI stack.
SANTA CLARA, Calif., Dec. 3, 2024 /PRNewswire/ -- Couchbase, Inc. (NASDAQ: BASE), the developer data platform for critical applications in our AI world, today announced financial results for its third quarter ended October 31, 2024. "I'm pleased with the continued operational progress of the entire Couchbase team," said Matt Cain, Chair, President and CEO of Couchbase. "We delivered top- and bottom-line results that exceeded our outlook, and we achieved another significant milestone with Capella, which now represents 15.1% of our ARR and one third of our customer base. I remain highly confident in our outlook and ability to achieve our objectives in fiscal 2025."
Capgemini, Confluent, IBM, QuantumBlack, AI by McKinsey, and Unstructured join the MongoDB AI Applications Program (MAAP) ecosystem to help organizations make an impact with AI. MongoDB, Meta collaborating to support developers with Meta models and the end-to-end MAAP technology stack. Leading autism and intellectual and developmental disability software provider CentralReach using MAAP to improve AI-powered care platform
Don’t miss OpenAI, Chevron, Nvidia, Kaiser Permanente, and Capital One leaders only at VentureBeat Transform 2024. Gain essential insights about GenAI and expand your network at this exclusive three day event. Learn More. The finalists for the Innovation Showcase will be at Transform: Putting AI to work, July 9-11 in San Francisco. Six companies have been selected to showcase their generative artificial intelligence (AI) products or features that are most likely to disrupt the enterprise.Those selected to present will do so in front of an invite-only audience of 400 industry decision-makers, and receive direct feedback from a panel of enterprise tech analysts, brand executives and others.The 2024 Innovation Showcase finalists are:. Countdown to VB Transform 2024Join enterprise leaders in San Francisco from July 9 to 11 for our flagship AI event. Connect with peers, explore the opportunities and challenges of Generative AI, and learn how to integrate AI applications into your industry
Additionally, DataStax has partnered with unstructured.io to provide structure to unstructured content before it is vectorized, resulting in increased accuracy and precision.
Don’t miss OpenAI, Chevron, Nvidia, Kaiser Permanente, and Capital One leaders only at VentureBeat Transform 2024. Gain essential insights about GenAI and expand your network at this exclusive three day event. Learn More. Retrieval Augmented Generation (RAG) is key to enterprise usage of generative AI, but it’s not as easy as just simply connecting a Large Language Model (LLM) to a database.DataStax is looking to help solve the challenge of enabling RAG for enterprise production deployments, with series of technologies announced today. DataStax is perhaps best known for its commercially supported version of the Apache Cassandra database, known as a DataStax Astra DB. In the last year, DataStax has increasingly focussed on enabling gen AI and specifically RAG, adding vector database search support alongside a data API to build gen AI RAG apps. Now DataStax is pushing further into enterprise RAG, with the release of Langflow 1.0 for building RAG and AI agent workflows
AI-focused big data startup Unstructured raises $40M to make raw data LLM-ready - SiliconANGLE
AI technology company Unstructured raises $40 million in second VC round, led by Menlo Ventures.
SAN FRANCISCO, Dec. 6, 2023 /PRNewswire/ - Yurts Technologies Inc. (Yurts), an enterprise-focused Generative AI integration platform, has officially announced an up to $16 million contract with the United States Special Operations Command (USSOCOM).
Weaviate has integrated with unstructured data management tool UnstructuredIO to ingest your PDFs and GitHub to read and import files directly from your repos.