Full-Time

Forward Deployed AI Engineer

Confirmed live in the last 24 hours

DatologyAI

DatologyAI

11-50 employees

Automated data curation for AI training

Compensation Overview

$180k - $250k/yr

Mid, Senior

Company Does Not Provide H1B Sponsorship

San Carlos, CA, USA

In-office 4 days a week; relocation assistance for employees moving to the Bay Area.

Category
Applied Machine Learning
Deep Learning
AI & Machine Learning
Required Skills
Kubernetes
Microsoft Azure
Python
Apache Spark
Docker
AWS
Data Analysis
Google Cloud Platform
Requirements
  • 4+ years of experience in software or customer engineering roles, with a strong emphasis on customer-facing engagements.
  • Strong programming skills in Python (or equivalent), with experience in data processing frameworks (e.g., Spark, Ray, etc).
  • Familiarity with cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).
  • Ability to troubleshoot complex systems and work in fast-paced, customer-facing environments.
  • Excellent communication skills with the ability to translate technical concepts for diverse audiences.
Responsibilities
  • Embed deeply with strategic customers to understand their data curation needs, business challenges, and technical requirements in detail.
  • Prepare detailed scopes of work and project plans for proof-of-concept prototypes and full production deployments.
  • Work hands-on with customers' technical teams as a technical expert and trusted advisor to drive projects to completion on their infrastructure.
  • Troubleshoot and resolve technical challenges related to data pipelines, model training, and infrastructure scalability.
  • Collaborate closely with GTM, Engineering, and Research teams to ensure seamless customer experiences, project success, and actionable product feedback.
  • Provide hands-on support, training, and documentation to enable customer success.
Desired Qualifications
  • Experience working with ML/AI workflows, data pipelines, or large-scale data systems is a plus.

DatologyAI specializes in automated data curation tools that enhance the training of Generative AI models. Its technology automatically selects high-quality data while removing irrelevant or harmful data points, which improves the accuracy and performance of AI models and reduces training costs. The company offers its services to businesses and organizations that rely on AI, allowing them to integrate these tools into their existing data systems with minimal changes. DatologyAI's business model is usage-based, enabling clients to scale their AI capabilities as their data needs grow. The company has received recognition for its technology and has secured funding to support its mission, which is to help businesses train better AI models more efficiently and cost-effectively.

Company Size

11-50

Company Stage

Series A

Total Funding

$57.7M

Headquarters

Redwood City, California

Founded

2023

Simplify Jobs

Simplify's Take

What believers are saying

  • Rising demand for data curation tools as AI models grow in complexity.
  • Opportunities in AI ethics and bias reduction align with industry trends.
  • Expansion into non-tech industries increases potential client base.

What critics are saying

  • Competition from established AI companies investing in data curation.
  • Over-reliance on venture capital funding may lead to financial instability.
  • Emerging privacy regulations could limit data curation capabilities.

What makes DatologyAI unique

  • DatologyAI specializes in automated data curation for GenAI model training.
  • Their technology removes redundant and harmful data, enhancing AI model accuracy.
  • Integration with existing infrastructures is seamless, requiring minimal code adjustments.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Health Insurance

Dental Insurance

Vision Insurance

401(k) Company Match

Unlimited Paid Time Off

Annual Wellness Stipend

Annual Learning and Development Stipend

Relocation Assistance

Growth & Insights and Company News

Headcount

6 month growth

-4%

1 year growth

11%

2 year growth

66%
SiliconANGLE
May 9th, 2024
DatologyAI raises $46M to streamline AI model training data diets

DatologyAI raises $46M to streamline AI model training data diets - SiliconANGLE

Datology AI
Feb 23rd, 2024
Introducing DatologyAI — Making models better through better data, automatically

Models are what they eat. AI models trained on large-scale datasets have demonstrated jaw-dropping abilities and have the power to transform every aspect of our daily lives, from work to play. This massive leap in capabilities has largely been driven by corresponding increases in the amount of data we train models on, shifting from millions of data points several years ago to billions or trillions of data points today. As a result, these models are a reflection of the data on which they’re train

SiliconANGLE
Feb 23rd, 2024
DatologyAI raises $11.65M to automate data curation for more efficient AI training

DatologyAI raises $11.65M to automate data curation for more efficient AI training.

TechCrunch
Feb 22nd, 2024
DatologyAI is building tech to automatically curate AI training datasets | TechCrunch

A new startup, DatologyAI, claims to be able to automatically curate the massive data sets on which AI models train.