Full-Time

Applied AI Engineer

SafetyKit

SafetyKit

AI-driven Trust and Safety risk management

No salary listed

San Francisco, CA, USA

In Person

Category
AI & Machine Learning (2)
,
Responsibilities
  • Turn language models into useful, functional, and beautiful products.
  • Scope and implement entirely novel applications of AI agents (adversarial webcrawling, weapons identification, red-teaming, ingredient extraction, etc.).
  • Architect and implement workflows that power SafetyKit's risk AI agents.
  • Design and conduct rigorous experiments to evaluate models' relative strengths, weaknesses, and optimal use cases.
  • Use codegen models to move 10x faster than you did two years ago.
Desired Qualifications
  • You love language models.
  • You have the patience to be rigorous.
  • You act like an owner in everything you do.
  • People tell you you pick things up unusually quickly.
  • You love the weird little details of life in our customer’s companies.
  • You care about your work and you care about moving fast. A lot.
  • You know more than you need to know. You have an insatiable curiosity.

SafetyKit provides AI-powered Trust and Safety software as a service for online marketplaces and platforms to manage risk and protect users. It analyzes user reports, including audio transcripts and images, automates triage and routing, and integrates with existing CRM systems to categorize and flag tickets. It differentiates itself with a tightly integrated, scalable SaaS solution focused on media-heavy Trust and Safety workflows and seamless CRM integration. Its goal is to reduce platform risk, prevent abuse, and protect users at scale through AI-assisted risk management.

Company Size

N/A

Company Stage

Seed

Total Funding

$2.5M

Headquarters

San Francisco, California

Founded

2023

Simplify Jobs

Simplify's Take

What believers are saying

  • $27 million funding accelerates product development and market expansion across marketplaces.
  • Agentic AI integrates CPSC, ASTM, and Visa/Mastercard compliance frameworks simultaneously across regions.
  • Network detection flags fraud rings and repeated violations by analyzing entire entity relationships.

What critics are saying

  • OpenAI's moderation API outperforms SafetyKit in accuracy and cost, driving customer defection.
  • YC S25 competitor TrustGuard undercuts pricing at 40% lower subscription fees for marketplaces.
  • Hallucination scandals in audio analysis could trigger wrongful bans, lawsuits, and contract terminations.

What makes SafetyKit unique

  • Transparent, policy-grounded AI decisions with explainable reasoning versus black-box ML systems.
  • Multimodal grooming detection analyzing conversation patterns, evasion tactics, and off-platform migration attempts.
  • Rapid policy deployment in hours via LLM interpretation of complex regional and regulatory variations.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Remote Work Options

Flexibile Work Hours

Flexible Work Hours

Health Insurance

401(k) Retirement Plan

401(k) Company Match

Paid Vacation

Paid Sick Leave

Paid Holidays

Hybrid Work Options

Wellness Program

Mental Health Support

Gym Membership

Conference Attendance Budget

Professional Development Budget

Family Planning Benefits

Fertility Treatment Support

Parental Leave

Adoption Assistance

Childcare Support

Employee Discounts

Phone/Internet Stipend

Home Office Stipend

Stock Options

Company Equity