Full-Time

Software Engineer

Safeguards Foundations, Internal Tooling

Anthropic

Anthropic

5,001-10,000 employees

Develops reliable, interpretable AI systems

Compensation Overview

£255k - £325k/yr

London, UK

Hybrid

Location-based hybrid policy requires 25% time in office.

Category
Software Engineering (1)
Required Skills
Python
React.js
TypeScript
REST APIs
Requirements
  • Have 4+ years of experience as a software engineer, with meaningful time spent building internal tools, operations platforms, or back-office products
  • Are comfortable using agentic coding tools (e.g. Claude Code) as a core part of your workflow, and can direct them to ship well-tested, production-quality software at a high cadence without lowering the bar (our stack is mostly React/TypeScript and Python)
  • Take a product-minded approach to internal users: you work with the people using your tools, watch where they struggle, and fix it
  • Are results-oriented, with a bias towards flexibility and impact
  • Pick up slack, even if it goes outside your job description
  • Communicate clearly with non-engineering stakeholders and can explain technical trade-offs to operations and policy partners
  • Care about the societal impacts of your work and want to apply your engineering skills directly to AI safety
Responsibilities
  • Design, build, and maintain the internal review and enforcement tooling used by Safeguards analysts — including case queues, content review surfaces, decision/audit logging, and account-actioning workflows
  • Understand user workflows and establish tooling for well processes that may be distributed across a number of tools and UIs
  • Develop the ‘base layer’ of reusable APIs, data storage, and backend services that let new review workflows be stood up quickly and safely
  • Partner with operations and policy teams to understand reviewer pain points, then translate them into clear product improvements that reduce handling time and decision error
  • Integrate tooling with upstream detection systems and downstream enforcement infrastructure so that flagged behaviour flows cleanly from signal → human review → action
  • Build in the guardrails that sensitive internal tools require: granular permissions, audit trails, data-access controls, and reviewer wellbeing features (e.g. content blurring, exposure limits)
  • Instrument the tools you ship — surfacing metrics on queue health, reviewer throughput, and decision quality so the team can see what's working
  • Contribute to the Foundations team's shared platform and on-call responsibilities
Desired Qualifications
  • Have built tooling in a trust & safety, content moderation, fraud, integrity, or risk-operations setting
  • Have experience designing case-management or workflow systems (queues, SLAs, escalation paths, audit logs)
  • Have worked with sensitive data and understand the privacy, access-control, and reviewer-wellbeing considerations that come with it
  • Have experience with Google Cloud Platform or Amazon Web Services, PostgreSQL/BigQuery, and CI/CD in a production environment
  • Have used large language models as a building block inside operational tools (e.g. assisted triage, summarisation, or classification in the review loop)
  • Representative projects: Rebuilding the analyst review queue so cases are routed by severity and skill, with full decision history and one-click escalation; Shipping a unified account-investigation view that pulls signals from multiple detection systems into a single, permissioned surface; Adding content-obfuscation and exposure-tracking features to protect reviewers working with harmful material; Building an internal labeling tool that feeds high-quality ground truth back to the detection and research teams

Anthropic focuses on AI research to build reliable, interpretable, and steerable AI systems. Its main product, Claude, is an AI assistant designed to handle tasks at any scale for clients across industries, delivered through deployment and licensing along with specialized AI R&D services. Claude works by combining natural language processing, human feedback, reinforcement learning, and interpretability techniques to produce a capable, controllable AI assistant that can assist with a wide range of tasks. The company differentiates itself from competitors by prioritizing safety, transparency, and controllability—emphasizing reliability, interpretability of model behavior, and user-controlled steerability in its AI systems. Anthropic’s goal is to make AI systems that people can trust and efficiently use to improve operations and decision-making across sectors.

Company Size

5,001-10,000

Company Stage

Late Stage VC

Total Funding

$77.3B

Headquarters

San Francisco, California

Founded

2021

Simplify Jobs

Simplify's Take

What believers are saying

  • Claude for Small Business targets 36M US firms via QuickBooks and Canva integrations.
  • Japan's three megabanks access Claude Mythos by May 2026 end, expanding finance revenue.
  • Thomson Reuters MCP links Claude to 1.9B Westlaw documents for legal dominance.

What critics are saying

  • Japan FSA's 36-entity group imposes Mythos cybersecurity audits within 6 months.
  • Legal hallucinations trigger malpractice suits against Freshfields using Claude live.
  • Thomson Reuters captures enterprise legal revenue, sidelining Anthropic plugins.

What makes Anthropic unique

  • Anthropic prioritizes AI safety through interpretable and steerable Claude models.
  • Claude integrates Model Context Protocol for grounded legal and business workflows.
  • Constitutional AI framework ensures Claude aligns with human values and transparency.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Anthropic who can refer or advise you

Benefits

Flexible Work Hours

Paid Vacation

Parental Leave

Hybrid Work Options

Company Equity

Growth & Insights and Company News

Headcount

6 month growth

-3%

1 year growth

-3%

2 year growth

1%
Ars Technica
Apr 21st, 2026
Mozilla: Anthropic's Mythos AI model finds 271 zero-day bugs in Firefox 150

Mozilla has discovered 271 security vulnerabilities in Firefox 150 using early access to Anthropic's Mythos Preview AI model. The findings represent a significant increase from the 22 bugs detected by Anthropic's Opus 4.6 model in Firefox 148 last month. Firefox CTO Bobby Holley said Mythos is "every bit as capable" as the world's best security researchers, whilst eliminating the need to "concentrate many months of costly human effort to find a single bug". He believes AI tools like Mythos tilt the cybersecurity balance towards defenders by making vulnerability discovery cheaper. Anthropic released Mythos Preview to a limited group of industry partners earlier this month. Mozilla CTO Raffi Krikorian argues such tools are particularly crucial for open source projects, which often rely on insufficient volunteer maintenance for security.

Bloomberg L.P.
Apr 21st, 2026
Anthropic's Mythos AI sparks fear and hope over cybersecurity threats to global finance

Anthropic's new AI model Mythos has sparked concern amongst policymakers at International Monetary Fund meetings over its potential to accelerate sophisticated cyberattacks on the global financial system. However, its developers argue the technology could provide banks with their strongest defence yet. What distinguishes Mythos is its ability to chain multiple security weaknesses into coordinated attacks, effectively automating complex cyber intrusions. This capability could significantly expand the pool of potential attackers in the near term. The model's creators emphasise a longer-term benefit: the same technology could enable banks to detect and patch vulnerabilities faster than ever, potentially shifting the balance towards defenders if widely adopted. The dual-use nature of Mythos has created both panic and optimism in boardrooms and governments regarding global financial system security.

Bloomberg L.P.
Apr 17th, 2026
Indian fintechs push Anthropic for early access to 'dangerous' Mythos AI model

Indian fintech companies including One97 Communications, Razorpay Software and Pine Labs are pushing Anthropic for early access to Mythos, the AI model that has raised global concerns about cyberattack risks. The firms want to test Mythos on their own systems to detect vulnerabilities following Anthropic's announcement of a limited rollout. The San Francisco-based AI developer considers the model too dangerous for wider release but major Indian financial technology companies are seeking early access to assess potential security threats to their platforms.

Bloomberg L.P.
Apr 16th, 2026
US government prepares to give federal agencies access to Anthropic's Mythos AI model

The US government is preparing to provide major federal agencies with access to Anthropic's new AI model, Mythos, according to a memo reviewed by Bloomberg News. Gregory Barbaccia, federal chief information officer at the White House Office of Management and Budget, informed Cabinet department officials on Tuesday that OMB is establishing protections to enable agencies to use the closely guarded AI tool. The move comes amid concerns that the powerful model could significantly increase cybersecurity risks. OMB is working to set up appropriate safeguards before rolling out access to the system across government departments.

Bloomberg L.P.
Apr 16th, 2026
Anthropic's Mythos AI model raises cybersecurity alarms for banks and governments

Anthropic's new Mythos AI model is causing concern among banks, tech giants and governments over its potential implications for cybersecurity and the internet's future. The model has prompted a scramble amongst major institutions to understand its capabilities and risks. Details about the specific features raising alarms were not disclosed in the source material.