Full-Time

Threat Investigator

Trust & Safety

Posted on 12/21/2024

Anthropic

Anthropic

1,001-5,000 employees

Develops reliable and interpretable AI systems

Enterprise Software
AI & Machine Learning

Compensation Overview

$130k - $295kAnnually

Mid, Senior

H1B Sponsorship Available

Seattle, WA, USA + 2 more

More locations: San Francisco, CA, USA | New York, NY, USA

Remote-friendly with travel required.

Category
Applied Machine Learning
Natural Language Processing (NLP)
AI & Machine Learning
Required Skills
Python
SQL
Data Analysis
Requirements
  • Have experience in technical analysis and investigations, including skills in SQL and Python
  • Have experience with large language models and a deep understanding of AI technology
  • Have subject matter expertise in abusive user behavior detection, for example influence operations, coordinated inauthentic behavior patterns, and/or cyber threat intelligence
  • Can derive insights from large amounts of data to make key decisions and recommendations
  • Have experience conducting threat actor profiling and utilizing threat intelligence frameworks
  • Have strong project management skills and the ability to build processes from the ground up
  • Possess excellent communication skills to collaborate with cross-functional teams
Responsibilities
  • Analyze the deployment of our products and services to identify how these systems are being misused or abused, with a particular focus on influence operations
  • Develop abuse signals and tracking strategies to proactively detect adversarial actors
  • Study trends internally and in the broader ecosystem to anticipate how systems could be misused or manipulated for harm in the future, generating and publishing reports
  • Create actionable intelligence reports on new attack vectors, vulnerabilities, and threat actor TTPs targeting LLM systems
  • Utilize the results of deep dive investigations to implement systematic changes to our safety approach to mitigate harm
  • Keep abreast of the latest industry risks, vulnerabilities, and issues related to the use of language models and generative AI; identify opportunities for improvement to our policies, controls, and enforcement mechanisms
  • Forecast how abuse actors will leverage new advances in AI technology and inform safety by design strategies
  • Build and maintain relationships with external threat intelligence partners and information sharing communities
  • Work with cross-functional team members to build out our threat intelligence program, establishing processes, tools, and best practices

Anthropic focuses on creating reliable and interpretable AI systems. Its main product, Claude, serves as an AI assistant that can manage tasks for clients across various industries. Claude utilizes advanced techniques in natural language processing, reinforcement learning, and code generation to perform its functions effectively. What sets Anthropic apart from its competitors is its emphasis on making AI systems that are not only powerful but also understandable and controllable by users. The company's goal is to enhance operational efficiency and improve decision-making for its clients through the deployment and licensing of its AI technologies.

Company Stage

Growth Equity (Venture Capital)

Total Funding

$11.5B

Headquarters

San Francisco, California

Founded

2021

Growth & Insights
Headcount

6 month growth

52%

1 year growth

283%

2 year growth

1205%
Simplify Jobs

Simplify's Take

What believers are saying

  • Growing demand for AI systems with comprehensive contextual knowledge and reasoning abilities.
  • Increased interest in AI models addressing the 'sameness problem' in generative content.
  • Trend towards more versatile and autonomous AI systems like Google's Gemini 2.0.

What critics are saying

  • Increased competition from OpenAI's new ChatGPT features like screen sharing.
  • Google's Gemini 2.0 poses a direct threat to Anthropic's market position.
  • Writer's Palmyra Creative model sets a new standard for creativity in AI.

What makes Anthropic unique

  • Anthropic focuses on AI safety, transparency, and alignment with human values.
  • Claude is designed to handle tasks of any size across various industries.
  • Anthropic emphasizes reliable, interpretable, and steerable AI systems.

Help us improve and share your feedback! Did you find this helpful?