Full-Time

Reinforcement Learning Engineer

MLabs

MLabs

51-200 employees

No salary listed

New York, NY, USA

In Person

Category
Quantitative Finance (1)
Required Skills
Risk Management
Reinforcement Learning
Requirements
  • Production Experience: Proven track record of deploying autonomous learning systems into production environments that directly controlled capital, pricing, traffic, or resources. Candidates must be able to demonstrate a deep understanding of system failures and subsequent remediation.
  • Risk Management: Hands-on experience designing and enforcing hard risk limits, such as capital caps, loss bounds, and circuit breakers, within a live financial or resource-based system.
  • Evaluation Loop Mastery: Experience building policy evaluation loops from scratch, including simulators, replay, counterfactuals, and shadow deployments, prior to live rollout.
  • Empirical Judgment: Ability to make and defend pragmatic technical tradeoffs (e.g., opting for heuristics over RL or bandits over deep RL) based on empirical results rather than theoretical preference.
  • Operational Independence: Demonstrated experience as the primary owner of a complex ML system within a lean environment, operating without the support of dedicated research organizations or external ML platforms.
  • Work Style: Comfort with an intense, fast-paced environment where expectations are high and impact is immediate. Our client operates primarily in-person.
Responsibilities
  • Own the design, shipment, and iteration of an RL-driven trading agent that utilizes real capital to drive ecosystem engagement.
  • Design reward functions and policies that align strictly with product goals while implementing and enforcing absolute downside risk constraints.
  • Build robust evaluation and validation frameworks, including simulation and offline analysis, to minimize reliance on live sequential testing.
  • Manage the safe transition of existing heuristic-based production systems toward advanced learning-based approaches.
  • Serve as the sole RL expert within a small, high-caliber team, maintaining responsibility for the entire lifecycle—from data modeling and deployment to monitoring and safety safeguards.

Company Size

51-200

Company Stage

N/A

Total Funding

N/A

Headquarters

London, United Kingdom

Founded

2018

Simplify Jobs

Simplify's Take

What believers are saying

  • Plutus V3 SOPs enable 30% faster program execution.
  • New primitives BLS12-381 support Ethereum smart contract ports.
  • CIP-1694 governance exposes voting features on SanchoNet.

What critics are saying

  • Intersect grants delay Plutarch updates past May 31, blocking governance.
  • Incomplete bitwise primitives hinder Ethereum porting in 6-12 months.
  • IOG dominance sidelines MLabs innovations in 12-24 months.

What makes MLabs unique

  • MLabs collaborates with IOG on Plutus V3 launch on SanchoNet.
  • Ply library serializes Plutarch validators into CIP-57 blueprints.
  • Upgrading tools like LambdaBuffers for Conway era and Plutus V3.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at MLabs who can refer or advise you

Benefits

Remote Work Options

Hybrid Work Options

Flexible Work Hours

Phone/Internet Stipend

Home Office Stipend

Wellness Program

Mental Health Support

Conference Attendance Budget

Professional Development Budget

Stock Options

Company Equity

401(k) Retirement Plan

401(k) Company Match

Family Planning Benefits

Fertility Treatment Support

Adoption Assistance

Parental Leave

Paid Vacation

Paid Holidays

Paid Sick Leave

Paid Holidays

Health Insurance

Dental Insurance

Vision Insurance

Life Insurance

Disability Insurance

HSA/FSA

Remote Work Options

Company News

Cryphedge
Feb 13th, 2024
Plutus V3 goes live on SanchoNet

Input Output Global has been working on it in collaboration with MLabs.