About Resolve AI
Resolve is building AI that operates as a Production Engineer. It investigates and resolves incidents, and handles operational tasks enhancing system reliability, and making on-call stress-free.
Our founders (Spiros Xanthos and Mayank Agarwal) are the core creators of OpenTelemetry and led Splunk Observability. They have 2 successful exits to Splunk and VMware.
We raised a $35M Seed round in September of 2024 from Greylock, Unusual Ventures and a group of AI and tech pioneers, including: Paul Daugherty (CTO at Accenture), Jeff Dean (Chief Scientist, Google DeepMind), Thomas Dohmke (CEO, GitHub), Matt Garman (CEO, AWS), Reid Hoffman (Founder, LinkedIn) and Fei Fei Li (Professor, Stanford).
Let’s make machines be on-call for humans, not the other way around.
Infrastructure Engineer:
We are seeking a Software Engineer with a strong infrastructure focus. This role is essential to building the foundation of scalable, reliable, and secure systems that will power our platform as we continue to grow. You will be instrumental in designing, developing and maintaining the infrastructure that supports our services, with a focus on scalability, security, reliability, efficiency and operational excellence.
Responsibilities:
Design and implement critical infrastructure components, ensuring the systems are scalable, secure, and resilient.
Collaborate closely with product teams to understand infrastructure needs for customer-facing features and ensure the platform meets product demands.
Own infrastructure automation and orchestration, leveraging cloud-native technologies (e.g., AWS, Kubernetes) to drive efficiencies and reduce manual intervention.
Build higher-order abstractions freeing product teams from the low-level infrastructure details and accelerating product development.
Participate in on-call rotations, post-incident reviews, and other operational duties to ensure service delivery quality.
Contribute to a culture of continuous improvement and automation across infrastructure, deployment pipelines, and system monitoring.
Qualifications:
Infrastructure Expertise: Proven experience in building, scaling, and optimizing infrastructure for high-concurrency, high-availability environments.
Cloud & Kubernetes Knowledge: Experienced with cloud platforms (especially AWS) and Kubernetes, with a strong understanding of databases and messaging systems for secure, scalable cloud-native applications.
Automation & Security: Skilled in automating infrastructure operations and enforcing security and compliance best practices.
Experience: 4+ years of relevant industry experience with a focus on infrastructure; Bachelor’s degree, or 2+ years with a Master’s degree.
We are an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, sex, gender, gender identity, sexual orientation, protected veteran status, disability, age, and other characteristics protected by law.