Full-Time

Staff AI Infrastructure Engineer

Posted on 8/26/2024

Normal Computing

Normal Computing

51-200 employees

Develops generative AI for enterprises

Enterprise Software
AI & Machine Learning

Senior

New York, NY, USA

Category
DevOps & Infrastructure
Site Reliability Engineering
Cloud Engineering
DevOps Engineering
Required Skills
Chef
Kubernetes
Microsoft Azure
Python
Puppet
CloudFormation
AWS
Go
Terraform
Ansible
Google Cloud Platform
Requirements
  • Bachelor's degree or higher in Computer Science, Engineering, or a related field
  • 6+ years of experience in infrastructure engineering, with a focus on machine learning, distributed systems, and cloud computing
  • Experience writing highly performant services and strong knowledge of Golang and/or Python
  • Specific expertise with managing and evolving Kubernetes in production and any cloud platform like GCP, AWS, Azure
  • Expertise in monitoring & alerting, scalable testing, automation, CI/CD frameworks and best practices, infrastructure-as-code (Terraform, CloudFormation), and configuration management tools (Ansible, Puppet, Chef)
  • Leadership and collaboration qualities, enthusiasm for real-world, responsible impact
  • Excellent problem-solving and “getting things done” skills, and a proven ability to troubleshoot and optimize complex systems
  • Strong written and verbal communication skills, with the ability to explain complex concepts to both technical and non-technical stakeholders across research and product
Responsibilities
  • Collaborating closely with software and research engineers to optimize and productionize training, experimentation, and enterprise product deployment infrastructure
  • Designing, implementing, and maintaining individual microservices that collectively form complex, scalable backend systems
  • Implementing tools, libraries and frameworks to speed up and enable new research and productization
  • Be a part of planning and performing rapid prototyping of machine learning techniques applied to real-world scientific and enterprise semiconductor design and engineering problems
  • Make improvements to model architectures, training, simulation, and compilation procedures
  • Practice sustainable incident response and blameless postmortems
  • Staying up-to-date with the latest advancements in AI, ML, and infrastructure technologies
  • Mentoring and guiding junior colleagues, nurturing a collaborative, growth-oriented environment that promotes knowledge sharing and professional development

Normal Computing develops generative AI specifically for critical enterprise applications, focusing on large-scale enterprises like Fortune 500 companies in sectors such as semiconductor manufacturing, supply chain management, banking, and government. Their technology, based on Probabilistic AI, utilizes statistical analysis to predict outcomes, allowing businesses to have greater control over the reliability, adaptivity, and auditability of their AI models. This approach addresses the significant risks that have hindered AI adoption in these industries. Unlike many competitors, Normal Computing tailors its AI solutions to meet the specific needs of its clients, operating on a subscription or contract basis. The goal of Normal Computing is to mitigate risks associated with AI implementation, making it a more viable option for critical applications in high-stakes environments.

Company Stage

Grant

Total Funding

$80.7M

Headquarters

New York City, New York

Founded

2022

Growth & Insights
Headcount

6 month growth

-1%

1 year growth

-5%

2 year growth

6%
Simplify Jobs

Simplify's Take

What believers are saying

  • Selected for ARIA's £50M Scaling Compute Program, boosting funding and visibility.
  • Raised $8.5M in seed funding to advance Probabilistic AI technology.
  • Growing demand for AI solutions in risk-averse industries like banking and government.

What critics are saying

  • Competition from IBM and Intel in thermodynamic computing could challenge market position.
  • Global semiconductor shortages may impact hardware scaling capabilities.
  • AI regulatory changes in the EU and US could increase compliance costs.

What makes Normal Computing unique

  • Normal Computing uses Probabilistic AI for reliable, adaptive enterprise applications.
  • Their thermodynamic computer enhances AI reliability and efficiency, a first in the industry.
  • Founded by ex-Google Brain and X engineers, they have a strong AI development pedigree.

Help us improve and share your feedback! Did you find this helpful?

Benefits

Flexible Work Hours

INACTIVE