Simplify Logo

Full-Time

Machine Learning Infrastructure Engineer

Posted on 9/12/2023

Normal Computing

Normal Computing

1-10 employees

Generative AI platform for enterprise applications

Hardware
Enterprise Software
AI & Machine Learning

Mid

New York, NY, USA

Category
Applied Machine Learning
AI & Machine Learning
DevOps & Infrastructure
DevOps Engineering Management
Required Skills
Chef
Kubernetes
Microsoft Azure
Python
Puppet
Apache Flink
NoSQL
Tensorflow
Pytorch
Apache Spark
SQL
Apache Kafka
Docker
AWS
Pandas
Terraform
Ansible
Hadoop
NumPy
Google Cloud Platform
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 3+ years of experience in infrastructure engineering with a focus on machine learning, distributed systems, and cloud computing
  • Experience with at least one programming language such as Python or C++
  • Experience using TensorFlow, PyTorch, Jax, NumPy, Pandas, or similar ML/scientific libraries
  • Leadership and collaboration qualities
  • Strong problem-solving skills
  • Strong written and verbal communication skills
  • Knowledge of containerization technologies (Docker, Kubernetes) and cloud platforms like GCP, AWS, Azure
  • Familiarity with ML infrastructure tools and technologies such as Ray, MLflow, Kubeflow, Flyte, or similar platforms
  • Understanding of CI/CD pipelines, infrastructure-as-code (Terraform, CloudFormation), and configuration management tools (Ansible, Puppet, Chef)
  • Experience with big data technologies like Hadoop, Spark, or Flink
  • Familiarity with data storage and processing systems (SQL/NoSQL, Kafka)
  • Passion for staying up-to-date with advancements in AI, ML, and infrastructure technologies
Responsibilities
  • Collaborating with ML research scientists and engineers to optimize and productionize pipelines and workflows
  • Implementing tools, libraries, and frameworks to enable new research
  • Collaborating with Thermodynamic Hardware scientists and engineers to integrate a novel simulation stack and compilation engine
  • Rapid prototyping of machine learning techniques applied to real-world problems
  • Improving model architectures, training, simulation, and compilation procedures
  • Reporting and presenting software developments, experimental results, and analysis
  • Contributing to documentation or educational content and adapting based on feedback
  • Staying up-to-date with industry trends and technologies
  • Mentoring and guiding junior colleagues

Normal Computing offers a generative AI platform for critical enterprise applications, leveraging Probabilistic AI to enhance reliability, adaptivity, and auditability of AI models. The company's technology, developed by former members of Google Brain, Palantir, and Alphabet X, focuses on enabling transformative value in high-stakes enterprise applications through novel full-stack probabilistic machine learning infrastructure driven by thermodynamic physics.

Company Stage

Seed

Total Funding

$8.5M

Headquarters

New York City, New York

Founded

2022

Growth & Insights
Headcount

6 month growth

16%

1 year growth

12%

2 year growth

500%
INACTIVE