Full-Time

Devops Engineer

Confirmed live in the last 24 hours

Lightning AI

Lightning AI

51-200 employees

AI development platform for coding and deployment

AI & Machine Learning

Compensation Overview

$120k - $215kAnnually

Junior, Mid

Palo Alto, CA, USA

Hybrid role with a two-day in-office requirement.

Category
DevOps & Infrastructure
DevOps Engineering
Required Skills
Kubernetes
Microsoft Azure
Git
Docker
CloudFormation
AWS
Go
Prometheus
Jenkins
Terraform
Ansible
Development Operations (DevOps)
CircleCI
Google Cloud Platform
Requirements
  • Proven experience as a DevOps Engineer or in a similar role, with a deep understanding of cloud infrastructure (AWS, GCP, or Azure).
  • Expertise in CI/CD tools such as Jenkins, CircleCI, GitHub, or GitLab.
  • Ability to code in golang
  • Experience with infrastructure as code tools like Terraform, Ansible, or CloudFormation.
  • Familiarity with containerization technologies like Docker and Kubernetes.
  • Knowledge of monitoring and logging tools such as Prometheus, Grafana, or ELK stack.
  • A strong security mindset with experience in managing secure cloud environments.
  • Excellent problem-solving skills, attention to detail, and ability to work in a fast-paced, collaborative environment.
Responsibilities
  • Design, build, and maintain scalable infrastructure for deploying, monitoring, and automating our cloud environments.
  • Collaborate closely with development teams to ensure seamless integration and delivery of new features.
  • Implement and manage CI/CD pipelines to improve deployment frequency and reduce manual intervention.
  • Monitor system performance, identify bottlenecks, and develop strategies to improve reliability and performance.
  • Ensure security best practices are followed across infrastructure and deployment processes.
  • Troubleshoot and resolve infrastructure-related issues in a timely manner.
  • Stay up to date with the latest industry trends and tools to drive innovation in DevOps practices.

Lightning AI offers a platform for developing artificial intelligence applications, covering the entire process from ideation to deployment. It provides tools for coding, prototyping, and training AI models on GPUs, all accessible through a web browser without setup. The platform operates on a subscription model, featuring a cloud-based AI Studio that allows users to code on CPUs, debug on GPUs, and scale their projects. Key features include PyTorch Lightning and Lit-GPT, aimed at optimizing and scaling AI models for developers and enterprises.

Company Stage

Seed

Total Funding

$57M

Headquarters

New York City, New York

Founded

2015

Growth & Insights
Headcount

6 month growth

11%

1 year growth

23%

2 year growth

15%
Simplify Jobs

Simplify's Take

What believers are saying

  • The availability of Lightning AI Studio in AWS Marketplace can lead to greater productivity and faster development times for AI applications.
  • Thunder's ability to significantly speed up training and reduce costs can attract more developers and enterprises to Lightning AI's platform.
  • The strategic collaboration with AWS can provide optimized performance and first-class support, making Lightning AI a preferred choice for building and deploying AI products.

What critics are saying

  • The competitive landscape in AI development platforms is intense, with major players like Google and Microsoft posing significant threats.
  • Dependence on AWS for cloud services could limit flexibility and expose Lightning AI to risks associated with changes in AWS policies or pricing.

What makes Lightning AI unique

  • Lightning AI's integration with AWS Marketplace provides a seamless procurement process and flexible billing options, setting it apart from competitors.
  • The launch of Thunder, a source-to-source compiler for PyTorch, offers up to 40% speed-up in training large language models, a significant advantage over unoptimized code.
  • Strategic collaboration with AWS and support for Amazon EC2 Trn1 instances powered by AWS Trainium accelerators enhances Lightning AI's enterprise-grade cloud-based platform.

Help us improve and share your feedback! Did you find this helpful?