Simplify Logo

Full-Time

Platform Engineer

Cloud Infrastructure

Posted on 11/30/2023

Stability AI

Stability AI

51-200 employees

Develops open-source AI models for image generation

Data & Analytics
Hardware
Enterprise Software
AI & Machine Learning

Compensation Overview

$133k - $247kAnnually

Junior, Mid, Senior

United States

Category
DevOps & Infrastructure
Software Engineering
Required Skills
Kubernetes
Python
Docker
TypeScript
AWS
Google Cloud Platform
Requirements
  • Strong experience in cloud computing, API development, and a deep understanding of High-Performance Computing environments, particularly in an AWS setting.
  • Proficiency in programming languages such as Python and Typescript, essential for API development and integration within AWS and/or Cloudflare environments.
  • Demonstrated expertise in API design, implementation, and maintenance, ensuring security and performance best practices within AWS and Cloudflare.
  • Knowledge of containerization technologies (e.g., Docker, Kubernetes) for deployment of APIs within AWS, Cloudflare, and HPC systems.
  • Familiarity with authentication and authorization protocols (e.g., OAuth, JWT) to ensure secure data exchange between AWS, Cloudflare, and HPC environments.
  • Strong problem-solving skills and the ability to troubleshoot complex issues related to API integrations in a hybrid cloud-HPC setup, particularly in AWS and Cloudflare environments.
  • Excellent communication and collaboration skills to work effectively with diverse teams and stakeholders in AWS and Cloudflare ecosystems.
Responsibilities
  • Design, develop, and maintain robust APIs that facilitate communication and data exchange between cloud-based services, particularly AWS, and HPC environments.
  • Collaborate with cross-functional teams to understand the unique requirements of both cloud based services and HPC systems, ensuring that the APIs developed meet the specific needs of these environments.
  • Implement best practices for API design, including security, scalability, and performance optimization to ensure efficient interaction between cloud services and HPC clusters.
  • Utilize services such as Cloudflare to enhance API performance, security, and reliability in the cloud-to-HPC communication, optimizing for speed and resilience.
  • Work closely with HPC engineers to identify and address integration challenges, striving for seamless connectivity between diverse systems and cloud-based platforms.
  • Drive innovation by proposing and implementing new API strategies, enhancing the efficiency and functionality of data exchange between AWS, GCP, Cloudflare, and on-premise HPC environments.
  • Create comprehensive documentation and provide training to internal teams on the use and integration of developed APIs, focusing on AWS and Cloudflare environments.
  • Monitor API performance and address issues related to data transfer, ensuring reliability and consistent operation between AWS, Cloudflare, and HPC systems.
  • Collaborate with the security team to ensure that the APIs comply with industry standards and best practices for data privacy and protection, especially in AWS and Cloudflare environments.

Stability AI specializes in developing open-source AI models, particularly Stable Diffusion XL, for text-to-image generation and other applications.

Company Stage

Seed

Total Funding

$176M

Headquarters

London, United Kingdom

Founded

2019

Growth & Insights
Headcount

6 month growth

-14%

1 year growth

-5%

2 year growth

300%
INACTIVE