Full-Time

Senior Performance Engineer-Pretraining

Aleph Alpha

Aleph Alpha

201-500 employees

Full-stack generative AI for sovereign enterprises

No salary listed

Bellheim, Germany

Hybrid

US Citizenship, US Top Secret Clearance Required

Category
AI & Machine Learning (2)
,
Required Skills
Python
CUDA
Pytorch
Requirements
  • Are proficient in Python and the PyTorch library.
  • Have a strong engineering background in parallel and/or distributed systems with proven track record of excellence.
  • Have hands-on experience with modern machine learning techniques (especially large language models and their life cycle).
  • Deeply understand the CUDA programming model.
  • Have experience in distributed programming with APIs like NCCL or MPI.
  • Have experience analysing profiling traces with tools such as PyTorch Profiler and Nvidia Nsight.
  • Please note this role requires regular on-site collaboration in Heidelberg as a member of the Training Efficiency Team.
Responsibilities
  • End-to-End Optimization: Profile training loops using PyTorch Profiler, Nsight Systems and Nsight Compute to identify system- and kernel-level bottlenecks in order to maximize model throughput.
  • Distributed Strategy and Topology: Configure and tune composite parallelism strategies (e.g. TP, DP, HSDP/FSDP, EP), optimizing load balance, minimizing critical-path bottlenecks, and managing communication-to-computation trade-offs for large-scale LLM training.
  • Hardware-Aware Modeling: Partner with AI Researchers to define model architectures for hardware efficiency without compromising convergence.
Desired Qualifications
  • Contributions to modern distributed training frameworks (e.g., TorchTitan, Megatron-LM, DeepSpeed).
  • Familiarity with low-precision training formats(MXFP4, MXFP8) and their impact on numerical stability and throughput.
  • A deep understanding of NCCL communication primitives, NVSHMEM or CUDA IPC and their performance.
  • A proven track record of implementing and optimising modern transformer-based model training.
  • A proven track record working on the NVIDIA Blackwell architecture.

Aleph Alpha builds sovereign, human-centric artificial intelligence in Europe. It offers PhariaAI, a full-stack generative AI suite that lets enterprises and governments design and manage bespoke AI use cases, delivered as PhariaAI-as-a-Service. The product works by providing a modular platform where customers specify their AI needs, deploy models, and govern usage within secure, policy-driven environments—often through partnerships with other firms to reach more clients. Compared with global players, Aleph Alpha differentiates itself by focusing on European sovereignty and trust, tailoring solutions for security-critical applications, and operating through a collaboration-based business model rather than a single product sale. Its goal is to provide trustworthy, enterprise-grade AI for government and industry, offering a European alternative to dominant AI providers.

Company Size

201-500

Company Stage

Series B

Total Funding

$641.1M

Headquarters

Heidelberg, Germany

Founded

2019

Simplify Jobs

Simplify's Take

What believers are saying

  • April 2026 Cohere merger values combined entity at $20B, backed by Schwarz Group's $600M.
  • Schwarz Group stake rises to 20%, securing retail sector partnerships and compute via Stackit.
  • thingsTHINKING acquisition bolsters language software for industrial and financial clients.

What critics are saying

  • January 2026 layoffs cut 50 jobs, triggering talent exodus that cripples PhariaAI development.
  • Cohere merger dilutes European sovereignty, diverting clients to OpenAI within 6 months.
  • Schwarz veto power forces retail pivot, alienating government clients by 2027.

What makes Aleph Alpha unique

  • PhariaAI delivers sovereign AI for German government deployments with on-premise data control.
  • Luminous models provide explainable AI, making learned patterns visible for enterprise compliance.
  • Pharia-1 LLM-7B open-source release under Apache 2.0 fosters developer ecosystem.

Help us improve and share your feedback! Did you find this helpful?

Your Connections

People at Aleph Alpha who can refer or advise you

Benefits

Paid Vacation

Wellness Program

Mental Health Support

401(k) Retirement Plan

Subsidized Germany-wide transportation ticket

Budget for additional technical equipment

Flexible Work Hours

Virtual Stock Option Plan

Growth & Insights and Company News

Headcount

6 month growth

2%

1 year growth

8%

2 year growth

14%
Heise Medien GmbH & Co. KG
Jan 30th, 2026
Schwarz Group buys Bosch's stake in Aleph Alpha, expands control to 20%

The Schwarz Group, owner of Lidl and Kaufland, is acquiring Bosch Ventures' stake in German AI startup Aleph Alpha, increasing its holding to approximately 20%. The deal's financial terms were not disclosed and remains subject to regulatory approval. Aleph Alpha raised $500 million in a Series B round in late 2023, with both Schwarz Group and Bosch Ventures participating. The retail group previously held nearly 14% whilst Bosch owned around 6%. Both investors secured special veto rights upon entry. Once heralded as Germany's AI champion, Aleph Alpha has struggled against US competition and now focuses primarily on public sector AI services. Founder Jonas Andrulis stepped down as CEO in October, retaining a 28% stake. New co-CEOs Reto Spörri and Ilhan Scheer are reportedly planning to cut approximately 50 positions as part of a restructuring effort.

Handelsblatt
Apr 24th, 2025
Aleph Alpha Acquires thingsTHINKING GmbH

Aleph Alpha, a company specializing in artificial intelligence, has acquired Thingsthinking, a language software specialist based in Karlsruhe. This acquisition aims to enhance Aleph Alpha's offerings for the industrial and financial sectors.

Tech.eu
Mar 21st, 2025
German Ai Startup Langdock Mulls Us Move

A German AI startup backed by General Catalyst is considering opening a US office, its first overseas office.Founded in 2023, Langdock has an office in Berlin but its executives are spending a considerable amount of time in New York and San Francisco.Lennard Schmidt, Langdock CEO and co-founder, says the US is “more dynamic when it comes to AI than Europe”.He said: “I think for us it’s still on the table if we move to the US.”Schmidt said that should the startup open an office in the US, it would retain its Berlin office.Schmidt also pointed out that Germany was behind the US and UK in commercialising AI research.He said: “In Germany we have these very established institutions around doing deep research, what we do lack is the commercial aspect of taking that research and commercialising some of it.“I feel the UK is in a better position right now, but also Paris, given their ties to the big American companies that have research facilities there.”Germany has some well-known AI companies, such as Helsing, Aleph Alpha and Black Forest Labs.Langdock is looking to capitalise on the fervour around ChatGPT and other LLMs while addressing employer concerns around data sharing and compliance when introducing AI chatbots into the workforce.Langdock has built what is essentially a model agnostic chatbot, which sits between the LLM and a business, that it says employers can roll out centrally, securely, and compliantly to its employees, saying it gives businesses “peace of mind”.Its tech, is says, basically addresses concerns a business might have when introducing ChatGPT, Claude or Gemini into the workforce.Schmidt, who along with his co-founders Jonas Beisswenger and Tobias Kemkes, attended Berlin startup university CODE, says: “We take away all the concerns about what happens to the data because we have essentially all the compliance in place for that in terms of contract work and providers we work with.”One potential benefit of Langdock, whose $3m seed fundraise last year was also backed by La Famiglia and Y Combinator, is that its clients, which include US pharma giant Merck and payment startup Mondu, are not tied to using one LLM for life, but can chop and change as they see fit.While it is relatively easy for individual users to swap models, it is harder for enterprises given compliance and regulatory challenges.Merck, for instance, uses Langdock as its AI base layer, which Merck calls MyGPT (with the Merck chatbot looking very similar to ChatGPT), which is rolled out to around 23,000 employees, about 45 per cent of its workforce.Langdock, which has 15 employees, has European and US clients, but Schmidt points out European firms have more regulatory and data concerns than US firms, given the relative tightness of the rules, along with concerns about data sharing

Verslo žinios
Jan 24th, 2025
Aleph Alpha raises $500M funding round

A startup based in Heidelberg, Germany, named Aleph Alpha, secured $500 million in Series B funding. Of this, €110 million was invested in the company's capital, while the remainder was allocated as research grants. The investment was led by prominent German business giants, including Schwarz Group, SAP, and Bosch Ventures. Despite this significant backing, Aleph Alpha struggled to maintain the pace set by its competitor, OpenAI.

Business Insider
Nov 20th, 2024
Deutsche Bank steigt bei Aleph Alpha ein

Die Deutsche Bank investiert in KI-Startup Aleph Alpha und sichert sich zwei Prozent der Anteile. Auch Earlybird und Burda bauen ihre Beteiligungen aus.