
Work Here?
Work Here?
Work Here?
Black Forest Labs builds AI-powered image generation tools. Its flagship model, FLUX.1, delivers strong prompt adherence, high visual quality, and diverse outputs, and is offered through partnerships and licensing for commercial use, as well as customized enterprise solutions. They serve a broad audience from individual developers to large enterprises. For non-commercial use, models like 1 dev and 1 schnell are available on platforms such as HuggingFace and under Apache 2.0, providing options for local development and personal use. The company differentiates itself through a combination of high-quality image generation, flexible access (enterprise licensing, partnerships, and non-commercial options), and a clear focus on practical deployment. The goal is to advance AI-based image generation and make reliable, high-quality models accessible to a wide range of users and applications.
Industries
Data & Analytics
Enterprise Software
AI & Machine Learning
Company Size
51-200
Company Stage
Series B
Total Funding
$431M
Headquarters
Freiburg, Germany
Founded
2024
Help us improve and share your feedback! Did you find this helpful?
Total Funding
$431M
Above
Industry Average
Funded Over
3 Rounds
Industry standards
Health Insurance
Dental Insurance
Vision Insurance
Paid Vacation
Paid Sick Leave
Paid Holidays
Flexible Work Hours
Hybrid Work Options
Remote Work Options
401(k) Retirement Plan
401(k) Company Match
Wellness Program
Gym Membership
Mental Health Support
Conference Attendance Budget
Professional Development Budget
Training Programs
Tuition Reimbursement
Stock Options
Company Equity
Phone/Internet Stipend
Home Office Stipend
Family Planning Benefits
Fertility Treatment Support
Adoption Assistance
Parental Leave
Relocation Assistance
Meal Benefits
Wellness Program
Employee Discounts
Company Social Events
Black Forest Labs' new Self-Flow technique makes training multimodal AI models 2.8x more efficient. March 4, 2026 To create coherent images or videos, generative AI diffusion models like Stable Diffusion or FLUX have typically relied on external "teachers" - frozen encoders like CLIP or DINOv2 - to provide the semantic understanding they couldn't learn on their own. But this reliance has come at a cost: a "bottleneck" where scaling up the model no longer yields better results because the external teacher has hit its limit. Keep Watching Today, German AI startup Black Forest Labs (maker of the FLUX series of AI image models) has announced a potential end to this era of academic borrowing with the release of Self-Flow, a self-supervised flow matching framework that allows models to learn representation and generation simultaneously. By integrating a novel Dual-Timestep Scheduling mechanism, Black Forest Labs has demonstrated that a single model can achieve state-of-the-art results across images, video, and audio without any external supervision. The technology: breaking the "semantic gap" The fundamental problem with traditional generative training is that it's a "denoising" task. The model is shown noise and asked to find an image; it has very little incentive to understand what the image is, only what it looks like. To fix this, researchers have previously "aligned" generative features with external discriminative models. However, Black Forest Labs argues this is fundamentally flawed: these external models often operate on misaligned objectives and fail to generalize across different modalities like audio or robotics. The Labs' new technique, Self-Flow, introduces an "information asymmetry" to solve this. Using a technique called Dual-Timestep Scheduling, the system applies different levels of noise to different parts of the input. The student receives a heavily corrupted version of the data, while the teacher - an Exponential Moving Average (EMA) version of the model itself - sees a "cleaner" version of the same data. The student is then tasked not just with generating the final output, but with predicting what its "cleaner" self is seeing - a process of self-distillation where the teacher is at layer 20 and the student is at layer 8. This "Dual-Pass" approach forces the model to develop a deep, internal semantic understanding, effectively teaching itself how to see while it learns how to create. Product implications: faster, sharper, and multi-modal. The practical results of this shift are stark. According to the research paper, Self-Flow converges approximately 2.8x faster than the REpresentation Alignment (REPA) method, the current industry standard for feature alignment. Perhaps more importantly, it doesn't plateau; as compute and parameters increase, Self-Flow continues to improve while older methods show diminishing returns. The leap in training efficiency is best understood through the lens of raw computational steps: while standard "vanilla" training traditionally requires 7 million steps to reach a baseline performance level, REPA shortened that journey to just 400,000 steps, representing a 17.5x speedup. Black Forest Labs' Self-Flow framework pushes this frontier even further, operating 2.8x faster than REPA to hit the same performance milestone in roughly 143,000 steps. Taken together, this evolution represents a nearly 50x reduction in the total number of training steps required to achieve high-quality results, effectively collapsing what was once a massive resource requirement into a significantly more accessible and streamlined process. Black Forest Labs showcased these gains through a 4B parameter multi-modal model. Trained on a massive dataset of 200M images, 6M videos, and 2M audio-video pairs, the model demonstrated significant leaps in three key areas: * Typography and text rendering: One of the most persistent "tells" of AI images has been garbled text. Self-Flow significantly outperforms vanilla flow matching in rendering complex, legible signs and labels, such as a neon sign correctly spelling "FLUX is multimodal". * Temporal consistency: In video generation, Self-Flow eliminates many of the "hallucinated" artifacts common in current models, such as limbs that spontaneously disappear during motion. * Joint video-audio synthesis: Because the model learns representations natively, it can generate synchronized video and audio from a single prompt, a task where external "borrowed" representations often fail because an image-encoder doesn't understand sound. In terms of quantitative metrics, Self-Flow achieved superior results over competitive baselines. On Image FID, the model scored 3.61 compared to REPA's 3.92. For video (FVD), it reached 47.81 compared to REPA's 49.59, and in audio (FAD), it scored 145.65 against the vanilla baseline's 148.87. From pixels to planning: the path to world models. The announcement concludes with a look toward world models - AI that doesn't just generate pretty pictures but understands the underlying physics and logic of a scene for planning and robotics. By fine-tuning a 675M parameter version of Self-Flow on the RT-1 robotics dataset, researchers achieved significantly higher success rates in complex, multi-step tasks in the SIMPLER simulator. While standard flow matching struggled with complex "Open and Place" tasks, often failing entirely, the Self-Flow model maintained a steady success rate, suggesting that its internal representations are robust enough for real-world visual reasoning. Implementation and engineering details. For researchers looking to verify these claims, Black Forest Labs has released an inference suite on GitHub specifically for ImageNet 256x256 generation. The project, primarily written in Python, provides the SelfFlowPerTokenDiT model architecture based on SiT-XL/2. Engineers can utilize the provided sample.py script to generate 50,000 images for standard FID evaluation. The repository highlights that a key architectural modification in this implementation is per-token timestep conditioning, which allows each token in a sequence to be conditioned on its specific noising timestep. During training, the model utilized BFloat16 mixed precision and the AdamW optimizer with gradient clipping to maintain stability. Licensing and availability. Black Forest Labs has made the research paper and official inference code available via GitHub and their research portal. While this is currently a research preview, the company's track record with the FLUX model family suggests these innovations will likely find their way into their commercial API and open-weights offerings in the near future. For developers, the move away from external encoders is a massive win for efficiency. It eliminates the need to manage separate, heavy models like DINOv2 during training, simplifying the stack and allowing for more specialized, domain-specific training that isn't beholden to someone else's "frozen" understanding of the world. Takeaways for enterprise technical decision-makers and adopters. For enterprises, the arrival of Self-Flow represents a significant shift in the cost-benefit analysis of developing proprietary AI. While the most immediate beneficiaries are organizations training large-scale models from scratch, the research demonstrates that the technology is equally potent for high-resolution fine-tuning. Because the method converges nearly three times faster than current standards, companies can achieve state-of-the-art results with a fraction of the traditional compute budget. This efficiency makes it viable for enterprises to move beyond generic off-the-shelf solutions and develop specialized models that are deeply aligned with their specific data domains, whether that involves niche medical imaging or proprietary industrial sensor data. The practical applications for this technology extend into high-stakes industrial sectors, most notably robotics and autonomous systems. By leveraging the framework's ability to learn "world models," enterprises in manufacturing and logistics can develop vision-language-action (VLA) models that possess a superior understanding of physical space and sequential reasoning. In simulation tests, Self-Flow allowed robotic controllers to successfully execute complex, multi-object tasks - such as opening a drawer to place an item inside - where traditional generative models failed. This suggests that the technology is a foundational tool for any enterprise seeking to bridge the gap between digital content generation and real-world physical automation. Beyond performance gains, Self-Flow offers enterprises a strategic advantage by simplifying the underlying AI infrastructure. Most current generative systems are "Frankenstein" models that require complex, external semantic encoders often owned and licensed by third parties. By unifying representation and generation into a single architecture, Self-Flow allows enterprises to eliminate these external dependencies, reducing technical debt and removing the "bottlenecks" associated with scaling third-party teachers. This self-contained nature ensures that as an enterprise scales its compute and data, the model's performance scales predictably in lockstep, providing a clearer ROI for long-term AI investments.
Black Forest Labs unleashes Flux.2 [klein]: open source AI image generation hits sub-second speeds. Black Forest Labs' open source Flux.2 [klein] model achieves sub-second AI image generation, challenging proprietary models with unprecedented speed and accessibility. The world of generative AI just got a major speed boost. Black Forest Labs has officially launched Flux.2 [klein], a new open source model designed to generate high-quality AI images in under a second. This release signals a significant shift in the accessibility and speed of text-to-image generation, challenging proprietary models that often require substantial cloud resources for rapid output. Key takeaways. * Flux.2 [klein] achieves sub-second image generation, dramatically improving workflow speed for creators. * Being open source, the model lowers the barrier to entry for developers wanting to build customized AI visual tools. * This speed leap forces competitors to re-evaluate their efficiency benchmarks for generative models. * The technical architecture focuses on distillation and efficiency, moving beyond sheer parameter count. What happened. Black Forest Labs unveiled Flux.2 [klein], an image generation model that leverages novel techniques to drastically cut down the time required to render complex visuals from text prompts. While many leading models take several seconds, even on powerful hardware, Flux.2 aims for near-instantaneous results. This speed isn't achieved by cutting corners on quality. The team focused on refining the underlying diffusion process, making each step of the generation pipeline incredibly efficient. Think of it like optimizing a complex assembly line; instead of slowing down production, they made every machine work faster without sacrificing the final product's integrity. Why this matters. Speed is the next major battleground in generative AI, moving beyond just raw capability. For professional workflows, seconds matter, translating directly into cost savings and faster iteration cycles. Open source availability is the critical differentiator here. By releasing Flux.2 as open source, Black Forest Labs is democratizing high-speed image synthesis. Previously, achieving this level of speed often required licensing proprietary APIs or running massive, expensive infrastructure. Now, smaller studios or individual developers can integrate cutting-edge speed directly into their applications without massive overhead. This release echoes the impact of early Stable Diffusion releases - making powerful tools accessible outside the walled gardens of major tech companies. It forces giants like Midjourney and OpenAI to justify their proprietary speed advantages or risk being outpaced on efficiency metrics. What's next. Techfeed24 anticipate an immediate surge in local, on-device AI image generation tools. If a model can run this fast locally, the need for constant cloud connectivity diminishes, which is huge for privacy and offline capabilities. Furthermore, this efficiency opens doors for real-time applications, such as dynamic texture generation in video games or instant visual mockups during client meetings. The sub-second benchmark is a new baseline for what consumers will expect from consumer-facing AI tools. The bottom line. Flux.2 [klein] isn't just a faster model; it's an open source catalyst forcing the entire generative AI ecosystem to prioritize efficiency. Black Forest Labs has delivered a powerful tool that lowers the cost of entry while raising the speed ceiling for visual AI creation.
Black Forest Labs launches FLUX.2 klein AI models. What's new? Black Forest Labs debuted FLUX.2 [Klein] 4B via Apache 2.0 and 9B via FLUX non commercial license with sub second inference on consumer GPUs; Black Forest Labs has just introduced the FLUX.2 [klein] model family, targeting developers, creators, and research teams that demand fast, high-quality image generation and editing. The launch includes both the 4B and 9B model variants, with the 4B model released under Apache 2.0 for maximum accessibility and the 9B under the FLUX Non-Commercial License. These models are public and available now for use and testing, with demos open to all. FLUX.2 [klein] stands out for its sub-second inference speeds, capable of generating or editing images in less than 0.5 seconds, while fitting on consumer GPUs with as little as 13GB VRAM. Technical highlights include: * Unified architecture supporting text-to-image, image editing, and multi-reference generation. * Quantized FP8 and NVFP4 versions that reduce VRAM usage by up to 55%. Unlike previous releases, the [klein] models deliver photorealistic outputs with high diversity and production-ready APIs. Benchmarks indicate the 9B model matches or surpasses much larger competitors in both speed and visual quality, and the quantized models offer even broader hardware support. Black Forest Labs is known for pioneering efficient and accessible AI models, and with FLUX.2 [klein], the company is advancing its mission of powering real-time, agent-ready visual intelligence. Early testers have praised the combination of speed, versatility, and open access, noting the potential for rapid prototyping and integration into diverse workflows.
12-19 AI model releases. Google launches Gemini 3 Flash: A speed-optimized AI model that's 3x faster than Gemini 2.5 Pro, with PhD-level reasoning at lower costs ($0.50/1M input tokens). It's now the default for the Gemini app, Search's AI Mode, Vertex AI, and developer tools, enabling high-throughput agentic workflows. OpenAI debuts GPT-5.2 Thinking and GPT-5.2-Codex: New models focused on advanced reasoning and coding, with Codex optimized for developer tasks. Also, GPT Image-1.5 powers faster image generation with precise edits, better text rendering, and availability to all users. Kakao open-sources Kanana-2: An advanced LLM optimized for agentic AI, with enhanced performance and efficiency for in-house applications. Mistral AI releases Devstral 2 and Mistral 3 models: Devstral 2 is a coding-focused model with Vibe CLI for open-source terminal AI development. Mistral 3 includes small dense models (14B, 8B) for advanced reasoning and efficiency. NVIDIA unveils Nemotron 3 family: Open models including Nemotron 3 Nano 30B-A3B, designed for efficient custom AI agent development with 1M token context windows and hybrid architectures. Luma AI introduces new video models: Includes a start-to-end frame video generation model and AI video editing that preserves actor performance. Meta releases SAM Audio: A multimodal sound separation model for advanced audio processing. Microsoft debuts TRELLIS 2: An image-to-3D model for generating detailed 3D assets. Alibaba launches Wan2.6 and Qwen Code v0.5.0: Wan2.6 is a multimodal video model; Qwen Code enhances coding capabilities. Black Forest Labs (bfl_ml) unveils Flux.2 Max: A high-resolution image generation model, integrated into tools like Adobe Photoshop. Rakuten releases Japan's largest LLM: Aimed at enterprise-scale applications. Recent arXiv uploads (December 19, 2025) highlight advancements in computer vision, AI reasoning, and multimodal processing. Key ones include: The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text by Hanlin Wang et al.: A framework for multimodal event generation in scene synthesis. Generative Refocusing: Flexible Defocus Control from a Single Image: Enables post-capture depth-of-field adjustments using generative models. Next-Embedding Prediction Makes Strong Vision Learners: Self-supervised learning via embedding prediction for improved vision representations. Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification: Framework to identify and fix gaps in AI models for fairness and robustness. EasyV2V: A High-quality Instruction-based Video Editing Framework by Jinjie Mai et al.: Instruction-driven pipeline for high-quality video manipulation. DVGT: Driving Visual Geometry Transformer by Sicheng Zuo et al.: Transformer for driving scene perception integrating geometry and vision. AdaTooler-V: Adaptive Tool-Use for Images and Videos by Chaoyang Wang et al.: Adaptive framework for tool integration in multimodal processing. Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning: Combines GANs and RL to boost LLM reasoning. StereoPilot: Learning Unified and Efficient Stereo Conversion via Generative Priors by Guibao Shen et al.: Generative approach for efficient stereo vision and depth estimation. Constructive Circuit Amplification: Improving Math Reasoning in LLMs via Targeted Sub-Network Updates: Novel training for enhancing mathematical reasoning in LLMs. From broader sources: MIT-IBM Watson AI Lab's expressive architecture for better LLM state tracking and reasoning over long texts.
Black Forest Labs secures 300 million US dollars in Series B. 300 million US dollars for visual AI. Black Forest Labs completes Series B and plans the next leap in innovation. The German-American AI startup Black Forest Labs has closed a Series B financing round of 300 million US dollars. This brings the valuation to 3.25 billion US dollars (post-money). The company develops next-generation models for visual artificial intelligence and intends to use the new funding to significantly expand its research and development activities. From start-up to influential technology provider. Black Forest Labs was founded in 2024 with the aim of developing powerful and intuitively controllable image models. These are systems that do not react exclusively to prompt execution, but interpret and creatively implement the user's intentions. Their FLUX series models are among the most popular open image models on Hugging Face and are widely used in numerous enterprise platforms such as Fal.ai, Replicate and TogetherAI. International technology companies such as Adobe, Canva, Meta and Microsoft are also integrating the models to enable new creative applications. Visual intelligence as a long-term vision. While the FLUX models are already widely used, the company is working on an overarching goal: models that combine visual perception, image generation, memory functions and reasoning. The aim is to lay the foundations for genuine "visual intelligence", which would go far beyond traditional image generation. Investors expand commitment. The round is led by Salesforce Ventures and Anjney Midha (AMP). Other investors include Temasek, Bain Capital Ventures, Air Street Capital, Visionaries Club, Canva and Figma Ventures. Existing investors such as a16z, NVIDIA, Northzone, Creandum, Earlybird VC, BroadLight Capital and General Catalyst are also increasing their commitment. The new funds will flow directly into the further expansion of the research program. Strong team of AI pioneers. The core team at Black Forest Labs includes leading developers behind Latent Diffusion, Stable Diffusion and FLUX. With locations in Freiburg and San Francisco, the company is expanding its team and is looking for additional specialists for research, development and infrastructure.
Find jobs on Simplify and start your career today
Industries
Data & Analytics
Enterprise Software
AI & Machine Learning
Company Size
51-200
Company Stage
Series B
Total Funding
$431M
Headquarters
Freiburg, Germany
Founded
2024
Find jobs on Simplify and start your career today