Robots Are Finally Starting to Work
For decades, general-purpose robotics seemed perpetually out of reach — a dream deferred by vertically integrated complexity, data scarcity, and the sheer cost of iteration. Now, Physical Intelligence claims to have cracked the code with a foundation model that can control any robot to do any task, bridging the gap between research demos and real-world deployments in warehouses and laundromats. Co-founder Kuang Vang reveals how cross-embodiment training, cloud-hosted inference, and tight partnerships with vertical robotics startups like Weave and Ultra are driving a «Cambrian explosion» of companies that no longer need 20 years of robotics expertise to ship autonomous systems. Is this the GPT-1 moment for robotics, or are we still years away from true general-purpose autonomy?
Points clés
A generalist model trained on 10+ robot platforms outperforms specialists by 50%, proving cross-embodiment training unlocks scaling laws for robotics.
Physical Intelligence hosts models in the cloud with ~50ms latency, eliminating the need for expensive on-robot compute and enabling rapid updates across fleets.
Real deployments with Weave (laundry folding) and Ultra (e-commerce packing) achieved break-even economics in weeks, not years, using mixed-autonomy systems.
The playbook for vertical robotics startups: understand workflow, pick cheap hardware, collect data, run mixed-autonomy trials, hit break-even, then scale robots.
Zero-shot task execution on unseen robots is emerging, though tasks still require human oversight — full autonomy remains a peeling-the-onion process, not a sudden leap.
En bref
Physical Intelligence has built a foundation model that runs in the cloud, works across dozens of robot types, and is already deployed in real warehouses and laundromats — dramatically lowering the barrier for founders to build profitable vertical robotics companies without deep hardware expertise.
The GPT-1 Moment for Robotics
Physical Intelligence aims to build a single model controlling any robot for any task.
Physical Intelligence's mission is audacious: create a foundation model that can control any robot to perform any physically feasible task at a performance level useful to people in all walks of life. Co-founder Kuang Vang frames this not as a single «ChatGPT moment» but as an incremental peeling of the onion — starting with a strong base model imbued with common-sense knowledge, deploying mixed-autonomy systems (like self-driving cars today), and iteratively improving through exposure to real-world edge cases. One day, the system wakes up fully autonomous.
Historically, robotics has been a graveyard of broken promises because it demands mastery of three pillars: semantics (understanding the world), planning (deciding what to do), and real-time control (executing actions in a changing environment). Language models unlocked semantics; papers like SayCan, RT-2, and PaLM-E brought common sense into robots by adapting vision-language models to «speak robot language.» The breakthrough came with cross-embodiment training: the Open X-Embodiment dataset trained a single model on 10+ robot platforms, and the generalist outperformed specialists by 50 percent. That result shattered the old paradigm that each robot needed bespoke engineering.
Physical Intelligence now deploys models hosted entirely in the cloud, querying API endpoints in real time during high-frequency control loops. By burying inference time within the robot's execution cycle and using «real-time chunking» (pre-computing overlapping action sequences), the team eliminated the need for expensive on-robot compute. Kuang admits he has never seen some partner robots in person and intentionally avoids learning their internal systems — proving the model can parachute into any hardware stack and just work.
«It still blows my mind… I didn't know if this would exist even in my entire lifetime.»
Folding laundry proves robots can handle infinite variability in deformable objects.
“It still like blows my mind to see a robot actually folding laundry because I remember until basically until chat GPT I didn't know if this would exist even in my entire lifetime because like folding laundry I mean it's it's always been like the Turing test for robotics because there's no way to like deterministically program a system the way that you did like preAI to do this because the space is like so infinite.”
Real-World Deployments: Weave and Ultra
Emergent Zero-Shot Capabilities (Unpublished)
PI's next models execute complex tasks without any task-specific training data.
Emergent Zero-Shot Capabilities (Unpublished)
Kuang teased unpublished results showing zero-shot task execution across multiple robot types — tasks that last year required hundreds of hours of data collection. These aren't trivial demos: they involve precision, multi-object reasoning, and robust generalization. The team intentionally tested diverse task «flavors» to rule out luck, and the property appears general across the model.
The Playbook for Vertical Robotics Startups
Six steps to profitably deploy robots without deep robotics expertise or capital.
Understand the Workflow Identify where inserting a robot into an existing process will make the biggest economic difference. Don't try to replace humans wholesale — find the highest-value insertion point.
Choose Cheap, Scrappy Hardware You don't need ultra-precise, expensive robots. Foundation models are reactive enough to compensate for hardware inaccuracies, so optimize for cost and iteration speed.
Collect Data and Run Evals Build the ability to collect task-specific data and run real-world evaluations. Evaluation infrastructure scales super-linearly with task complexity, so invest early.
Deploy Mixed Autonomy Launch with a human-in-the-loop system where operators intervene when the robot fails. This lets you ship quickly while the model learns from corrections.
Hit Economic Break-Even Optimize the system until deploying one more robot doesn't lose money. Break-even is the unlock for scaling fleet size without bleeding capital.
Scale the Robot Fleet Once unit economics work, add robots aggressively. The cost of starting a robotics business has collapsed; focus on differentiation, not vertical integration.
Key Numbers Behind the Breakthrough
Cross-embodiment, cloud inference latency, and data scale define the new paradigm.
Why Robotics Was Hard — and Why It Isn't Anymore
Data scarcity, hardware fragmentation, and vertical integration collapsed as barriers.
Robotics historically failed because it demanded full vertical integration: custom hardware, proprietary autonomy stacks, safety certifications, and direct customer relationships. The upfront capital and expertise required kept the industry in a «mainframe era» — only giant enterprises could afford deployment. Data was the bottleneck: there was no «internet of robot data» to train on, and even within a single lab, no two robots were identical due to hardware drift, software updates, and component variations.
Physical Intelligence's insight was to treat hardware heterogeneity as a feature, not a bug. By training cross-embodiment models from day one, the team discovered emergent abstractions: the model learned how to control «a robot,» not just «this specific robot.» That meant old data from drifted hardware remained useful, and new robot types could be onboarded without retraining from scratch. Cloud-hosted inference further decoupled model development from hardware constraints, letting founders iterate on software without worrying about obsolete compute units on每个robot.
The result is a new equation for startup founders: you no longer need a PhD in robotics, a custom-built robot, or years of classical-control engineering. You need workflow understanding, scrappy hardware procurement, data-collection rigor, and the ability to run mixed-autonomy trials. Physical Intelligence provides the foundation model; vertical startups provide domain expertise and go-to-market. Kuang explicitly framed this as enabling a «Cambrian explosion» — not hyperbole, but a measured prediction that hundreds of robotics companies will emerge in the next few years.
The Founding Team and Company Structure
Personnes
Glossaire
Avertissement : Ceci est un résumé généré par IA d'une vidéo YouTube à des fins éducatives et de référence. Il ne constitue pas un conseil en investissement, financier ou juridique. Vérifiez toujours les informations auprès des sources originales avant de prendre des décisions. TubeReads n'est pas affilié au créateur de contenu.