Discussion about this post

User's avatar
Stavros's avatar

I suppose this is an article meant to target VC types or something because it lacks the usual depth we are used to from Sergei. While I appreciate the discussion, I can't help but point out it in essence a tautology. «The best thing to train on is the thing itself» is immediately self-evident. A more nuanced discussion we are avoiding here is «how much» of the real thing do we need? In your tennis example, you can get pretty strong at playing tennis if you just do drills, then simply transition to playing strong real players. Still, the bulk of the skill development is done on surrogates. I don't think we need every robot to be Roger Federer, although it's a great goal. (I am a tennis fanatic). By the by, I think we can both agree the solution is not a farm of cheap arm operators in a third-world country collecting datasets from scratch for every single thing they can imagine we would like to do. There is no escaping the inductive biases of humans -- they exist in all solutions! Hand-held grippers, simulator design, and 1-to-1 same robot demonstrations. I come from the LLM world, but I dabble in robotics -- please refute me if you fancy!

Expand full comment
Jeremiah Coholich's avatar

This article makes me wonder: what do we consider to be “real-world” data? A model trained on robot data from human teleoperation will learn to solve problems in the way that a human teleoperating a robot would. What are the "gold labels" for real-world robot data? Is it data or gradients from real-world RL? Or will it be from offline RL/success-filtering on data from deployed “spork” models (the data flywheel)?

Expand full comment
10 more comments...

No posts