Skip to main content

Human toddlers are inspiring new approaches to robot learning

It’s an exciting time for robotic learning. Organizations have spent decades building complex datasets and pioneering different ways to teach systems to perform new tasks. It seems we’re on the cusp of some real breakthroughs when it comes to deploying technology that can adapt and learn on the fly.

The past year, we’ve seen a large number of fascinating studies. Take VRB (Vision-Robotics Bridge), which Carnegie Mellon University showcased back in June. The system is capable of applying learnings from YouTube videos to different environments, so a programmer doesn’t have to account for every possible variation.

Last month, Google’s DeepMind robotics team showed off its own impressive work, in the form of RT-2 (Robotic Transformer 2). The system is able to abstract away minutia of performing a task. In the example given, telling a robot to throw away a piece of trash doesn’t require a programmer to teach the robot to identify specific pieces of trash, pick it up and throw it away in order to perform a seemingly simple (for humans, at least) task.


Want the top robotics news in your inbox each week? Sign up for Actuator here.


Additional research highlighted by CMU this week compares its work to early-stage human learning. Specifically, the robotic AI agent is compared to a three-year-old toddler. Putting context, the level of learning is broken up into two categories — active and passive learning.

Passive learning in this instance is teaching a system to perform a task by showing it videos or training it on the aforementioned datasets. Active learning is exactly what it sounds like — going out and performing a task and adjusting until you get it right.

RoboAgent, which is a joint effort between CMU and Meta AI (yes, that Meta), combines these two types of learning, much as a human would. Here that means observing tasks being performed via the internet, coupled with active learning by way of remotely teleoperating the robot. According to the team, the system is able to take learnings from one environment and apply them to another, similar to the VRB system mentioned above.

“An agent capable of this sort of learning moves us closer to a general robot that can complete a variety of tasks in diverse unseen settings and continually evolve as it gathers more experiences,” Shubham Tulsiani of CMU’s Robotics Institute says. “RoboAgent can quickly train a robot using limited in-domain data while relying primarily on abundantly available free data from the internet to learn a variety of tasks. This could make robots more useful in unstructured settings like homes, hospitals and other public spaces.”

One of the cooler bits of all of this is the fact that the dataset is open source and universally accessible. It’s also designed to be used with readily available, off-the-shelf robotics hardware, meaning researchers and companies alike can both utilize and build out a growing trove of robot data and skills.

“RoboAgents are capable of much richer complexity of skills than what others have achieved,” says the Robotics Institute’s Abhinav Gupta. “We’ve shown a greater diversity of skills than anything ever achieved by a single real-world robotic agent with efficiency and a scale of generalization to unseen scenarios that is unique.”

Image Credits: CMU

This is all super promising stuff when it comes to building and deploying multipurpose robotics systems with an eye toward eventual general-purpose robots. The goal is to create technology that can move beyond the repetitive machines in highly structured environments that we tend to think of when we think of industrial robots. Actual real-world use and scaling is, of course, a lot easier said than done.

We are much closer to the beginning when it comes to these approaches to robotic learning, but we’re moving through an exciting period for emerging multipurpose systems.



source https://techcrunch.com/2023/08/08/human-toddlers-are-inspiring-new-approaches-to-robot-learning/

Comments

Popular posts from this blog

The Silent Revolution of On-Device AI: Why the Cloud Is No Longer King

Introduction For years, artificial intelligence has meant one thing: the cloud. Whether you’re asking ChatGPT a question, editing a photo with AI tools, or getting recommendations on Netflix — those decisions happen on distant servers, not your device. But that’s changing. Thanks to major advances in silicon, model compression, and memory architecture, AI is quietly migrating from giant data centres to the palm of your hand. Your phone, your laptop, your smartwatch — all are becoming AI engines in their own right. It’s a shift that redefines not just how AI works, but who controls it, how private it is, and what it can do for you. This article explores the rise of on-device AI — how it works, why it matters, and why the cloud’s days as the centre of the AI universe might be numbered. What Is On-Device AI? On-device AI refers to machine learning models that run locally on your smartphone, tablet, laptop, or edge device — without needing constant access to the cloud. In practi...

Apple’s AI Push: Everything We Know About Apple Intelligence So Far

Apple’s WWDC 2025 confirmed what many suspected: Apple is finally making a serious leap into artificial intelligence. Dubbed “Apple Intelligence,” the suite of AI-powered tools, enhancements, and integrations marks the company’s biggest software evolution in a decade. But unlike competitors racing to plug AI into everything, Apple is taking a slower, more deliberate approach — one rooted in privacy, on-device processing, and ecosystem synergy. If you’re wondering what Apple Intelligence actually is, how it works, and what it means for your iPhone, iPad, or Mac, you’re in the right place. This article breaks it all down.   What Is Apple Intelligence? Let’s get the terminology clear first. Apple Intelligence isn’t a product — it’s a platform. It’s not just a chatbot. It’s a system-wide integration of generative AI, machine learning, and personal context awareness, embedded across Apple’s OS platforms. Think of it as a foundational AI layer stitched into iOS 18, iPadOS 18, and m...

Max Q: Anomalous

Hello and welcome back to Max Q! Last week wasn’t the most successful for spaceflight missions. We’ll get into that a bit more below. In this issue: First up, a botched launch from Virgin Orbit… …followed by one from ABL Space Systems News from Rocket Lab, World View and more Virgin Orbit’s botched launch highlights shaky financial future After Virgin Orbit’s launch failure last Monday, during which the mission experienced an  “anomaly” that prevented the rocket from reaching orbit, I went back over the company’s financials — and things aren’t looking good. For Virgin Orbit, this year has likely been completely turned on its head. The company was aiming for three launches this year, but everything will remain grounded until the cause of the anomaly has been identified and resolved. It’s unclear how long that will take, but likely at least three months. Add this delay to Virgin’s dwindling cash reserves and you have a foundation that’s suddenly much shakier than before. ...