Robot Technology News
ROBO SPACE
Engineering household robots to have a little common sense
illustration only
Engineering household robots to have a little common sense
by Jennifer Chu | MIT News
Boston MA (SPX) Mar 26, 2024

From wiping up spills to serving up food, robots are being taught to carry out increasingly complicated household tasks. Many such home-bot trainees are learning through imitation; they are programmed to copy the motions that a human physically guides them through.

It turns out that robots are excellent mimics. But unless engineers also program them to adjust to every possible bump and nudge, robots don't necessarily know how to handle these situations, short of starting their task from the top.

Now MIT engineers are aiming to give robots a bit of common sense when faced with situations that push them off their trained path. They've developed a method that connects robot motion data with the "common sense knowledge" of large language models, or LLMs.

Their approach enables a robot to logically parse many given household task into subtasks, and to physically adjust to disruptions within a subtask so that the robot can move on without having to go back and start a task from scratch - and without engineers having to explicitly program fixes for every possible failure along the way.

"Imitation learning is a mainstream approach enabling household robots. But if a robot is blindly mimicking a human's motion trajectories, tiny errors can accumulate and eventually derail the rest of the execution," says Yanwei Wang, a graduate student in MIT's Department of Electrical Engineering and Computer Science (EECS). "With our method, a robot can self-correct execution errors and improve overall task success."

Wang and his colleagues detail their new approach in a study they will present at the International Conference on Learning Representations (ICLR) in May. The study's co-authors include EECS graduate students Tsun-Hsuan Wang and Jiayuan Mao, Michael Hagenow, a postdoc in MIT's Department of Aeronautics and Astronautics (AeroAstro), and Julie Shah, the H.N. Slater Professor in Aeronautics and Astronautics at MIT.

Language task
The researchers illustrate their new approach with a simple chore: scooping marbles from one bowl and pouring them into another. To accomplish this task, engineers would typically move a robot through the motions of scooping and pouring - all in one fluid trajectory. They might do this multiple times, to give the robot a number of human demonstrations to mimic.

"But the human demonstration is one long, continuous trajectory," Wang says.

The team realized that, while a human might demonstrate a single task in one go, that task depends on a sequence of subtasks, or trajectories. For instance, the robot has to first reach into a bowl before it can scoop, and it must scoop up marbles before moving to the empty bowl, and so forth. If a robot is pushed or nudged to make a mistake during any of these subtasks, its only recourse is to stop and start from the beginning, unless engineers were to explicitly label each subtask and program or collect new demonstrations for the robot to recover from the said failure, to enable a robot to self-correct in the moment.

"That level of planning is very tedious," Wang says.

Instead, he and his colleagues found some of this work could be done automatically by LLMs. These deep learning models process immense libraries of text, which they use to establish connections between words, sentences, and paragraphs. Through these connections, an LLM can then generate new sentences based on what it has learned about the kind of word that is likely to follow the last.

For their part, the researchers found that in addition to sentences and paragraphs, an LLM can be prompted to produce a logical list of subtasks that would be involved in a given task. For instance, if queried to list the actions involved in scooping marbles from one bowl into another, an LLM might produce a sequence of verbs such as "reach," "scoop," "transport," and "pour."

"LLMs have a way to tell you how to do each step of a task, in natural language. A human's continuous demonstration is the embodiment of those steps, in physical space," Wang says. "And we wanted to connect the two, so that a robot would automatically know what stage it is in a task, and be able to replan and recover on its own."

Mapping marbles
For their new approach, the team developed an algorithm to automatically connect an LLM's natural language label for a particular subtask with a robot's position in physical space or an image that encodes the robot state. Mapping a robot's physical coordinates, or an image of the robot state, to a natural language label is known as "grounding." The team's new algorithm is designed to learn a grounding "classifier," meaning that it learns to automatically identify what semantic subtask a robot is in - for example, "reach" versus "scoop" - given its physical coordinates or an image view.

"The grounding classifier facilitates this dialogue between what the robot is doing in the physical space and what the LLM knows about the subtasks, and the constraints you have to pay attention to within each subtask," Wang explains.

The team demonstrated the approach in experiments with a robotic arm that they trained on a marble-scooping task. Experimenters trained the robot by physically guiding it through the task of first reaching into a bowl, scooping up marbles, transporting them over an empty bowl, and pouring them in. After a few demonstrations, the team then used a pretrained LLM and asked the model to list the steps involved in scooping marbles from one bowl to another. The researchers then used their new algorithm to connect the LLM's defined subtasks with the robot's motion trajectory data. The algorithm automatically learned to map the robot's physical coordinates in the trajectories and the corresponding image view to a given subtask.

The team then let the robot carry out the scooping task on its own, using the newly learned grounding classifiers. As the robot moved through the steps of the task, the experimenters pushed and nudged the bot off its path, and knocked marbles off its spoon at various points. Rather than stop and start from the beginning again, or continue blindly with no marbles on its spoon, the bot was able to self-correct, and completed each subtask before moving on to the next. (For instance, it would make sure that it successfully scooped marbles before transporting them to the empty bowl.)

"With our method, when the robot is making mistakes, we don't need to ask humans to program or give extra demonstrations of how to recover from failures," Wang says. "That's super exciting because there's a huge effort now toward training household robots with data collected on teleoperation systems. Our algorithm can now convert that training data into robust robot behavior that can do complex tasks, despite external perturbations."

Research Report:"Grounding Language Plans in Demonstrations Through Counter-Factual Perturbations"

Related Links
Computer Science and Artificial Intelligence Laboratory
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
ESA to build digital Chat assistant powered by EO data
Paris, France (SPX) Mar 26, 2024
The European Space Agency (ESA), in collaboration with technology partners, is embarking on an ambitious project to develop artificial intelligence (AI) applications designed to transform the way we retrieve information from Earth observation data. This initiative aims to create a digital assistant capable of producing scientifically accurate responses based on verified data, answering complex questions about environmental and geographical phenomena. One of the project's highlights is the I*STAR p ... read more

ROBO SPACE
Drones adapt mid-mission with revolutionary software integration

Black Sea fleet unleashes waves of drones on Ukraine after strike on Russian navy

Mira Aerospace and VEDA Aeronautics Partner to Launch Specialized HAPS Technology in India

Cheap drones 'cannot match' artillery power in Ukraine: experts

ROBO SPACE
UC San Diego Scientists Unveil Plant-Based Polymers that Biodegrade Microplastics in Months

Large language models use a surprisingly simple mechanism to retrieve some stored knowledge

Stanford revolutionizing material science wih shapeshifting nanoparticles

New Study Unveils Inadequacies in Traditional Theories of Van Allen Belts

ROBO SPACE
New OLED material design from St Andrews is enhancing brightness and efficiency

Profits fall for China's top chipmaker as sanctions bite

NIMS Unveils Revolutionary N-Channel Diamond Transistor for Extreme Conditions

TokyoU develops scalable processor for optimal problem solving

ROBO SPACE
France eyes spent uranium plant to bypass Russia: ministry

Future nuclear power reactors could rely on molten salts - but what about corrosion?

GE Vernova and UK Industry Explore Small Modular Reactor Deployment at Sheffield Conference

Russian strike severs power line to Ukraine nuclear plant

ROBO SPACE
Five Chinese dam workers, driver killed in Pakistan suicide attack

Sins of the fathers: Children of IS left to rot in Syria camp

Torture part of Russia's war policy in Ukraine: UN expert

Displaced Mozambicans recall terror of new jihadist attacks

ROBO SPACE
Research highlights Australia's carbon credit 'catastrophe'

Iraq to import electricity from Jordan

Poorer countries need money before raising climate targets: COP29 head

Sweden off-track to meet climate goals: expert agency

ROBO SPACE
Dig deep: US bets on geothermal to become renewable powerhouse

Setting a laser like sight on a path to practical fusion

Unveiling a new class of plasma waves: implications for fusion energy

KULR Technology Secures Key Contract with Nanoracks to Boost Space Battery Innovation

ROBO SPACE
Shenzhou 17 astronauts complete China's first in-space repair job

Tiangong Space Station's Solar Wings Restored After Spacewalk Repair by Shenzhou XVII Team

BIT advances microbiological research on Chinese Space Station

Chang'e 6 and new rockets highlight China's packed 2024 space agenda

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.