Robot Technology News
ROBO SPACE
New technique helps robots pack objects into a tight space
MIT researchers are using generative AI models to help robots more efficiently solve complex object manipulation problems, such as packing a box with different objects.
New technique helps robots pack objects into a tight space
by Adam Zewe for MIT News
Boston MA (SPX) Oct 17, 2023

Anyone who has ever tried to pack a family-sized amount of luggage into a sedan-sized trunk knows this is a hard problem. Robots struggle with dense packing tasks, too.

For the robot, solving the packing problem involves satisfying many constraints, such as stacking luggage so suitcases don't topple out of the trunk, heavy objects aren't placed on top of lighter ones, and collisions between the robotic arm and the car's bumper are avoided.

Some traditional methods tackle this problem sequentially, guessing a partial solution that meets one constraint at a time and then checking to see if any other constraints were violated. With a long sequence of actions to take, and a pile of luggage to pack, this process can be impractically time consuming.

MIT researchers used a form of generative AI, called a diffusion model, to solve this problem more efficiently. Their method uses a collection of machine-learning models, each of which is trained to represent one specific type of constraint. These models are combined to generate global solutions to the packing problem, taking into account all constraints at once.

Their method was able to generate effective solutions faster than other techniques, and it produced a greater number of successful solutions in the same amount of time. Importantly, their technique was also able to solve problems with novel combinations of constraints and larger numbers of objects, that the models did not see during training.

Due to this generalizability, their technique can be used to teach robots how to understand and meet the overall constraints of packing problems, such as the importance of avoiding collisions or a desire for one object to be next to another object. Robots trained in this way could be applied to a wide array of complex tasks in diverse environments, from order fulfillment in a warehouse to organizing a bookshelf in someone's home.

"My vision is to push robots to do more complicated tasks that have many geometric constraints and more continuous decisions that need to be made - these are the kinds of problems service robots face in our unstructured and diverse human environments. With the powerful tool of compositional diffusion models, we can now solve these more complex problems and get great generalization results," says Zhutian Yang, an electrical engineering and computer science graduate student and lead author of a paper on this new machine-learning technique.

Her co-authors include MIT graduate students Jiayuan Mao and Yilun Du; Jiajun Wu, an assistant professor of computer science at Stanford University; Joshua B. Tenenbaum, a professor in MIT's Department of Brain and Cognitive Sciences and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL); Tomas Lozano-Perez, an MIT professor of computer science and engineering and a member of CSAIL; and senior author Leslie Kaelbling, the Panasonic Professor of Computer Science and Engineering at MIT and a member of CSAIL. The research will be presented at the Conference on Robot Learning.

Constraint complications
Continuous constraint satisfaction problems are particularly challenging for robots. These problems appear in multistep robot manipulation tasks, like packing items into a box or setting a dinner table. They often involve achieving a number of constraints, including geometric constraints, such as avoiding collisions between the robot arm and the environment; physical constraints, such as stacking objects so they are stable; and qualitative constraints, such as placing a spoon to the right of a knife.

There may be many constraints, and they vary across problems and environments depending on the geometry of objects and human-specified requirements.

To solve these problems efficiently, the MIT researchers developed a machine-learning technique called Diffusion-CCSP. Diffusion models learn to generate new data samples that resemble samples in a training dataset by iteratively refining their output.

To do this, diffusion models learn a procedure for making small improvements to a potential solution. Then, to solve a problem, they start with a random, very bad solution and then gradually improve it.

For example, imagine randomly placing plates and utensils on a simulated table, allowing them to physically overlap. The collision-free constraints between objects will result in them nudging each other away, while qualitative constraints will drag the plate to the center, align the salad fork and dinner fork, etc.

Diffusion models are well-suited for this kind of continuous constraint-satisfaction problem because the influences from multiple models on the pose of one object can be composed to encourage the satisfaction of all constraints, Yang explains. By starting from a random initial guess each time, the models can obtain a diverse set of good solutions.

Working together
For Diffusion-CCSP, the researchers wanted to capture the interconnectedness of the constraints. In packing for instance, one constraint might require a certain object to be next to another object, while a second constraint might specify where one of those objects must be located.

Diffusion-CCSP learns a family of diffusion models, with one for each type of constraint. The models are trained together, so they share some knowledge, like the geometry of the objects to be packed.

The models then work together to find solutions, in this case locations for the objects to be placed, that jointly satisfy the constraints.

"We don't always get to a solution at the first guess. But when you keep refining the solution and some violation happens, it should lead you to a better solution. You get guidance from getting something wrong," she says.

Training individual models for each constraint type and then combining them to make predictions greatly reduces the amount of training data required, compared to other approaches.

However, training these models still requires a large amount of data that demonstrate solved problems. Humans would need to solve each problem with traditional slow methods, making the cost to generate such data prohibitive, Yang says.

Instead, the researchers reversed the process by coming up with solutions first. They used fast algorithms to generate segmented boxes and fit a diverse set of 3D objects into each segment, ensuring tight packing, stable poses, and collision-free solutions.

"With this process, data generation is almost instantaneous in simulation. We can generate tens of thousands of environments where we know the problems are solvable," she says.

Trained using these data, the diffusion models work together to determine locations objects should be placed by the robotic gripper that achieve the packing task while meeting all of the constraints.

They conducted feasibility studies, and then demonstrated Diffusion-CCSP with a real robot solving a number of difficult problems, including fitting 2D triangles into a box, packing 2D shapes with spatial relationship constraints, stacking 3D objects with stability constraints, and packing 3D objects with a robotic arm.

Their method outperformed other techniques in many experiments, generating a greater number of effective solutions that were both stable and collision-free.

In the future, Yang and her collaborators want to test Diffusion-CCSP in more complicated situations, such as with robots that can move around a room. They also want to enable Diffusion-CCSP to tackle problems in different domains without the need to be retrained on new data.

"Diffusion-CCSP is a machine-learning solution that builds on existing powerful generative models," says Danfei Xu, an assistant professor in the School of Interactive Computing at the Georgia Institute of Technology and a Research Scientist at NVIDIA AI, who was not involved with this work. "It can quickly generate solutions that simultaneously satisfy multiple constraints by composing known individual constraint models. Although it's still in the early phases of development, the ongoing advancements in this approach hold the promise of enabling more efficient, safe, and reliable autonomous systems in various applications."

This research was funded, in part, by the National Science Foundation, the Air Force Office of Scientific Research, the Office of Naval Research, the MIT-IBM Watson AI Lab, the MIT Quest for Intelligence, the Center for Brains, Minds, and Machines, Boston Dynamics Artificial Intelligence Institute, the Stanford Institute for Human-Centered Artificial Intelligence, Analog Devices, JPMorgan Chase and Co., and Salesforce.

Research Report:"Compositional Diffusion-Based Continuous Constraint Solvers"

Related Links
Computer Science and Artificial Intelligence Laboratory
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
China's Baidu says its upgraded AI bot rivals latest ChatGPT
Beijing (AFP) Oct 17, 2023
Chinese internet giant Baidu unveiled the newest version of its AI chatbot ERNIE on Tuesday, claiming it rivals the capabilities of OpenAI's latest ChatGPT. Baidu led Chinese tech firms in rolling out generative artificial intelligence apps, which are trained on vast amounts of data and interactions with users to answer questions, even complex ones, in human-like language. Released to the public in August, ERNIE marked a major step for China's tech sector, which is aiming to compete with the lik ... read more

ROBO SPACE
Raytheon's KuRFS and Coyote detect and defeat UAS targets

US-led troops in Iraq reportedly targeted by suicide drone

Drones shot down in attack on US troops in Syria: Iraq condemns attacks on US forces

Enhancing the safety and efficacy of drone flights in polar regions

ROBO SPACE
Researchers developing 'revolutionary' multi-material for light-based 3D printing

Tightbeam tech set to revolutionize Global Marine Internet through Aalyria-HICO Partnership

NASA's First Two-way End-to-End Laser Communications System

Light-powered multi-level memory tech revolutionizes data processing

ROBO SPACE
Taiwan's TSMC reports profit drop in third quarter

From a five-layer graphene sandwich, a rare electronic state emerges

Tech giants Foxconn, Nvidia announce they are building 'AI factories'

US tightens curbs on AI chip exports to China

ROBO SPACE
Framatome Space: A New Player in Space Exploration and Nuclear Power

Bulgaria to get two US-built nuclear reactors

Electrons are quick-change artists in molten salts, chemists show

France insists on nuclear for 'green' hydrogen

ROBO SPACE
As hostilities grow in Lebanon today, U.S. officials remember '83 Beirut attack

Germany arrests Iraqi Islamic State war crimes suspect

Russia loses bid to regain seat on UN rights body

NYPD tightens security at churches, synagogues after Hamas attacks in Israel

ROBO SPACE
IEA: World must add 50,000 miles of power lines by 2040 to hit climate targets

EU states strike deal on electricity market reform

EU strives for common position ahead of COP28

UK climate shift makes its harder to reach net zero: IMF

ROBO SPACE
Remaking an old Swedish oil depot into a giant underground 'thermos'

Revolutionizing energy storage: Metal nanoclusters for stable lithium-sulfur batteries

A cheaper, safer alternative to lithium-ion batteries: Aqueous rechargeable batteries

Sustainable living technology

ROBO SPACE
China discloses tasks of Shenzhou-17 crewed space mission

Next-generation rocket for China's manned space missions on track

Final rehearsal for Shenzhou XVII flight completed

Chinese sci-fi fans over the moon at Chengdu Worldcon

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.