Robot Technology News
ROBO SPACE
A simpler method for learning to control a robot
Researchers from MIT and Stanford University created a machine-learning method that can derive a controller for a robot, drone, or autonomous vehicle that is more effective at following a stable trajectory than other methods. This technique could help, for instance, a drone to closely follow a downhill skier despite being buffeted by strong winds.
A simpler method for learning to control a robot
by Adam Zewe for MIT News
Boston MA (SPX) Jul 28, 2023

Researchers from MIT and Stanford University have devised a new machine-learning approach that could be used to control a robot, such as a drone or autonomous vehicle, more effectively and efficiently in dynamic environments where conditions can change rapidly.

This technique could help an autonomous vehicle learn to compensate for slippery road conditions to avoid going into a skid, allow a robotic free-flyer to tow different objects in space, or enable a drone to closely follow a downhill skier despite being buffeted by strong winds.

The researchers' approach incorporates certain structure from control theory into the process for learning a model in such a way that leads to an effective method of controlling complex dynamics, such as those caused by impacts of wind on the trajectory of a flying vehicle. One way to think about this structure is as a hint that can help guide how to control a system.

"The focus of our work is to learn intrinsic structure in the dynamics of the system that can be leveraged to design more effective, stabilizing controllers," says Navid Azizan, the Esther and Harold E. Edgerton Assistant Professor in the MIT Department of Mechanical Engineering and the Institute for Data, Systems, and Society (IDSS), and a member of the Laboratory for Information and Decision Systems (LIDS). "By jointly learning the system's dynamics and these unique control-oriented structures from data, we're able to naturally create controllers that function much more effectively in the real world."

Using this structure in a learned model, the researchers' technique immediately extracts an effective controller from the model, as opposed to other machine-learning methods that require a controller to be derived or learned separately with additional steps. With this structure, their approach is also able to learn an effective controller using fewer data than other approaches. This could help their learning-based control system achieve better performance faster in rapidly changing environments.

"This work tries to strike a balance between identifying structure in your system and just learning a model from data," says lead author Spencer M. Richards, a graduate student at Stanford University. "Our approach is inspired by how roboticists use physics to derive simpler models for robots. Physical analysis of these models often yields a useful structure for the purposes of control - one that you might miss if you just tried to naively fit a model to data. Instead, we try to identify similarly useful structure from data that indicates how to implement your control logic."

Additional authors of the paper are Jean-Jacques Slotine, professor of mechanical engineering and of brain and cognitive sciences at MIT, and Marco Pavone, associate professor of aeronautics and astronautics at Stanford. The research will be presented at the International Conference on Machine Learning (ICML).

Learning a controller
Determining the best way to control a robot to accomplish a given task can be a difficult problem, even when researchers know how to model everything about the system.

A controller is the logic that enables a drone to follow a desired trajectory, for example. This controller would tell the drone how to adjust its rotor forces to compensate for the effect of winds that can knock it off a stable path to reach its goal.

This drone is a dynamical system - a physical system that evolves over time. In this case, its position and velocity change as it flies through the environment. If such a system is simple enough, engineers can derive a controller by hand.

Modeling a system by hand intrinsically captures a certain structure based on the physics of the system. For instance, if a robot were modeled manually using differential equations, these would capture the relationship between velocity, acceleration, and force. Acceleration is the rate of change in velocity over time, which is determined by the mass of and forces applied to the robot.

But often the system is too complex to be exactly modeled by hand. Aerodynamic effects, like the way swirling wind pushes a flying vehicle, are notoriously difficult to derive manually, Richards explains. Researchers would instead take measurements of the drone's position, velocity, and rotor speeds over time, and use machine learning to fit a model of this dynamical system to the data. But these approaches typically don't learn a control-based structure. This structure is useful in determining how to best set the rotor speeds to direct the motion of the drone over time.

Once they have modeled the dynamical system, many existing approaches also use data to learn a separate controller for the system.

"Other approaches that try to learn dynamics and a controller from data as separate entities are a bit detached philosophically from the way we normally do it for simpler systems. Our approach is more reminiscent of deriving models by hand from physics and linking that to control," Richards says.

Identifying structure
The team from MIT and Stanford developed a technique that uses machine learning to learn the dynamics model, but in such a way that the model has some prescribed structure that is useful for controlling the system.

With this structure, they can extract a controller directly from the dynamics model, rather than using data to learn an entirely separate model for the controller.

"We found that beyond learning the dynamics, it's also essential to learn the control-oriented structure that supports effective controller design. Our approach of learning state-dependent coefficient factorizations of the dynamics has outperformed the baselines in terms of data efficiency and tracking capability, proving to be successful in efficiently and effectively controlling the system's trajectory," Azizan says.

When they tested this approach, their controller closely followed desired trajectories, outpacing all the baseline methods. The controller extracted from their learned model nearly matched the performance of a ground-truth controller, which is built using the exact dynamics of the system.

"By making simpler assumptions, we got something that actually worked better than other complicated baseline approaches," Richards adds.

The researchers also found that their method was data-efficient, which means it achieved high performance even with few data. For instance, it could effectively model a highly dynamic rotor-driven vehicle using only 100 data points. Methods that used multiple learned components saw their performance drop much faster with smaller datasets.

This efficiency could make their technique especially useful in situations where a drone or robot needs to learn quickly in rapidly changing conditions.

Plus, their approach is general and could be applied to many types of dynamical systems, from robotic arms to free-flying spacecraft operating in low-gravity environments.

In the future, the researchers are interested in developing models that are more physically interpretable, and that would be able to identify very specific information about a dynamical system, Richards says. This could lead to better-performing controllers.

"Despite its ubiquity and importance, nonlinear feedback control remains an art, making it especially suitable for data-driven and learning-based methods. This paper makes a significant contribution to this area by proposing a method that jointly learns system dynamics, a controller, and control-oriented structure," says Nikolai Matni, an assistant professor in the Department of Electrical and Systems Engineering at the University of Pennsylvania, who was not involved with this work. "What I found particularly exciting and compelling was the integration of these components into a joint learning algorithm, such that control-oriented structure acts as an inductive bias in the learning process. The result is a data-efficient learning process that outputs dynamic models that enjoy intrinsic structure that enables effective, stable, and robust control. While the technical contributions of the paper are excellent themselves, it is this conceptual contribution that I view as most exciting and significant."

This research is supported, in part, by the NASA University Leadership Initiative and the Natural Sciences and Engineering Research Council of Canada.

Research Report:"Learning Control-Oriented Dynamical Structure from Data"

Related Links
Department of Mechanical Engineering
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
Surface Avatar - an astronaut on board the ISS controls a robot team on Earth
Oberpfaffenhofen, Germany (SPX) Jul 27, 2023
Researchers at the German Aerospace Center are developing key technologies such as telepresence robotics for the exploration of space, including the Moon and Mars. This will allow robots on a distant planet to carry out tasks commanded by a human from a spacecraft in orbit. The 'Surface Avatar' mission team at DLR in Oberpfaffenhofen has now demonstrated how this could work. From the International Space Station (ISS), a single person, NASA astronaut Frank Rubio, simultaneously controlled several robots ... read more

ROBO SPACE
Russian fighter jets damage U.S. drone in Syria

Kim shows off new North Korean drones, ICBMs to Russia defence minister

U.S. announces a new security package for Ukraine that includes spy drones

US says Russia damaged American drone over Syria

ROBO SPACE
Imaging shows how solar-powered microbes turn CO2 into bioplastic

For decades, artist Eduardo Kac has been laser-focused on sending hologram project into space

Goddard, Wallops Engineers Test Printed Electronics in Space

Optimum Technologies unveils innovative spacecraft facility in Northern Virginia

ROBO SPACE
A novel catalyst for efficient hydrogen production

Controlling the electro-optic response of a perovskite coupled to a phonon-resonant cavity

New quantum magnet unleashes electronics potential

Chip giant TSMC determined to 'keep roots in Taiwan': CEO

ROBO SPACE
Niger coup raises questions about uranium dependence

First US nuclear reactor in seven years goes online

Framatome's accident tolerant fuel technology one step closer to market readiness

EU says no uranium 'supply risk' after Niger coup

ROBO SPACE
US destroys its last chemical weapons, watchdog hails milestone

Kremlin says only hits 'military' targets after restaurant strike

6,000 Myanmar civilians killed in 20 months post coup: report

US, Saudi urge Western nations to repatriate IS jihadists

ROBO SPACE
UK climate campaigners fear net zero policies under threat

Electrical fire sparks nationwide power outage in Iraq

U.S. pulls plug on incandescent light bulbs as new ban goes into effect

In a warming world, is an air-conditioned future inevitable?

ROBO SPACE
MIT engineers create an energy-storing supercapacitor from ancient materials

New approach to fuel cell manufacturing could reduce cost, increase availability

Less power, lower emissions: improving AC technology

Fusion model hot off the wall

ROBO SPACE
China's Space Station Opens Doors to Global Scientific Community

China's Lunar Mission targets manned landing by 2030

Shenzhou XVI crew set to conduct their first EVA

Timeline unveiled for China's advanced manned spacecraft's inaugural flight

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.