Robot Technology News
ROBO SPACE
Want better AI? Get input from a human expert
Many AI activities perform better is augmented with a human worker.
Want better AI? Get input from a human expert
by Staff Writers
Richland WA (SPX) Nov 21, 2023

Can AI be trusted? The question pops up wherever AI is used or discussed-which, these days, is everywhere. It's a question that even some AI systems ask themselves. Many machine-learning systems create what experts call a "confidence score," a value that reflects how confident the system is in its decisions. A low score tells the human user that there is some uncertainty about the recommendation; a high score indicates to the human user that the system, at least, is quite sure of its decisions. Savvy humans know to check the confidence score when deciding whether to trust the recommendation of a machine-learning system.

Scientists at the Department of Energy's Pacific Northwest National Laboratory have put forth a new way to evaluate an AI system's recommendations. They bring human experts into the loop to view how the ML performed on a set of data. The expert learns which types of data the machine-learning system typically classifies correctly, and which data types lead to confusion and system errors. Armed with this knowledge, the experts then offer their own confidence score on future system recommendations.

The result of having a human look over the shoulder of the AI system? Humans predicted the AI system's performance more accurately.

Minimal human effort-just a few hours-evaluating some of the decisions made by the AI program allowed researchers to vastly improve on the AI program's ability to assess its decisions. In some analyses by the team, the accuracy of the confidence score doubled when a human provided the score.

The PNNL team presented its results at a recent meeting of the Human Factors and Ergonomics Society in Washington, D.C., part of a session on human-AI robot teaming.

"If you didn't develop the machine-learning algorithm in the first place, then it can seem like a black box," said Corey Fallon, the lead author of the study and an expert in human-machine interaction. "In some cases, the decisions seem fine. In other cases, you might get a recommendation that is a real head-scratcher. You may not understand why it's making the decisions it is."

The grid and AI
It's a dilemma that power engineers working with the electric grid face. Their decisions based on reams of data that change every instant keep the lights on and the nation running. But power engineers may be reluctant to turn over decision-making authority to machine-learning systems.

"There are hundreds of research papers about the use of machine learning in power systems, but almost none of them are applied in the real world. Many operators simply don't trust ML. They have domain experience-something that ML can't learn," said coauthor Tianzhixi "Tim" Yin.

The researchers at PNNL, which has a world-class team modernizing the grid, took a closer look at one machine-learning algorithm applied to power systems. They trained the SVM (support-vector machine) algorithm on real data from the grid's Eastern Interconnection in the U.S. The program looked at 124 events, deciding whether a generator was malfunctioning, or whether the data was showing other types of events that are less noteworthy.

The algorithm was 85% reliable in its decisions. Many of its errors occurred when there were complex power bumps or frequency shifts. Confidence scores created with a human in the loop were a marked improvement over the system's assessment of its own decisions. The human expert's input predicted the algorithm's decisions with much greater accuracy.

More human, better machine learning
Fallon and Yin call the new score an "Expert-Derived Confidence" score, or EDC score.

They found that, on average, when humans weighed in on the data, their EDC scores predicted model behavior that the algorithm's confidence scores couldn't predict.

"The human expert fills in gaps in the ML's knowledge," said Yin. "The human provides information that the ML did not have, and we show that that information is significant. The bottom line is that we've shown that if you add human expertise to the ML results, you get much better confidence."

The work by Fallon and Yin was funded by PNNL through an initiative known as MARS-Mathematics for Artificial Reasoning in Science. The effort is part of a broader effort in artificial intelligence at PNNL. The initiative brought together Fallon, an expert on human-machine teaming and human factors research, and Yin, a data scientist and an expert on machine learning.

"This is the type of research needed to prepare and equip an AI-ready workforce," said Fallon. "If people don't trust the tool, then you've wasted your time and money. You've got to know what will happen when you take a machine learning model out of the laboratory and put it to work in the real world.

"I'm a big fan of human expertise and of human-machine teaming. Our EDC scores allow the human to better assess the situation and make the ultimate decision."

Research Report:Method For Generating Expert Derived Confidence Scores

Related Links
Pacific Northwest National Laboratory
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
System mimics human muscles to prevent slippage on robotic rovers
Tokyo, Japan (SPX) Nov 16, 2023
In an era where planetary exploration increasingly relies on unmanned rovers, ensuring their safe and effective navigation across challenging extraterrestrial terrains is paramount. This is especially true in environments like Mars or the Moon, where surfaces are often covered with regolith-a fine, loose material that can significantly impair rover mobility. Addressing this challenge, researchers from Japan's Shibaura Institute of Technology (SIT) have introduced a groundbreaking system designed to enha ... read more

ROBO SPACE
US warship shoots down drone launched by Yemen's Huthis

US Reaper shot down off Yemeni coast

Two drone attacks in Iraq target global coalition: official

Drone attack targets US-led anti-jihadist coalition in Iraq

ROBO SPACE
Climate conspiracy theories flourish ahead of COP28

NASA's Deep Space Optical Comm Demo Sends, Receives First Data

Rice researcher scans tropical forest with mixed-reality device

Japan PM says experts to talk in China seafood row

ROBO SPACE
US chip curbs trip up China's AI-hungry tech giants

Alibaba cancels cloud service spinoff over US chip restrictions

First 2D semiconductor with 1000 transistors developed at EPFL Switzerland

Atomic dance gives rise to a magnet

ROBO SPACE
Europe's largest nuclear reactor offline after glitch

Europe's largest nuclear reactor restarts after fault

US opens way for nuclear investment in energy-hungry Philippines

Sweden plans huge investment in nuclear power

ROBO SPACE
DHS Secretary Alejandro Mayorkas warns of elevated risk of attack on U.S.

French army to pitch tents in Paris for Olympics: military

Treasury levels new sanctions against Hamas-affiliated individuals, groups

Germany bans support for Hamas

ROBO SPACE
Indonesia unveils investment plan for $20 bn energy transition pact

EU says climate funding should not rely on 1992 calculations

European banks lack transparency on green finance: NGO

Rich nations 'likely' met $100 bn climate finance goal: OECD

ROBO SPACE
A novel approach to energy storage by University of Cordoba

Researchers aim to make cheaper fuel cells a reality

BMW probes Moroccan cobalt supplier over pollution claims

The secret to longer lasting batteries might be in how soap works, new study says

ROBO SPACE
New scientific experimental samples from China's space station return to Earth

Shenzhou XVI crew return after 'very cool journey'

Chinese astronauts return to Earth with fruitful experimental results

Chinese astronauts return to Earth after 'successful' mission

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2024 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.