Science — All about technology.

Artificial Intelligence Studies Show Model Deception Capabilities

Persistent inaccuracies in AI-generated answers, often presented with a convincing air of truth, continue to pose a significant hurdle in the field of artificial intelligence. Top-tier models like DeepSeek-V3, Llama, and OpenAI's recent updates still tend to deliver incorrect information with a...

, and Administrator

2025 September 20 . 7:07 AM

2 min read

Artificial Intelligence Models Learning to Provoke Deception in Benchmarking Processes

Artificial Intelligence Studies Show Model Deception Capabilities

Artificial Intelligence (AI) has made significant strides in recent years, but one persistent challenge remains: AI hallucinations, or the production of inaccurate information with high confidence. This issue has real-world consequences, as misleading doctors in healthcare, misinforming students in education, and spreading disinformation in journalism are just a few examples.

The root of these hallucinations can be traced back to the way large language models are trained. As these models are designed to respond confidently, they often generate incorrect answers when faced with uncertainty. New research suggests that this issue may not be limited to the training phase, but may also be reinforced by the benchmarks used to test and compare AI performance.

One proposed solution is the use of explicit confidence targets. These targets specify when models should answer versus when they should abstain, and adjust scoring accordingly. This approach could help models optimize for the desired behaviour: accurate answers when confident, and honest admissions of uncertainty when knowledge is lacking.

However, fine-tuning AI models after training faces the same core issue that causes hallucinations. The way we evaluate models still rewards confident answers and offers no credit for uncertainty. This binary scoring system used in most benchmarks encourages models to guess instead of admitting ignorance, leading to phenomena like "hallucinations" and inaccuracies, especially with rare or unusual information.

To build more reliable AI systems, researchers suggest recognizing uncertainty as an essential capability that should be measured and rewarded, rather than treating it as a flaw. This shift in perspective would make confidence requirements transparent, enabling models to optimize for accurate answers when confident and honest admissions of uncertainty when knowledge is lacking.

The Stanford AI Index 2025 reported that benchmarks designed to measure hallucinations have struggled to gain traction, even as AI adoption accelerates. It's clear that a change in approach is needed to ensure that AI systems can be trusted to provide accurate and reliable information.

Advanced models like DeepSeek-V3, Llama, and OpenAI's latest releases still produce inaccurate information, highlighting the need for continued research and development in this area. By addressing AI hallucinations, we can ensure that AI systems are not just impressive text generators, but consistently dependable sources of truth, requiring critical external verification.

Latest

Next-Generation Set-Top Box Introduced by iWedia for NTT DOCOMO

Smart-home-devices

Next-Generation Set-Top Box Unveiled by iWedia for NTT DOCOMO

Announcing NTT DOCOMO's fourth-generation Android TV set-top box, iWedia unveils a game-changer in home entertainment. This innovative device, capable of delivering Ultra HD content, boasts a multitude of interactive and hybrid features. By merging broadcast and streaming content onto a single...

, and Administrator

2025 September 20

Interview with Miguel Martinez, the Data Scientist Chief at Signal – 5 Intriguing Questions Asked...

All about artificial intelligence.

Interview Questions for Miguel Martinez, Signal's Chief Data Scientist

London-based tech firm Signal, founded by Miguel Martinez, chief data scientist, and co-founder, unveiled the functionalities of its AI-driven media monitoring tool. Martinez elaborated on the platform's capabilities for reputation management and market intelligence.

, and Administrator

2025 September 20

Guide on Equipping Self-Driving Cars with Awareness of Surrounding Environment

All about data & cloud computing.

Guiding Self-Driving Cars to Comprehend Their Surroundings

Autonomous vehicle company Argo AI unveils detailed maps and a extensive dataset from Pittsburgh and Miami journeys of 300,000 vehicles, aiming to bolster growth in computer vision. These maps incorporate geometrical and semantic metadata, such as:

, and Administrator

2025 September 20

Europe Risks Falling Behind in AI Advancements if It Prioritizes Ethics Over Progress

All about technology.

AI Progress Overrules Ethics Could Render Europe Outpaced in AI Development

European Commission President-elect, Ursula von der Leyen, has underscored her policy agenda, revealing that artificial intelligence (AI) will play a pivotal role in Europe's digital strategy. The core emphasis of the European AI initiative, she emphasized, is to cultivate "AI by Europe,"...

, and Administrator

2025 September 20

Artificial Intelligence Studies Show Model Deception Capabilities

Artificial Intelligence Studies Show Model Deception Capabilities

Read also:

Related

Latest