Read the Beforeitsnews.com story here. Advertise at Before It's News here.
Profile image
By Freedom Bunker
Contributor profile | More stories
Story Views
Now:
Last hour:
Last 24 hours:
Total:

AI Models Still Far From AGI-Level Reasoning: Apple Researchers

% of readers think this story is Fact. Add your two cents.


AI Models Still Far From AGI-Level Reasoning: Apple Researchers

Authored by Martin Young via CoinTelegraph.com,

The race to develop artificial general intelligence (AGI) still has a long way to run, according to Apple researchers who found that leading AI models still have trouble reasoning. 

Recent updates to leading AI large language models (LLMs) such as OpenAI’s ChatGPT and Anthropic’s Claude have included large reasoning models (LRMs), but their fundamental capabilities, scaling properties, and limitations “remain insufficiently understood,” said the Apple researchers in a June paper called “The Illusion of Thinking.” 

They noted that current evaluations primarily focus on established mathematical and coding benchmarks, “emphasizing final answer accuracy.” 

However, this evaluation does not provide insights into the reasoning capabilities of the AI models, they said. 

The research contrasts with an expectation that artificial general intelligence is just a few years away.

Apple researchers test “thinking” AI models

The researchers devised different puzzle games to test “thinking” and “non-thinking” variants of Claude Sonnet, OpenAI’s o3-mini and o1, and DeepSeek-R1 and V3 chatbots beyond the standard mathematical benchmarks. 

They discovered that “frontier LRMs face a complete accuracy collapse beyond certain complexities,” don’t generalize reasoning effectively, and their edge disappears with rising complexity, contrary to expectations for AGI capabilities.

“We found that LRMs have limitations in exact computation: they fail to use explicit algorithms and reason inconsistently across puzzles.”

Verification of final answers and intermediate reasoning traces (top chart), and charts showing non-thinking models are more accurate at low complexity (bottom charts). Source: Apple Machine Learning Research 

AI chatbots are overthinking, say researchers

They found inconsistent and shallow reasoning with the models and also observed overthinking, with AI chatbots generating correct answers early and then wandering into incorrect reasoning.

The researchers concluded that LRMs mimic reasoning patterns without truly internalizing or generalizing them, which falls short of AGI-level reasoning.

“These insights challenge prevailing assumptions about LRM capabilities and suggest that current approaches may be encountering fundamental barriers to generalizable reasoning.”

Illustration of the four puzzle environments. Source: Apple

The race to develop AGI

AGI is the holy grail of AI development, a state where the machine can think and reason like a human and is on a par with human intelligence. 

In January, OpenAI CEO Sam Altman said the firm was closer to building AGI than ever before. “We are now confident we know how to build AGI as we have traditionally understood it,” he said at the time. 

In November, Anthropic CEO Dario Amodei said that AGI would exceed human capabilities in the next year or two. “If you just eyeball the rate at which these capabilities are increasing, it does make you think that we’ll get there by 2026 or 2027,” he said.  

Tyler Durden Mon, 06/09/2025 – 17:10


Source: https://freedombunker.com/2025/06/09/ai-models-still-far-from-agi-level-reasoning-apple-researchers/


Before It’s News® is a community of individuals who report on what’s going on around them, from all around the world.

Anyone can join.
Anyone can contribute.
Anyone can become informed about their world.

"United We Stand" Click Here To Create Your Personal Citizen Journalist Account Today, Be Sure To Invite Your Friends.

Before It’s News® is a community of individuals who report on what’s going on around them, from all around the world. Anyone can join. Anyone can contribute. Anyone can become informed about their world. "United We Stand" Click Here To Create Your Personal Citizen Journalist Account Today, Be Sure To Invite Your Friends.


LION'S MANE PRODUCT


Try Our Lion’s Mane WHOLE MIND Nootropic Blend 60 Capsules


Mushrooms are having a moment. One fabulous fungus in particular, lion’s mane, may help improve memory, depression and anxiety symptoms. They are also an excellent source of nutrients that show promise as a therapy for dementia, and other neurodegenerative diseases. If you’re living with anxiety or depression, you may be curious about all the therapy options out there — including the natural ones.Our Lion’s Mane WHOLE MIND Nootropic Blend has been formulated to utilize the potency of Lion’s mane but also include the benefits of four other Highly Beneficial Mushrooms. Synergistically, they work together to Build your health through improving cognitive function and immunity regardless of your age. Our Nootropic not only improves your Cognitive Function and Activates your Immune System, but it benefits growth of Essential Gut Flora, further enhancing your Vitality.



Our Formula includes: Lion’s Mane Mushrooms which Increase Brain Power through nerve growth, lessen anxiety, reduce depression, and improve concentration. Its an excellent adaptogen, promotes sleep and improves immunity. Shiitake Mushrooms which Fight cancer cells and infectious disease, boost the immune system, promotes brain function, and serves as a source of B vitamins. Maitake Mushrooms which regulate blood sugar levels of diabetics, reduce hypertension and boosts the immune system. Reishi Mushrooms which Fight inflammation, liver disease, fatigue, tumor growth and cancer. They Improve skin disorders and soothes digestive problems, stomach ulcers and leaky gut syndrome. Chaga Mushrooms which have anti-aging effects, boost immune function, improve stamina and athletic performance, even act as a natural aphrodisiac, fighting diabetes and improving liver function. Try Our Lion’s Mane WHOLE MIND Nootropic Blend 60 Capsules Today. Be 100% Satisfied or Receive a Full Money Back Guarantee. Order Yours Today by Following This Link.


Report abuse

Comments

Your Comments
Question   Razz  Sad   Evil  Exclaim  Smile  Redface  Biggrin  Surprised  Eek   Confused   Cool  LOL   Mad   Twisted  Rolleyes   Wink  Idea  Arrow  Neutral  Cry   Mr. Green

MOST RECENT
Load more ...

SignUp

Login

Newsletter

Email this story
Email this story

If you really want to ban this commenter, please write down the reason:

If you really want to disable all recommended stories, click on OK button. After that, you will be redirect to your options page.