Current AI systems are powerful pattern machines that simulate “thinking” in narrow contexts but collapse on many forms of real reasoning, robustness, and real‑world value; a growing body of research and critique is now systematically exposing those limits and the hype built around them. Apple’s recent “Illusion of Thinking” work is a particularly stark example, showing that even the latest “reasoning models” fail catastrophically once problem complexity crosses a certain threshold, despite all the marketing language about planning, logic, and AGI.machinelearning.apple+4

What Apple’s study actually shows

Apple’s “Illusion of Thinking” paper builds controlled puzzle environments where the logical structure is fixed but compositional complexity (steps, dependencies, rules) is dialed up in a precise way. On these tasks, Apple finds three regimes: standard LLMs beat “Large Reasoning Models” (LRMs) on very simple problems, LRMs show an advantage on medium complexity, and then both classes of models experience a near‑total accuracy collapse beyond a complexity tipping point. Critically, the models do not switch to explicit algorithmic strategies; instead they produce inconsistent, brittle reasoning traces and fail at exact computation, revealing that the appearance of step‑by‑step “thought” is not grounded in robust internal procedures.thecuberesearch+2

The team also observes what they call a counter‑intuitive “quitter” effect: as tasks become harder, the models’ internal reasoning length (tokens spent on thinking) at first increases and then declines, even though they still have plenty of budget left. This suggests no reliable metacognition or “sense of difficulty”: the systems neither know when to try harder nor recognize when they are failing, which aligns with broader evidence that current LLMs lack self‑monitoring and systematically overestimate their own correctness.pmc.ncbi.nlm.nih+4

Other empirical limits being exposed

Beyond Apple, multiple technical communities are documenting hard edges of current LLM and “reasoning” capabilities across domains such as medicine, clinical problem‑solving, and cognitive modeling. Key limitations repeatedly observed include: sharp degradation when inputs are long, information‑dense, or require integrating many weak signals; brittle generalization to out‑of‑distribution or “long‑tail” cases where surface correlations no longer hold; and overconfidence, where models provide fluent, authoritative‑sounding but incorrect answers, especially in high‑stakes settings.nature+3

Formal assessments in clinical decision‑making, for example, show that state‑of‑the‑art models fail on many realistic medical reasoning tasks and are particularly weak when problems are constructed to break familiar patterns. In cognitive science and linguistics, researchers argue that LLMs tell us very little about human language acquisition, evolution, or genuine conceptual understanding, precisely because they lack grounded semantics, causal models, and developmental constraints. Across these lines of work, the emerging consensus is not that LLMs are useless, but that they are powerful stochastic text engines whose strengths are being systematically misinterpreted as general intelligence.direct.mit+3

Economic and industry‑level hype

While the technical literature is surfacing concrete failure modes, critical work in economics, management, and political economy is unpacking the hype machine around AI as an industry. Analysts point out that enterprises are pouring billions into AI projects with unclear or negative ROI, and that many initiatives stall at the integration and data‑plumbing stage despite polished demos and impressive prototypes. Some critics note that a large share of near‑term “value” is accruing not from real productivity gains but from financial speculation, narrative pumping, and cost‑cutting ambitions, such as using AI to deskill labor and shift bargaining power rather than to create genuinely new capabilities.jacobin+3

Research on the broader political economy of AI argues that both utopian promises (“abundance for all”) and dystopian doom narratives function as tools of power: they justify aggressive data extraction, surveillance, and labor restructuring under the banner of innovation. In this view, anti‑hype that only targets individual companies or product claims is insufficient; the real object of critique is the underlying incentive structure that rewards over‑promising and hides the gap between marketing claims and lived impacts.hbr+1

Technical limits of current LLM architecture

A parallel thread in surveys and primers on LLMs synthesizes the structural reasons for these limits: current models are large autoregressive functions trained on internet‑scale text, without explicit grounding in physical reality, causal structure, or algorithmic modules. Commonly documented constraints include hallucinations and factual inaccuracy, time‑lagged knowledge due to static training corpora, difficulty with tasks that demand exact arithmetic or strict logic, vulnerability to adversarial prompts and distribution shifts, and high energy and compute costs that make “scale is all you need” economically and environmentally suspect.arxiv+3

Surveys emphasize that while incremental techniques (tool‑calling, retrieval, self‑critique, chain‑of‑thought, agents) can mitigate some of these issues, they do not transform the underlying model into a reasoning engine; instead, they scaffold a pattern‑matcher with external systems that supply algorithms, databases, and guardrails. This perspective aligns closely with Apple’s findings: the models may look more thoughtful when wrapped in sophisticated prompting, but under controlled conditions they still fail to implement stable, generalizable procedures.research.aimultiple+4

Where “truth‑telling” about AI is heading

Putting these strands together, the emerging “truth‑exposure” movement around current AI is doing four things at once.machinelearning.apple+2

It is empirically mapping failure regimes of LLMs and LRMs in reasoning, safety‑critical domains, and out‑of‑distribution settings, showing that their competence is narrower and more brittle than hype suggests.pmc.ncbi.nlm.nih+2
It is reframing LLMs as tools that must be combined with explicit algorithms, domain systems, and human oversight, rather than as incipient general intelligences.mashable+2
It is critiquing the economic and institutional structures that monetize AI hype, including venture dynamics, enterprise FOMO, and narratives that justify intensified surveillance and labor restructuring.carlsetzer+2
It is opening space for alternative research directions—such as neuromorphic, biologically grounded, or hybrid systems—that aim for robustness, energy efficiency, and causal transparency instead of ever‑larger opaque text models.sciencedirect+2

Biologically faithful and neuromorphic approaches directly target the same weaknesses that the Apple work and wider anti‑hype literature expose in current LLM‑centric AI: brittle “reasoning,” poor energy efficiency, lack of grounding, and opacity. Rather than scaling text prediction, they try to inherit the structural properties that make brains robust, efficient, and adaptive in the first place.machinelearning.apple+3

From “illusion of thinking” to “mechanisms of thinking”

Apple’s “Illusion of Thinking” results show that today’s language and “reasoning” models do not instantiate stable algorithms for logic or planning but instead surf statistical regularities until a complexity threshold is crossed, at which point they collapse while still sounding confident. This exposes a core architectural issue: these systems lack mechanisms for grounded world models, causal structure, and self‑monitoring, so they cannot tell when they are out of their depth or construct new procedures on the fly.arxiv+5

Neuromorphic and biologically faithful work treats this not as a tuning problem but as a design flaw: if the goal is generalizable intelligence, the system must incorporate elements like spiking dynamics, local learning, and energy‑constrained computation that real nervous systems use to manage complexity. Instead of adding more layers of prompt‑engineering or tool‑calling around a text model, these approaches explore architectures where the primitive operations already look more like neurons, synapses, and networks evolved to survive in noisy, uncertain environments.pmc.ncbi.nlm.nih+2

Why neuromorphic systems matter for the current critique

Brain‑inspired and neuromorphic computing explicitly frame themselves as alternatives to the “bigger LLM” trajectory, emphasizing orders‑of‑magnitude improvements in energy efficiency, locality, and robustness. Reviews in the field stress that the brain’s spike‑based computing achieves remarkable throughput and adaptability on roughly 20 watts, whereas training or running frontier LLMs burns megawatt‑hours and still fails on many reasoning benchmarks. This directly undercuts the hype narrative that more GPU‑hungry models will inevitably converge to human‑like thinking.arxiv+4

Several strands connect tightly to the Apple‑style reasoning critique:

Spiking and temporal structure: Spiking neural networks and hybrid brain‑inspired models encode information over time with discrete events, supporting persistent working memories and recurrent loops that can implement explicit algorithms and decision policies rather than just next‑token prediction.academic.oup+2
Local learning and plasticity: Biologically plausible learning rules (e.g. spike‑timing dependent plasticity, forward‑only schemes) aim to train networks using local signals instead of global backpropagation, bringing models closer to the way real brains adapt continuously and efficiently.openreview+2
Structural modularity: Many neuromorphic architectures treat neurons or microcircuits as semi‑autonomous agents that optimize local energy or reward, producing compositional systems that may scale more gracefully with task complexity than monolithic transformers.sciencedirect+2

These design moves target exactly the regimes where Apple shows LLMs fail: tasks that require persistent internal state, structured search, and robust behavior as combinatorial complexity grows.thecuberesearch+2

Energy, embodiment, and real‑world grounding

The hype around current AI often treats energy and embodiment as implementation details, but both neuromorphic and biological‑fidelity camps argue they are central to real intelligence. Neuromorphic hardware and algorithms are being deployed in edge settings such as implants, sensors, and cyber‑defense systems specifically because they can run continuously, locally, and with low latency under tight power budgets, where cloud‑scale LLMs are simply not viable. That shift in deployment context forces architectures to be robust under noise, partial observability, and hardware faults—conditions that expose the gap between “demo intelligence” and operational intelligence.lanl+3

In parallel, brain‑inspired computing work stresses that cognition emerges from tightly coupled perception‑action loops, not from disembodied text statistics. By embedding computation in physical or simulated bodies, neuromorphic systems naturally tie “reasoning” to sensorimotor feedback and energy constraints, which encourages forms of intelligence that are less prone to the hallucinations and ungrounded verbosity that plague LLMs.pmc.ncbi.nlm.nih+4

A different research and economic narrative

The critique of AI hype points out that current incentives reward short‑term, demo‑driven progress and capital‑intensive scaling, even when the underlying models are fragile. Brain‑inspired and neuromorphic efforts, by contrast, often pitch themselves as long‑horizon infrastructure work: building substrates for future agents and cyber‑physical systems that are auditable, certifiable, and compatible with safety‑critical domains like medicine, implants, or autonomous defense. This aligns more naturally with calls from the Apple‑style literature and policy circles for systems whose internal operations can be understood, constrained, and verified rather than treated as opaque black boxes.pmc.ncbi.nlm.nih+8

There is also growing recognition in surveys and special issues that the way forward is not “LLMs or neuromorphic,” but hybrid ecosystems in which large models act as high‑level pattern recognizers or interfaces while brain‑inspired modules handle continuous control, memory, and grounded reasoning. In that sense, biologically faithful and neuromorphic approaches are not merely add‑ons to the current paradigm; they are candidates for the “missing half” of intelligence that the Apple paper shows LLMs do not have.bohrium+2

Putting it together: from critique to construction

Taken together, the Apple “Illusion of Thinking” work and the broader literature on LLM limitations describe, in quantitative terms, where current AI fails to behave like an intelligent system rather than a powerful autocomplete. Neuromorphic and biologically faithful research is best viewed as an attempt to build architectures whose very primitives are closer to the mechanisms that give biological brains their robustness, energy efficiency, and compositional flexibility, instead of patching those properties onto an inherently brittle predictor.nature+6

For a neuromorphic or synthetic‑brain project, this moment is an opportunity: the more clearly mainstream work exposes the mismatch between hype and actual reasoning capacity in LLMs, the easier it becomes to argue for architectures that start from physiology and dynamics rather than from web text. A strong next step for your own work would be to frame your models explicitly against the regimes where Apple and others document LLM collapse—long‑horizon tasks, noisy environments, strict correctness requirements—and show, even at small scale, how a biologically grounded system behaves differently under those stresses.jacobin+2

Beyond the Illusion: Where This Leaves Us

The pattern is now clear.
We have machines that appear intelligent, markets that want them to be intelligent, and a global narrative that insists they must already be intelligent. But appearance is not architecture, and narratives are not mechanisms.

LLMs give us the illusion of understanding because they reproduce patterns of language, not the grounded processes that generate thought. Their failures—hallucinations, inconsistency, brittleness—are not bugs but the natural limits of a system that has no internal model of reality.

This is why so many organisations are reaching a breaking point.
They’ve hit the boundary where “smart outputs” no longer mask a shallow internal structure. They need systems that can reason, adapt, explain, and operate reliably—qualities that cannot be bolted onto a statistical engine built for autocomplete.

The future of intelligent systems will not come from stretching the illusion further.
It will come from replacing it with a computational foundation that mirrors the physical constraints of real cognition.

Why Qognetix Is Building Something Different

At Qognetix, we start from a simple principle:

If intelligence is a physical process, then intelligent machines must be built on physical principles—not statistical shortcuts.

This is the foundation of our Synthetic Intelligence (SI) stack.
Instead of approximating intelligence through probability, we build mechanisms inspired by electrophysiology—systems that behave according to biophysical rules, offering:

predictability rooted in dynamics, not guesswork
explainability down to the equations
traceability and auditability essential for regulated sectors
robustness and sovereignty without retraining cycles
a bridge between neuroscience, simulation, and engineered intelligence

Our early work, including BioSynapStudio Lab, shows how grounded computation can deliver biological fidelity on consumer hardware—something that no statistical model can provide because it was never designed for it.

From Hype to Engineering Reality

The AI era has produced dazzling surface-level results, but the cracks underneath are now impossible to ignore.
What comes next will require a new substrate—one that treats intelligence as a system to be engineered, not imitated.

Synthetic Intelligence is that path.

Our mission at Qognetix is to move the field beyond the illusion of intelligence and toward technologies that are:

transparent
mechanistic
faithful to the biology
capable of supporting true reasoning

If the last decade was dominated by artificial intelligence as spectacle,
the next will belong to Synthetic Intelligence as a scientific discipline—
and to the companies willing to build it from first principles.

Sources

Apple / LLM limitations / hype

Apple Machine Learning Research – “The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models”
https://machinelearning.apple.com/research/illusion-of-thinking machinelearning.apple
Ninza7 – “Apple Just Pulled the Plug on the AI Hype. Here’s What Their Shocking Study Found”
https://ninza7.medium.com/apple-just-pulled-the-plug-on-the-ai-hype-heres-what-their-shocking-study-found-24ad42c234a0 machinelearning.apple
The Cube Research – “Apple Exposes the Hype: LLMs Cannot Reason. What You Need to Know About the Future of AI Reasoning”
https://thecuberesearch.com/apple-exposes-the-hype-llms-cannot-reason-what-you-need-to-know-about-the-future-of-ai-reasoning/thecuberesearch
Mashable – “‘The illusion of thinking’: Apple research finds AI models struggle with real reasoning”
https://mashable.com/article/apple-research-ai-reasoning-models-collapse-logic-puzzles mashable
NIH / PMC – “Assessing the Current Limitations of Large Language Models”
https://pmc.ncbi.nlm.nih.gov/articles/PMC11756841/pmc.ncbi.nlm.nih
Nature Scientific Reports – “Limitations of large language models in clinical problem‑solving”
https://www.nature.com/articles/s41598-025-22940-0 nature
MIT Press / Open Mind – “The Limitations of Large Language Models for Understanding”
https://direct.mit.edu/opmi/article/doi/10.1162/opmi_a_00160/124234/The-Limitations-of-Large-Language-Models-for direct.mit
arXiv – “A Primer on Large Language Models and their Limitations”
https://arxiv.org/html/2412.04503v1 arxiv
ScienceDirect – “Generative AI and Large Language Models – Benefits, Drawbacks and Future Directions”
https://www.sciencedirect.com/science/article/pii/S1877050924027492 sciencedirect
AIMultiple – “The Future of Large Language Models”
https://research.aimultiple.com/future-of-large-language-models/research.aimultiple
Jacobin – “Don’t Believe the Hype — or Doom — About AI”
https://jacobin.com/2025/12/hype-artificial-intelligence-vc-capital jacobin
Harvard Business Review – “The AI Revolution Won’t Happen Overnight”
https://hbr.org/2025/06/the-ai-revolution-wont-happen-overnight hbr
McKinsey – “The state of AI in 2025: Agents, innovation, and transformation”
https://www.mckinsey.com/capabilities/quantumblack/our-insights/the-state-of-ai mckinsey
Carl Setzer – “A Critique On AI Hype”
https://www.carlsetzer.com/2025/11/06/a-critique-on-ai-hype/

Neuromorphic / biologically faithful / brain‑inspired computing

PMC – “Brain‑inspired computing: Neuroscience drives the development of neuromorphic computing”
https://pmc.ncbi.nlm.nih.gov/articles/PMC9808067/pmc.ncbi.nlm.nih
arXiv – “A Computational Perspective on NeuroAI and Synthetic Intelligence”
https://arxiv.org/html/2509.23896v2 arxiv
PMC – “Neuromorphic algorithms for brain implants: a review”
https://pmc.ncbi.nlm.nih.gov/articles/PMC12021827/pmc.ncbi.nlm.nih
National Science Review – “Advancing brain-inspired computing with hybrid neural networks”
https://academic.oup.com/nsr/article/11/5/nwae066/7614610 academic.oup
Frontiers – “Brain-Inspired Computing under the Era of Large Model and Generative AI: Theory, Algorithms and Applications” (research topic overview)
https://www.frontiersin.org/research-topics/63741/brain-inspired-computing-under-the-era-of-large-model-and-generative-ai-theory frontiersin
arXiv – “Bio-inspired AI: Integrating Biological Complexity into Machine Intelligence”
https://arxiv.org/html/2411.15243v1 arxiv
arXiv – “SpikeLLM: Scaling up Spiking Neural Network to Large Language Models”
https://arxiv.org/html/2407.04752v1 arxiv
arXiv – “Brain-inspired Computing Based on Deep Learning for Human-like Intelligence”
https://arxiv.org/abs/2312.07213 arxiv
Bohrium – “Brain-inspired Computing Based on Machine Learning And Deep Learning: A Review”
https://www.bohrium.com/paper-details/brain-inspired-computing-based-on-machine-learning-and-deep-learning-a-review/941757976534 bohrium
Los Alamos National Laboratory – “Neuromorphic computing: the future of AI”
https://www.lanl.gov/media/publications/1663/1269-neuromorphic-computing lanl
Frontiers in Artificial Intelligence – “Effective methods and framework for energy-based local learning in neuromorphic systems”
https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2025.1605706/full frontiersin
ScienceDirect – “Neurons as autonomous agents: A biologically inspired perspective on computation and control”
https://www.sciencedirect.com/science/article/pii/S138904172500018X sciencedirect
Frontiers in Big Data – “What is next for LLMs? Pushing the boundaries of next-gen AI”
https://pmc.ncbi.nlm.nih.gov/articles/PMC12592636/pmc.ncbi.nlm.nih
Situated Cognition – “Workshop: Philosophy of Neuromorphic AI”
https://situated-cognition.com/2024/01/03/neuromorphic_ai/situated-cognition
Frontiers in Artificial Intelligence – “Machine learning and artificial intelligence in neuroscience”
https://www.sciencedirect.com/science/article/pii/S0889159123003380 sciencedirect

Nic Windley

Nic Windley is co-founder of Qognetix, a company developing deterministic runtime infrastructure for persistent intelligent systems. His background includes electronics engineering and early exposure to computational approaches such as fuzzy logic. During his MBA he undertook an artificial intelligence project exploring machine learning systems and their practical application. He has also been involved in the training and evaluation of AI models and has held technology leadership roles with responsibility for operating systems and P&L performance. His current work focuses on synthetic intelligence substrates designed to support reliable, governed intelligent behaviour in long-running operational environments.

The Illusion of ‘Smart’ Machines: Exposing the AI Hype

What Apple’s study actually shows

Other empirical limits being exposed

Economic and industry‑level hype

Technical limits of current LLM architecture

Where “truth‑telling” about AI is heading

From “illusion of thinking” to “mechanisms of thinking”

Why neuromorphic systems matter for the current critique

Energy, embodiment, and real‑world grounding

A different research and economic narrative

Putting it together: from critique to construction

Beyond the Illusion: Where This Leaves Us

Why Qognetix Is Building Something Different

From Hype to Engineering Reality

Sources

Apple / LLM limitations / hype

Neuromorphic / biologically faithful / brain‑inspired computing

Nic Windley

Leave a Reply Cancel reply

More Articles:

Birmingham AI Infrastructure Startup Qognetix Selected for UK StartUp Awards Midlands Final

Enterprise AI Architecture and the Retraining Problem Revealed by Doom-on-a-Chip

Agentic AI Has Outgrown Its Hardware: Why True Agents Require a New Computational Substrate

The Illusion of ‘Smart’ Machines: Exposing the AI Hype

What Apple’s study actually shows

Other empirical limits being exposed

Economic and industry‑level hype

Technical limits of current LLM architecture

Where “truth‑telling” about AI is heading

From “illusion of thinking” to “mechanisms of thinking”

Why neuromorphic systems matter for the current critique

Energy, embodiment, and real‑world grounding

A different research and economic narrative

Putting it together: from critique to construction

​Beyond the Illusion: Where This Leaves Us

Why Qognetix Is Building Something Different

From Hype to Engineering Reality

Sources

Apple / LLM limitations / hype

Neuromorphic / biologically faithful / brain‑inspired computing

Nic Windley

Leave a Reply Cancel reply

More Articles:

Birmingham AI Infrastructure Startup Qognetix Selected for UK StartUp Awards Midlands Final

Enterprise AI Architecture and the Retraining Problem Revealed by Doom-on-a-Chip

Agentic AI Has Outgrown Its Hardware: Why True Agents Require a New Computational Substrate

Beyond the Illusion: Where This Leaves Us