LLM Reasoning Model Classification

Nvidia’s new technique cuts LLM reasoning costs by 8x without losing accuracy

Nvidia researchers developed dynamic memory sparsification (DMS), a technique that compresses the KV cache in large language models by up to 8x while maintaining reasoning accuracy — and it can be ...

VentureBeat

Meta researchers open the LLM black box to repair flawed AI reasoning

Researchers at Meta FAIR and the University of Edinburgh have developed a new technique that can predict the correctness of a large language model's (LLM) reasoning and even intervene to fix its ...

MIT Technology Review

Why OpenAI’s new model is such a big deal

The bulk of LLM progress until now has been language-driven. This new model enters the realm of complex reasoning, with implications for physics, coding, and more. This story is from The Algorithm, ...

Communications of the ACM

Formal Reasoning Meets LLMs: Toward AI for Mathematics and Verification

A marriage of formal methods and LLMs seeks to harness the strengths of both.

SiliconANGLE

OpenAI, Anthropic release new reasoning-optimized language models

OpenAI and Anthropic PBC, two of the leading artificial intelligence model providers, today both introduced new large language models optimized for reasoning tasks. OpenAI’s new algorithms, ...

SiliconANGLE

OpenAI makes its o3-mini reasoning model generally available

OpenAI today made its o3-mini large language model generally available for ChatGPT users and developers. Word of the launch leaked a few hours earlier. According to Wired, OpenAI brought o3-mini’s ...

TechCrunch

OpenAI announces new o3 models

OpenAI saved its biggest announcement for the last day of its 12-day “shipmas” event. On Friday, the company unveiled o3, the successor to the o1 “reasoning” model it released earlier in the year. o3 ...

ExtremeTech

Apple Study Reveals 'Fragility' of LLM Reasoning Capabilities

For years, even the best chatbots in the world were hard-pressed to succeed in the Turing Test, an assessment of whether an AI can pass as a human intelligence. Today's powerful generative artificial ...

The Register on MSN

Microsoft boffins figured out how to break LLM safety guardrails with one simple prompt

Chaos-inciting fake news right this way A single, unlabeled training prompt can break LLMs' safety behavior, according to Microsoft Azure CTO Mark Russinovich and colleagues. They published a research ...

Unite.AI

Why the “Best LLM for Marketing” Doesn’t Exist

Every new large language model release arrives with the same promises: bigger context windows, stronger reasoning, and better benchmark performance. Then, before long, AI-savvy marketers feel a ...

Forbes

The AI Model Showdown — Which LLM Deserves Your Trust?

The battle for AI dominance is heating up — with ChatGPT, Gemini, Claude and Perplexity each vying for enterprise trust in a high-stakes showdown of power, security and performance. In the artificial ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results