LLM Diffusion Models - Search News

Inception Launches Mercury 2, the Fastest Reasoning LLM - 5x Faster Than Leading Speed-Optimized LLMs, with Dramatically Lower Inference Cost

Inception, the company behind the first commercial diffusion large language models (dLLMs), today announced the launch of Mercury 2, the fastest reasoning LLM and first reasoning dLLM. Mercury 2 ...

Mercury 2 : World’s Fastest Reasoning AI Model Built for Production Applications

The new Mercury 2 AI model uses diffusion reasoning to generate 1,000 tokens per second; it runs about 5x faster than Haiku, speed limits are ...

eWeek

Need for Speed: Mercury 2 Is 13x Faster Than Claude Haiku

Mercury 2 introduces diffusion LLMs to text, delivering 10x faster speeds for AI agents and production workflows without sacrificing reasoning power.

SiliconANGLE

Microsoft reportedly develops LLM series that can rival OpenAI, Anthropic models

Microsoft Corp. has developed a series of large language models that can rival algorithms from OpenAI and Anthropic PBC, multiple publications reported today. Sources told Bloomberg that the LLM ...

InfoWorld

Google unveils open source projects for generative AI

Google at Google Cloud Next 24 unveiled three open source projects for building and running generative AI models. The company also introduced new large language models to its MaxText project of ...

SDxCentral

DeepSeek looks to offload simple LLM tasks to save billions of parameters

A little over a year after it upended the tech industry, DeepSeek is back with another apparent breakthrough: a means to stop current large language models (LLMs) from wasting computational depth on ...

InfoWorld

LiteLLM: An open-source gateway for unified LLM access

LiteLLM allows developers to integrate a diverse range of LLM models as if they were calling OpenAI’s API, with support for fallbacks, budgets, rate limits, and real-time monitoring of API calls. The ...

The Hacker News

How Exposed Endpoints Increase Risk Across LLM Infrastructure

Exposed endpoints quietly expand attack surfaces across LLM infrastructure. Learn why endpoint privilege management is important to AI security.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results