Apple open sourced DiffuCoder, a diffusion large language model (dLLM) fine-tuned for coding tasks. DiffuCoder is based on Qwen-2.5-Coder and outperforms other code-specific LLMs on several coding ...
NVIDIA diffusion language model Nemotron TwoTower achieves 2.42x LLM inference throughput without a full retraining run, ...
Deploying DFlash block diffusion on NVIDIA hardware accelerates autoregressive LLMs during latency-sensitive inference.
Both models trade word-by-word generation for parallel denoising. Only one of them does it without losing intelligence in the ...
With so much money flooding into AI startups, it’s a good time to be an AI researcher with an idea to test out. And if the idea is novel enough, it might be easier to get the resources you need as an ...
Rather than generating text word by word, Google's experimental open-source model drafts entire passages simultaneously using diffusion, resulting in up to 4x faster inference.
Cursor, a San Francisco AI coding platform from startup Anysphere valued at $29.3 billion, has launched Composer 2, a new fine-tuned variant of Chinese open source model Kimi K2.5 now available inside ...
Cursor’s new Composer model, built for low-latency agentic coding, completes most iterations in under 30 seconds, according to Anysphere. Anysphere has introduced Cursor 2.0, an update to the AI ...