Coding and Decoding Reasoning

Fable 5 Breach Leaks Cryptic AI Chain of Thought Shorthand

Fable 5's chain of thought has leaked, showing math-like shorthand, while its three-layer defense classifiers block most jailbreak attempts.

Tech Times

NVIDIA Diffusion LLM Hits 2.42x Throughput Without Retraining: Nemotron TwoTower Released

NVIDIA diffusion language model Nemotron TwoTower achieves 2.42x LLM inference throughput without a full retraining run, ...

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

Tech Times

DeepSeek Releases DSpark: Speculative Decoding Makes V4 Up to 85 Percent Faster

DeepSeek speculative decoding framework DSpark went live June 27 on V4-Flash and V4-Pro, reporting up to 85 percent faster ...

Developer Tech

NVIDIA: DFlash block diffusion accelerates autoregressive LLMs

Deploying DFlash block diffusion on NVIDIA hardware accelerates autoregressive LLMs during latency-sensitive inference.

10d

What is GLM-5.2: China’s AI model challenging Anthropic’s Claude Fable 5 in coding and long-context reasoning

In recent days, a new large language model from China has started circulating through technical circles with an unusual mix ...

The Financial Express

What is GLM-5.2? Chinese AI model making Silicon Valley sit up again

Explore the Chinese open-source AI model challenging OpenAI and Anthropic with powerful coding abilities, agentic workflows, ...

Developer Tech

What is GLM-5.2? Z.ai targets coding agents

Z.ai’s GLM-5.2 is an open-source model aimed at long-context coding-agent workflows, with support for a one million-token ...

16d

Z.ai pitches GLM-5.2 for long-running software engineering tasks

The open-source model combines a one-million-token context window with architectural updates aimed at lowering the cost of repository-scale AI coding.

16d

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

It allows engineering teams to host frontier-level AI on their own sovereign infrastructure, entirely eliminating vendor lock ...

decrypt

China's Xiaomi MiMo Is Now 15X Faster Than ChatGPT and Claude

Add Decrypt as your preferred source to see more of our stories on Google. Xiaomi and inference partner TileRT have broken 1,000 tokens per second on a 1-trillion-parameter model, a first at that ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results