Coding Dicoding Reasoning Most

10d

What is GLM-5.2: China’s AI model challenging Anthropic’s Claude Fable 5 in coding and long-context reasoning

In recent days, a new large language model from China has started circulating through technical circles with an unusual mix ...

DeepSeek open sources DSpark, a new framework to speed up LLM inference by up to 85%

DSpark can make decoding faster, but acceptance quality still determines how much speed the system actually realizes.

Tech Times

NVIDIA Diffusion LLM Hits 2.42x Throughput Without Retraining: Nemotron TwoTower Released

NVIDIA diffusion language model Nemotron TwoTower achieves 2.42x LLM inference throughput without a full retraining run, ...

Tech Times

Compile Once, Run Offline: New AI Method Matches 32B Models With a 23MB File

Local AI inference at 32B-parameter quality, no cloud API required: University of Waterloo researchers released PAW on July 2 ...

17d

Z.ai’s open-weights GLM-5.2 beats GPT-5.5 on multiple long-horizon coding benchmarks for 1/6th the cost

It allows engineering teams to host frontier-level AI on their own sovereign infrastructure, entirely eliminating vendor lock ...

11h

Waterloo's PAW compiles task specs into 23MB LoRA adapters a 600M-parameter model runs entirely offline.

Local AI inference at 32B-parameter quality, no cloud API required: University of Waterloo researchers released PAW on July 2, 2026, a system that compiles any natural-language task spec into a 23MB ...

10d

Explained: How China is narrowing the AI gap with the US one model at a time

Just when the AI industry’s attention seemed fixed on OpenAI, Google and Anthropic, a new Chinese model has stolen the ...

i-SCOOP

Token minimizing, how to cut LLM costs without losing quality

Token minimizing is the fastest way to lower LLM costs and latency. Learn practical techniques: prompt trimming, compaction, ...

18don MSN

Intel gets a $170 billion AI reason to matter again

Intel’s AI comeback case now has a $170 billion hook.

EE World Online

Why small language models win at the Edge

By Pietro Antonio Ciclese, Senior Technical Marketing Engineer, Ambarella The workloads that generate the most commercial ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results