Query Processing System

New agentic memory framework uses 118K tokens per query. LangMem burns through 3.26M.

NUS researchers' MRAgent framework reduces LLM agent memory retrieval to 118K tokens per query — vs. 3.26M for LangMem — ...

Waterloo's PAW compiles task specs into 23MB LoRA adapters a 600M-parameter model runs entirely offline.

Local AI inference at 32B-parameter quality, no cloud API required: University of Waterloo researchers released PAW on July 2, 2026, a system that compiles any natural-language task spec into a 23MB ...

Interesting Engineering on MSN

New safety-critical cockpit automation engine unveiled to minimize flight plan data errors

Artificial intelligence is rapidly finding its way into nearly every industry, but aviation has ...

Tech Times

Compile Once, Run Offline: New AI Method Matches 32B Models With a 23MB File

Local AI inference at 32B-parameter quality, no cloud API required: University of Waterloo researchers released PAW on July 2 ...

InfoWorld

EDB converges analytics on Postgres to support AI agents

EDB’s Postgres-first approach to converged analytics could appeal to enterprises seeking greater control over data, ...

Tech Xplore on MSN

Next-generation database reduces AI hallucinations and improves accuracy by 78%

One of the greatest weaknesses of AI agents that read and understand vast amounts of enterprise data is "hallucination"—the generation of plausible-sounding but factually incorrect information. KAIST ...

Tech Times

OpenAI Halves Inference Costs With Software Alone: GPUs Drop to Hundreds

OpenAI inference cost reduction cut ChatGPT guest traffic from tens of thousands of Nvidia GPUs to just a couple hundred, ...

OpenAI engineers cut ChatGPT guest traffic to a few hundred Nvidia GPUs, with no new hardware deployed.

OpenAI inference cost reduction cut ChatGPT guest traffic from tens of thousands of Nvidia GPUs to just a couple hundred, using software optimization alone. Engineers achieved more than 50% savings ...

InfoWorld

AWS aims to lower log analytics costs with new analytics engine for managed OpenSearch

The new engine could let enterprises retain more telemetry data for compliance and incident response at lower cost, although migration work and missing query support may slow adoption, analysts say.

6don MSN

Nvidia Stock Hasn't Been This Cheap in 7 Years. Is This the Ultimate Buying Opportunity?

Nvidia's declining stock price and rapidly growing earnings have led to a very attractive valuation.

Communications of the ACM

The LLVM Compiler Infrastructure

LLVM powers the core development tools, operating systems, and most applications at Apple Computer, where it long ago ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results