OpenAI has found a way to reduce its inference costs by roughly 50%, a development that could reshape the economics of running large language models at scale. Inference is the process of actually ...