PPO Algorithm Paper - Search News

The PERC-Peds rule: a promising step towards structured pulmonary embolism risk assessment in children

Pulmonary embolism accounts for 10–15% of all paediatric venous thromboembolism (VTE) events, with an annual incidence of 0·14–0·9 per 100 000 children.1,2 Although rare, pulmonary embolism is a ...

IEEE

PPO-Based Hierarchical Intelligent Tactical Decision-Making Algorithm for UAV Air Combat

Abstract: To address high dynamics, strong uncertainty, and decision-dimensional explosion in air combat, this paper constructs a PPO-based hierarchical tactical decision-making algorithm (PHT-PPO) ...

Microsoft

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Proximal Policy Optimization (PPO) is central to aligning Large Language Models (LLMs) in reasoning tasks with verifiable rewards. However, standard token-level PPO struggles in this setting due to ...

the-decoder

Alibaba's Qwen team makes AI models think deeper with new algorithm

Alibaba's Qwen team has developed a new training algorithm for reasoning models that assigns different weights to individual tokens based on how much each step influences the subsequent chain of ...

Frontiers

LG-H-PPO: offline hierarchical PPO for robot path planning on a latent graph

The path planning capability of autonomous robots in complex environments is crucial for their widespread application in the real world. However, long-term decision-making and sparse reward signals ...

GitHub

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

Code for reproducing the results in the VinePPO paper. This codebase also provides performant implementation (leveraging vLLM as inference engine*) of popular RL and RL-free baselines (such as PPO, ...

News Medical

Brain cells beat AI in learning speed and efficiency

Researchers have demonstrated that brain cells learn faster and carry out complex networking more effectively than machine learning by comparing how both a Synthetic Biological Intelligence (SBI) ...

techxplore

Brain cells learn faster than machine learning, research reveals

Researchers have demonstrated that brain cells learn faster and carry out complex networking more effectively than machine learning by comparing how both a Synthetic Biological Intelligence (SBI) ...

EurekAlert!

Brain cells learn faster than machine learning, new research reveals

Melbourne, Australia - 12 August 2025 - Researchers have demonstrated that brain cells learn faster and carry out complex networking more effectively than machine learning by comparing how both a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results