News

Wang Haifeng detailed three key technological upgrades of the Wenxin Large Model X1.1 at the conference. First is the application of reinforcement learning technology. Through reinforcement learning ...
The paper “Deep Reinforcement Learning-based Multi-Objective Scheduling for Distributed Heterogeneous Hybrid Flow Shops with Blocking Constraints,” authored by Xueyan Sun, Weiming Shen, Jiaxin ...
DeepSeek-R1’s Monday release has sent shockwaves through the AI community, disrupting assumptions about what’s required to achieve cutting-edge AI performance. This story focuses on exactly ...
Deep learning model personalises adaptive therapy for prostate cancer: A deep reinforcement learning model designed to tailor adaptive treatment schedules could double the time to progression for ...
Reinforcement learning techniques could be the keys to integrating robots — who use machine learning to output more than words — into the real world.
Examples of AI models trained with reinforcement learning include Pluribus, a poker-playing bot, and DeepMind's AlphaGo, which plays Go.