Reinforcement Learning in Action: From Foundations to Frontier AI

★★★★★ 4.2 84 reviews

$129.33
Price when purchased online
Free shipping Free 30-day returns

Sold and shipped by www.techfitsl.com
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here.
$129.33
Price when purchased online
Free shipping Free 30-day returns

How do you want your item?
You get 30 days free! Choose a plan at checkout.
Shipping
Arrives Jun 29
Free
Pickup
Check nearby
Delivery
Not available

Sold and shipped by www.techfitsl.com
Free 30-day returns Details

Product details

Management number 231977363 Release Date 2026/06/18 List Price $51.73 Model Number 231977363
Category

Reinforcement learning (RL) has become the engine behind some of the most significant advances in modern artificial intelligence, from defeating world champions in Go to aligning large language models with human preferences. Yet despite its central role, RL remains poorly understood by many practitioners who work with these systems daily. Reinforcement Learning in Action: From Foundations to Frontiers bridges the gap between classical RL theory and the cutting-edge techniques driving today’s AI breakthroughs. The book traces a complete path from Markov Decision Processes and Bellman equations through deep RL methods (DQN, REINFORCE, Actor-Critic, PPO) to the modern landscape of LLM alignment (RLHF, DPO, SimPO, KTO), reasoning optimization (GRPO, VinePPO, MCTS), and agentic systems with tool use, memory, and multi-turn planning. A distinguishing feature is the book’s consistent five-layer pedagogical structure: each algorithm is presented with its key characteristics, a full mathematical derivation, an honest assessment of its advantages and limitations, a complete from-scratch Python/PyTorch implementation in which variable names match the equations, and a hands-on case study with reproducible experiments. Case studies progress from Grid World navigation and CartPole control to fine-tuning language models with DPO on the HuggingFace ecosystem, training reasoning models with GRPO on mathematical benchmarks, and building a full agentic customer support system. Written for ML engineers, researchers, and advanced students, this book provides both the conceptual depth and implementation fluency needed to understand, build, and extend the RL systems shaping the future of AI. Read more

ISBN10 1041131410
ISBN13 978-1041131410
Edition 1st
Language English
Publisher CRC Press
Item Weight 1.74 pounds
Print length 352 pages
Publication date October 14, 2026

Correction of product information

If you notice any omissions or errors in the product information on this page, please use the correction request form below.

Correction Request Form

Customer ratings & reviews

4.2 out of 5
★★★★★
84 ratings | 34 reviews
How item rating is calculated
View all reviews
5 stars
78% (66)
4 stars
6% (5)
3 stars
3% (3)
2 stars
2% (2)
1 star
11% (9)
Sort by

There are currently no written reviews for this product.