• Contact

  • Newsletter

  • About us

  • Delivery options

  • Prospero Book Market Podcast

  • News

  • 0
    Reinforcement Learning: Theory and Python Implementation

    Reinforcement Learning by Xiao, Zhiqing;

    Theory and Python Implementation

      • GET 20% OFF

      • The discount is only available for 'Alert of Favourite Topics' newsletter recipients.
      • Publisher's listprice EUR 80.24
      • The price is estimated because at the time of ordering we do not know what conversion rates will apply to HUF / product currency when the book arrives. In case HUF is weaker, the price increases slightly, in case HUF is stronger, the price goes lower slightly.

        34 037 Ft (32 416 Ft + 5% VAT)
      • Discount 20% (cc. 6 807 Ft off)
      • Discounted price 27 229 Ft (25 933 Ft + 5% VAT)

    34 037 Ft

    db

    Availability

    Estimated delivery time: In stock at the publisher, but not at Prospero's office. Delivery time approx. 3-5 weeks.
    Not in stock at Prospero.

    Why don't you give exact delivery time?

    Delivery time is estimated on our previous experiences. We give estimations only, because we order from outside Hungary, and the delivery time mainly depends on how quickly the publisher supplies the book. Faster or slower deliveries both happen, but we do our best to supply as quickly as possible.

    Product details:

    • Edition number 2024
    • Publisher Springer
    • Date of Publication 29 September 2024
    • Number of Volumes 1 pieces, Book

    • ISBN 9789811949326
    • Binding Hardback
    • No. of pages559 pages
    • Size 235x155 mm
    • Language English
    • Illustrations 1 Illustrations, black & white; 60 Illustrations, color
    • 641

    Categories

    Short description:

    Reinforcement Learning: Theory and Python Implementation is a tutorial book on reinforcement learning, with explanations of both theory and applications. Starting from a uniform mathematical framework, this book derives the theory of modern reinforcement learning systematically and introduces all mainstream reinforcement learning algorithms such as PPO, SAC, and MuZero. It also covers key technologies of GPT training such as RLHF, IRL, and PbRL. Every chapter is accompanied by high-quality implementations, and all implementations of deep reinforcement learning algorithms are with both TensorFlow and PyTorch. Codes can be found on GitHub along with their results and are runnable on a conventional laptop with either Windows, macOS, or Linux.



    This book is intended for readers who want to learn reinforcement learning systematically and apply reinforcement learning to practical applications. It is also ideal to academical researchers who seek theoretical foundation or algorithm enhancement in their cutting-edge AI research.

    More

    Long description:

    Reinforcement Learning: Theory and Python Implementation is a tutorial book on reinforcement learning, with explanations of both theory and applications. Starting from a uniform mathematical framework, this book derives the theory of modern reinforcement learning systematically and introduces all mainstream reinforcement learning algorithms such as PPO, SAC, and MuZero. It also covers key technologies of GPT training such as RLHF, IRL, and PbRL. Every chapter is accompanied by high-quality implementations, and all implementations of deep reinforcement learning algorithms are with both TensorFlow and PyTorch. Codes can be found on GitHub along with their results and are runnable on a conventional laptop with either Windows, macOS, or Linux.



    This book is intended for readers who want to learn reinforcement learning systematically and apply reinforcement learning to practical applications. It is also ideal to academical researchers who seek theoretical foundation or algorithm enhancement in their cutting-edge AI research.

    More

    Table of Contents:

    Chapter 1. Introduction of Reinforcement Learning (RL).- Chapter 2. MDP: Markov Decision Process.- Chapter 3. Model-based Numerical Iteration.- Chapter 4. MC: Monte Carlo Learning.- Chapter 5. TD: Temporal Difference Learning.- Chapter 6. Function Approximation.- Chapter 7. PG: Policy Gradient.- Chapter 8. AC: Actor?Critic.- Chapter 9. DPG: Deterministic Policy Gradient.- Chapter 10. Maximum-Entropy RL.- Chapter 11. Policy-based Gradient-Free Algorithms.- Chapter 12. Distributional RL.- Chapter 13. Minimize Regret.- Chapter 14. Tree Search.- Chapter 15. More Agent?Environment Interfaces.- Chapter 16. Learn from Feedback and Imitation Learning.

    More
    Recently viewed
    previous
    Reinforcement Learning: Theory and Python Implementation

    Reinforcement Learning: Theory and Python Implementation

    Xiao, Zhiqing;

    34 037 HUF

    Reconstructing Shakespeare in the Nordic Countries: National Revival and Interwar Politics, 1870 ? 1940

    Reconstructing Shakespeare in the Nordic Countries: National Revival and Interwar Politics, 1870 ? 1940

    Keinänen, Nely; Sivefors, Per; (ed.)

    40 488 HUF

    next