.. _sphx_glr_auto_examples_gradient-based-onpolicy: Gradient-based on-policy learning agents ---------------------------------------- The following examples showcase how to use gradient-based Reinforcement Learning techniques (in particular, Q-learning and Deterministic Policy Gradient) to train a Model Predictive Controller (MPC) scheme for a simple task in an on-policy fashion. .. raw:: html
.. thumbnail-parent-div-open .. raw:: html
.. only:: html .. image:: /auto_examples/gradient-based-onpolicy/images/thumb/sphx_glr_q_learning_thumb.png :alt: :ref:`sphx_glr_auto_examples_gradient-based-onpolicy_q_learning.py` .. raw:: html
On-policy Q-learning
.. raw:: html
.. only:: html .. image:: /auto_examples/gradient-based-onpolicy/images/thumb/sphx_glr_dpg_thumb.png :alt: :ref:`sphx_glr_auto_examples_gradient-based-onpolicy_dpg.py` .. raw:: html
On-policy Deterministic Policy Gradient
.. thumbnail-parent-div-close .. raw:: html
.. toctree:: :hidden: /auto_examples/gradient-based-onpolicy/q_learning /auto_examples/gradient-based-onpolicy/dpg