.. _sphx_glr_auto_examples_gradient-based-onpolicy:
Gradient-based on-policy learning agents
----------------------------------------
The following examples showcase how to use gradient-based Reinforcement Learning
techniques (in particular, Q-learning and Deterministic Policy Gradient) to train a
Model Predictive Controller (MPC) scheme for a simple task in an on-policy fashion.
.. raw:: html
.. thumbnail-parent-div-open
.. raw:: html
.. only:: html
.. image:: /auto_examples/gradient-based-onpolicy/images/thumb/sphx_glr_q_learning_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_gradient-based-onpolicy_q_learning.py`
.. raw:: html
On-policy Q-learning
.. raw:: html
.. only:: html
.. image:: /auto_examples/gradient-based-onpolicy/images/thumb/sphx_glr_dpg_thumb.png
:alt:
:ref:`sphx_glr_auto_examples_gradient-based-onpolicy_dpg.py`
.. raw:: html
On-policy Deterministic Policy Gradient
.. thumbnail-parent-div-close
.. raw:: html
.. toctree::
:hidden:
/auto_examples/gradient-based-onpolicy/q_learning
/auto_examples/gradient-based-onpolicy/dpg