mpcrl.core.callbacks.AgentCallbackMixin#

class mpcrl.core.callbacks.AgentCallbackMixin[source]#

Class with callbacks for agents.

In particular, this class defines the following callbacks:

on_mpc_failure, invoked when an MPC solver fails
on_validation_start, invoked when validation starts (see mpcrl.Agent.evaluate)
on_validation_end, invoked when validation ends
on_episode_start, invoked when a training or validation episode starts
on_episode_end, invoked when a training or validation episode ends
on_env_step, invoked when a training or validation episode steps, i.e., after gymnasium.Env.step
on_timestep_end, invoked when the current simulation’s time step reaches an end, i.e., after having stepped the environment and done all the internal computations according to the algorithm.

Methods

`on_env_step`(env, episode, timestep)	Callback called after each call to `gymnasium.Env.step`.
`on_episode_end`(env, episode, rewards)	Callback called at the end of each episode in the training or evaluation process (see `mpcrl.Agent.evaluate`, `mpcrl.LearningAgent.train` and `mpcrl.LearningAgent.train_offpolicy`).
`on_episode_start`(env, episode, state)	Callback called at the beginning of each episode in the training or validation process (see `mpcrl.Agent.evaluate`, `mpcrl.LearningAgent.train` and `mpcrl.LearningAgent.train_offpolicy`).
`on_mpc_failure`(episode, timestep, status, raises)	Callback in case of failure of the MPC solver.
`on_timestep_end`(env, episode, timestep)	Callback called at the end of each time iteration.
`on_validation_end`(env, returns)	Callback called at the end of the validation process (see `mpcrl.Agent.evaluate`).
`on_validation_start`(env)	Callback called at the beginning of the validation process (see `mpcrl.Agent.evaluate`)

on_env_step(env, episode, timestep)[source]#

Callback called after each call to gymnasium.Env.step.

Parameters:

Return type:

None

on_episode_end(env, episode, rewards)[source]#

Callback called at the end of each episode in the training or evaluation process (see mpcrl.Agent.evaluate, mpcrl.LearningAgent.train and mpcrl.LearningAgent.train_offpolicy).

Parameters:

Return type:

None

on_episode_start(env, episode, state)[source]#

Callback called at the beginning of each episode in the training or validation process (see mpcrl.Agent.evaluate, mpcrl.LearningAgent.train and mpcrl.LearningAgent.train_offpolicy).

Parameters:

Return type:

None

on_mpc_failure(episode, timestep, status, raises)[source]#

Callback in case of failure of the MPC solver.

Parameters:

episodeint: Number of the episode when the failure happened.
timestepint or None: Timestep of the current episode when the failure happened. Can be None, in case the error occurs inter-episodically or no notion of time step is available.
statusstr: Status of the solver that failed.
raisesbool: Whether the failure should be raised as exception (True) or as a warning (False).

Return type: