RLtoolbox Function

Mc_Off_Policy_Improve - Search an optimal policy.

Calling Sequence

Pi=Mc_Off_Policy_Improve(Episode, Pi, Q, Actionlist)

Parameters

Description

Search an optimal policy for the environment described by the others parameters.

Examples

See Also

none