RLtoolbox Function
On policy Evaluation
Calling Sequence
- Returns=Mc_On_Policy_Evaluation(Episode, Returns)
Parameters
- Episode
: List of each action-value during episode
- Returns
: All returns (rewards) for episodes. (This parameter is also returned)
Description
Evaluate the policy given in each Episodes, and returned the modified Returns.
Examples
None
See Also
Mc On Policy Improvement