RLtoolbox Function

Off policy Evaluation

Calling Sequence

[N, D, Q]=Mc_Off_Policy_Evaluation(Episode, tau, PiPrime, N, D, Q)

Parameters

Description

Evaluate the policy given in each Episodes, and returned the modified Returns.

Examples

See Also

Mc Off Policy Improvement