RLtoolbox Function

Value_Iteration - Search an optimal policy.

Calling Sequence

[Pi,V,T_V,]=Value_Iteration(NbStates, NbActions, TransProb, Rewards, Gamma, Actions_States)

Parameters

Description

Search an optimal policy for the environnment described by the others parameters.

Examples

See Also

Iter_Policy_Improv