# Value Determination with General Function Approximators

### Vassilis Papavassiliou and Stuart Russell

###
EECS Department

University of California, Berkeley

Technical Report No. UCB/CSD-98-1005

May 1998

A new algorithm is described for value determination in Markov decision processes. The algorithm works with arbitrary approximate representations of the value function. We show that if the approximating family is agnostically PAC-learnable, then the algorithm converges to a solution that is close to the globally optimal solution in the approximating family.

