Least-Square Policy Improvements (LSPI) with Radial Basis Function (RBF) See Lagoudakis, M. G. & Parr, R. (2001) for details. References Lagoudakis, M. G., & Parr, R. (2001). Model-free least-squares policy iteration. NeurIPS.