B. Scherrer. "Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris." Journal of Machine Learning Research, 2013, Vol. 14, 1175-1221. jmlr pdf hal slides
--- This is a revised version of: B. Scherrer. "Performance Bounds for Lambda Policy Iteration and Application to the Game of Tetris." Tech report. pdf hal
B. Scherrer and B. Lesner. "On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Discounted Markov Decision Processes." NIPS 2012. pdf hal slides
--- This is an extended version of: B. Scherrer. On the Use of Non-Stationary Policies for Stationary Infinite-Horizon Discounted Markov Decision Processes." Tech report. pdf hal
M. Geist, B. Scherrer, A. Lazaric and M. Ghavamzadeh. "A Dantzig Selector for Temporal Difference Learning." ICML 2012. pdf hal
V. Gabillon, A. Lazaric, M. Ghavamzadeh and B. Scherrer. "Classification-based Policy Iteration with a Critic." ICML 2011. pdf hal
B. Scherrer. "Should one compute the Temporal Difference fix point or minimize the Bellman Residual? The unified oblique projection view." ICML 2010. pdf hal slides
C. Thiéry and B. Scherrer. "Least-Squares Lambda Policy Iteration: Bias-Variance Trade-off in Control Problems." ICML 2010. pdf hal slides
--- Related tech report: B. Scherrer and C. Thiéry. "Performance bound for Approximate Optimistic Policy Iteration." Tech report. pdf hal
M. Petrik and B. Scherrer. "Biasing Approximate Dynamic Programming with a Lower Discount Factor." NIPS 2008. pdf hal
C. Thiéry and B. Scherrer. "Building Controllers for Tetris." International Computer Games Association Journal 32 (2009) 3-11. pdf hal
C. Thiéry and B. Scherrer. "Improvements on Learning Tetris with Cross Entropy." International Computer Games Association Journal 32 (2009). pdf hal
C. Thiéry and B. Scherrer. "Construction d'un joueur artificiel pour Tetris." Revue d'Intelligence Artificielle 23, 2-3 (2009) 387-407. pdf hal
A. Dutech, B. Scherrer and C. Thiéry. "La carotte et le bâton... et Tetris." Interstices (2008). html on-line game
B. Girau, A. Boumaza, B. Scherrer and C.-T. Huitzil. "Block-synchronous harmonic control for scalable trajectory planning." Robotics, Automation and Control I-Tech Publications (2008). pdf hal
A. Boumaza and B. Scherrer. "Convergence and Rate of Convergence of a Foraging Ant Model." CEC 2007. pdf hal
--- Extended version: A. Boumaza and B. Scherrer. "Convergence and rate of convergence of simple ant models. Tech report. pdf hal
A. Boumaza and B. Scherrer. "Optimal control subsumes harmonic control." ICRA 2007. pdf hal
--- Extended version: A. Boumaza and B. Scherrer. "Optimal control subsumes harmonic control." Tech report. pdf hal slides
B. Scherrer. "Asynchronous Neurocomputing for optimal control and reinforcement learning with large state spaces." Neurocomputing 23 (2005). pdf hal