Home
Papers
Research
Courses
People
|
Papers
More information about research areas is available in
research descriptions.
You also can view papers by partially overlapping categories
Reinforcement learning papers
- S. Singh, T. Jaakkola, M. Littman, and C. Szepesvari.
Convergence results for single-step on-policy reinforcement-learning
algorithms.
Machine Learning, 38(3):287, 2000.
[postscript], [gzipped postscript]
- S. Singh, T. Jaakkola, and M. Jordan.
Reinforcement learning with soft state aggregation.
In Advances in Neural Information Processing Systems 7, 1994.
[postscript], [gzipped postscript]
- T. Jaakkola, S. Singh, and M. Jordan.
Reinforcement learning algorithm for partially observable markov
decision problems.
In Advances in Neural Information Processing Systems 7, 1994.
[postscript], [gzipped postscript]
- S. Singh, T. Jaakkola, and M. Jordan.
Learning without state estimation in partially observable
environments.
In Proceedings of the Eleventh Machine Learning Conference,
1994.
[postscript], [gzipped postscript]
- T. Jaakkola, M. Jordan, and S. Singh.
On the convergence of stochastic iterative dynamic programming
algorithms.
Neural Computation, 6(6):1185--1201, 1994.
[postscript], [gzipped postscript]
- T. Jaakkola, M. Jordan, and S. Singh.
Convergence of stochastic iterative dynamic programming algorithms.
In Advances in Neural Information Processing Systems 6, 1993.
[postscript], [gzipped postscript]
|