![]() |
On December 18, 2008, I successfully defended my thesis. Thank you to everyone that helped me through this process. | |
Publications | ||
|
Provably Efficient Learning with Typed Parametric Models Emma Brunskill, Bethany R. Leffler, Lihong Li, Michael L. Littman, and Nicholas Roy Journal of Machine Learning Research, 10:1955--1988, 2009. |
||
|
To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuous-valued domains must use a representation that is both sufficiently powerful to capture important domain characteristics, and yet simultaneously allows generalization, or sharing, among experiences. Our algorithm balances this tradeoff by using a stochastic, switching, parametric dynamics representation. We argue that this model characterizes a number of significant, real-world domains, such as robot navigation across varying terrain. We prove that this representational assumption allows our algorithm to be probably approximately correct with a sample complexity that scales polynomially with all problem-specific quantities including the state-space dimension. We also explicitly incorporate the error introduced by approximate planning in our sample complexity bounds, in contrast to prior Probably Approximately Correct (PAC) Markov Decision Processes (MDP) approaches, which typically assume the estimated MDP can be solved exactly. Our experimental results on constructing plans for driving to work using real car trajectory data, as well as a small robot experiment on navigating varying terrain, demonstrate that our dynamics representation enables us to capture real-world dynamics in a sufficient manner to produce good performance. |
||
| The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning. Carlos Diuk, Lihong Li, and Bethany R. Leffler Proceedings of the International Conference on Machine Learning. June 2009. |
||
| The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an algorithm, known as the Adaptive k-Meteorologists Algorithm, analyze its sample-complexity upper bound, and give a matching lower bound. Second, this algorithm is used to create a new reinforcement-learning algorithm for factoredstate problems that enjoys significant improvement over the previous state-of-the-art algorithm. Finally, we apply the Adaptive k-Meteorologists Algorithm to remove a limiting assumption in an existing reinforcement-learning algorithm. The effectiveness of our approaches is demonstrated empirically in a couple benchmark domains as well as a robotics navigation problem. | ||
![]() |
Efficient Learning of Dynamics Models using Terrain Classification Bethany R. Leffler, Christopher R. Mansley, and Michael L. Littman Proceedings of the International Workshop on Evolutionary and Reinforcement Learning for Autonomous Robot Systems. July, 2008. BibTex | |
|
Terrain classification in robotics has heavily focused on determining a region for traversal, while also labeling obstacles. Our work attempts to expand this essentially binary viewpoint and to use terrain classifiers as an indicator for switching between a set of system dynamics. By learning multiple models of the system dynamics, the robot is able to assess alternative paths based on traversal costs of different terrain types instead of strict distance metrics. We demonstrate a system that reliably learns an optimal control policy using this additional terrain information and contrast it with several systems based on more traditional methods that fail to reliably complete the same task. | ||
| CORL: A Continuous-State Offset-Dynamics Reinforcement Learner Emma Brunskill, Bethany R. Leffler, Lihong Li, Michael L. Littman, and Nicholas Roy Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI-08) , 2008. BibTex Continuous state spaces and stochastic, switching dynamics characterize a number of rich, real-world domains, such as robot navigation across varying terrain. We describe a reinforcement-learning algorithm for learning in these domains and prove for certain environments the algorithm is probably approximately correct with a sample complexity that scales polynomially with the state-space dimension. Unfortunately, no optimal planning techniques exist in general for such problems; instead we use fitted value iteration to solve the learned MDP, and include the error due to approximate planning in our bounds. Finally, we report an experiment using a robotic car driving over varying terrain to demonstrate that these dynamics representations adequately capture real-world dynamics and that our algorithm can be used to efficiently solve such problems. |
||
![]() |
Efficient Reinforcement Learning with Relocatable Action Models Bethany R. Leffler, Michael L.Littman, and Timothy Edmunds Proceedings of the Twenty-Second Conference on Artificial Intelligence. July, 2007. BibTex | |
|
Realistic domains for learning possess regularities that make it possible to generalize experience across related states. This paper explores an environment-modeling framework that represents transitions as state-independent outcomes that are common to all states that share the same type. We analyze a set of novel learning problems that arise in this framework, providing lower and upper bounds. We single out one particular variant of practical interest and provide an efficient algorithm and experimental results in both simulated and robotic environments. | ||
| Movie | ||
![]() |
Efficient Exploration with Latent Structure Bethany R. Leffler, Michael L. Littman, Alexander L. Strehl, and Thomas J. Walsh. Proceedings of Robotics: Science and Systems. Cambridge, USA. June 2005. BibTex | |
When interacting with a new environment, a robot can improve its online performance by efficiently exploring the effects of its actions. The efficiency of exploration can be expanded significantly by modeling and using latent structure to generalize experiences. We provide a theoretical development of the problem of exploration with latent structure, analyze several algorithms and prove matching lower bounds. We demonstrate our algorithmic ideas on a simple robot car repeatedly traversing a path with two different surface properties. |
Links |
|
| Perceptual Science Program | ||
| Rutgers Laboratory for Real Life Reinforcement Learning (RL3) | ||
| Women In Machine Learning Workshop | ||