My DBLP Entry
- Journal/Conference
Papers
- Thomas J. Walsh,
Ali Nouri, Lihong Li, and
Michael L. Littman: Planning and learning
in environments with delayed feedback. In the Journal of Autonomous Agents and
Multi-Agent Systems, 2008.
A preliminary version appeared in ECML-07.
- Lihong
Li: A worst-case comparison between temporal
difference and residual gradient with linear function approximation. In ICML-08, Helsinki,
Finland,
July, 2008. [slides][poster]
- Lihong
Li, Michael L. Littman, and Thomas J. Walsh: Knows
what it knows: A framework for self-aware learning. In ICML-08, Helsinki,
Finland,
July, 2008. [slides][poster] Co-winner of the “Best Student Paper” Award.
- Ronald Parr, Lihong Li, Gavin Taylor, Christopher
Painter-Wakefield, and Michael L. Littman: An analysis of linear models, linear
value-function approximation, and feature selection for reinforcement
learning. In ICML-08, Helsinki, Finland,
July, 2008. [Ron’s slides][poster]
- Emma Brunskill,
Bethany R. Leffler, Lihong Li,
Michael L. Littman, and Nicholas Roy: CORL: A continuous-state offset-dynamics
reinforcement learner.
In UAI-08, Helsinki, Finland, July, 2008. [Emma’s slides]
- Lihong
Li and Michael L. Littman: Efficient value-function
approximation via online linear regression. In AI&Math-08, Fort
Lauderdale, FL,
January, 2008. [slides]
- Jennifer Wortman,
Yevgeniy Vorobeychik, Lihong Li,
and John Langford: Maintaining equilibria during
exploration in sponsored search auctions. In WINE-07, San Diego, CA,
December, 2007. Also appears
in LNCS 4858. A longer version with proofs is here. [Jenn’s slides].
- Lihong
Li, Vadim Bulitko, and Russell Greiner: Focus
of attention in reinforcement learning. In the Journal of Universal Computer Science, 13(9):1246-1269,
November, 2007.
- Thomas J. Walsh,
Ali Nouri, Lihong Li,
and Michael L. Littman: Planning
and learning in environments with delayed feedback.
In ECML-07, Warsaw, Poland, September, 2007. Also appears in LNCS 4701. [Tom’s slides]
- Ronald Parr,
Christopher Painter-Wakefield, Lihong Li,
and Michael L. Littman: Analyzing
feature generation for value-function approximation.
In ICML-07, Corvallis, OR,
June, 2007. [poster]
- Alexander L. Strehl,
Lihong Li, Eric Wiewiora,
John Langford, and Michael L. Littman: PAC
model-free reinforcement learning. In ICML-06, Pittsburgh, PA,
June, 2006. [slides]
- Alexander L. Strehl,
Lihong Li, and
Michael L. Littman: Incremental
model-based learners with formal learning-time guarantees. In UAI-06, Cambridge, MA,
July, 2006.
- Lihong
Li, Thomas J. Walsh, and Michael L. Littman: Towards a unified theory of state
abstraction for MDPs. In AI&Math-06, Fort Lauderdale, FL, January, 2006. [Tom’s slides]
- Lihong
Li and Michael L. Littman: Lazy approximation for solving continuous
finite-horizon MDPs. In AAAI-05, pages 1175-1180, Pittsburgh, PA,
2005. [slides]
- Lihong
Li, Vadim Bulitko, and Russell Greiner: Batch reinforcement learning with state
importance. In ECML-04, pages 566-568, Pisa, Italy,
2004. Also appears in LNCS 3201. [poster]
- Vadim Bulitko, Lihong Li, Russell Greiner, and
Ilya Levner: Lookahead
pathologies for single agent search. In IJCAI-03, pages
1531-1533, Acapulco, Mexico, August, 2003. [poster]
- Lihong
Li, Vadim Bulitko, Russell Greiner, and Ilya Levner: Improving
an adaptive image interpretation system by leveraging. In the Eighth
Australian and New Zealand
Conference on Intelligent Information Systems, Sydney, Australia,
December 2003.
- Ilya Levner, Vadim
Bulitko, Lihong Li,
Greg Lee, and Russell Greiner: Learning
robust object recognition strategies. In the Eighth Australian
and New Zealand
Conference on Intelligent Information Systems, Sydney, Australia,
December 2003.
- Ilya Levner, Vadim
Bulitko, Lihong Li,
Greg Lee, and Russell Greiner: Automated
feature extraction for object recognition. In Image and Vision
Computing'03 New Zealand,
Palmerston North, New
Zealand, November 2003.
- Ilya Levner, Vadim
Bulitko, Lihong Li,
Greg Lee, and Russell Greiner: Towards
automated creation of image interpretation systems. In the Sixteenth
Australian Joint Conference on Artificial Intelligence, pages 653-665,
Perth, Australia, December 2003.
Also appears in LNCS 2903.
- Lihong Li, Min Shao,
Zhenkun Zheng, Chuan He, and Zhi-Hui Du: Typical XML
document transformation methods and an application system. Computer
Science, 30(2): 40-44, China, 2003.
- Min Shao, Lihong Li, Zhenkun Zheng, Chuan
He, Peng Liu, Yu Chen, Zhi-Hui Du, and Sanli Li: XML and its application in clusters THNPSC-2.
In Proc. Chinese Symposium on High Performance Computing and
Application, Shanghai,
China,
2001.
- Min Shao, Lihong Li, Zhenkun Zheng, and
Chuan He: Practical Programming in XML. Tsinghua University
Press, Beijing,
China,
Dec. 2002. ISBN 7-900643-85-0.

- Lihong
Li, Michael L. Littman, and Thomas J.
Walsh: Knows what it knows: A framework for
self-aware learning.
To appear in European
Workshop on Reinforcement Learning, France, July, 2008. Also appeared in ICML-08.
- Lihong Li: Reinforcement learning via online
linear regression. In New York Academy of Sciences
Symposium on Machine Learning, October, 2007. [poster]
- Thomas J. Walsh, Ali Nouri,
and Lihong Li: Planning and learning in
environments with delays.
In
New York
Academy of
Sciences Symposium on Machine Learning, October, 2007. [poster]
- Ronald Parr, Christopher
Painter-Wakefield, Lihong Li,
and Michael L. Littman: Analyzing
feature generation for value function approximation. In Snowbird Learning Workshop, San Juan,
Puerto Rico, March, 2007.
- Alexander L. Strehl, Lihong
Li, Eric Wiewiora, John Langford, and Michael L.
Littman: PAC model-free
reinforcement learning.
In New York
Academy of
Sciences Symposium on Machine Learning, October, 2006. Winner
of the “Best Student Paper” Award. A conference version appeared in
ICML-06.
- Alexander L. Strehl, Lihong
Li, and Michael L. Littman: PAC reinforcement learning bounds for RTDP
and Rand-RTDP. In AAAI-06 Workshop on Learning for Search, Boston, MA,
July, 2006. Also appeared as an AAAI technical report.
- Thomas J. Walsh, Lihong
Li, and Michael L. Littman: Transferring state abstractions
between MDPs. In ICML-06 Workshop on Structural Knowledge
Transfer in Machine Learning, Pittsburgh,
PA, June, 2006. [Tom’s slides]
- Lihong Li and Jin Zhu: Algorithm description. In NIPS-05
Workshop on Reinforcement Learning Benchmarks and Bake-offs II,
Whistler, BC, Canada,
December, 2005.
- Lihong Li, Vadim Bulitko,
and Russell Greiner: Focus of attention
in sequential decision making. In AAAI-04 Workshop on Learning
and Planning in Markov Processes --- Advances and Challenges, CA,
July, 2004. Also published as an AAAI technical report.
- Lihong Li, Vadim Bulitko,
Russell Greiner, and Ilya Levner: Automated
learning distance metrics for the kNN. In NIPS-03 Workshop
on Approximate Nearest Neighbors Methods for Learning and Vision,
Whistler, BC, Canada,
December 2003.
- Vadim Bulitko, Greg Lee, Ilya
Levner, and Lihong Li:
Open challenges in learning vision systems.
In NIPS-03 Workshop on the Open Challenges in Cognitive Vision,
Whistler, BC, Canada,
December 2003.
- Vadim Bulitko, Lihong Li, Greg Lee, Russell
Greiner, and Ilya Levner: Adaptive
image interpretation: A spectrum of machine learning problems. In ICML-03
Workshop on the Continuum from Labeled to Unlabeled Data in Machine
Learning and Data Mining, Washington
D.C., US,
August, 2003.
- Lihong
Li: Focus of
attention in reinforcement learning. Master's thesis, Department of Computing Science, University of Alberta, Edmonton,
Alberta, Canada, 2004.
- Lihong
Li: Design and Implementation of an Agent
Communication Module based on KQML. Bachelor Degree Thesis,
Department of Computer Science
and Technology, Tsinghua
University, Beijing,
China,
2002.
- Technical/Unpublished
Reports
- John Langford, Lihong
Li, and Tong Zhang: Sparse online learning via truncated
gradient. In arXiv:0806.4686,
June, 2008.
- Lihong Li and Michael L. Littman: Prioritized sweeping converges to the
optimal value function.
Technical report DCS-TR-631, Department of Computer Science,
Rutgers University, May 2008.
- Alexander L. Strehl, Lihong
Li, and Michael L. Littman: PAC reinforcement learning bounds for RTDP
and Rand-RTDP. In AAAI technical report WS-06-11, pages
50-56, July 2006. Also presented at an AAAI-06 workshop.
- Lihong Li, Michael L.
Littman, and Alexander L. Strehl: A model-free
reinforcement learning algorithm with low computational and sample
complexity. Technical report DCS-TR-591, Department of Computer Science, Rutgers University,
December, 2005.
- Lihong Li and Michael L.
Littman: Lazy approximation: A new
approach for solving continuous finite-horizon MDPs. Technical report DCS-TR-577, Department of Computer
Science, Rutgers
University, May
2005.
- Lihong Li, Vadim Bulitko,
and Russell Greiner: Focus of attention
in sequential decision making. AAAI technical report WS-04-08,
pages 43-48, July 2004. Also presented at an AAAI-04 workshop.