Reinforcement-routing information page
This page gives pointers to several papers concerning the use of
reinforcement-learning techniques to solve problems in network
routing.
Papers
Justin A. Boyan and Michael L. Littman. Packet routing in dynamically
changing networks: A reinforcement learning approach. In Jack
D. Cowan, Gerald Tesauro, and Joshua Alspector, editors, Advances
in Neural Information Processing Systems, volume 6, pages
671--678. Morgan Kaufmann, San Francisco CA, 1993. (abstract, postscript)
Michael L. Littman and Justin A. Boyan. A distributed reinforcement
learning scheme for network routing. In Joshua Alspector, Rodney
Goodman, and Timothy X. Brown, editors, Proceedings of the 1993
International Workshop on Applications of Neural Networks to
Telecommunications, pages 45--51. Lawrence Erlbaum Associates,
Hillsdale NJ, 1993. (abstract, postscript, Bellcore's
copy)
Michael Littman and Justin Boyan. A distributed reinforcement
learning scheme for network routing. Technical Report CMU-CS-93-16,
School of Computer Science, Carnegie Mellon University, Pittsburgh PA,
July, 1993. (abstract, postscript, Mercury's
copy, CMU's
copy)
John W. Bates. Packet routing and reinforcement learning: Estimating
shortest paths in dynamic graphs. Unpublished manuscript, 1995.
(postscript)
Samuel P.M. Choi and Dit-Yan Yeung. Predictive Q-routing: A
memory-based reinforcement learning approach to adaptive traffic
control. To appear in Advances in Neural Information Processing
Systems 8, D. S. Touretzky, M. C. Mozer, M. E. Hasselmo, eds., MIT
Press, 1996. In press. (abstract, compressed
postscript)
Shailesh Kumar and Risto Miikkulainen. Dual Reinforcement Q-Routing:
An On-line adaptive routing algorithm. In C. H. Dagli, M. Akay,
O. Ersoy, B. R. Fernandez and A. Smith (editors), Smart
Engineering Systems: Neural Networks, Fuzzy Logic, Data Mining, and
Evolutionary Programming: Volume 7 in Intelligent Engineering
Systems Through Artificial Neural Networks (ANNIE-97, St. Louis, MO),
231-238. New York: ASME Press, 1997. (abstract
page, compressed
postscript)
Kumar, S. 1998. Confidence Based Dual Reinforcement Q-routing.
Master's thesis, Dept. of Comp. Sci, The University of Texas at
Austin.
Devika Subramanian, Peter Druschel, and Johnny Chen. Ants and
reinforcement learning: A case study in routing in dynamic networks,
In Proceedings of IJCAI-97, 1997. (postscript)
Ann Nowe has also done some work on Q routing, which was presented at
CONALD in Pittsburgh, June 1998.
Other links
For more information, contact Michael Littman: mlittman@cs.duke.edu.
Last update: Thu Jul 18 07:35:39 EDT 1996