• Sorted by Date • Classified by Publication Type • Sorted by First Author Last Name • Classified by Research Category •
Matthew E. Taylor and Peter
Stone. Behavior Transfer for Value-Function-Based Reinforcement Learning. In Proceedings of the Fourth International
Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), pp. 53–59, July 2005. 25% acceptance rate.
AAMAS-2005.
Superseded by the journal article Transfer
Learning via Inter-Task Mappings for Temporal Difference Learning.
Temporal difference (TD) learning methods have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been shown to exhibit some desirable properties in theory, but have often been found very slow in practice. A key feature of TD methods is that they represent policies in terms of value functions. In this paper we introduce behavior transfer, a novel approach to speeding up TD learning by transferring the learned value function from one task to a second related task. We present experimental results showing that autonomous learners are able to learn one multiagent task and then use behavior transfer to markedly reduce the total training time for a more complex task.
@InProceedings{AAMAS05-taylor, author="Matthew E.\ Taylor and Peter Stone", title="Behavior Transfer for Value-Function-Based Reinforcement Learning", booktitle="Proceedings of the Fourth International Joint Conference on Autonomous Agents and Multiagent Systems ({AAMAS})", month="July",year="2005", pages="53--59", abstract={ Temporal difference (TD) learning methods have become popular reinforcement learning techniques in recent years. TD methods have had some experimental successes and have been shown to exhibit some desirable properties in theory, but have often been found very slow in practice. A key feature of TD methods is that they represent policies in terms of value functions. In this paper we introduce \emph{behavior transfer}, a novel approach to speeding up TD learning by transferring the learned value function from one task to a second related task. We present experimental results showing that autonomous learners are able to learn one multiagent task and then use behavior transfer to markedly reduce the total training time for a more complex task. }, note = {25% acceptance rate.}, wwwnote={<a href="http://www.aamas2005.nl/">AAMAS-2005</a>.<br> Superseded by the journal article <a href="http://cs.lafayette.edu/~taylorm/Publications/b2hd-JMLR07-taylor.html">Transfer Learning via Inter-Task Mappings for Temporal Difference Learning</a>.}, }
Generated by bib2html.pl (written by Patrick Riley ) on Thu Jul 24, 2014 16:09:10