TDγ: Re-evaluating Complex Backups in Temporal Difference Learning (bibtex)
by G.D. Konidaris, S. Niekum and P.S. Thomas
Reference:
TDγ: Re-evaluating Complex Backups in Temporal Difference Learning (G.D. Konidaris, S. Niekum and P.S. Thomas), In Advances in Neural Information Processing Systems 24, 2011.
Bibtex Entry:
@inproceedings{Konidaris11c,
 author = {G.D. Konidaris and S. Niekum and P.S. Thomas},
 title = {TD<sub>&gamma;</sub>: Re-evaluating Complex Backups in Temporal Difference Learning},
 booktitle = {Advances in Neural Information Processing Systems 24},
 pages = {2402-2410},
 month = {December},
 year = 2011,
    keywords={Reinforcement Learning},
 url = {http://lis.csail.mit.edu/pubs/konidaris-nips11.pdf}
}
Powered by bibtexbrowser