Return to search

Average cost temporal-difference learning

John N. Tsitsiklis and Benjamin Van Roy. / Includes bibliographical references (p. 23). / Supported by NSF. DMI-9625489 Supported by AFOSR grant. F49620-95-1-0219

Identiferoai:union.ndltd.org:MIT/oai:dspace.mit.edu:1721.1/3455
Date January 1997
ContributorsTsitsiklis, John N., Van Roy, Benjamin., Massachusetts Institute of Technology. Laboratory for Information and Decision Systems.
PublisherMassachusetts Institute of Technology, Laboratory for Information and Decision Systems
Source SetsM.I.T. Theses and Dissertation
LanguageEnglish
Detected LanguageEnglish
Format23 p., 1689354 bytes, application/pdf
RelationLIDS-P ; 2390

Page generated in 0.002 seconds