OrDB
Paper
  Data
Tsitsiklis JN, Van_Roy B (2002)
On average versus discounted reward temporal-difference learning
49
179
191
2002
Mach Learn
Other categories referring to Tsitsiklis JN, Van_Roy B (2002)
Paper.References   (1)
Revisions: 1
Last Time: 7/21/2006 12:30:28 PM
Reviewer: System Administrator
Owner: System Administrator