OrDB
Paper
  Data
Baxter J, Bartlett PL, Weaver L (2001)
Experiments with infinite-horizon, policy-gradient estimation
15
351
381
2001
J Artif Intel Res
Other categories referring to Baxter J, Bartlett PL, Weaver L (2001)
Paper.References   (2)
Revisions: 1
Last Time: 12/24/2007 4:36:45 PM
Reviewer: System Administrator
Owner: System Administrator