Back to Search

Hierarchical Relative Entropy Policy Search

AUTHOR Neumann Gerhard; Daniel Christian; Daniel, Christian et al.
PUBLISHER AV Akademikerverlag (01/04/2014)
PRODUCT TYPE Paperback (Paperback)

Description
Many real-world problems are inherently hierarchically structured. The use of this structure in an agent's policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy - the mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates.
Show More
Product Format
Product Details
ISBN-13: 9783639475999
ISBN-10: 3639475992
Binding: Paperback or Softback (Trade Paperback (Us))
Content Language: English
More Product Details
Page Count: 68
Carton Quantity: 104
Product Dimensions: 6.00 x 0.16 x 9.00 inches
Weight: 0.25 pound(s)
Country of Origin: US
Subject Information
BISAC Categories
Computers | Information Technology
Descriptions, Reviews, Etc.
publisher marketing
Many real-world problems are inherently hierarchically structured. The use of this structure in an agent's policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy - the mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates.
Show More
List Price $34.51
Your Price  $34.16
Paperback