Created by W.Langdon from gp-bibliography.bib Revision:1.8010
Graduate Sch. of Frontier Sci., Univ. of Tokyo, Chiba, Japan.
Reinforcement learning (RL) is outside the GP loop. Q-table 168 or 238 states, RL ten hours or 6 hours. 6 way IF. GP run ten minutes. Target and box are colour coded. Precise simulation of both robots is not possible. p 330 GP {"}learning some general knowledge...not limited to a particular robot{"}.",
Genetic Programming entries for Shotaro Kamio Hitoshi Iba