Publication | Open Access
Discovering Rubik's Cube Subgroups using Coevolutionary GP
12
Citations
21
References
2016
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningCube SubgroupsMulti-agent LearningIntelligent SystemsGp Policy SearchComputational TopologyCoxeter GroupData ScienceRobot LearningComputational GeometryMechanism DesignMulti-agent PlanningSequential Decision MakingComputer ScienceGraph TheoryAi PlanningBusinessDirect Policy DiscoveryTopological Combinatorics
This work reports on an approach to direct policy discovery (a form of reinforcement learning) using genetic programming (GP) for the 3 by 3 by 3 Rubik's Cube. Specifically, a synthesis of two approaches is proposed: 1) a previous group theoretic formulation is used to suggest a sequence of objectives for developing solutions to different stages of the overall task; and 2) a hierarchical formulation of GP policy search is utilized in which policies adapted for an earlier objective are explicitly transferred to aid the construction of policies for the next objective. The resulting hierarchical organization of policies explicitly demonstrates task decomposition and policy reuse. Algorithmically, the process makes use of a recursive call to a common approach for maintaining a diverse population of GP individuals and then learns how to reuse subsets of programs (policies) developed against the earlier objective. Other than the two objectives, we do not explicitly identify how to decompose the task or mark specific policies for reuse. Moreover, at the end of evolution we return a population solving 100% of 17,675,698 different initial Cubes for the two objectives currently in use.
| Year | Citations | |
|---|---|---|
Page 1
Page 1