Date of Award
Winter 1996
Project Type
Dissertation
Program or Major
Engineering
Degree Name
Doctor of Philosophy
First Advisor
W Thomas Miller, III
Abstract
This thesis presents a study of biped dynamic walking using reinforcement learning. A hardware biped robot was built. It uses low gear ratio DC motors in order to provide free leg movements. The Self Scaling Reinforcement learning algorithm was developed in order to deal with the problem of reinforcement learning in continuous action domains. A new learning architecture was designed to solve complex control problems. It uses different modules that consist of simple controllers and small neural networks. The architecture allows for easy incorporation of modules that represent new knowledge, or new requirements for the desired task. Control experiments were carried out using a simulator and the physical biped. The biped learned dynamic walking on flat surfaces without any previous knowledge about its dynamic model.
Recommended Citation
Benbrahim, Hamid, "Biped dynamic walking using reinforcement learning" (1996). Doctoral Dissertations. 1926.
https://scholars.unh.edu/dissertation/1926