Intelligent control of an autonomous vehicle

Reinforcement Learning is the learning methodology whereby a learner develops its knowledge through the trial-and-error interactions with the dynamic environment. Based on how the learner reacts to the environment, the leaner will only receive “reward” of “punishment” instead of “instructive” inform...

Full description

Saved in:
Bibliographic Details
Main Author: San, Linn.
Other Authors: Er Meng Joo
Format: Theses and Dissertations
Language:English
Published: 2009
Subjects:
Online Access:http://hdl.handle.net/10356/18792
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Reinforcement Learning is the learning methodology whereby a learner develops its knowledge through the trial-and-error interactions with the dynamic environment. Based on how the learner reacts to the environment, the leaner will only receive “reward” of “punishment” instead of “instructive” information. Among the reinforcement learning concepts, Q-Learning is the most popular algorithm due to its simplicity and well-developed theory. But, Q-Learning is not able to address to generalize large states and actions space. The practical learning agent requires a compact representation to generalize experiences in the continuous domain. Many research works have been done on the generalization issue of Q-Learning. Fuzzy Q-Learning (FQL) approach was proposed in [18] for the representation of Q-Learning to address the continuous domain. The greatest achievement of FQL is that it can enable the original Q-Learning to handle continuous states and actions by means of fuzzy logic, which is regarded as a systematic mathematical approach to emulate human way of thinking. A fuzzy system can be decomposed into two phases; namely structure identification phase and parameter identification phase. Structure identification phase concerns about partitioning the input space and determining the number of fuzzy rules while the parameter identification phase involves determining the parameter of premises and consequents. The FQL approach is only well-defined in parameter identification and does not focus on structure identification.