Autonomous strategy planning under uncertainty
| dc.contributor | Graduate Program in Computer Engineering. | |
| dc.contributor.advisor | Akın, H. Levent. | |
| dc.contributor.author | Sardağ, Alp. | |
| dc.date.accessioned | 2023-03-16T10:05:45Z | |
| dc.date.available | 2023-03-16T10:05:45Z | |
| dc.date.issued | 2006. | |
| dc.description.abstract | A real world environment is often partially observable for agents either because of noisy sensors or incomplete perception. Autonomous strategy planning under uncer- tainty has two major challenges. The 歔rst one is autonomous segmentation of the state space for a given task, and the second, emerging complex behaviors, that deal with each state segment. This thesis proposes three new approaches, namely ARKAQ-Learning, KAFAQ-Learning and KBVI, that handle both challenges by utilizing combinations of various techniques. ARKAQ makes use of ART2-A Networks augmented with Kalman Filters and Q-Learning. KAFAQ is a 歔nite state automaton using Kalman 歔lters and Q-Learning. KBVI uses Monte Carlo methods and introduces a new technique to calculate Q-values for continuous domains. All are online algorithms with relatively low space and time complexity. The algorithms were run for some well-known Partially Observable Markov Decision Process problems, where the problem of representing the value function is more di±cult than the discrete case because inputs are continuous distributions. The algorithms could reveal the hidden states, mapping non-Markovian observations to internal belief states, and also could construct an approximate optimal policy on the internal belief state space. | |
| dc.format.extent | 30cm. | |
| dc.format.pages | xvi, 135 leaves; | |
| dc.identifier.other | CMPE 2006 S27 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14908/12472 | |
| dc.publisher | Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2006. | |
| dc.relation | Includes appendices. | |
| dc.relation | Includes appendices. | |
| dc.subject.lcsh | Reinforcement learning (Machine learning) | |
| dc.subject.lcsh | Markov processes. | |
| dc.title | Autonomous strategy planning under uncertainty |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- b1425655.000518.001.PDF
- Size:
- 862.06 KB
- Format:
- Adobe Portable Document Format