# Q-Learning
+ Successfully implemented Q-Learning for a simple robot navigation problem of a robot moving on a 5 x 5 grid with
one arbitrary goal (reward of +10) and three arbitrary obstacles (reward of -10)
+ Algorithm used is:
1. Initialize q-table with all zeroes
2. Loop
+ Pick a random start state, action(s,a) transition
+ make transistion from(s,a) -> s’
+ receive reward
+ Update Q(s,a) <- R(s,a) + gamma*(max(Q(s’,for all actions possible from next state)))
+ Run from main.m
Kinonoyomeo
- 粉丝: 91
- 资源: 1万+
最新资源
- 数据库课程设计-仓库管理系统中文最新版本
- 技术资料分享TF卡资料很好的技术资料.zip
- 技术资料分享TF介绍很好的技术资料.zip
- 10、安徽省大学生学科和技能竞赛A、B类项目列表(2019年版).xlsx
- 9、教育主管部门公布学科竞赛(2015版)-方喻飞
- C语言-leetcode题解之83-remove-duplicates-from-sorted-list.c
- C语言-leetcode题解之79-word-search.c
- C语言-leetcode题解之78-subsets.c
- C语言-leetcode题解之75-sort-colors.c
- C语言-leetcode题解之74-search-a-2d-matrix.c
资源上传下载、课程学习等过程中有任何疑问或建议,欢迎提出宝贵意见哦~我们会及时处理!
点击此处反馈