项目作者: Rakshith6

项目描述 :
(Prioritized experience replay, random uniform replay) with tabular-Q for blind cliffwalk problem introduced as a motivating example in the publication Schaul et al., 2015
高级语言: Python
项目地址: git://github.com/Rakshith6/PrioritizedExperienceReplay_CliffWalk_Schaul2015.git