紀要論文 リカレントネットを用いた強化学習アルゴリズム -ネットワーク学習則BPTTとRTRLとの性能比較-
A REINFORCEMENT LEARNING ALGORITHM USING RECURRENT NEURAL NETWORK: COMPARING THE PERFORMANCE OF BPTT AND RTRL NETWORK LEARNING RULES

蔡, 詩祐  ,  SAI, Shiyu

(56) 2015-03-24 , 法政大学大学院理工学・工学研究科
ISSN:2187-9923
NII書誌ID(NCID):AA12677220
内容記述
Goto and Shibata(2010) proposed a reinforcement learning algorithm using a recurrent neural network. Back Propagation Through Time (BPTT) was used for the neural network’s learning rule. This algorithm autonomously acquires prediction functions for tasks that are difficult to be accomplished without these functions. To verify the effectiveness of this method, they used episodic tasks where the starting state and the terminal state of the tasks could be given explicitly. However, in the real world, there are many continuous tasks that cannot indicate the starting and terminal state in detail. This study verifies the performance of the previous method on continuous tasks and presents a new method that uses Real Time Recurrent Learning (RTRL) which allows real-time learning for the neural network. The results indicated that the previous method had a good performance even in continuous tasks. On the other hand, the new method using RTRL was inferior to the previous method in performance for both continuous tasks and episodic tasks.
本文を読む

http://repo.lib.hosei.ac.jp/bitstream/10114/10539/1/13R4111.pdf

このアイテムのアクセス数:  回

その他の情報