Page 185 - 《软件学报》2021年第11期

P. 185

陈子璇等:一种基于广义异步值迭代的规划网络模型 3511

[21] van Hasselt H. Double Q-learning. In: Proc. of the Advances in Neural Information Processing Systems (NIPS). 2016. 2613−2621.
[22] van Hasselt H, Guez A, Silver D. Deep reinforcement learning with double Q-learning. In: Proc. of the AAAI Conf. on Artificial
Intelligence (AAAI). 2016. 2094−2100.
[23] Defferrard M, Bresson X, Vandergheynst P. Convolutional neural networks on graphs with fast localized spectral filtering. In: Proc.
of the Advances in Neural Information Processing Systems (NIPS). 2016. 3837−3845.
[24] Niepert M, Ahmed M, Kutzkov K. Learning convolutional neural networks for graphs. In: Proc. of the Int’l Conf. on Machine
Learning (ICML). 2016. 2014−2023.
[25] Franceschi L, Niepert M, Pontil M, He X. Learning discrete structures for graph neural networks. In: Proc. of the Int’l Conf. on
Machine Learning (ICML). 2019. 1972−1982.

附中文参考文献:
[1] 孙志军,薛磊,许阳明,王正,深度学习研究综述.计算机应用研究,2012,29(8):2806−2810.
[2] 刘全,翟建伟,章宗长,钟珊,周倩,章鹏,徐进.深度强化学习综述.计算机学报,2018,41(1):1−27.

陈子璇(1996－),女,博士生,CCF 学生会潘致远(1993－),男,硕士,主要研究领域为
员,主要研究领域为强化学习,智能规划. 强化学习.

章宗长(1985－),男,博士,副教授,CCF 高张琳婧(1995－),女,硕士,主要研究领域为
级会员,主要研究领域为强化学习,智能规强化学习.
划,多智能体系统.

180 181 182 183 184 185 186 187 188 189 190