Research on intelligent control algorithm of coal gangue sorting robot armbased on reinforcement learning

ZHANG Yongchao; YU Zhiwei; DING Lili

doi:10.13272/j.issn.1671-251x.2020080047

Volume 47 Issue 1

Turn off MathJax

Article Contents

Abstract

Journal of Mine Automation > 2021 > 47(1): 36-42. > DOI: 10.13272/j.issn.1671-251x.2020080047

ZHANG Yongchao, YU Zhiwei, DING Lili. Research on intelligent control algorithm of coal gangue sorting robot armbased on reinforcement learning[J]. Journal of Mine Automation, 2021, 47(1): 36-42. DOI: 10.13272/j.issn.1671-251x.2020080047

Citation:

PDF (1866 KB)

Research on intelligent control algorithm of coal gangue sorting robot armbased on reinforcement learning

College of Mechanical and Electronic Engineering, Shandong University of Science and Technology, Qingdao 266590, China

More Information

Graphical Abstract

Abstract

Abstract

The problems of the traditional gangue sorting robot arm control algorithms such as the grasping function method and the dynamic target grasping algorithm based on Ferrary method are relying on an accurate environment model and lacking adaptivity in the control process. At the same time, the problems of the traditional intelligent control algorithms such as deep deterministic policy gradient (DDPG) are excessive output actions and sparse rewards that are easily covered. In order to solve these problems, this study improves the neural network structure and reward function in the traditional DDPG algorithm, and proposes an improved DDPG algorithm based on reinforcement learning, which is suitable for handling six-degree-of-freedom gangue sorting robot arms. After the gangue enters the working space of the robot arm, the improved DDPG algorithm can make decisions according to the gangue position and robot arm state returned by the corresponding sensor, and can output a set of joint angle state control quantity to the corresponding motion controller. The algorithm can control the movement of the robot arm according to the gangue position and joint angle state control quantity, so that the robot arm moves to the nearby gangue to conduct gangue sorting. The simulation results show that compared with the traditional DDPG algorithm, the improved DDPG algorithm has the advantages of model-free versatility and adaptive learning of grasping pose in interaction with the environment. Moreover, the improved algorithm can be the first to converge to the maximum reward value encountered during exploration. The robot arm controlled by the improved DDPG algorithm has better policy generalization, smaller joint angle state control output and higher gangue sorting efficiency.
- coal preparation,
- coal gangue sorting,
- sorting robot,
- robot arm,
- joint angle state control,
- reinforcement learning,
- reward function,
- DDPG algorithm

FullText(HTML)

References (0)

[1]	FENG Zhanke, QIAN Wang, PENG Jianchuan. Slope stability evaluation based on comprehensive weight and TOPSIS[J]. Journal of Mine Automation, 2023, 49(S1): 133-137.
[2]	CHEN Xiaolin. Staged multi-index comprehensive evaluation method for fire risk of coal working face[J]. Journal of Mine Automation, 2022, 48(7): 90-95, 104. DOI: 10.13272/j.issn.1671-251x.2021120083
[3]	REN Zihui, CHEN Zepeng, WU Xinzhong, QIAN Xiaoyu, LI Ang. Research on health evaluation of mine ventilation system[J]. Journal of Mine Automation, 2021, 47(9): 70-76.. DOI: 10.13272/j.issn.1671-251x.2020120047
[4]	YU Liya, ZHAO Yongfang, ZHANG Lingyun, CHEN Guangbo. Coal and gas outburst risk evaluation based on cloud model and D -S theory[J]. Journal of Mine Automation, 2020, 46(11): 106-112. DOI: 10.13272/j.issn.1671 -251x.2020040029
[5]	WEI Yinshang, JIA Yuquan, WANG Yibo, DONG Dingwen. Research on grading control of mine ventilation system[J]. Journal of Mine Automation, 2018, 44(12): 30-33. DOI: 10.13272/j.issn.1671-251x.2018050017
[6]	GONG Dali. Application of combination weighting method in coal mine safety risk analysis[J]. Journal of Mine Automation, 2018, 44(10): 94-99. DOI: 10.13272/j.issn.1671-251x.17348
[7]	HE Yaoyi. Discussion on evaluation index system and architecture of smart mine[J]. Journal of Mine Automation, 2017, 43(9): 16-20. DOI: 10.13272/j.issn.1671-251x.2017.09.003
[8]	LIU Yejiao, TIAN Zhichao, LIU Hong, REN Yuhui. Research of disaster resistance ability evaluation of mine ventilation system[J]. Journal of Mine Automation, 2015, 41(4): 44-47. DOI: 10.13272/j.issn.1671-251x.2015.04.012
[9]	CHENG Lei, ZHENG Xiaopeng, WANG Dongdong. Determination of evaluation indexes of ventilation effect and their membership functions of working face with high volume fraction gas[J]. Journal of Mine Automation, 2014, 40(8): 10-14. DOI: 10.13272/j.issn.1671-251x.2014.08.003
[10]	YANG Ying-di, ZHANG Guo-shu, QIN Ru-xiang. Development of General Evaluation Software of Indicator System Method and Its Applicatio[J]. Journal of Mine Automation, 2009, 35(3): 36-39.

Cited By

Get Citation

PDF

XML

Article Metrics

Article views (140) PDF downloads (20)

Research on intelligent control algorithm of coal gangue sorting robot armbased on reinforcement learning

Abstract

Related Articles

Catalog

Article Metrics

Related

Research on intelligent control algorithm of coal gangue sorting robot armbased on reinforcement learning

Abstract

Related Articles

Catalog

Article Metrics

Related

Export File

Citation

Format

Content