An improved deep reinforcement learning approach: A case study for optimisation of berth and yard scheduling for bulk cargo terminal

T. Ai,L. Huang, R. J. Song, H. F. Huang, F. Jiao, W. G. Ma

ADVANCES IN PRODUCTION ENGINEERING & MANAGEMENT(2023)

引用 0|浏览0
暂无评分
摘要
The cornerstone of port production operations is ship handling, necessitating judicious allocation of diverse production resources to enhance the efficiency of loading and unloading operations. This paper introduces an optimisation method based on deep reinforcement learning to schedule berths and yards at a bulk cargo terminal. A Markov Decision Process model is formulated by analysing scheduling processes and unloading operations in bulk port imports business. The study presents an enhanced reinforcement learning algorithm called PS-D3QN (Prioritised Experience Replay and Softmax strategy-based Dueling Double Deep Q-Network), amalgamating the strengths of the Double DQN and Dueling DQN algorithms. The proposed solution is evaluated using actual port data and benchmarked against the other two algorithms mentioned in this paper. The numerical experiments and comparative analysis substantiate that the PS-D3QN algorithm significantly enhances the efficiency of berth and yard scheduling in bulk terminals, reduces the cost of port operation, and eliminates errors associated with manual scheduling. The algorithm presented in this paper can be tailored to address scheduling issues in the fields of production and manufacturing with suitable adjustments, including problems like the job shop scheduling problem and its extensions.
更多
查看译文
关键词
Bulk cargo terminal,Scheduling,Optimisation,Markov decision process (MDP) model,Deep reinforcement learning,Prioritised experience replay and softmax strategy-based dueling,Double deep Q-network (PS-D3QN)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要