Issue |
Wuhan Univ. J. Nat. Sci.
Volume 27, Number 6, December 2022
|
|
---|---|---|
Page(s) | 531 - 538 | |
DOI | https://doi.org/10.1051/wujns/2022276531 | |
Published online | 10 January 2023 |
CLC number: TP 391
Manufacturing Resource Scheduling Based on Deep Q-Network
School of Electronic and Information Engineering, Tongji University, Shanghai 201804, China
† To whom correspondence should be addressed. E-mail: tjcad-zhaoxd@tongji.edu.cn
Received:
29
September
2022
To optimize machine allocation and task dispatching in smart manufacturing factories, this paper proposes a manufacturing resource scheduling framework based on reinforcement learning (RL). The framework formulates the entire scheduling process as a multi-stage sequential decision problem, and further obtains the scheduling order by the combination of deep convolutional neural network (CNN) and improved deep Q-network (DQN). Specifically, with respect to the representation of the Markov decision process (MDP), the feature matrix is considered as the state space and a set of heuristic dispatching rules are denoted as the action space. In addition, the deep CNN is employed to approximate the state-action values, and the double dueling deep Q-network with prioritized experience replay and noisy network (D3QPN2) is adopted to determine the appropriate action according to the current state. In the experiments, compared with the traditional heuristic method, the proposed method is able to learn high-quality scheduling policy and achieve shorter makespan on the standard public datasets.
Key words: smart manufacturing / job shop scheduling / convolutional neural network / deep Q-network
Biography: ZHANG Yufei, female, Master candidate, research direction: resource optimization allocation, reinforcement learning. E-mail: zhang_yufei@tongji.edu.cn
Supported by the National Key Research and Development Plan (2019YFB1706401)
© Wuhan University 2022
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.