Application of reinforcement learning to production system

The primary goal for this research is to obtain the optimal or near-optimal joint production and maintenance scheduling policy by means of reinforcement learning. In this research, we adopted reinforcement algorithm to control the feeding interval and the maintenance state of upstream station in pro...

Full description

Saved in:
Bibliographic Details
Main Author: Jiang, Zhijin
Other Authors: Rajesh Piplani
Format: Theses and Dissertations
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/75935
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-75935
record_format dspace
spelling sg-ntu-dr.10356-759352023-03-11T17:18:23Z Application of reinforcement learning to production system Jiang, Zhijin Rajesh Piplani School of Mechanical and Aerospace Engineering DRNTU::Engineering::Systems engineering The primary goal for this research is to obtain the optimal or near-optimal joint production and maintenance scheduling policy by means of reinforcement learning. In this research, we adopted reinforcement algorithm to control the feeding interval and the maintenance state of upstream station in production system. With the help of this algorithm, the work-in-process(WIP) in the production system can be limited to a reasonable level and machines are preventively maintained to be functional. By balancing the reward and cost from WIP, maintenance and the idle loss of bottleneck machine, the reinforcement learning algorithm is able to find the acceptable policy for adjusting the feeding rate and scheduling the preventive maintenance for upstream machine. However reinforcement learning involves in a lot of parameters and in practice parameters may range widely from cases to cases. There are totally five experiments performed in this research, the first and the second is the validation experiments and the third and forth is to explain the property of the algorithm. the fifth experiment describes how fast the algorithm can learn to achieve the target state of upstream station. The developed model consists of reinforcement learning based, decision-making agents with simulation model of the integrated production system. The smart agent determine the optimal or near-optimal action for each system state by interacting with their environment. Master of Science (Supply Chain & Logistics) 2018-08-07T05:37:11Z 2018-08-07T05:37:11Z 2018 Thesis http://hdl.handle.net/10356/75935 en 74 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Systems engineering
spellingShingle DRNTU::Engineering::Systems engineering
Jiang, Zhijin
Application of reinforcement learning to production system
description The primary goal for this research is to obtain the optimal or near-optimal joint production and maintenance scheduling policy by means of reinforcement learning. In this research, we adopted reinforcement algorithm to control the feeding interval and the maintenance state of upstream station in production system. With the help of this algorithm, the work-in-process(WIP) in the production system can be limited to a reasonable level and machines are preventively maintained to be functional. By balancing the reward and cost from WIP, maintenance and the idle loss of bottleneck machine, the reinforcement learning algorithm is able to find the acceptable policy for adjusting the feeding rate and scheduling the preventive maintenance for upstream machine. However reinforcement learning involves in a lot of parameters and in practice parameters may range widely from cases to cases. There are totally five experiments performed in this research, the first and the second is the validation experiments and the third and forth is to explain the property of the algorithm. the fifth experiment describes how fast the algorithm can learn to achieve the target state of upstream station. The developed model consists of reinforcement learning based, decision-making agents with simulation model of the integrated production system. The smart agent determine the optimal or near-optimal action for each system state by interacting with their environment.
author2 Rajesh Piplani
author_facet Rajesh Piplani
Jiang, Zhijin
format Theses and Dissertations
author Jiang, Zhijin
author_sort Jiang, Zhijin
title Application of reinforcement learning to production system
title_short Application of reinforcement learning to production system
title_full Application of reinforcement learning to production system
title_fullStr Application of reinforcement learning to production system
title_full_unstemmed Application of reinforcement learning to production system
title_sort application of reinforcement learning to production system
publishDate 2018
url http://hdl.handle.net/10356/75935
_version_ 1761781426593726464