Simulated annealing with reinforcement learning for the set team orienteering problem with time windows

This research investigates the Set Team Orienteering Problem with Time Windows (STOPTW), a new variant of the well-known Team Orienteering Problem with Time Windows and Set Orienteering Problem. In the STOPTW, customers are grouped into clusters. Each cluster is associated with a profit attainable w...

Full description

Saved in:

Bibliographic Details
Main Authors:	YU, Vincent F., SALSABILA, Nabila Y., LIN, Shih-W, GUNAWAN, Aldy
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Team orienteering problem with time windows Set orienteering problem Simulated annealing Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/8265 https://ink.library.smu.edu.sg/context/sis_research/article/9268/viewcontent/SimulatedAnnealing_STOPTW_av.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-9268
record_format	dspace
spelling	sg-smu-ink.sis_research-92682023-11-10T08:52:00Z Simulated annealing with reinforcement learning for the set team orienteering problem with time windows YU, Vincent F. SALSABILA, Nabila Y. LIN, Shih-W GUNAWAN, Aldy This research investigates the Set Team Orienteering Problem with Time Windows (STOPTW), a new variant of the well-known Team Orienteering Problem with Time Windows and Set Orienteering Problem. In the STOPTW, customers are grouped into clusters. Each cluster is associated with a profit attainable when a customer in the cluster is visited within the customer's time window. A Mixed Integer Linear Programming model is formulated for STOPTW to maximizing total profit while adhering to time window constraints. Since STOPTW is an NP-hard problem, a Simulated Annealing with Reinforcement Learning (SARL) algorithm is developed. The proposed SARL incorporates the core concepts of reinforcement learning, utilizing the ε-greedy algorithm to learn the fitness values resulting from neighborhood moves. Numerical experiments are conducted to assess the performance of SARL, comparing the results with those obtained by CPLEX and Simulated Annealing (SA). For small instances, both SARL and SA algorithms outperform CPLEX by obtaining eight optimal solutions and 12 better solutions. For large instances, both algorithms obtain better solutions to 28 out of 29 instances within shorter computational times compared to CPLEX. Overall, SARL outperforms SA by resulting in lower gap percentages within the same computational times. Specifically, SARL outperforms SA in solving 13 large STOPTW benchmark instances. Finally, a sensitivity analysis is conducted to derive managerial insights. 2024-03-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8265 info:doi/10.1016/j.eswa.2023.121996 https://ink.library.smu.edu.sg/context/sis_research/article/9268/viewcontent/SimulatedAnnealing_STOPTW_av.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Team orienteering problem with time windows Set orienteering problem Simulated annealing Databases and Information Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Team orienteering problem with time windows Set orienteering problem Simulated annealing Databases and Information Systems
spellingShingle	Team orienteering problem with time windows Set orienteering problem Simulated annealing Databases and Information Systems YU, Vincent F. SALSABILA, Nabila Y. LIN, Shih-W GUNAWAN, Aldy Simulated annealing with reinforcement learning for the set team orienteering problem with time windows
description	This research investigates the Set Team Orienteering Problem with Time Windows (STOPTW), a new variant of the well-known Team Orienteering Problem with Time Windows and Set Orienteering Problem. In the STOPTW, customers are grouped into clusters. Each cluster is associated with a profit attainable when a customer in the cluster is visited within the customer's time window. A Mixed Integer Linear Programming model is formulated for STOPTW to maximizing total profit while adhering to time window constraints. Since STOPTW is an NP-hard problem, a Simulated Annealing with Reinforcement Learning (SARL) algorithm is developed. The proposed SARL incorporates the core concepts of reinforcement learning, utilizing the ε-greedy algorithm to learn the fitness values resulting from neighborhood moves. Numerical experiments are conducted to assess the performance of SARL, comparing the results with those obtained by CPLEX and Simulated Annealing (SA). For small instances, both SARL and SA algorithms outperform CPLEX by obtaining eight optimal solutions and 12 better solutions. For large instances, both algorithms obtain better solutions to 28 out of 29 instances within shorter computational times compared to CPLEX. Overall, SARL outperforms SA by resulting in lower gap percentages within the same computational times. Specifically, SARL outperforms SA in solving 13 large STOPTW benchmark instances. Finally, a sensitivity analysis is conducted to derive managerial insights.
format	text
author	YU, Vincent F. SALSABILA, Nabila Y. LIN, Shih-W GUNAWAN, Aldy
author_facet	YU, Vincent F. SALSABILA, Nabila Y. LIN, Shih-W GUNAWAN, Aldy
author_sort	YU, Vincent F.
title	Simulated annealing with reinforcement learning for the set team orienteering problem with time windows
title_short	Simulated annealing with reinforcement learning for the set team orienteering problem with time windows
title_full	Simulated annealing with reinforcement learning for the set team orienteering problem with time windows
title_fullStr	Simulated annealing with reinforcement learning for the set team orienteering problem with time windows
title_full_unstemmed	Simulated annealing with reinforcement learning for the set team orienteering problem with time windows
title_sort	simulated annealing with reinforcement learning for the set team orienteering problem with time windows
publisher	Institutional Knowledge at Singapore Management University
publishDate	2024
url	https://ink.library.smu.edu.sg/sis_research/8265 https://ink.library.smu.edu.sg/context/sis_research/article/9268/viewcontent/SimulatedAnnealing_STOPTW_av.pdf
_version_	1783955660987170816

Simulated annealing with reinforcement learning for the set team orienteering problem with time windows

Similar Items