Q-Learning based framework for solving the stochastic E-waste collection problem
Electrical and Electronic Equipment (EEE) has evolved into a gateway for accessing technological innovations. However, EEE imposes substantial pressure on the environment due to the shortened life cycles. E-waste encompasses discarded EEE and its components which are no longer in use. This study foc...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9753 https://ink.library.smu.edu.sg/context/sis_research/article/10753/viewcontent/978_3_031_57712_3_4.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10753 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-107532024-12-16T03:19:22Z Q-Learning based framework for solving the stochastic E-waste collection problem NGUYEN, Dang Viet Anh GUNAWAN, Aldy MISIR, Mustafa VANSTEENWEGEN, Pieter Electrical and Electronic Equipment (EEE) has evolved into a gateway for accessing technological innovations. However, EEE imposes substantial pressure on the environment due to the shortened life cycles. E-waste encompasses discarded EEE and its components which are no longer in use. This study focuses on the e-waste collection problem and models it as a Vehicle Routing Problem with a heterogeneous fleet and a multi-period planning problem with time windows as well as stochastic travel times. Two different Q-learning-based methods are designed to enhance the search procedure for finding solutions. The first method involves utilizing the state-action value to determine the order of multiple improvement operators within the GRASP framework. The second one involves a hyperheuristic that extracts a stochastic policy to select heuristic operators during the search. Computational experiments demonstrate that both methods perform competitively with state-of-the-art methods in newly-generated small-sized instances, while the performance gap widens as the size of the problem instances increases. 2024-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9753 https://ink.library.smu.edu.sg/context/sis_research/article/10753/viewcontent/978_3_031_57712_3_4.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University E-waste collection Vehicle routing problem GRASP framework Q-learning Artificial Intelligence and Robotics |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
E-waste collection Vehicle routing problem GRASP framework Q-learning Artificial Intelligence and Robotics |
spellingShingle |
E-waste collection Vehicle routing problem GRASP framework Q-learning Artificial Intelligence and Robotics NGUYEN, Dang Viet Anh GUNAWAN, Aldy MISIR, Mustafa VANSTEENWEGEN, Pieter Q-Learning based framework for solving the stochastic E-waste collection problem |
description |
Electrical and Electronic Equipment (EEE) has evolved into a gateway for accessing technological innovations. However, EEE imposes substantial pressure on the environment due to the shortened life cycles. E-waste encompasses discarded EEE and its components which are no longer in use. This study focuses on the e-waste collection problem and models it as a Vehicle Routing Problem with a heterogeneous fleet and a multi-period planning problem with time windows as well as stochastic travel times. Two different Q-learning-based methods are designed to enhance the search procedure for finding solutions. The first method involves utilizing the state-action value to determine the order of multiple improvement operators within the GRASP framework. The second one involves a hyperheuristic that extracts a stochastic policy to select heuristic operators during the search. Computational experiments demonstrate that both methods perform competitively with state-of-the-art methods in newly-generated small-sized instances, while the performance gap widens as the size of the problem instances increases. |
format |
text |
author |
NGUYEN, Dang Viet Anh GUNAWAN, Aldy MISIR, Mustafa VANSTEENWEGEN, Pieter |
author_facet |
NGUYEN, Dang Viet Anh GUNAWAN, Aldy MISIR, Mustafa VANSTEENWEGEN, Pieter |
author_sort |
NGUYEN, Dang Viet Anh |
title |
Q-Learning based framework for solving the stochastic E-waste collection problem |
title_short |
Q-Learning based framework for solving the stochastic E-waste collection problem |
title_full |
Q-Learning based framework for solving the stochastic E-waste collection problem |
title_fullStr |
Q-Learning based framework for solving the stochastic E-waste collection problem |
title_full_unstemmed |
Q-Learning based framework for solving the stochastic E-waste collection problem |
title_sort |
q-learning based framework for solving the stochastic e-waste collection problem |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/9753 https://ink.library.smu.edu.sg/context/sis_research/article/10753/viewcontent/978_3_031_57712_3_4.pdf |
_version_ |
1819113128420442112 |