Training multilayer neural network based on optimal control theory for limited computational resources

Backpropagation (BP)-based gradient descent is the general approach to train a neural network with a multilayer perceptron. However, BP is inherently slow in learning, and it sometimes traps at local minima, mainly due to a constant learning rate. This pre-fixed learning rate regularly leads the BP...

Full description

Saved in:

Bibliographic Details
Main Authors:	Alkawaz, Ali Najem, Kanesan, Jeevan, Khairuddin, Anis Salwa Mohd, Badruddin, Irfan Anjum, Kamangar, Sarfaraz, Hussien, Mohamed, Baig, Maughal Ahmed Ali, Ahammad, N. Ameer
Format:	Article
Published:	MDPI 2023
Subjects:	T Technology (General) TK Electrical engineering. Electronics Nuclear engineering
Online Access:	http://eprints.um.edu.my/38665/
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Malaya

id	my.um.eprints.38665
record_format	eprints
spelling	my.um.eprints.386652024-11-12T06:47:10Z http://eprints.um.edu.my/38665/ Training multilayer neural network based on optimal control theory for limited computational resources Alkawaz, Ali Najem Kanesan, Jeevan Khairuddin, Anis Salwa Mohd Badruddin, Irfan Anjum Kamangar, Sarfaraz Hussien, Mohamed Baig, Maughal Ahmed Ali Ahammad, N. Ameer T Technology (General) TK Electrical engineering. Electronics Nuclear engineering Backpropagation (BP)-based gradient descent is the general approach to train a neural network with a multilayer perceptron. However, BP is inherently slow in learning, and it sometimes traps at local minima, mainly due to a constant learning rate. This pre-fixed learning rate regularly leads the BP network towards an unsuccessful stochastic steepest descent. Therefore, to overcome the limitation of BP, this work addresses an improved method of training the neural network based on optimal control (OC) theory. State equations in optimal control represent the BP neural network's weights and biases. Meanwhile, the learning rate is treated as the input control that adapts during the neural training process. The effectiveness of the proposed algorithm is evaluated on several logic gates models such as XOR, AND, and OR, as well as the full adder model. Simulation results demonstrate that the proposed algorithm outperforms the conventional method in terms of improved accuracy in output with a shorter time in training. The training via OC also reduces the local minima trap. The proposed algorithm is almost 40% faster than the steepest descent method, with a marginally improved accuracy of approximately 60%. Consequently, the proposed algorithm is suitable to be applied on devices with limited computation resources, since the proposed algorithm is less complex, thus lowering the circuit's power consumption. MDPI 2023-02 Article PeerReviewed Alkawaz, Ali Najem and Kanesan, Jeevan and Khairuddin, Anis Salwa Mohd and Badruddin, Irfan Anjum and Kamangar, Sarfaraz and Hussien, Mohamed and Baig, Maughal Ahmed Ali and Ahammad, N. Ameer (2023) Training multilayer neural network based on optimal control theory for limited computational resources. Mathematics, 11 (3). ISSN 2227-7390, DOI https://doi.org/10.3390/math11030778 <https://doi.org/10.3390/math11030778>. 10.3390/math11030778
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Research Repository
url_provider	http://eprints.um.edu.my/
topic	T Technology (General) TK Electrical engineering. Electronics Nuclear engineering
spellingShingle	T Technology (General) TK Electrical engineering. Electronics Nuclear engineering Alkawaz, Ali Najem Kanesan, Jeevan Khairuddin, Anis Salwa Mohd Badruddin, Irfan Anjum Kamangar, Sarfaraz Hussien, Mohamed Baig, Maughal Ahmed Ali Ahammad, N. Ameer Training multilayer neural network based on optimal control theory for limited computational resources
description	Backpropagation (BP)-based gradient descent is the general approach to train a neural network with a multilayer perceptron. However, BP is inherently slow in learning, and it sometimes traps at local minima, mainly due to a constant learning rate. This pre-fixed learning rate regularly leads the BP network towards an unsuccessful stochastic steepest descent. Therefore, to overcome the limitation of BP, this work addresses an improved method of training the neural network based on optimal control (OC) theory. State equations in optimal control represent the BP neural network's weights and biases. Meanwhile, the learning rate is treated as the input control that adapts during the neural training process. The effectiveness of the proposed algorithm is evaluated on several logic gates models such as XOR, AND, and OR, as well as the full adder model. Simulation results demonstrate that the proposed algorithm outperforms the conventional method in terms of improved accuracy in output with a shorter time in training. The training via OC also reduces the local minima trap. The proposed algorithm is almost 40% faster than the steepest descent method, with a marginally improved accuracy of approximately 60%. Consequently, the proposed algorithm is suitable to be applied on devices with limited computation resources, since the proposed algorithm is less complex, thus lowering the circuit's power consumption.
format	Article
author	Alkawaz, Ali Najem Kanesan, Jeevan Khairuddin, Anis Salwa Mohd Badruddin, Irfan Anjum Kamangar, Sarfaraz Hussien, Mohamed Baig, Maughal Ahmed Ali Ahammad, N. Ameer
author_facet	Alkawaz, Ali Najem Kanesan, Jeevan Khairuddin, Anis Salwa Mohd Badruddin, Irfan Anjum Kamangar, Sarfaraz Hussien, Mohamed Baig, Maughal Ahmed Ali Ahammad, N. Ameer
author_sort	Alkawaz, Ali Najem
title	Training multilayer neural network based on optimal control theory for limited computational resources
title_short	Training multilayer neural network based on optimal control theory for limited computational resources
title_full	Training multilayer neural network based on optimal control theory for limited computational resources
title_fullStr	Training multilayer neural network based on optimal control theory for limited computational resources
title_full_unstemmed	Training multilayer neural network based on optimal control theory for limited computational resources
title_sort	training multilayer neural network based on optimal control theory for limited computational resources
publisher	MDPI
publishDate	2023
url	http://eprints.um.edu.my/38665/
_version_	1816130404209393664

Training multilayer neural network based on optimal control theory for limited computational resources

Similar Items