Fractional ordering of activation functions for neural networks: A case study on Texas wind turbine
Activation functions play an important role in deep learning models by introducing non-linearity to the output of a neuron, enabling the network to learn complex patterns and non-linear relationships in data and make predictions on more complex tasks. Deep learning models� most commonly used activ...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Published: |
Elsevier Ltd
2024
|
Online Access: | http://scholars.utp.edu.my/id/eprint/37730/ https://www.scopus.com/inward/record.uri?eid=2-s2.0-85174329387&doi=10.1016%2fj.engappai.2023.107308&partnerID=40&md5=7374f922ed8f76dff4dddff12c075d70 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Petronas |
Summary: | Activation functions play an important role in deep learning models by introducing non-linearity to the output of a neuron, enabling the network to learn complex patterns and non-linear relationships in data and make predictions on more complex tasks. Deep learning models� most commonly used activation functions are Purelin, Sigmoid, Tansig, Rectified Linear Unit (ReLU), and Exponential Linear Unit (ELU), which exhibit limitations such as non-differentiability, vanishing gradients, and neuron inactivity with negative values. These functions are typically defined over a finite range, and their outputs are integers or real numbers. Using fractional calculus in designing activation functions for neural networks has shown promise in improving the performance of deep learning models in specific applications. These activation functions can capture more complex non-linearities than traditional integer-order activation functions, improving performance on tasks such as image classification and time series prediction. This paper focuses on deriving and testing linear and non-linear fractional-order forms of activation functions and their variants. The linear activation function includes Purelin. In contrast, the non-linear activation functions are Binary Step, Sigmoid, Tansig, ReLU, ELU, Gaussian Error Linear Unit (GELU), Hexpo, and their variants. Besides, the standard formula has been implemented and used in developing the fractional-order linear activation function. Furthermore, various expansion series, such as Euler and Maclaurin, have been used to design non-linear fractional-order activation functions and their variants. The single- and multi-layer fractional-order neural network models have been developed using the designed fractional-order activation functions. The simulation study uses developed fractional-order neural network models for predicting the Texas wind turbine systems� generated power. The performance of single and multi-layer fractional-order neural network models has been evaluated by changing the activation functions in the hidden layer while keeping the Purelin function constant at the output layer. Experiments on neural network models demonstrate that the designed fractional-order activation functions outperform traditional functions like Sigmoid, Tansig, ReLU, ELU, and their variants, effectively addressing limitations. © 2023 Elsevier Ltd |
---|