Time-inconsistent control problems, path-dependent PDEs, and neural network approximation
There are two main parts of this thesis: Time-Inconsistent Control (TIC) problems (Chapters 1 and 2) and the neural network approximation (Chapters 3 and 4). In the first part of the thesis, we are interested in constructing and characterizing the equilibria of TIC problems, while in the second part...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/163165 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-163165 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1631652023-02-28T23:58:52Z Time-inconsistent control problems, path-dependent PDEs, and neural network approximation Nguwi, Jiang Yu Nicolas Privault School of Physical and Mathematical Sciences NPRIVAULT@ntu.edu.sg Science::Mathematics There are two main parts of this thesis: Time-Inconsistent Control (TIC) problems (Chapters 1 and 2) and the neural network approximation (Chapters 3 and 4). In the first part of the thesis, we are interested in constructing and characterizing the equilibria of TIC problems, while in the second part of the thesis, we apply the neural network approach to solve path-dependent Partial Differential Equations (PDEs) and derive the generalization error bounds of 2-layer neural networks. More precisely, in Chapter 1 we extend the construction of equilibria for mean-variance portfolio and linear-quadratic problems available in the literature to a large class of mean-field continuous-time TIC problems. Our approach relies on a time discretization of the control problem using n-person games characterized by Backward Stochastic Differential Equations (BSDEs). The existence of equilibria is proved by applying weak convergence arguments to the n-person games solutions. Then, we present the numerical results by approximating n-person games with finite Markov chains. We also show the small-time uniqueness of equilibria under Lipschitz assumptions on the Hamiltonian function. In Chapter 2, we derive a characterization of equilibria using the Malliavin calculus. For this, we replace the classical duality analysis of adjoint BSDEs by the Malliavin integration by parts. This results in a necessary and sufficient Malliavin maximum principle, which is more explicit than classical maximum principle. We apply the results to the linear-quadratic TIC problem and the generalized Merton problem. In Chapter 3, we use the neural network function to approximate path-dependent PDE solutions, and derive the error bounds of the approximation scheme. In the literature, path-dependent PDEs are solved by estimating the conditional expectations of their probabilistic representation using the regression. However, the regression approach requires a careful selection of the functional basis, which may not be possible in many cases. Our neural network approach, on the other hand, does not require such basis selection. In our numerical examples of two-person zero-sum game, Asian and barrier option pricing, our algorithm appears to be more accurate, especially in large dimensions. In Chapter 4, we derive the generalization error bound of a 2-layer neural network trained by the stochastic gradient method. Our bounds require no boundedness conditions on the loss function or on its gradients, and can be computed prior to the training of the model. In the numerical simulations, we confirm our dimension-free bound in the case of independent test data. Doctor of Philosophy 2022-11-29T00:01:53Z 2022-11-29T00:01:53Z 2022 Thesis-Doctor of Philosophy Nguwi, J. Y. (2022). Time-inconsistent control problems, path-dependent PDEs, and neural network approximation. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/163165 https://hdl.handle.net/10356/163165 10.32657/10356/163165 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Mathematics |
spellingShingle |
Science::Mathematics Nguwi, Jiang Yu Time-inconsistent control problems, path-dependent PDEs, and neural network approximation |
description |
There are two main parts of this thesis: Time-Inconsistent Control (TIC) problems (Chapters 1 and 2) and the neural network approximation (Chapters 3 and 4). In the first part of the thesis, we are interested in constructing and characterizing the equilibria of TIC problems, while in the second part of the thesis, we apply the neural network approach to solve path-dependent Partial Differential Equations (PDEs) and derive the generalization error bounds of 2-layer neural networks.
More precisely, in Chapter 1 we extend the construction of equilibria for mean-variance portfolio and linear-quadratic problems available in the literature to a large class of mean-field continuous-time TIC problems. Our approach relies on a time discretization of the control problem using n-person games characterized by Backward Stochastic Differential Equations (BSDEs). The existence of equilibria is proved by applying weak convergence arguments to the n-person games solutions. Then, we present the numerical results by approximating n-person games with finite Markov chains. We also show the small-time uniqueness of equilibria under Lipschitz assumptions on the Hamiltonian function.
In Chapter 2, we derive a characterization of equilibria using the Malliavin calculus. For this, we replace the classical duality analysis of adjoint BSDEs by the Malliavin integration by parts. This results in a necessary and sufficient Malliavin maximum principle, which is more explicit than classical maximum principle. We apply the results to the linear-quadratic TIC problem and the generalized Merton problem.
In Chapter 3, we use the neural network function to approximate path-dependent PDE solutions, and derive the error bounds of the approximation scheme. In the literature, path-dependent PDEs are solved by estimating the conditional expectations of their probabilistic representation using the regression. However, the regression approach requires a careful selection of the functional basis, which may not be possible in many cases. Our neural network approach, on the other hand, does not require such basis selection. In our numerical examples of two-person zero-sum game, Asian and barrier option pricing, our algorithm appears to be more accurate, especially in large dimensions.
In Chapter 4, we derive the generalization error bound of a 2-layer neural network trained by the stochastic gradient method. Our bounds require no boundedness conditions on the loss function or on its gradients, and can be computed prior to the training of the model. In the numerical simulations, we confirm our dimension-free bound in the case of independent test data. |
author2 |
Nicolas Privault |
author_facet |
Nicolas Privault Nguwi, Jiang Yu |
format |
Thesis-Doctor of Philosophy |
author |
Nguwi, Jiang Yu |
author_sort |
Nguwi, Jiang Yu |
title |
Time-inconsistent control problems, path-dependent PDEs, and neural network approximation |
title_short |
Time-inconsistent control problems, path-dependent PDEs, and neural network approximation |
title_full |
Time-inconsistent control problems, path-dependent PDEs, and neural network approximation |
title_fullStr |
Time-inconsistent control problems, path-dependent PDEs, and neural network approximation |
title_full_unstemmed |
Time-inconsistent control problems, path-dependent PDEs, and neural network approximation |
title_sort |
time-inconsistent control problems, path-dependent pdes, and neural network approximation |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/163165 |
_version_ |
1759857893984698368 |