IMPROVING TENSORFLOWâS MEMORY SWAPPING

The ever-increasing sizes of deep learning models and datasets used increase the need for memory, as insufficient memory may abort a training process. Adding to this problem, deep learning trainings tend to use GPUs over CPUs for better training speed, where in general a GPU has significantly les...

Full description

Saved in:

Bibliographic Details
Main Author:	Alvaro, Devin
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/39023
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:39023
spelling	id-itb.:390232019-06-21T09:55:48ZIMPROVING TENSORFLOWâS MEMORY SWAPPING Alvaro, Devin Indonesia Final Project memory swapping, TensorFlow, asynchronous, backpropagation INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/39023 The ever-increasing sizes of deep learning models and datasets used increase the need for memory, as insufficient memory may abort a training process. Adding to this problem, deep learning trainings tend to use GPUs over CPUs for better training speed, where in general a GPU has significantly less memory than a CPU. One solution is memory swapping, a way to free some memory by temporarily moving data to another memory pool, in this case moving data from a GPU memory to a CPU memory. However, since moving data takes time—the larger the data the more the time—performing memory swapping in a training can significantly increase the training duration. Therefore, an ideal memory swapping has to be selective on how much and which data to swap. TensorFlow, a machine learning framework, features memory swapping that, based on our analysis over its implementation, can be improved to reduce the increase on training duration it causes. The improvement is done is by prioritizing earlier tensors (tensor is the basic data unit in TensorFlow) which, because of a certain backpropagation property, allows the asynchronicity of program execution to increase, ultimately reducing training duration. The improvement is implemented by modifying a part of TensorFlow’s kernel responsible for memory swapping. Originally, TensorFlow’s memory swapping always swap the latest tensor. After the improvement, swappings are prioritized on earlier tensors, following the improvement idea. Based on our experiments on Char-RNN models with various hyperparameters and datasets (of size 285 KB and 4.4 MB), the improvement reduces up to around 3% of training duration on certain cases. Moreover, as the improvement is done on kernel level, it is transparent to TensorFlow’s end-users. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	The ever-increasing sizes of deep learning models and datasets used increase the need for memory, as insufficient memory may abort a training process. Adding to this problem, deep learning trainings tend to use GPUs over CPUs for better training speed, where in general a GPU has significantly less memory than a CPU. One solution is memory swapping, a way to free some memory by temporarily moving data to another memory pool, in this case moving data from a GPU memory to a CPU memory. However, since moving data takes time—the larger the data the more the time—performing memory swapping in a training can significantly increase the training duration. Therefore, an ideal memory swapping has to be selective on how much and which data to swap. TensorFlow, a machine learning framework, features memory swapping that, based on our analysis over its implementation, can be improved to reduce the increase on training duration it causes. The improvement is done is by prioritizing earlier tensors (tensor is the basic data unit in TensorFlow) which, because of a certain backpropagation property, allows the asynchronicity of program execution to increase, ultimately reducing training duration. The improvement is implemented by modifying a part of TensorFlow’s kernel responsible for memory swapping. Originally, TensorFlow’s memory swapping always swap the latest tensor. After the improvement, swappings are prioritized on earlier tensors, following the improvement idea. Based on our experiments on Char-RNN models with various hyperparameters and datasets (of size 285 KB and 4.4 MB), the improvement reduces up to around 3% of training duration on certain cases. Moreover, as the improvement is done on kernel level, it is transparent to TensorFlow’s end-users.
format	Final Project
author	Alvaro, Devin
spellingShingle	Alvaro, Devin IMPROVING TENSORFLOWâS MEMORY SWAPPING
author_facet	Alvaro, Devin
author_sort	Alvaro, Devin
title	IMPROVING TENSORFLOWâS MEMORY SWAPPING
title_short	IMPROVING TENSORFLOWâS MEMORY SWAPPING
title_full	IMPROVING TENSORFLOWâS MEMORY SWAPPING
title_fullStr	IMPROVING TENSORFLOWâS MEMORY SWAPPING
title_full_unstemmed	IMPROVING TENSORFLOWâS MEMORY SWAPPING
title_sort	improving tensorflowâs memory swapping
url	https://digilib.itb.ac.id/gdl/view/39023
_version_	1822925172289568768

IMPROVING TENSORFLOWâS MEMORY SWAPPING

Similar Items

IMPROVING TENSORFLOWâS MEMORY SWAPPING