A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits

This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structur...

Full description

Saved in:
Bibliographic Details
Main Authors: Liu, Xue-Xin, Yu, Hao, Tan, Sheldon X.-D.
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2014
Subjects:
Online Access:https://hdl.handle.net/10356/103119
http://hdl.handle.net/10220/19253
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-103119
record_format dspace
spelling sg-ntu-dr.10356-1031192020-03-07T14:00:33Z A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits Liu, Xue-Xin Yu, Hao Tan, Sheldon X.-D. School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to 8x speedup. Accepted version 2014-04-11T07:54:28Z 2019-12-06T21:06:02Z 2014-04-11T07:54:28Z 2019-12-06T21:06:02Z 2013 2013 Journal Article Liu, X.-X., Yu, H., & Tan, S. X.-D. (2014). A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, (99),1-1. 1063-8210 https://hdl.handle.net/10356/103119 http://hdl.handle.net/10220/19253 10.1109/TVLSI.2014.2309606 en IEEE transactions on very large scale integration (VLSI) systems © 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TVLSI.2014.2309606]. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Electrical and electronic engineering
spellingShingle DRNTU::Engineering::Electrical and electronic engineering
Liu, Xue-Xin
Yu, Hao
Tan, Sheldon X.-D.
A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
description This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to 8x speedup.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Liu, Xue-Xin
Yu, Hao
Tan, Sheldon X.-D.
format Article
author Liu, Xue-Xin
Yu, Hao
Tan, Sheldon X.-D.
author_sort Liu, Xue-Xin
title A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
title_short A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
title_full A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
title_fullStr A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
title_full_unstemmed A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
title_sort gpu-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
publishDate 2014
url https://hdl.handle.net/10356/103119
http://hdl.handle.net/10220/19253
_version_ 1681042613129445376