A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits
This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structur...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2014
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/103119 http://hdl.handle.net/10220/19253 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-103119 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1031192020-03-07T14:00:33Z A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits Liu, Xue-Xin Yu, Hao Tan, Sheldon X.-D. School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to 8x speedup. Accepted version 2014-04-11T07:54:28Z 2019-12-06T21:06:02Z 2014-04-11T07:54:28Z 2019-12-06T21:06:02Z 2013 2013 Journal Article Liu, X.-X., Yu, H., & Tan, S. X.-D. (2014). A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, (99),1-1. 1063-8210 https://hdl.handle.net/10356/103119 http://hdl.handle.net/10220/19253 10.1109/TVLSI.2014.2309606 en IEEE transactions on very large scale integration (VLSI) systems © 2013 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TVLSI.2014.2309606]. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering Liu, Xue-Xin Yu, Hao Tan, Sheldon X.-D. A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits |
description |
This paper presents a new parallel shooting-Newton method based on a graphic processing unit (GPU)-accelerated periodic Arnoldi shooting solver (GAPAS) for fast periodic steady-state analysis of radio frequency/millimeter-wave integrated circuits. The new algorithm first explores a periodic structure of the state matrix by using a periodic Arnoldi algorithm for computing the resulting structured Krylov subspace in the generalized minimal residual (GMRES) solver. The resulting periodic Arnoldi shooting method is very amenable for massive parallel computing, such as GPUs. Second, the periodic Arnoldi-based GMRES solver in the shooting-Newton method is parallelized on the recent NVIDIA Tesla GPU platforms. We further explore CUDA GPUs features, such as coalesced memory access and overlapping transfers with computation to boost the efficiency of the resulting parallel GAPAS method. Experimental results from several industrial examples show that when compared with the state-of-the-art implicit GMRES method under the same accuracy, the new parallel shooting-Newton method can lead up to 8x speedup. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Liu, Xue-Xin Yu, Hao Tan, Sheldon X.-D. |
format |
Article |
author |
Liu, Xue-Xin Yu, Hao Tan, Sheldon X.-D. |
author_sort |
Liu, Xue-Xin |
title |
A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits |
title_short |
A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits |
title_full |
A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits |
title_fullStr |
A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits |
title_full_unstemmed |
A GPU-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits |
title_sort |
gpu-accelerated parallel shooting algorithm for analysis of radio frequency and microwave integrated circuits |
publishDate |
2014 |
url |
https://hdl.handle.net/10356/103119 http://hdl.handle.net/10220/19253 |
_version_ |
1681042613129445376 |