Parallelizing discrete geodesic algorithms with perfect efficiency
This paper presents a new method for parallelizing geodesic algorithms on triangle meshes. Using the half-edge data structure, we define the propagation dependency graph to characterize data dependency in computing geodesics. Then, we design an active strategy such that the vertices and half-edges o...
Saved in:
Main Authors: | , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/151965 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-151965 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1519652021-07-08T04:59:15Z Parallelizing discrete geodesic algorithms with perfect efficiency Ying, Xiang Huang, Caibao Fu, Xuzhou He, Ying Yu, Ruiguo Wang, Jianrong Yu, Mei School of Computer Science and Engineering Engineering::Computer science and engineering Discrete Geodesics Autonomous Wavefront Propagation This paper presents a new method for parallelizing geodesic algorithms on triangle meshes. Using the half-edge data structure, we define the propagation dependency graph to characterize data dependency in computing geodesics. Then, we design an active strategy such that the vertices and half-edges on the wavefront take the initiative to collect their input data and then propagate windows and update geodesic information in their own memory space. As a result, all the read and write operations can be carried out simultaneously. Our method, named AWP, works for both exact (e.g., the CH algorithm) and approximate (e.g., the fast marching method) geodesic algorithms. Our implementation on various NVIDIA GPUs exhibit perfect linear speedup, i.e., doubling the computational power (i.e., FLOPS) doubles the speed. We prove that the AWP-CH algorithm runs in O(n2∕min(C,n)) time, where n and C are the numbers of faces and cores, respectively. Evaluation on GTX Titan XP shows that AWP-CH empirically runs in np time, p∈[1.25,1.35], for real-world models with n≤107 and anisotropy measure τ≤2.0. Thanks to its perfect efficiency and the trend of increasing the number of processors in graphics hardware, we believe that the actual performance of AWP can be further improved in the near future. Ministry of Education (MOE) This project is partially supported by the National Natural Science Foundation of China (Grant No. 61402322), Tianjin Research Program of Application Foundation and Advanced Technology, China (Grant No, 15JCQNJC00300) and Singapore Ministry of Education Grant RG26/17. 2021-07-08T04:59:15Z 2021-07-08T04:59:15Z 2019 Journal Article Ying, X., Huang, C., Fu, X., He, Y., Yu, R., Wang, J. & Yu, M. (2019). Parallelizing discrete geodesic algorithms with perfect efficiency. Computer Aided Design, 115, 161-171. https://dx.doi.org/10.1016/j.cad.2019.05.023 0010-4485 https://hdl.handle.net/10356/151965 10.1016/j.cad.2019.05.023 2-s2.0-85066400280 115 161 171 en RG26/17 Computer Aided Design © 2019 Elsevier Ltd. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Discrete Geodesics Autonomous Wavefront Propagation |
spellingShingle |
Engineering::Computer science and engineering Discrete Geodesics Autonomous Wavefront Propagation Ying, Xiang Huang, Caibao Fu, Xuzhou He, Ying Yu, Ruiguo Wang, Jianrong Yu, Mei Parallelizing discrete geodesic algorithms with perfect efficiency |
description |
This paper presents a new method for parallelizing geodesic algorithms on triangle meshes. Using the half-edge data structure, we define the propagation dependency graph to characterize data dependency in computing geodesics. Then, we design an active strategy such that the vertices and half-edges on the wavefront take the initiative to collect their input data and then propagate windows and update geodesic information in their own memory space. As a result, all the read and write operations can be carried out simultaneously. Our method, named AWP, works for both exact (e.g., the CH algorithm) and approximate (e.g., the fast marching method) geodesic algorithms. Our implementation on various NVIDIA GPUs exhibit perfect linear speedup, i.e., doubling the computational power (i.e., FLOPS) doubles the speed. We prove that the AWP-CH algorithm runs in O(n2∕min(C,n)) time, where n and C are the numbers of faces and cores, respectively. Evaluation on GTX Titan XP shows that AWP-CH empirically runs in np time, p∈[1.25,1.35], for real-world models with n≤107 and anisotropy measure τ≤2.0. Thanks to its perfect efficiency and the trend of increasing the number of processors in graphics hardware, we believe that the actual performance of AWP can be further improved in the near future. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Ying, Xiang Huang, Caibao Fu, Xuzhou He, Ying Yu, Ruiguo Wang, Jianrong Yu, Mei |
format |
Article |
author |
Ying, Xiang Huang, Caibao Fu, Xuzhou He, Ying Yu, Ruiguo Wang, Jianrong Yu, Mei |
author_sort |
Ying, Xiang |
title |
Parallelizing discrete geodesic algorithms with perfect efficiency |
title_short |
Parallelizing discrete geodesic algorithms with perfect efficiency |
title_full |
Parallelizing discrete geodesic algorithms with perfect efficiency |
title_fullStr |
Parallelizing discrete geodesic algorithms with perfect efficiency |
title_full_unstemmed |
Parallelizing discrete geodesic algorithms with perfect efficiency |
title_sort |
parallelizing discrete geodesic algorithms with perfect efficiency |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/151965 |
_version_ |
1705151293174054912 |