Shape fitting problems in the presence of outliers

The thesis studies three planar shape fitting problems in computational geometry, the unit square/disk cover, the minimum enclosing parallelogram and the Tukey/- convex layers. For all the three problems, we allow presence of noisy outliers, whose removal could improve the quality of the output. In...

Full description

Saved in:
Bibliographic Details
Main Author: Guo, Zhengyang
Other Authors: -
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148408
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-148408
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Mathematics
spellingShingle Science::Mathematics
Guo, Zhengyang
Shape fitting problems in the presence of outliers
description The thesis studies three planar shape fitting problems in computational geometry, the unit square/disk cover, the minimum enclosing parallelogram and the Tukey/- convex layers. For all the three problems, we allow presence of noisy outliers, whose removal could improve the quality of the output. In convention, the covered and uncovered points are called inliers and outliers, respectively. We let n denote the size of the input point set and t the number of possible outliers. With the target to cover no less than (n − t) points of the input, we consider the minimum number of unit disks, a parallelogram of minimum area and a convex polygon of minimum area. In Chapter 3, we study the unit square/disk cover problem. Even when there is no outlier, both problems are NP-hard [57]. By extending the well-known shifting strategy [57] to the cases of outliers, we present an approximation algorithm to unit square cover, which covers at least (n − t − δt) points with at most 2 times of the optimal number of squares. We also give a (7/2, 1 + δ) bi-criteria approximation algorithm whose time complexity is O(n 7 t + δ −1ntlog n). We also introduce a restricted version of unit disk cover, where the centers of the disks are restricted to be among the input points. This restricted version is also NP-hard and we give a (1 + 6/ √ 5 + 1/`, 1 + δ) bi-criteria approximation algorithm whose runtime is O(`n4 t + δ −1 `mtlog n). Next in Chapter 4, we study the minimum enclosing parallelogram with outliers. By developing a method similar to rotating calipers [97], we can obtain all the possible positions of the parallelogram sides. Combining the different side positions, we present an exact algorithm with O |Ut+1(X)| 2 t 4 + n 2 log n runtime, with the assumption that no three points are collinear. Here Ut+1(X) denotes the first (t+1) Tukey layers of the input, consisting of the points whose Tukey depth [98] are at most (t + 1). The algorithm is quadratic in n and could be prohibitively slow for pratical use. To overcome such issue, we give a sampling algorithm with runtime O(n+poly(log n, t, 1/δ)), which with high probability finds a parallelogram covering at least (1 − δ)(n − t) points. The area of the parallelogram is no more than the optimal value. In Section 4.6, we show our exact algorithm can be adapted to the partial minimum enclosing rectangle, whose orientation can be arbitrary, improving the best known time complexity O(n 2 t 2 + n 2 tlog n) in [26] to O(n 2 t 2 + n 2 log n). In Chapter 5, we study the average case complexity of two shape fitting algorithms, both in the cases of outliers. The first is our exact algorithms for minimum enclosing parallelogram with outliers, introduced in Chapter 4. As proved in Theorem 4.12, the time complexity of the algorithm to MEPt depends on the size of the first t Tukey layers, which is denoted by |U[t](X)|. We therefore turn to study the expected size of the first t Tukey layers of the input. We show that E[U[t](X)] = O(ktlog(n/t)), when the n points are independently and uniformly sampled from a convex polygon with k vertices. Consequently, the average case complexity is O(kt5n log(n/k) + n 2 log n). In the same spirit, we also study the expected size of the first t convex layers. We show that E[V[t](X)] = O(kt3 log(n/(kt2 ))) and by this result we can compute the average case complexity of the algorithm for convex hull with outliers in [5], which is O ( n log n + k 4t2t (3t){t t3 log nkt2).
author2 -
author_facet -
Guo, Zhengyang
format Thesis-Doctor of Philosophy
author Guo, Zhengyang
author_sort Guo, Zhengyang
title Shape fitting problems in the presence of outliers
title_short Shape fitting problems in the presence of outliers
title_full Shape fitting problems in the presence of outliers
title_fullStr Shape fitting problems in the presence of outliers
title_full_unstemmed Shape fitting problems in the presence of outliers
title_sort shape fitting problems in the presence of outliers
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/148408
_version_ 1759855993133465600
spelling sg-ntu-dr.10356-1484082023-02-28T23:47:44Z Shape fitting problems in the presence of outliers Guo, Zhengyang - School of Physical and Mathematical Sciences Yi Li yili@ntu.edu.sg Science::Mathematics The thesis studies three planar shape fitting problems in computational geometry, the unit square/disk cover, the minimum enclosing parallelogram and the Tukey/- convex layers. For all the three problems, we allow presence of noisy outliers, whose removal could improve the quality of the output. In convention, the covered and uncovered points are called inliers and outliers, respectively. We let n denote the size of the input point set and t the number of possible outliers. With the target to cover no less than (n − t) points of the input, we consider the minimum number of unit disks, a parallelogram of minimum area and a convex polygon of minimum area. In Chapter 3, we study the unit square/disk cover problem. Even when there is no outlier, both problems are NP-hard [57]. By extending the well-known shifting strategy [57] to the cases of outliers, we present an approximation algorithm to unit square cover, which covers at least (n − t − δt) points with at most 2 times of the optimal number of squares. We also give a (7/2, 1 + δ) bi-criteria approximation algorithm whose time complexity is O(n 7 t + δ −1ntlog n). We also introduce a restricted version of unit disk cover, where the centers of the disks are restricted to be among the input points. This restricted version is also NP-hard and we give a (1 + 6/ √ 5 + 1/`, 1 + δ) bi-criteria approximation algorithm whose runtime is O(`n4 t + δ −1 `mtlog n). Next in Chapter 4, we study the minimum enclosing parallelogram with outliers. By developing a method similar to rotating calipers [97], we can obtain all the possible positions of the parallelogram sides. Combining the different side positions, we present an exact algorithm with O |Ut+1(X)| 2 t 4 + n 2 log n runtime, with the assumption that no three points are collinear. Here Ut+1(X) denotes the first (t+1) Tukey layers of the input, consisting of the points whose Tukey depth [98] are at most (t + 1). The algorithm is quadratic in n and could be prohibitively slow for pratical use. To overcome such issue, we give a sampling algorithm with runtime O(n+poly(log n, t, 1/δ)), which with high probability finds a parallelogram covering at least (1 − δ)(n − t) points. The area of the parallelogram is no more than the optimal value. In Section 4.6, we show our exact algorithm can be adapted to the partial minimum enclosing rectangle, whose orientation can be arbitrary, improving the best known time complexity O(n 2 t 2 + n 2 tlog n) in [26] to O(n 2 t 2 + n 2 log n). In Chapter 5, we study the average case complexity of two shape fitting algorithms, both in the cases of outliers. The first is our exact algorithms for minimum enclosing parallelogram with outliers, introduced in Chapter 4. As proved in Theorem 4.12, the time complexity of the algorithm to MEPt depends on the size of the first t Tukey layers, which is denoted by |U[t](X)|. We therefore turn to study the expected size of the first t Tukey layers of the input. We show that E[U[t](X)] = O(ktlog(n/t)), when the n points are independently and uniformly sampled from a convex polygon with k vertices. Consequently, the average case complexity is O(kt5n log(n/k) + n 2 log n). In the same spirit, we also study the expected size of the first t convex layers. We show that E[V[t](X)] = O(kt3 log(n/(kt2 ))) and by this result we can compute the average case complexity of the algorithm for convex hull with outliers in [5], which is O ( n log n + k 4t2t (3t){t t3 log nkt2). Doctor of Philosophy 2021-04-28T08:34:38Z 2021-04-28T08:34:38Z 2021 Thesis-Doctor of Philosophy Guo, Z. (2021). Shape fitting problems in the presence of outliers. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148408 https://hdl.handle.net/10356/148408 10.32657/10356/148408 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University