Screen content image evaluation and processing

Screen Content Image (SCI) is a typical kind of compound images that contain texts, graphics and pictures concurrently. With the rapid development of digital devices and computing techniques, SCIs have increasingly appeared in multi-client communication systems. The related applications bring many c...

Full description

Saved in:

Bibliographic Details
Main Author:	Yang, Huan
Other Authors:	Lin Weisi
Format:	Theses and Dissertations
Language:	English
Published:	2015
Subjects:	DRNTU::Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/65397
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-65397
record_format	dspace
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	DRNTU::Engineering::Computer science and engineering
spellingShingle	DRNTU::Engineering::Computer science and engineering Yang, Huan Screen content image evaluation and processing
description	Screen Content Image (SCI) is a typical kind of compound images that contain texts, graphics and pictures concurrently. With the rapid development of digital devices and computing techniques, SCIs have increasingly appeared in multi-client communication systems. The related applications bring many challenges on SCI processing, such as acquisition, segmentation, compression, transmission, quality evaluation, etc. SCIs have different characteristics from natural scene images and scanned document images, which result in the fact that existing classical image processing methods cannot effectively process SCIs. Hence, specialized algorithms for SCI processing are much desired. Currently, there is no much research work in the literature for SCI processing. In this research work, we try to understand the basic properties of SCIs and focus on address- ing challenging problems in the following three aspects, i.e., segmentation, compression and perceptual quality assessment of SCIs. SCI segmentation, which aims to distinguish texts from other components, is a fun- damental step in various SCI processing techniques. In this research work, we rstly propose a coarse-to- ne framework to segment texts with arbitrary scales and orien- tations from other components in SCIs. A Local Image Activity Measure (LIAM) is designed to enhance the di erence between textual and pictorial regions and eliminate most of pictorial regions with low frequency. In order to remove survived pictorial regions (mistaken as texts), a new Scale and Orientation Invariant Grouping (SOIG) algorithm is proposed to construct Textual Connected Components (TCCs) with uni- form geometrical features. False positive components are nally ltered out by three veri cation criteria. The proposed text segmentation algorithm can maintain integrity of texts with varied scales and orientations, which bene ts the compression and evalu- ation procedures for SCIs.It has been demonstrated that traditional coding methods with a single basic func- tion, such as JPEG and JPEG2000, cannot achieve good performance for SCI com- pression due to the intensive high frequency variations in textual regions. In this work, a novel SCI compression scheme is proposed by using di erent basis functions to en- code di erent components respectively. A tailored text dictionary for textual image representation is learned via a modi ed dictionary learning method, i.e., K-Singular Value Decomposition (K-SVD). Compared with the Discrete Cosine Transform (DCT) based representation, textual representation derived from the tailored text dictionary is much sparser, which provides more probability to e ectively encode SCIs. The pro- posed coding scheme achieves much higher coding performance than existing standard coding methods, especially for SCIs with large percentage of textual regions. To evaluate the visual quality of the processed SCIs by compression and other processing, we present a study on perceptual quality assessment of SCIs. A large SCI Quality Assessment Database (SIQAD) is constructed with the visual quality scores ob- tained through subjective testing. Besides, we investigate the correlations between the subjective scores of di erent regions, which reveals the impact of textual and pictorial regions to the overall visual quality. A new SCI Perceptual Quality Assessment (SPQA) scheme is also proposed to automatically evaluate the visual quality of distorted SCIs, by taking into account the di erent properties and contributions of textual and pic- torial regions. Compared with the start-of-the-art Image Quality Assessment (IQA) methods, the proposed SPQA achieves much higher consistency with subjective results.
author2	Lin Weisi
author_facet	Lin Weisi Yang, Huan
format	Theses and Dissertations
author	Yang, Huan
author_sort	Yang, Huan
title	Screen content image evaluation and processing
title_short	Screen content image evaluation and processing
title_full	Screen content image evaluation and processing
title_fullStr	Screen content image evaluation and processing
title_full_unstemmed	Screen content image evaluation and processing
title_sort	screen content image evaluation and processing
publishDate	2015
url	https://hdl.handle.net/10356/65397
_version_	1759857528314789888
spelling	sg-ntu-dr.10356-653972023-03-04T00:42:55Z Screen content image evaluation and processing Yang, Huan Lin Weisi School of Computer Engineering Centre for Multimedia and Network Technology DRNTU::Engineering::Computer science and engineering Screen Content Image (SCI) is a typical kind of compound images that contain texts, graphics and pictures concurrently. With the rapid development of digital devices and computing techniques, SCIs have increasingly appeared in multi-client communication systems. The related applications bring many challenges on SCI processing, such as acquisition, segmentation, compression, transmission, quality evaluation, etc. SCIs have different characteristics from natural scene images and scanned document images, which result in the fact that existing classical image processing methods cannot effectively process SCIs. Hence, specialized algorithms for SCI processing are much desired. Currently, there is no much research work in the literature for SCI processing. In this research work, we try to understand the basic properties of SCIs and focus on address- ing challenging problems in the following three aspects, i.e., segmentation, compression and perceptual quality assessment of SCIs. SCI segmentation, which aims to distinguish texts from other components, is a fun- damental step in various SCI processing techniques. In this research work, we rstly propose a coarse-to- ne framework to segment texts with arbitrary scales and orien- tations from other components in SCIs. A Local Image Activity Measure (LIAM) is designed to enhance the di erence between textual and pictorial regions and eliminate most of pictorial regions with low frequency. In order to remove survived pictorial regions (mistaken as texts), a new Scale and Orientation Invariant Grouping (SOIG) algorithm is proposed to construct Textual Connected Components (TCCs) with uni- form geometrical features. False positive components are nally ltered out by three veri cation criteria. The proposed text segmentation algorithm can maintain integrity of texts with varied scales and orientations, which bene ts the compression and evalu- ation procedures for SCIs.It has been demonstrated that traditional coding methods with a single basic func- tion, such as JPEG and JPEG2000, cannot achieve good performance for SCI com- pression due to the intensive high frequency variations in textual regions. In this work, a novel SCI compression scheme is proposed by using di erent basis functions to en- code di erent components respectively. A tailored text dictionary for textual image representation is learned via a modi ed dictionary learning method, i.e., K-Singular Value Decomposition (K-SVD). Compared with the Discrete Cosine Transform (DCT) based representation, textual representation derived from the tailored text dictionary is much sparser, which provides more probability to e ectively encode SCIs. The pro- posed coding scheme achieves much higher coding performance than existing standard coding methods, especially for SCIs with large percentage of textual regions. To evaluate the visual quality of the processed SCIs by compression and other processing, we present a study on perceptual quality assessment of SCIs. A large SCI Quality Assessment Database (SIQAD) is constructed with the visual quality scores ob- tained through subjective testing. Besides, we investigate the correlations between the subjective scores of di erent regions, which reveals the impact of textual and pictorial regions to the overall visual quality. A new SCI Perceptual Quality Assessment (SPQA) scheme is also proposed to automatically evaluate the visual quality of distorted SCIs, by taking into account the di erent properties and contributions of textual and pic- torial regions. Compared with the start-of-the-art Image Quality Assessment (IQA) methods, the proposed SPQA achieves much higher consistency with subjective results. COMPUTER ENGINEERING 2015-09-08T05:12:48Z 2015-09-08T05:12:48Z 2015 2015 Thesis Yang, H. (2015). Screen content image evaluation and processing. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/65397 10.32657/10356/65397 en 140 p. application/pdf

Screen content image evaluation and processing

Similar Items