Screen content image evaluation and processing

Screen Content Image (SCI) is a typical kind of compound images that contain texts, graphics and pictures concurrently. With the rapid development of digital devices and computing techniques, SCIs have increasingly appeared in multi-client communication systems. The related applications bring many c...

Full description

Saved in:
Bibliographic Details
Main Author: Yang, Huan
Other Authors: Lin Weisi
Format: Theses and Dissertations
Language:English
Published: 2015
Subjects:
Online Access:https://hdl.handle.net/10356/65397
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-65397
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Yang, Huan
Screen content image evaluation and processing
description Screen Content Image (SCI) is a typical kind of compound images that contain texts, graphics and pictures concurrently. With the rapid development of digital devices and computing techniques, SCIs have increasingly appeared in multi-client communication systems. The related applications bring many challenges on SCI processing, such as acquisition, segmentation, compression, transmission, quality evaluation, etc. SCIs have different characteristics from natural scene images and scanned document images, which result in the fact that existing classical image processing methods cannot effectively process SCIs. Hence, specialized algorithms for SCI processing are much desired. Currently, there is no much research work in the literature for SCI processing. In this research work, we try to understand the basic properties of SCIs and focus on address- ing challenging problems in the following three aspects, i.e., segmentation, compression and perceptual quality assessment of SCIs. SCI segmentation, which aims to distinguish texts from other components, is a fun- damental step in various SCI processing techniques. In this research work, we rstly propose a coarse-to- ne framework to segment texts with arbitrary scales and orien- tations from other components in SCIs. A Local Image Activity Measure (LIAM) is designed to enhance the di erence between textual and pictorial regions and eliminate most of pictorial regions with low frequency. In order to remove survived pictorial regions (mistaken as texts), a new Scale and Orientation Invariant Grouping (SOIG) algorithm is proposed to construct Textual Connected Components (TCCs) with uni- form geometrical features. False positive components are nally ltered out by three veri cation criteria. The proposed text segmentation algorithm can maintain integrity of texts with varied scales and orientations, which bene ts the compression and evalu- ation procedures for SCIs.It has been demonstrated that traditional coding methods with a single basic func- tion, such as JPEG and JPEG2000, cannot achieve good performance for SCI com- pression due to the intensive high frequency variations in textual regions. In this work, a novel SCI compression scheme is proposed by using di erent basis functions to en- code di erent components respectively. A tailored text dictionary for textual image representation is learned via a modi ed dictionary learning method, i.e., K-Singular Value Decomposition (K-SVD). Compared with the Discrete Cosine Transform (DCT) based representation, textual representation derived from the tailored text dictionary is much sparser, which provides more probability to e ectively encode SCIs. The pro- posed coding scheme achieves much higher coding performance than existing standard coding methods, especially for SCIs with large percentage of textual regions. To evaluate the visual quality of the processed SCIs by compression and other processing, we present a study on perceptual quality assessment of SCIs. A large SCI Quality Assessment Database (SIQAD) is constructed with the visual quality scores ob- tained through subjective testing. Besides, we investigate the correlations between the subjective scores of di erent regions, which reveals the impact of textual and pictorial regions to the overall visual quality. A new SCI Perceptual Quality Assessment (SPQA) scheme is also proposed to automatically evaluate the visual quality of distorted SCIs, by taking into account the di erent properties and contributions of textual and pic- torial regions. Compared with the start-of-the-art Image Quality Assessment (IQA) methods, the proposed SPQA achieves much higher consistency with subjective results.
author2 Lin Weisi
author_facet Lin Weisi
Yang, Huan
format Theses and Dissertations
author Yang, Huan
author_sort Yang, Huan
title Screen content image evaluation and processing
title_short Screen content image evaluation and processing
title_full Screen content image evaluation and processing
title_fullStr Screen content image evaluation and processing
title_full_unstemmed Screen content image evaluation and processing
title_sort screen content image evaluation and processing
publishDate 2015
url https://hdl.handle.net/10356/65397
_version_ 1759857528314789888
spelling sg-ntu-dr.10356-653972023-03-04T00:42:55Z Screen content image evaluation and processing Yang, Huan Lin Weisi School of Computer Engineering Centre for Multimedia and Network Technology DRNTU::Engineering::Computer science and engineering Screen Content Image (SCI) is a typical kind of compound images that contain texts, graphics and pictures concurrently. With the rapid development of digital devices and computing techniques, SCIs have increasingly appeared in multi-client communication systems. The related applications bring many challenges on SCI processing, such as acquisition, segmentation, compression, transmission, quality evaluation, etc. SCIs have different characteristics from natural scene images and scanned document images, which result in the fact that existing classical image processing methods cannot effectively process SCIs. Hence, specialized algorithms for SCI processing are much desired. Currently, there is no much research work in the literature for SCI processing. In this research work, we try to understand the basic properties of SCIs and focus on address- ing challenging problems in the following three aspects, i.e., segmentation, compression and perceptual quality assessment of SCIs. SCI segmentation, which aims to distinguish texts from other components, is a fun- damental step in various SCI processing techniques. In this research work, we rstly propose a coarse-to- ne framework to segment texts with arbitrary scales and orien- tations from other components in SCIs. A Local Image Activity Measure (LIAM) is designed to enhance the di erence between textual and pictorial regions and eliminate most of pictorial regions with low frequency. In order to remove survived pictorial regions (mistaken as texts), a new Scale and Orientation Invariant Grouping (SOIG) algorithm is proposed to construct Textual Connected Components (TCCs) with uni- form geometrical features. False positive components are nally ltered out by three veri cation criteria. The proposed text segmentation algorithm can maintain integrity of texts with varied scales and orientations, which bene ts the compression and evalu- ation procedures for SCIs.It has been demonstrated that traditional coding methods with a single basic func- tion, such as JPEG and JPEG2000, cannot achieve good performance for SCI com- pression due to the intensive high frequency variations in textual regions. In this work, a novel SCI compression scheme is proposed by using di erent basis functions to en- code di erent components respectively. A tailored text dictionary for textual image representation is learned via a modi ed dictionary learning method, i.e., K-Singular Value Decomposition (K-SVD). Compared with the Discrete Cosine Transform (DCT) based representation, textual representation derived from the tailored text dictionary is much sparser, which provides more probability to e ectively encode SCIs. The pro- posed coding scheme achieves much higher coding performance than existing standard coding methods, especially for SCIs with large percentage of textual regions. To evaluate the visual quality of the processed SCIs by compression and other processing, we present a study on perceptual quality assessment of SCIs. A large SCI Quality Assessment Database (SIQAD) is constructed with the visual quality scores ob- tained through subjective testing. Besides, we investigate the correlations between the subjective scores of di erent regions, which reveals the impact of textual and pictorial regions to the overall visual quality. A new SCI Perceptual Quality Assessment (SPQA) scheme is also proposed to automatically evaluate the visual quality of distorted SCIs, by taking into account the di erent properties and contributions of textual and pic- torial regions. Compared with the start-of-the-art Image Quality Assessment (IQA) methods, the proposed SPQA achieves much higher consistency with subjective results. COMPUTER ENGINEERING 2015-09-08T05:12:48Z 2015-09-08T05:12:48Z 2015 2015 Thesis Yang, H. (2015). Screen content image evaluation and processing. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/65397 10.32657/10356/65397 en 140 p. application/pdf