The old newspaper project

In this project, we will create a newspaper image recognition app based on AI which uses Computer Vision, Optical Character Recognition (OCR) technology and Natural Language Processing (NLP) models to recognize, classify and extract information from images of newspapers. The project follows an itera...

Full description

Saved in:
Bibliographic Details
Main Author: Li, JiaGeng
Other Authors: Ling Keck Voon
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181681
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181681
record_format dspace
spelling sg-ntu-dr.10356-1816812024-12-13T15:45:41Z The old newspaper project Li, JiaGeng Ling Keck Voon School of Electrical and Electronic Engineering EKVLING@ntu.edu.sg Computer and Information Science In this project, we will create a newspaper image recognition app based on AI which uses Computer Vision, Optical Character Recognition (OCR) technology and Natural Language Processing (NLP) models to recognize, classify and extract information from images of newspapers. The project follows an iterative process; it begins with the investigation of Tesseract OCR and Yolov8 in a pre-alpha version to extract text and analyze layout. Having faced limitations we shifted to advanced implementations such as PaddleOCR along with LayoutParser for styling newspaper layouts and document parsing and using GPT-3 models for detailed outline writing of the extracted content., In this report, we describe the project context, technology choices, how we collected data is followed by some challenges we faced while building the application and how we solved those issues to improve the application's functionality. Bachelor's degree 2024-12-13T11:59:04Z 2024-12-13T11:59:04Z 2024 Final Year Project (FYP) Li, J. (2024). The old newspaper project. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181681 https://hdl.handle.net/10356/181681 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
spellingShingle Computer and Information Science
Li, JiaGeng
The old newspaper project
description In this project, we will create a newspaper image recognition app based on AI which uses Computer Vision, Optical Character Recognition (OCR) technology and Natural Language Processing (NLP) models to recognize, classify and extract information from images of newspapers. The project follows an iterative process; it begins with the investigation of Tesseract OCR and Yolov8 in a pre-alpha version to extract text and analyze layout. Having faced limitations we shifted to advanced implementations such as PaddleOCR along with LayoutParser for styling newspaper layouts and document parsing and using GPT-3 models for detailed outline writing of the extracted content., In this report, we describe the project context, technology choices, how we collected data is followed by some challenges we faced while building the application and how we solved those issues to improve the application's functionality.
author2 Ling Keck Voon
author_facet Ling Keck Voon
Li, JiaGeng
format Final Year Project
author Li, JiaGeng
author_sort Li, JiaGeng
title The old newspaper project
title_short The old newspaper project
title_full The old newspaper project
title_fullStr The old newspaper project
title_full_unstemmed The old newspaper project
title_sort old newspaper project
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181681
_version_ 1819113015321034752