Digital library for Thai astronomical history study on French document resource

© 2019 Association for Computing Machinery Thai history in the era of Ayutthaya Kingdom was mostly documented by French missionaries during the 17th-18th centuries. Huge amount of resources in form of manuscripts, books, microfilms are preserved and provided by several institutions such as Bibliotèq...

Full description

Saved in:
Bibliographic Details
Main Authors: Papangkorn Inkeaw, Jeerayut Chaijaruwanich, Boonrucksar Soonthornthum
Format: Conference Proceeding
Published: 2020
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85078502356&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/67695
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-67695
record_format dspace
spelling th-cmuir.6653943832-676952020-04-02T15:01:38Z Digital library for Thai astronomical history study on French document resource Papangkorn Inkeaw Jeerayut Chaijaruwanich Boonrucksar Soonthornthum Computer Science © 2019 Association for Computing Machinery Thai history in the era of Ayutthaya Kingdom was mostly documented by French missionaries during the 17th-18th centuries. Huge amount of resources in form of manuscripts, books, microfilms are preserved and provided by several institutions such as Bibliotèque National de France, etc. Nowadays, the advance of digital technology allows us to access these resources publicly. Many resources were digitized in form of scanned images. This work aims to establish our own specific digital library for Thai astronomical history study. Document management system was developed. It includes data acquisition and collection management. To be able to access knowledge behind the texts, the scanned images were transformed into machine-readable format by optical characters recognition (OCR). Search engine was implemented to allow historians to find pieces of reverent information from keywords. In our circumstance, Thai historians may not have French reading skill. We integrated an automatic French to English language translation by using machine translation technique. Our system provides the historians the e-books of the French historical original documents in English. To automatically extract knowledge from texts, we perform the natural language processing to identify name-entities, such as name of person, places, events, etc., from texts. This enables the historian to explore some meaningful concepts via the indices of the texts. The indices were also automatically linked to Wikipedia as an existing knowledge pool. There are still some limitations of our project including the processes of OCR, language machine translation, name-entity recognition which remain challenged in computer science research. 2020-04-02T15:01:38Z 2020-04-02T15:01:38Z 2019-10-25 Conference Proceeding 2-s2.0-85078502356 10.1145/3369199.3369236 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85078502356&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/67695
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Computer Science
spellingShingle Computer Science
Papangkorn Inkeaw
Jeerayut Chaijaruwanich
Boonrucksar Soonthornthum
Digital library for Thai astronomical history study on French document resource
description © 2019 Association for Computing Machinery Thai history in the era of Ayutthaya Kingdom was mostly documented by French missionaries during the 17th-18th centuries. Huge amount of resources in form of manuscripts, books, microfilms are preserved and provided by several institutions such as Bibliotèque National de France, etc. Nowadays, the advance of digital technology allows us to access these resources publicly. Many resources were digitized in form of scanned images. This work aims to establish our own specific digital library for Thai astronomical history study. Document management system was developed. It includes data acquisition and collection management. To be able to access knowledge behind the texts, the scanned images were transformed into machine-readable format by optical characters recognition (OCR). Search engine was implemented to allow historians to find pieces of reverent information from keywords. In our circumstance, Thai historians may not have French reading skill. We integrated an automatic French to English language translation by using machine translation technique. Our system provides the historians the e-books of the French historical original documents in English. To automatically extract knowledge from texts, we perform the natural language processing to identify name-entities, such as name of person, places, events, etc., from texts. This enables the historian to explore some meaningful concepts via the indices of the texts. The indices were also automatically linked to Wikipedia as an existing knowledge pool. There are still some limitations of our project including the processes of OCR, language machine translation, name-entity recognition which remain challenged in computer science research.
format Conference Proceeding
author Papangkorn Inkeaw
Jeerayut Chaijaruwanich
Boonrucksar Soonthornthum
author_facet Papangkorn Inkeaw
Jeerayut Chaijaruwanich
Boonrucksar Soonthornthum
author_sort Papangkorn Inkeaw
title Digital library for Thai astronomical history study on French document resource
title_short Digital library for Thai astronomical history study on French document resource
title_full Digital library for Thai astronomical history study on French document resource
title_fullStr Digital library for Thai astronomical history study on French document resource
title_full_unstemmed Digital library for Thai astronomical history study on French document resource
title_sort digital library for thai astronomical history study on french document resource
publishDate 2020
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85078502356&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/67695
_version_ 1681426682842447872