Word by word labelling of Romanized Sindhi text by using online python tool
Sindhi is one of the most ancient languages in the world and it has its own written and spoken scripts. After the rigorous study it was found that a lot of research work has been done in different languages, but word by word labelling of Sindhi language had not been done yet. In this research study...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English English |
Published: |
The Science and Information (SAI) Organization
2022
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/100654/1/100654_Word%20by%20word%20labelling.pdf http://irep.iium.edu.my/100654/2/100654_Word%20by%20word%20labelling_SCOPUS.pdf http://irep.iium.edu.my/100654/ https://thesai.org/Publications/ViewPaper?Volume=13&Issue=8&Code=IJACSA&SerialNo=31 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English English |
id |
my.iium.irep.100654 |
---|---|
record_format |
dspace |
spelling |
my.iium.irep.1006542023-12-29T02:18:00Z http://irep.iium.edu.my/100654/ Word by word labelling of Romanized Sindhi text by using online python tool Sodhar, Irum Naz Buller, Abdul Hafeez Sulaiman, Suriani Sodhar, Anam Naz QA75 Electronic computers. Computer science Sindhi is one of the most ancient languages in the world and it has its own written and spoken scripts. After the rigorous study it was found that a lot of research work has been done in different languages, but word by word labelling of Sindhi language had not been done yet. In this research study, word labelling was done on 100 sentences of Romanized Sindhi texts using Python online tool. The dataset was collected from different sources which include Sindhi newspaper, blogs and social media webpages. From this dataset, a rule-based model has been applied for the Parts-of-Speech (POS) tagging of the Romanized Sindhi sentences. A total of 624 words of Romanized Sindhi texts were tested and successfully tagged by the SindhiNLP tool in which 482 words were tagged as nouns and pronouns, 92 words tagged as verbs and 50 words tagged as determinants. The Science and Information (SAI) Organization 2022 Article PeerReviewed application/pdf en http://irep.iium.edu.my/100654/1/100654_Word%20by%20word%20labelling.pdf application/pdf en http://irep.iium.edu.my/100654/2/100654_Word%20by%20word%20labelling_SCOPUS.pdf Sodhar, Irum Naz and Buller, Abdul Hafeez and Sulaiman, Suriani and Sodhar, Anam Naz (2022) Word by word labelling of Romanized Sindhi text by using online python tool. International Journal of Advanced Computer Science and Applications, 13 (8). pp. 262-267. ISSN 2158-107X E-ISSN 2156-5570 https://thesai.org/Publications/ViewPaper?Volume=13&Issue=8&Code=IJACSA&SerialNo=31 10.14569/IJACSA.2022.0130831 |
institution |
Universiti Islam Antarabangsa Malaysia |
building |
IIUM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
International Islamic University Malaysia |
content_source |
IIUM Repository (IREP) |
url_provider |
http://irep.iium.edu.my/ |
language |
English English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Sodhar, Irum Naz Buller, Abdul Hafeez Sulaiman, Suriani Sodhar, Anam Naz Word by word labelling of Romanized Sindhi text by using online python tool |
description |
Sindhi is one of the most ancient languages in the world and it has its own written and spoken scripts. After the rigorous study it was found that a lot of research work has been done in different languages, but word by word labelling of Sindhi
language had not been done yet. In this research study, word
labelling was done on 100 sentences of Romanized Sindhi texts using Python online tool. The dataset was collected from different sources which include Sindhi newspaper, blogs and social media webpages. From this dataset, a rule-based model has been applied for the Parts-of-Speech (POS) tagging of the Romanized Sindhi sentences. A total of 624 words of Romanized Sindhi texts were tested and successfully tagged by the SindhiNLP tool in which 482 words were tagged as nouns and pronouns, 92 words tagged as verbs and 50 words tagged as determinants. |
format |
Article |
author |
Sodhar, Irum Naz Buller, Abdul Hafeez Sulaiman, Suriani Sodhar, Anam Naz |
author_facet |
Sodhar, Irum Naz Buller, Abdul Hafeez Sulaiman, Suriani Sodhar, Anam Naz |
author_sort |
Sodhar, Irum Naz |
title |
Word by word labelling of Romanized Sindhi text by using online python tool |
title_short |
Word by word labelling of Romanized Sindhi text by using online python tool |
title_full |
Word by word labelling of Romanized Sindhi text by using online python tool |
title_fullStr |
Word by word labelling of Romanized Sindhi text by using online python tool |
title_full_unstemmed |
Word by word labelling of Romanized Sindhi text by using online python tool |
title_sort |
word by word labelling of romanized sindhi text by using online python tool |
publisher |
The Science and Information (SAI) Organization |
publishDate |
2022 |
url |
http://irep.iium.edu.my/100654/1/100654_Word%20by%20word%20labelling.pdf http://irep.iium.edu.my/100654/2/100654_Word%20by%20word%20labelling_SCOPUS.pdf http://irep.iium.edu.my/100654/ https://thesai.org/Publications/ViewPaper?Volume=13&Issue=8&Code=IJACSA&SerialNo=31 |
_version_ |
1787131857017503744 |