Arabic word stemming algorithms and retrieval effectiveness
Documents retrieval in Information Retrieval Systems (IRS) is generally about retrieving of relevant documents pertaining to information needs. The more the system able to understand the contents of documents the more effective will be the retrieval outcomes. But understanding of the contents is a v...
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Conference Paper |
Published: |
2023
|
Subjects: | |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Tenaga Nasional |
id |
my.uniten.dspace-29940 |
---|---|
record_format |
dspace |
spelling |
my.uniten.dspace-299402024-04-17T10:51:48Z Arabic word stemming algorithms and retrieval effectiveness Sembok T.M.T. Ata B.A. 9268900400 36837446100 Artificial intelligence Information retrieval Natural language processing Algorithms Artificial intelligence Information retrieval systems Natural language processing systems Semantics Vector spaces Experimental system Morphological analysis NAtural language processing Relevant documents Retrieval effectiveness Retrieval process Semantic technologies Vector space models Information retrieval Documents retrieval in Information Retrieval Systems (IRS) is generally about retrieving of relevant documents pertaining to information needs. The more the system able to understand the contents of documents the more effective will be the retrieval outcomes. But understanding of the contents is a very complex task. Conventional IRS applies algorithms that can only approximate the meaning of document contents through keywords approach using vector space model. Keywords may be unstemmed or stemmed. When keywords are stemmed and conflated in retrieval process, we are a step forwards in applying semantic technology in IRS. Word stemming is a process in morphological analysis under natural language processing, before syntactic and semantic analysis. We have developed algorithms for Arabic stemming and incorporated it in our experimental system in order to measure retrieval effectiveness. The results have shown that the retrieval effectiveness has increased when stemming is used. Final 2023-12-29T07:43:40Z 2023-12-29T07:43:40Z 2013 Conference Paper 2-s2.0-84887864770 https://www.scopus.com/inward/record.uri?eid=2-s2.0-84887864770&partnerID=40&md5=7dab76731f4eb220f927d55c385e5316 https://irepository.uniten.edu.my/handle/123456789/29940 3 LNECS 1577 1582 Scopus |
institution |
Universiti Tenaga Nasional |
building |
UNITEN Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Tenaga Nasional |
content_source |
UNITEN Institutional Repository |
url_provider |
http://dspace.uniten.edu.my/ |
topic |
Artificial intelligence Information retrieval Natural language processing Algorithms Artificial intelligence Information retrieval systems Natural language processing systems Semantics Vector spaces Experimental system Morphological analysis NAtural language processing Relevant documents Retrieval effectiveness Retrieval process Semantic technologies Vector space models Information retrieval |
spellingShingle |
Artificial intelligence Information retrieval Natural language processing Algorithms Artificial intelligence Information retrieval systems Natural language processing systems Semantics Vector spaces Experimental system Morphological analysis NAtural language processing Relevant documents Retrieval effectiveness Retrieval process Semantic technologies Vector space models Information retrieval Sembok T.M.T. Ata B.A. Arabic word stemming algorithms and retrieval effectiveness |
description |
Documents retrieval in Information Retrieval Systems (IRS) is generally about retrieving of relevant documents pertaining to information needs. The more the system able to understand the contents of documents the more effective will be the retrieval outcomes. But understanding of the contents is a very complex task. Conventional IRS applies algorithms that can only approximate the meaning of document contents through keywords approach using vector space model. Keywords may be unstemmed or stemmed. When keywords are stemmed and conflated in retrieval process, we are a step forwards in applying semantic technology in IRS. Word stemming is a process in morphological analysis under natural language processing, before syntactic and semantic analysis. We have developed algorithms for Arabic stemming and incorporated it in our experimental system in order to measure retrieval effectiveness. The results have shown that the retrieval effectiveness has increased when stemming is used. |
author2 |
9268900400 |
author_facet |
9268900400 Sembok T.M.T. Ata B.A. |
format |
Conference Paper |
author |
Sembok T.M.T. Ata B.A. |
author_sort |
Sembok T.M.T. |
title |
Arabic word stemming algorithms and retrieval effectiveness |
title_short |
Arabic word stemming algorithms and retrieval effectiveness |
title_full |
Arabic word stemming algorithms and retrieval effectiveness |
title_fullStr |
Arabic word stemming algorithms and retrieval effectiveness |
title_full_unstemmed |
Arabic word stemming algorithms and retrieval effectiveness |
title_sort |
arabic word stemming algorithms and retrieval effectiveness |
publishDate |
2023 |
_version_ |
1806426083849928704 |