Using text mining for information extraction / Saliza Ramly
The growth of the Internet and the availability of very large amounts of documents online that contain valuable information, have caused the need for tools to assist the users to extract the relevant information from the bundle of information without having to read them all, and also to retrieve...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
Faculty of Computer and Mathematical Sciences
2007
|
Subjects: | |
Online Access: | http://ir.uitm.edu.my/id/eprint/956/1/TD_SALIZA%20RAMLY%20CS%2007_5%20P01.pdf http://ir.uitm.edu.my/id/eprint/956/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Mara |
Language: | English |
id |
my.uitm.ir.956 |
---|---|
record_format |
eprints |
spelling |
my.uitm.ir.9562018-11-07T08:12:06Z http://ir.uitm.edu.my/id/eprint/956/ Using text mining for information extraction / Saliza Ramly Ramly, Saliza Electronic Computers. Computer Science The growth of the Internet and the availability of very large amounts of documents online that contain valuable information, have caused the need for tools to assist the users to extract the relevant information from the bundle of information without having to read them all, and also to retrieve it in a fast and effective. An e-mail is composed of date, e-mail address, subject, body of the e-mail, and so on. It is possible for the body to include pictures, sounds, and programs, but usually the body is mainly composed of textual data. Thus, it is possible to use text mining techniques in order to analyze e-mails. The research focuses on the email of students in Faculty of Information Technology and Quantitative Sciences (FTMSK). There are three objectives of the research that have been achieved. The survey was conducted to achieve the first objective. The second objective was achieved through content analysis and website observation. Researcher was identified the basic techniques that usually used and tabulate it in form of table. A number of organizations that have been done some development on text miner as their commercial product also have been identified. Finally, the third objective of the research was achieved through the development of a tool using text mining techniques. Furthermore, the Prototyping Methodology is chosen in order to develop the system. The researcher identified appropriate techniques from the past researches and existing text mining tool. As a result, categorization, clustering and summarization techniques was selected and applied for Text Mining Application Tool, TMAT development. Faculty of Computer and Mathematical Sciences 2007 Thesis NonPeerReviewed text en http://ir.uitm.edu.my/id/eprint/956/1/TD_SALIZA%20RAMLY%20CS%2007_5%20P01.pdf Ramly, Saliza (2007) Using text mining for information extraction / Saliza Ramly. Degree thesis, Universiti Teknologi MARA. |
institution |
Universiti Teknologi Mara |
building |
Tun Abdul Razak Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Mara |
content_source |
UiTM Institutional Repository |
url_provider |
http://ir.uitm.edu.my/ |
language |
English |
topic |
Electronic Computers. Computer Science |
spellingShingle |
Electronic Computers. Computer Science Ramly, Saliza Using text mining for information extraction / Saliza Ramly |
description |
The growth of the Internet and the availability of very large amounts of documents
online that contain valuable information, have caused the need for tools to assist the
users to extract the relevant information from the bundle of information without
having to read them all, and also to retrieve it in a fast and effective. An e-mail is
composed of date, e-mail address, subject, body of the e-mail, and so on. It is possible
for the body to include pictures, sounds, and programs, but usually the body is mainly
composed of textual data. Thus, it is possible to use text mining techniques in order to
analyze e-mails. The research focuses on the email of students in Faculty of
Information Technology and Quantitative Sciences (FTMSK). There are three
objectives of the research that have been achieved. The survey was conducted to
achieve the first objective. The second objective was achieved through content
analysis and website observation. Researcher was identified the basic techniques that
usually used and tabulate it in form of table. A number of organizations that have been
done some development on text miner as their commercial product also have been
identified. Finally, the third objective of the research was achieved through the
development of a tool using text mining techniques. Furthermore, the Prototyping
Methodology is chosen in order to develop the system. The researcher identified
appropriate techniques from the past researches and existing text mining tool. As a
result, categorization, clustering and summarization techniques was selected and
applied for Text Mining Application Tool, TMAT development. |
format |
Thesis |
author |
Ramly, Saliza |
author_facet |
Ramly, Saliza |
author_sort |
Ramly, Saliza |
title |
Using text mining for information extraction / Saliza Ramly |
title_short |
Using text mining for information extraction / Saliza Ramly |
title_full |
Using text mining for information extraction / Saliza Ramly |
title_fullStr |
Using text mining for information extraction / Saliza Ramly |
title_full_unstemmed |
Using text mining for information extraction / Saliza Ramly |
title_sort |
using text mining for information extraction / saliza ramly |
publisher |
Faculty of Computer and Mathematical Sciences |
publishDate |
2007 |
url |
http://ir.uitm.edu.my/id/eprint/956/1/TD_SALIZA%20RAMLY%20CS%2007_5%20P01.pdf http://ir.uitm.edu.my/id/eprint/956/ |
_version_ |
1685648111697920000 |