Text mining with minimal human supervision

In a National Basketball Association match, head coaches often have to make good and timely decisions in the best interest of his team. To make good decisions, it is important that coaches know and recognize his players’ past performances and records.This report explores the various aspects in the c...

Full description

Saved in:
Bibliographic Details
Main Author: Ling, Hong Yao.
Other Authors: School of Computer Engineering
Format: Final Year Project
Language:English
Published: 2011
Subjects:
Online Access:http://hdl.handle.net/10356/44614
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In a National Basketball Association match, head coaches often have to make good and timely decisions in the best interest of his team. To make good decisions, it is important that coaches know and recognize his players’ past performances and records.This report explores the various aspects in the creation and implementation of the system. The main objective of this project is to develop a system, NBA Automated Extraction System (NAXS), which uses the text mining technology and it follows the manual approach of identifying data patterns. The system automatically crawls the NBA Web site to search for games as specified by the user and extract useful information from these games. The proposed system was evaluated for the precision of the extraction procedure though various tests. These tests comprise of data taken from 3 months and include the extraction of i) The number of games as well as ii) The extraction of individual player’s statistics. All the tests performed well with each having a percentage of above 99%. The average accuracy of the extraction procedure of NAXS based on the data taken from 3 months is 99.80%. In conclusion, NAXS proved to be almost as efficient as counting the data manually and the automation process is also much faster as compared to the manually counting process.