Meaning representation in natural language processing

In this report, the semantic information in parse selection is analyzed. A Python software model was used to carry out feature engineering on semantic parsing results by parsers. The data used was from the SemCor corpus and WeScience corpus. Different types of semantic features were generated usin...

Full description

Saved in:
Bibliographic Details
Main Author: Tripathi Surabhita
Other Authors: School of Computer Engineering
Format: Final Year Project
Language:English
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/59189
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In this report, the semantic information in parse selection is analyzed. A Python software model was used to carry out feature engineering on semantic parsing results by parsers. The data used was from the SemCor corpus and WeScience corpus. Different types of semantic features were generated using the model and training and testing was conducted using a maximum entropy model TADM. Error analysis was performed on the entire SemCor and WeScience corpus by reproducing the old results. Generalized features provide better parse selection accuracy than more specific features. Further Machine learning was performed using ELM, Extreme Machine Learning technique, to compare the parse ranking results with TADM. The key fundamental task is to understand the meaning of a word in a sentence and semantic relations between words, resolving ambiguities by considering context.