Classifying biomedical citations without labeled training examples

In this paper we introduce a novel technique for classifying text citations without labeled training examples. We first utilize the search results of a general search engine as original training data. We then proposed a mutually reinforcing learning algorithm (MRL) to mine the classification knowled...

Full description

Saved in:

Bibliographic Details
Main Authors:	Li X., Joshi R., Ramachandaran S., Tze-Yun LEONG
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2004
Subjects:	Artificial Intelligence and Robotics Health Information Technology
Online Access:	https://ink.library.smu.edu.sg/sis_research/3005
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Description
Summary:	In this paper we introduce a novel technique for classifying text citations without labeled training examples. We first utilize the search results of a general search engine as original training data. We then proposed a mutually reinforcing learning algorithm (MRL) to mine the classification knowledge and to "clean" the training data. With the help of a set of established domain-specific ontological terms or keywords, the MRL mining step derives the relevant classification knowledge. The MRL cleaning step then builds a Naive Bayes classifier based on the mined classification knowledge and tries to clean the training set. The MRL algorithm is iteratively applied until a clean training set is obtained. We show the effectiveness of the proposed technique in the classification of biomedical citations from a large medical literature database. © 2004 IEEE.

Classifying biomedical citations without labeled training examples

Similar Items