Comparative study of probability models for compound similarity searching
The quality of a chemical retrieval system heavily depends on its molecular similarity function which returns a similarity measurement between the target compound and each molecule in the collection. Compounds are sorted according to their similarity values with the query and those with high ranks a...
Saved in:
Main Authors: | , |
---|---|
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2005
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/10086/1/NaomieSalim2005_ComparativeStudyofProbabilityModels.pdf http://eprints.utm.my/id/eprint/10086/ http://dblp2.uni-trier.de/rec/bibtex/conf/cita/SalimM05 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
Language: | English |
Summary: | The quality of a chemical retrieval system heavily depends on its molecular similarity function which returns a similarity measurement between the target compound and each molecule in the collection. Compounds are sorted according to their similarity values with the query and those with high ranks are returned to the users. Most current chemical retrieval systems use the vector space model for similarity calculation. In this paper, the use of probability of relevance for compound retrieval is explored. It reports on the effectiveness of the probability model for compound similarity searching by using Binary Independence Model and Binary Dependence Madel on two different databases. The result based on fusion of queries for both models is also discussed. The results show that in all cases, Binary Independence Retrieval model performed better than Binary Dependence model. It is also found that fusion does not give better results than the un-fused queries. |
---|