Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here for more information

Click here to sign up for SAGE Journal Email Alerts today!

Sign In to gain access to subscriptions and/or personal tools.
Journal of Biomolecular Screening
This Article
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
1087057105284334v1
11/2/138    most recent
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Web of Science (4)
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Fang, J.
Right arrow Articles by Georg, G. I.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Fang, J.
Right arrow Articles by Georg, G. I.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Support Vector Machines in HTS Data Mining: Type I MetAPs Inhibition Study

Jianwen Fang

Bioinformatics Core Facility, Information and Telecommunication Technology Center, University of Kansas, Lawrence

Yinghua Dong

Bioinformatics Core Facility, University of Kansas, Lawrence

Gerald H. Lushington

Bioinformatics Core Facility, Molecular Graphics and Modeling Laboratory, University of Kansas, Lawrence

Qi-Zhuang Ye

High Throughput Screening Laboratory, University of Kansas, Lawrence

Gunda I. Georg

High Throughput Screening Laboratory, University of Kansas, Lawrence

This article reports a successful application of support vector machines (SVMs) in mining high-throughput screening (HTS) data of a type I methionine aminopeptidases (MetAPs) inhibition study. A library with 43,736 small organic molecules was used in the study, and 1355 compounds in the library with 40% or higher inhibition activity were considered as active. The data set was randomly split into a training set and a test set (3:1 ratio). The authors were able to rank compounds in the test set using their decision values predicted by SVM models that were built on the training set. They defined a novel score PT50, the percentage of the test set needed to be screened to recover 50% of the actives, to measure the performance of the models. With carefully selected parameters, SVM models increased the hit rates significantly, and 50% of the active compounds could be recovered by screening just 7% of the test set. The authors found that the size of the training set played a significant role in the performance of the models. A training set with 10,000 member compounds is likely the minimum size required to build a model with reasonable predictive power.

Key Words: support vector machines • high-throughput screening • MetAP • machine learning

This version was published on March 1, 2006

Journal of Biomolecular Screening, Vol. 11, No. 2, 138-144 (2006)
DOI: 10.1177/1087057105284334


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?