Bioinformatics Advance Access published online on September 11, 2008
Bioinformatics, doi:10.1093/bioinformatics/btn481
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
MPI-LIT: A literature-curated dataset of microbial binary protein-protein interactions

1J Craig Venter Institute, Rockville, MD 20850, USA
2Indgen Life Technologies, Bangalore - 560 004, Karnataka, India
3Institute for Genetics, Forschungszentrum Karlsruhe, Karlsruhe, Germany
*To whom correspondence should be addressed. Dr. Seesandra Rajagopala, E-mail: raja{at}jcvi.org,rajgsv{at}gmail.com
| Abstract |
|---|
Prokaryotic protein-protein interactions are underrepresented in currently available databases. Here we describe a "gold standard" dataset (MPI-LIT) focusing on microbial binary protein-protein interactions and associated experimental evidence that we have manually curated from 813 abstracts and full texts that were selected from an initial set of 36,852 abstracts. The MPI-LIT dataset comprises 1,237 experimental descriptions that describe a non-redundant set of 746 interactions of which 659 (88%) are not reported in public databases. To estimate the curation quality, we compared our dataset with a union of microbial interaction data from IntAct, DIP, BIND and MINT. Among common abstracts, we achieve a sensitivity of up to 66% for interactions and 75% for PSI-MI annotations of experimental methods. Compared to other datasets, MPI-LIT has the lowest fraction of interaction experiments per abstract (0.9) and the highest coverage of strains (92) and scientific articles (813). We compared methods that evaluate functional interactions among proteins (such as genomic context or co-expression) which are implemented in the STRING database. Most of these methods discriminate well between functionally relevant protein interactions (MPI-LIT) and highthroughput data.
Availability: http://www.jcvi.org/mpidb/interaction.php?dbsource=MPILIT.
Contact: raja{at}jcvi.org
Associate Editor: Dr. Jonathan Wren
Current address: Crump Institute for Molecular Imaging, University of California, Los Angeles, California, United States of America
Received on July 2, 2008; revised on August 13, 2008; accepted on September 7, 2008
This article has been cited by other articles:
![]() |
J. Gu, Y. Wang, and T. Lilburn A Comparative Genomics, Network-Based Approach to Understanding Virulence in Vibrio cholerae J. Bacteriol., October 15, 2009; 191(20): 6262 - 6272. [Abstract] [Full Text] [PDF] |
||||
