Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (58)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Pavy, N.
Right arrow Articles by Rouzé, P.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pavy, N.
Right arrow Articles by Rouzé, P.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 15 no. 11 1999
Pages 887-899
© 1999 Oxford University Press

Evaluation of gene prediction software using a genomic data set: application to Arabidopsis thalianasequences

Nathalie Pavy 1, Stephane Rombauts 2, Patrice Déhais 2, Catherine Mathé 2, Davuluri V. V.Ramana 2, Philippe Leroy 1 and Pierre Rouzé 1

1 Laboratoire associéde l’INRA(France) ,
2 Department of Plant Genetics, Flanders Interuniversity Institute of Biotechnology, Universiteit Gent, K.L. Ledeganckstraat, 35, Belgium

Davuluri V. V. Ramana: On leave from Avesthagen Graine Technologies, Plant Genome Biology Laboratory, P.O. Box 5091, Cubbon Park GPO, Bangalore-560001, India. Present address: CSHL, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA.

Philippe Leroy: Present address: Station INRA d’Amélioration des Plantes - Domaine de Crouelle 234 avenue du Brézet, 63039 Clermont-Ferrand cedex 2, France.

Pierre Rouz é

Motivation: The annotation of the Arabidopsis thalianagenome remains a problem in terms of time and quality. To improve the annotation process, we want to choose the most appropriate tools to use inside a computer-assisted annotation platform. We therefore need evaluation of prediction programs with Arabidopsis sequences containing multiple genes.

Results: We have developed AraSet, a data set of contigs of validated genes, enabling the evaluation of multi-gene models for the Arabidopsis genome. Besides conventional metrics to evaluate gene prediction at the site and the exon levels, new measures were introduced for the prediction at the protein sequence level as well as for the evaluation of gene models. This evaluation method is of general interest and could apply to any new gene prediction software and to any eukaryotic genome. The GeneMark.hmm program appears to be the most accurate software at all three levels for the Arabidopsis genomic sequences. Gene modeling could be further improved by combination of prediction software.

Availability: The AraSet sequence set, the Perl programs and complementary results and notes are available at http://sphinx.rug.ac.be:8080/biocomp/napav/.

Contact: Pierre.Rouze{at}gengenp.rug.ac.be


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
A. Lomsadze, V. Ter-Hovhannisyan, Y. O. Chernoff, and M. Borodovsky
Gene identification in novel eukaryotic genomes by self-training algorithm
Nucleic Acids Res., November 28, 2005; 33(20): 6494 - 6506.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
S. Wu, M. A. Schoenbeck, B. T. Greenhagen, S. Takahashi, S. Lee, R. M. Coates, and J. Chappell
Surrogate Splicing for Functional Analysis of Sesquiterpene Synthase Genes
Plant Physiology, July 1, 2005; 138(3): 1322 - 1333.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. Degroeve, Y. Saeys, B. De Baets, P. Rouze, and Y. Van de Peer
SpliceMachine: predicting splice sites from high-dimensional local context representations
Bioinformatics, April 15, 2005; 21(8): 1332 - 1338.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. S. Katari, V. Balija, R. K. Wilson, R. A. Martienssen, and W. R. McCombie
Comparing low coverage random shotgun sequence data from Brassica oleracea and Oryza sativa genome sequence for their ability to add to the annotation of Arabidopsis thaliana
Genome Res., April 1, 2005; 15(4): 496 - 504.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Aubourg, V. Brunaud, C. Bruyere, M. Cock, R. Cooke, A. Cottet, A. Couloux, P. Dehais, G. Deleage, A. Duclert, et al.
GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D641 - D646.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. D. Schlueter, Q. Dong, and V. Brendel
GeneSeqer@PlantGDB: gene structure prediction in plant genomes
Nucleic Acids Res., July 1, 2003; 31(13): 3597 - 3600.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Foissac, P. Bardou, A. Moisan, M.-J. Cros, and T. Schiex
EUGENE'HOM: a generic similarity-based gene finder using multiple homologous sequences
Nucleic Acids Res., July 1, 2003; 31(13): 3742 - 3745.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
W. Zhu, S. D. Schlueter, and V. Brendel
Refined Annotation of the Arabidopsis Genome by Complete Expressed Sequence Tag Mapping
Plant Physiology, June 1, 2003; 132(2): 469 - 484.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. L. Crowe, C. Serizet, V. Thareau, S. Aubourg, P. Rouze, P. Hilson, J. Beynon, P. Weisbeek, P. van Hummelen, P. Reymond, et al.
CATMA: a complete Arabidopsis GST database
Nucleic Acids Res., January 1, 2003; 31(1): 156 - 158.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. T. Morishige, K. L. Childs, L. D. Moore, and J. E. Mullet
Targeted Analysis of Orthologous Phytochrome A Regions of the Sorghum, Maize, and Rice Genomes using Comparative Gene-Island Sequencing
Plant Physiology, December 1, 2002; 130(4): 1614 - 1625.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
K. Vandepoele, Y. Saeys, C. Simillion, J. Raes, and Y. Van de Peer
The Automatic Detection of Homologous Regions (ADHoRe) and Its Application to Microcolinearity Between Arabidopsis and Rice
Genome Res., November 1, 2002; 12(11): 1792 - 1801.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. Mathe, M.-F. Sagot, T. Schiex, and P. Rouze
Current methods of gene prediction, their strengths and weaknesses
Nucleic Acids Res., October 1, 2002; 30(19): 4103 - 4117.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
F. M. Ausubel
Summaries of National Science Foundation-Sponsored Arabidopsis 2010 Projects and National Science Foundation-Sponsored Plant Genome Projects That Are Generating Arabidopsis Resources for the Community
Plant Physiology, June 1, 2002; 129(2): 394 - 437.
[Full Text] [PDF]


Home page
Plant CellHome page
S. Bensmihen, S. Rippa, G. Lambert, D. Jublot, V. Pautot, F. Granier, J. Giraudat, and F. Parcy
The Homologous ABI5 and EEL Transcription Factors Function Antagonistically to Fine-Tune Gene Expression during Late Embryogenesis
PLANT CELL, June 1, 2002; 14(6): 1391 - 1403.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
K. Vandepoele, J. Raes, L. De Veylder, P. Rouze, S. Rombauts, and D. Inze
Genome-Wide Analysis of Core Cell Cycle Genes in Arabidopsis
PLANT CELL, April 1, 2002; 14(4): 903 - 916.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
G. C. MacIntosh, C. Wilkerson, and P. J. Green
Identification and Analysis of Arabidopsis Expressed Sequence Tags Characteristic of Non-Coding RNAs
Plant Physiology, November 1, 2001; 127(3): 765 - 776.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Ashburner
A Biologist's View of the Drosophila Genome Annotation Assessment Project
Genome Res., April 1, 2000; 10(4): 391 - 393.
[Full Text]


Home page
Genome ResHome page
A. Louis, E. Ollivier, J.-C. Aude, and J.-L. Risler
Massive Sequence Comparisons as a Help in Annotating Genomic Sequences
Genome Res., July 1, 2001; 11(7): 1296 - 1303.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
N. Boudet, S. Aubourg, C. Toffano-Nioche, M. Kreis, and A. Lecharny
Evolution of Intron/Exon Structure of DEAD Helicase Family Genes in Arabidopsis, Caenorhabditis, and Drosophila
Genome Res., December 1, 2001; 11(12): 2101 - 2114.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.