Skip Navigation


Bioinformatics Advance Access originally published online on April 8, 2004
Bioinformatics 2004 20(15):2479-2481; doi:10.1093/bioinformatics/bth261
This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow All Versions of this Article:
20/15/2479    most recent
bth261v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (45)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Frank, E.
Right arrow Articles by Witten, I. H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Frank, E.
Right arrow Articles by Witten, I. H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics 20(15) © Oxford University Press 2004; all rights reserved.

Applications Note

Data mining in bioinformatics using Weka

Eibe Frank 1,*, Mark Hall 1, Len Trigg 2, Geoffrey Holmes 1 and Ian H. Witten 1

1 Department of Computer Science, University of Waikato, Private Bag 3105, Hamilton, New Zealand and 2 Reel Two, PO Box 1538, Hamilton, New Zealand

Received on December 3, 2003; revised on February 3, 2004; accepted on February 26, 2004
Advance Access Publication April 8, 2004

Summary: The Weka machine learning workbench provides a general-purpose environment for automatic classification, regression, clustering and feature selection—common data mining problems in bioinformatics research. It contains an extensive collection of machine learning algorithms and data pre-processing methods complemented by graphical user interfaces for data exploration and the experimental comparison of different machine learning techniques on the same problem. Weka can process data given in the form of a single relational table. Its main objectives are to (a) assist users in extracting useful information from data and (b) enable them to easily identify a suitable algorithm for generating an accurate predictive model from it.

Availability: http://www.cs.waikato.ac.nz/ml/weka

Contact: eibe{at}cs.waikato.ac.nz

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
M. Masso and I. I. Vaisman
Accurate prediction of stability changes in protein mutants by combining machine learning with structure based computational mutagenesis
Bioinformatics, September 15, 2008; 24(18): 2002 - 2009.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
P. Sonego, A. Kocsor, and S. Pongor
ROC analysis: applications to the classification of biological sequences and 3D structures
Brief Bioinform, May 1, 2008; 9(3): 198 - 209.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
M. O. Hoque, M. S. Kim, K. L. Ostrow, J. Liu, G. B. A. Wisman, H. L. Park, M. L. Poeta, C. Jeronimo, R. Henrique, A. Lendvai, et al.
Genome-Wide Promoter Analysis Uncovers Portions of the Cancer Methylome
Cancer Res., April 15, 2008; 68(8): 2661 - 2670.
[Abstract] [Full Text] [PDF]


Home page
Cancer Res.Home page
A. Jimeno, A. C. Tan, J. Coffa, N.V. Rajeshkumar, P. Kulesza, B. Rubio-Viqueira, J. Wheelhouse, B. Diosdado, W. A. Messersmith, C. Iacobuzio-Donahue, et al.
Coordinated Epidermal Growth Factor Receptor Pathway Gene Overexpression Predicts Epidermal Growth Factor Receptor Inhibitor Sensitivity in Pancreatic Cancer
Cancer Res., April 15, 2008; 68(8): 2841 - 2849.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Masso and I. I. Vaisman
Accurate prediction of enzyme mutant activity based on a multibody statistical potential
Bioinformatics, December 1, 2007; 23(23): 3155 - 3161.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Microbiol.Home page
C. L. Moore, J. A. Smagala, C. B. Smith, E. D. Dawson, N. J. Cox, R. D. Kuchta, and K. L. Rowlen
Evaluation of MChip with Historic Subtype H1N1 Influenza A Viruses, Including the 1918 "Spanish Flu" Strain
J. Clin. Microbiol., November 1, 2007; 45(11): 3807 - 3810.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. E. Gewehr, M. Szugat, and R. Zimmer
BioWeka extending the Weka framework for bioinformatics
Bioinformatics, March 1, 2007; 23(5): 651 - 653.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
M. Maziarz, C. Chung, D. J. Drucker, and A. Emili
Integrating Global Proteomic and Genomic Expression Profiles Generated from Islet {alpha} Cells: Opportunities and Challenges to Deriving Reliable Biological Inferences
Mol. Cell. Proteomics, April 1, 2005; 4(4): 458 - 474.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Balasubramanian, Y. Xia, E. Freinkman, and M. Gerstein
Sequence variation in G-protein-coupled receptors: analysis of single nucleotide polymorphisms
Nucleic Acids Res., March 22, 2005; 33(5): 1710 - 1721.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.