Bioinformatics Advance Access published online on March 7, 2006
Bioinformatics, doi:10.1093/bioinformatics/btl080
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 James Hogg iCAPTURE Centre for Cardiovascular and Pulmonary Research, St. Paul's Hospital, University of British Columbia, Vancouver, V6Z 1Y6, Canada
* To whom correspondence should be addressed.
Summary: MACGT is a Java application that clusters complex multi-dimensional vector data derived from single nucleotide polymorphism (SNP) genotyping experiments using mini-sequencing based microarray chemistries such as arrayed primer extension (APEX). Spot intensity output files from microarray experiments across multiple samples are imported into MACGT. The data sets can include four channels of intensity data for each spot, replica spots for each SNP probe, and multiple probe types (APEX and allele-specific [AS] APEX probes) on both DNA strands for each SNP. MACGT automatically clusters these multi-dimensionality data sets for each SNP across multiple samples. Incorporation of additional array data sets from known samples that have previously validated SNP genotype calls, allows unknown samples to be automatically assigned a genotype based on the clustering, along with numerical measures of confidence for each genotype call. Calling accuracy by MACGT exceeds 98% when applied to genotyping data from APEX microarrays, and can be increased to >99.5% by applying thresholds to the confidence measures. Availability: MACGT is open source and is freely available (under a GNU General Public License) from the iCAPTURE Centre web site, http://www.mrl.ubc.ca/who/who_bios_scott_tebbutt.shtml. Supplementary information: Additional information, including Supplementary Figure S1, test data and a user's manual, is available from the iCAPTURE Centre web site (see above).
Received January 13, 2006
Revised February 16, 2006
Accepted March 2, 2006
Applications note
MACGT: multi-dimensional automated clustering genotyping tool for analysis of microarray-based mini-sequencing data
David C. Walley 1,
Ben W. Tripp 1,
Young C. Song 1,
Keith R. Walley 1,
and
Scott J. Tebbutt 1 *
Scott J. Tebbutt, E-mail: stebbutt{at}mrl.ubc.ca
![]()
Abstract
Associate Editor: Martin Bishop
![]()
CiteULike
Connotea
Del.icio.us What's this?