Skip Navigation


Bioinformatics Advance Access originally published online on November 30, 2006
Bioinformatics 2007 23(4):496-497; doi:10.1093/bioinformatics/btl608
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/4/496    most recent
btl608v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Müller, A.
Right arrow Articles by Kestler, H. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Müller, A.
Right arrow Articles by Kestler, H. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Visualization of genomic aberrations using Affymetrix SNP arrays

André Müller 1, Karlheinz Holzmann 2 and Hans A. Kestler 1,3,*

1 Internal Medicine I Robert-Koch-Strasse 8
2 Chip Facility/ZKF, University of Ulm D-89069 Ulm, Germany
3 Neural Information Processing, University of Ulm D-89069 Ulm, Germany

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 BACKGROUND
 2 FUNCTIONALITY
 REFERENCES
 

Motivation: DNA copy number aberrations are frequently found in different types of cancer. Recent developments of microarray-based approaches have broadened the knowledge on number and structure of such aberrations. High-density single nucleotide polymorphism (SNP) microarrays provide an extremely high resolution with up to 500 000 SNPs per genome. Owing to the enormous amount of data the detection of common aberrations in large datasets is a great challenge. We describe a novel open source software tool—IdeogramBrowser—which was specifically designed for use with the Affymetrix SNP arrays. It provides an interactive karyotypic visualization of multiple aberration profiles and direct links to GeneCards. Visualization of consensus regions together with gene representation allows the explorative assessment of the data.

Availability: IdeogramBrowser and its source code are freely available under a creative commons license and can be obtained from http://www.informatik.uni-ulm.de/ni/staff/HKestler/ideo/. IdeogramBrowser is a platform independent Java application.

Contact: hans.kestler{at}uni-ulm.de


    1 BACKGROUND
 TOP
 ABSTRACT
 1 BACKGROUND
 2 FUNCTIONALITY
 REFERENCES
 
Genetic aberrations of the DNA copy number level, such as deletions, amplifications and unbalanced translocations are among the main pathogenetic mechanisms underlying human genetic disorders. Analysis of chromosomal aberrations is particularly important in cancer, where amplification of oncogenes, cell cycle regulators and/or deletion of tumor suppressor genes are involved in the multistep process of cancer development. Rapid and accurate identification of such genetic imbalances is therefore an important factor for the understanding of physiological and pathophysiological processes and may serve as a starting point for new diagnostic methods or therapeutic strategies.

Single nucleotide polymorphism (SNP) arrays from Affymetrix are a widely used tool for the analysis of allelic variations and, additionally, for the detection of copy number aberrations. Besides genotyping, the hybridization signal intensity provides information about the copy number of the chromosomal segments by comparing these intensities to a control set of 110 healthy, ethnically diverse individuals (Wang et al., 2004). For the analysis of these data Affymetrix provides the Chromosome Copy Number Analysis Tool (CNAT) [http://www.affymetrix.com; Slater et al. (2005)].

Owing to the high number of data points (up to 500 000), the high noise in the data and the small area to display this information on screen, a visual identification of such aberrations is only practical if they span several megabasepairs. Currently the only solution for identification is to scroll through the datafiles or visually scan the single chromosomal representations at a high zoom level, as only one chromosome from a single case is visualized. Such a procedure is not practical for a large number of cases.

According to Huang et al. (2004) and Slater et al. (2005) it is necessary to discard data points below a specific threshold for the copy number estimation and above a certain P-value (single point or genomic smoothed average value). In addition to these thresholds it is also advisable to screen only for recurrent aberrant regions, as Sebat et al. (2004) and Iafrate et al. (2004) first described the existence of so-called copy number polymorphisms (CNPs). According to Sebat et al. (2004) two otherwise healthy individuals differ in about 11 CNPs on average. Therefore singular findings may be invalidated by CNPs and consensus region identification is of major importance. Our software tool allows the filtering via these thresholds (e.g. a higher threshold might be useful if the DNA is slightly degraded) and therefore expedites the identification of biological relevant regions (Fig. 1).


Figure 1
View larger version (67K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1 Visualization of the REF-103, MCF-7, SKBR3 and ZR7530 datasets (freely available from Affymetrix) with group size 3 and minimum length 0 is shown. Gene markers are shown on the right-hand side of the karyogram whereas all SNP locations of the selected sample are shown on the left-hand side. Gains (red) and losses (green) are visualized in the usual color coding.

 
Other software tools in this field include VAMP (La Rosa et al., 2006), a large software framework which is not freely available (only tests on a predefined set of examples are allowed), SNP-VISTA (Shah et al., 2005) which is mainly used in the analysis of sequence data and provides no ideogram representation, and the web-based tool SNPscan (Ting et al., 2006), which requires an installed Apache web-server if run as a local application. In contrast to these approaches the proposed software tool—IdeogramBrowser—is open source, easy to use and specifically designed for the import of the CNT result files of the Affymetrix CNAT. In addition, a tab-separated text format allows the use of external copy-number detection tools. Further file formats can be implemented through the open structure of the software.


    2 FUNCTIONALITY
 TOP
 ABSTRACT
 1 BACKGROUND
 2 FUNCTIONALITY
 REFERENCES
 
IdeogramBrowser is a platform independent Java application providing an interactive karyotypic representation which shows gain/loss or LOH markers together with genes. The karyogram and the genes are directly generated from local copies of the two NCBI mapview database files ideogram and seq_gene.md (ftp://ftp.ncbi.nlm.nih.gov/genomes/H_sapiens/mapview/) that can be updated accordingly (currently BUILD 36.2). On selecting a marker, overlapping genes are shown in a list (Fig. 1) with direct hyperlinks to http://www.genecards.org for explorative assessment. The visualization is based on the different marker types: LOH score (loss of heterozygosity), log2ratio, single point copy number (SPA), or genomes-smoothed analysis of the copy number (GSA for >10k chips).

Individual global thresholds can be chosen for each marker type. Multiple CNT files can be merged into a single sample which is required e.g. for the 100k split files. Two alternatives for merging two or more profiles can be chosen: resolution enhancement for interlocked SNP chips or reliability enhancement (differential mode). As the number of aberrations (log2ratios) is often too high a marker reduction method was included: only contiguous regions (gain, loss or LOH) of a minimum group size (default three adjacent SNPs of the same polarity) and a minimum sequence length are shown. In addition for GSA and SPA values an upper bound for the corresponding P-value can be defined. Multiple profiles can be shown in the diagram. They may originate from different types of SNP chips (e.g. 100k or 50k) or another ASCII file containing aberration information. Consensus regions can be identified by specifying the minimum number of samples which need to be of a certain state (gain, loss or LOH) in the given locus interval. Karyograms can be exported as JPEG, SVG [World Wide Web Consortium (W3C)] (scalable vector graphics format, which is especially suited for highquality prints), or PDF. Multiple files can be imported at once.


    Acknowledgments
 
This work is supported by the German Science Foundation, SFB 518, Project C5 (AM, HAK), Project B19 (KH) and the Stifterverband für die Deutsche Wissenschaft (HAK).

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Martin Bishop

Received on October 9, 2006; revised on November 14, 2006; accepted on November 23, 2006

    REFERENCES
 TOP
 ABSTRACT
 1 BACKGROUND
 2 FUNCTIONALITY
 REFERENCES
 

    Huang, J., et al. (2004) Whole genome DNA copy number changes identified by high density oligonucleotide arrays. Hum. Genomics, 1, 287–299[Medline].

    Iafrate, A.J., et al. (2004) Detection of large-scale variation in the human genome. Nat. Genet, . 36, 949–951[CrossRef][Web of Science][Medline].

    La Rosa, P., et al. (2006) VAMP: visualization and analysis of array-CGH, transcriptome and other molecular profiles. Bioinformatics, 22, 2066–2073[Abstract/Free Full Text].

    Sebat, J., et al. (2004) Large-scale copy number polymorphism in the human genome. Science, 305, 525–528[Abstract/Free Full Text].

    Shah, N., et al. (2005) SNP-VISTA: an interactive SNP visualization tool. BMC Bioinformatics, 6, 292[CrossRef][Medline].

    Slater, H.R., et al. (2005) High-resolution identification of chromosomal abnormalities using oligonucleotide arrays containing 116,204 SNPs. Am. J. Hum. Genet, . 77, 709–726[CrossRef][Web of Science][Medline].

    Ting, J.C., et al. (2006) Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan. BMC Bioinformatics, 7, 25[CrossRef][Medline].

    Wang, Z.C., et al. (2004) Loss of heterozygosity and its correlation with expression profiles in subclasses of invasive breast cancers. Cancer Res, . 64, 64–71[Abstract/Free Full Text].

    World Wide Web Consortium (W3C). Scalable Vector Graphics (SVG) 1.1 Specification.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/4/496    most recent
btl608v1
Right arrow Comments: Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when Comments are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Müller, A.
Right arrow Articles by Kestler, H. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Müller, A.
Right arrow Articles by Kestler, H. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?