Bioinformatics Vol. 18 no. 2 2002
Pages 235-243
© 2002 Oxford University Press
A data-mining approach to spacer oligonucleotide typing of Mycobacterium tuberculosis
1 French West Indies and Guiana University,
TRIVIA, Department of Mathematics and Computer Science, Campus
Fouillole, 97159 Pointe-à-Pitre
Cedex, Guadeloupe
2 Unitéde la
Tuberculose et des Mycobactéries,
Institut Pasteur de Guadeloupe, BP 484, F-97165
Pointe-à-Pitre Cedex, Guadeloupe
Received on February 9, 2001
; revised on June 25, 2001
; accepted on October 9, 2001
Motivation: The Direct Repeat (DR) locus of Mycobacterium tuberculosis is a suitable model to study (i) molecular epidemiology and (ii) the evolutionary genetics of tuberculosis. This is achieved by a DNA analysis technique (genotyping), called sp acer oligo nucleotide typing (spoligotyping ). In this paper, we investigated data analysis methods to discover intelligible knowledge rules from spoligotyping, that has not yet been applied on such representation. This processing was achieved by applying the C4.5 induction algorithm and knowledge rules were produced. Finally, a Prototype Selection (PS) procedure was applied to eliminate noisy data. This both simplified decision rules, as well as the number of spacers to be tested to solve classification tasks. In the second part of this paper, the contribution of 25 new additional spacers and the knowledge rules inferred were studied from a machine learning point of view. From a statistical point of view, the correlations between spacers were analyzed and suggested that both negative and positive ones may be related to potential structural constraints within the DR locus that may shape its evolution directly or indirectly.
Results: By generating knowledge rules induced from decision trees, it was shown that not only the expert knowledge may be modeled but also improved and simplified to solve automatic classification tasks on unknown patterns. A practical consequence of this study may be a simplification of the spoligotyping technique, resulting in a reduction of the experimental constraints and an increase in the number of samples processed.
Contact: csola{at}pasteur.gp
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. L. Gibson, R. C. Huard, N. C. Gey van Pittius, L. C. O. Lazzarini, J. Driscoll, N. Kurepina, T. Zozio, C. Sola, S. M. Spindola, A. L. Kritski, et al. Application of Sensitive and Specific Molecular Methods To Uncover Global Dissemination of the Major RDRio Sublineage of the Latin American-Mediterranean Mycobacterium tuberculosis Spoligotype Family J. Clin. Microbiol., April 1, 2008; 46(4): 1259 - 1267. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Dubiley, E. Kirillov, A. Ignatova, V. Stepanshina, and I. Shemyakin Molecular Characteristics of the Mycobacterium tuberculosis LAM-RUS Family Prevalent in Central Russia J. Clin. Microbiol., December 1, 2007; 45(12): 4036 - 4038. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Streicher, T. C. Victor, G. van der Spuy, C. Sola, N. Rastogi, P. D. van Helden, and R. M. Warren Spoligotype Signatures in the Mycobacterium tuberculosis Complex J. Clin. Microbiol., January 1, 2007; 45(1): 237 - 240. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al. Machine learning in bioinformatics Brief Bioinform, March 1, 2006; 7(1): 86 - 112. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Filliol, A. S. Motiwala, M. Cavatore, W. Qi, M. H. Hazbon, M. Bobadilla del Valle, J. Fyfe, L. Garcia-Garcia, N. Rastogi, C. Sola, et al. Global Phylogeny of Mycobacterium tuberculosis Based on Single Nucleotide Polymorphism (SNP) Analysis: Insights into Tuberculosis Evolution, Phylogenetic Accuracy of Other DNA Fingerprinting Systems, and Recommendations for a Minimal Standard SNP Set J. Bacteriol., January 15, 2006; 188(2): 759 - 772. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Brudey, I. Filliol, S. Ferdinand, V. Guernier, P. Duval, B. Maubert, C. Sola, and N. Rastogi Long-Term Population-Based Genotyping Study of Mycobacterium tuberculosis Complex Isolates in the French Departments of the Americas J. Clin. Microbiol., January 1, 2006; 44(1): 183 - 191. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. M. Warren, T. C. Victor, E. M. Streicher, M. Richardson, G. D. van der Spuy, R. Johnson, V. N. Chihota, C. Locht, P. Supply, and P. D. van Helden Clonal Expansion of a Globally Disseminated Lineage of Mycobacterium tuberculosis with Low IS6110 Copy Numbers J. Clin. Microbiol., December 1, 2004; 42(12): 5774 - 5782. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. C. Victor, P. E. W. de Haas, A. M. Jordaan, G. D. van der Spuy, M. Richardson, D. van Soolingen, P. D. van Helden, and R. Warren Molecular Characteristics and Global Spread of Mycobacterium tuberculosis with a Western Cape F11 Genotype J. Clin. Microbiol., February 1, 2004; 42(2): 769 - 772. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. M. Streicher, R. M. Warren, C. Kewley, J. Simpson, N. Rastogi, C. Sola, G. D. van der Spuy, P. D. van Helden, and T. C. Victor Genotypic and Phenotypic Characterization of Drug-Resistant Mycobacterium tuberculosis Isolates from Rural Districts of the Western Cape Province of South Africa J. Clin. Microbiol., February 1, 2004; 42(2): 891 - 894. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Phyu, R. Jureen, T. Ti, U. R. Dahle, and H. M. S. Grewal Heterogeneity of Mycobacterium tuberculosis Isolates in Yangon, Myanmar J. Clin. Microbiol., October 1, 2003; 41(10): 4907 - 4908. [Full Text] [PDF] |
||||
![]() |
I. Filliol, J. R. Driscoll, D. van Soolingen, B. N. Kreiswirth, K. Kremer, G. Valetudie, D. D. Anh, R. Barlow, D. Banerjee, P. J. Bifani, et al. Snapshot of Moving and Expanding Clones of Mycobacterium tuberculosis and Their Global Distribution Assessed by Spoligotyping in an International Study J. Clin. Microbiol., May 1, 2003; 41(5): 1963 - 1970. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. W. Dale, H. Al-Ghusein, S. Al-Hashmi, P. Butcher, A. L. Dickens, F. Drobniewski, K. J. Forbes, S. H. Gillespie, D. Lamprecht, T. D. McHugh, et al. Evolutionary Relationships among Strains of Mycobacterium tuberculosis with Few Copies of IS6110 J. Bacteriol., April 15, 2003; 185(8): 2555 - 2562. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ferdinand, C. Sola, B. Verdol, E. Legrand, K. S. Goh, M. Berchel, A. Aubery, M. Timothee, P. Joseph, J. W. Pape, et al. Molecular Characterization and Drug Resistance Patterns of Strains of Mycobacterium tuberculosis Isolated from Patients in an AIDS Counseling Center in Port-au-Prince, Haiti: a 1-Year Study J. Clin. Microbiol., February 1, 2003; 41(2): 694 - 702. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Mokrousov, O. Narvskaya, T. Otten, E. Limeschenko, L. Steklova, and B. Vyshnevskiy High Prevalence of KatG Ser315Thr Substitution among Isoniazid-Resistant Mycobacterium tuberculosis Clinical Isolates from Northwestern Russia, 1996 to 2001 Antimicrob. Agents Chemother., May 1, 2002; 46(5): 1417 - 1424. [Abstract] [Full Text] [PDF] |
||||



