Skip Navigation


Bioinformatics Advance Access originally published online on June 22, 2007
Bioinformatics 2007 23(16):2188-2189; doi:10.1093/bioinformatics/btm315
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/16/2188    most recent
btm315v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Rezvoy, C.
Right arrow Articles by Marais, G. A.B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rezvoy, C.
Right arrow Articles by Marais, G. A.B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

MareyMap: an R-based tool with graphical interface for estimating recombination rates

Clément Rezvoy , Delphine Charif , Laurent Guéguen and Gabriel A.B. Marais *

Université de Lyon; Université Lyon 1; Centre National de la Recherche Scientifique; UMR5558; Laboratoire de Biométrie et Biologie évolutive; Villeurbanne, F-69622 cedex, France

*To whom correspondence should be addressed.


    ABSTRACT
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 DESCRIPTION
 REFERENCES
 

Summary: Comparing genetic and physical maps (the so-called Marey map approach) is still the most widely used approach to estimate genome-wide recombination rates. Remarkably, there is no available bioinformatics tool specifically devoted to Marey map approach. Here, we developed such a tool called MareyMap based on GNU R and Tcl/Tk. MareyMap offers a user-friendly graphical interface and includes useful features, such as data cleaning process, sophisticated interpolation methods to estimate local rates, possibility of complex queries, various range of import–export files. Moreover, MareyMap comes with ready-to-use maps for human, Drosophila, Caenorhabditis elegans and Arabidopsis. MareyMap has been made so that it can be easily upgraded with new data and interpolation methods.

Availability: http://pbil.univ-lyon1.fr/software/mareymap/

Contact: marais{at}biomserv.univ-lyon1.fr


    1 INTRODUCTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 DESCRIPTION
 REFERENCES
 
Recent advances in genomics have shown that genome organization is influenced by meiotic recombination rates in Eukaryotes (see for instance Bartolomé et al., 2002; Carvalho and Clark, 1999; Marais et al., 2001; Pal and Hurst, 2003, Presgraves, 2005; Wright et al., 2003). The most striking example of this is probably the non-recombining Y chromosome that is very different (i.e. chromosome size, gene content) from recombining X chromosome although these two chromosomes have the same autosomal ancestry and were identical at the early stage of their evolution (see Charlesworth et al., 2005 for review). More generally, non-recombining regions of the genome have a different structure (i.e. gene density, GC content and evolutionary rates) compared to recombining ones.

In the vast majority of the genome-analysis papers cited above, estimates of recombination rate have been obtained using Marey map approach. The Marey map approach has been proposed by Chakravarti (1991). Marey was a French scientist who popularized in his book ‘La méthode graphique’ a method developed by a rail engineer called Ibry. This method compared the distance in kilometers between two stations with the time needed to travel by train from one station to another with the idea of improving global train circulation between Paris and Lyon (Marey, 1885). Chakravarti used this approach to compare physical and genetic maps. Historically, genetic maps were the first maps for chromosomes. They stem on the fact that recombination frequency between two genes increases with the physical distance between these two genes. But genetic maps can be used for another purpose: if one fits a curve to the plot of genetic versus physical distances for a given chromosome then the slope of that curve will give the local recombination rates along that particular chromosome.

Although other approaches can in principles give genome-wide estimates for recombination rate (see for instance Myers et al., 2005; Stumpf and McVean, 2003), Marey map is still considered the more convenient approach to get such estimates (Gaut et al., 2007). Indeed, Marey map is fast, unbiased and genetic and physical maps are available for a relatively large number of organisms. Intriguingly, there are almost no tools to automatically deal with the comparison of genetic and physical maps. As far as we know, there is only one tool comparing physical and genetic maps (cartographer, Voigt et al., 2004) and this tool has not been made to estimate recombination rates. It is actually a converter between the two types of distances (genetic and physical) used in biomedical research (Voigt et al., 2004).


    2 IMPLEMENTATION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 DESCRIPTION
 REFERENCES
 
Our work aimed at developing a tool to estimate recombination rates using the Marey map approach. We wanted that tool—we called it MareyMap—to be specifically devoted to estimating recombination rates (not as cartographer, see above), to have a user-friendly graphical interface (because Marey Map is a graphical method), to have sophisticated interpolation methods (to get slopes), to allow complex queries (to retrieve estimates at particular positions) and to have a good upgradeability (regarding data and interpolation methods). We chose GNU R to implement MareyMap because R includes many statistical functions that we needed for our interpolation methods (e.g. loess, cubic splines). We used Tcl/Tk to build our graphical interface along with the R tkrplot library. However, MareyMap can also work without interface using R command lines only, which can be useful for advanced use (in a pipeline for instance). MareyMap is formed by several packages: MareyMap (work on Marey maps), MareyMapGUI (graphical interface) and one package for each interpolation method. A screen shot of MareyMap graphical interface is shown in Figure 1.


Figure 1
View larger version (41K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Fig. 1. MareyMap graphical interface. A selection of maps from different organisms (human, Drosophila, Caenorhabditis elegans and Arabidopsis) is available in the menu bar (Map sets). The Maps window shows the different maps of the chromosomes of a given species. MareyMap has a cleaning data function because some maps may contain aberrant markers. Two central windows show the Marey map plot of the selected chromosome and the recombination estimates using various interpolation methods, which can be selected in the interpolation window. Several parameters need to be set for the three available methods (sliding window, loess and cubic spline). The query window is for getting estimates for given positions. A text file containing positions can be uploaded. Various possibilities of file import export can be found in menu bar (in particular pdf files for plots). See http://pbil.univ-lyon1.fr/software/mareymap/ for details about how to install and to run MareyMap.

 

    3 DESCRIPTION
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 DESCRIPTION
 REFERENCES
 
MareyMap has several advantages. It comes with physical and genetic maps from different organisms: one vertebrate (Homo sapiens, three sets of maps: male, female and sex average, from Kong et al., 2002), two invertebrates (Drosophila melanogaster, from Flybase at http://flybase.bio.indiana.edu/ see Marais et al., 2001; C.elegans, from Wormbase at http://www.wormbase.org/ see Marais et al., 2001) and one plant (Arabidopsis thaliana, from NASC at http://arabidopsis.info/ see Wright et al., 2003). Importantly, the user can include other sets of maps from the same or other species very easily. This can be done by uploading a text file with species name, chromosomes, marker names, physical positions and genetic distances (see http://pbil.univ-lyon1.fr/software/mareymap/ for more details). MareyMap includes a (facultative) data cleaning process. Maps may contain errors (discrepancies between genetic and physical maps). In this case, it is possible to exclude some chosen markers temporarily or definitively. One strong advantage of MareyMap is the choice of different interpolation methods. It includes sliding window, which is the simplest and most widely used method. The idea is just sliding a window along a chromosome and getting the local estimate with the slope of the best line fit to the data in the local window. Parameters are window size and shift. We also offer more sophisticated methods: Loess, which is a sophisticated sliding window (a two-degree polynomial is fitted in each window) and cubic splines, which is probably the best method to estimate recombination rate with Marey map approach (Berloff et al., 2002; Yu et al., 2001). More methods can be included in MareyMap by the user. Different estimators can be compared visually (in recombination rates plot) and to the original data (Marey map plot). Recombination rates for some positions in the genome can be recovered using a query window and it is also possible to upload files of preselected positions, which is very useful for large-scale genome analysis. Possible file import–export include export of pdf for plots, import of new sets of maps (see above) and export of recombination rates estimates.

Conflict of Interest: none declared.


    FOOTNOTES
 
Associate Editor: Keith Crandall

Received on March 6, 2007; revised on May 14, 2007; accepted on June 6, 2007

    REFERENCES
 TOP
 ABSTRACT
 1 INTRODUCTION
 2 IMPLEMENTATION
 3 DESCRIPTION
 REFERENCES
 

    Bartolomé C, et al. On the abundance and distribution of transposable elements in the genome of Drosophila melanogaster. Mol. Biol. Evol (2002) 19:926–937.[Abstract/Free Full Text]

    Berloff N, et al. Spline methods for the comparison of physical and genetic maps. J. Comput. Biol (2002) 9:465–475.[CrossRef][Web of Science][Medline]

    Carvalho AB, Clark AG. Intron size and natural selection. Nature (1999) 401:344.[CrossRef][Medline]

    Chakravarti A. A graphical representation of genetic and physical maps: the Marey map. Genomics (1991) 11:219–222.[CrossRef][Web of Science][Medline]

    Charlesworth D, et al. Steps in the evolution of heteromorphic sex chromosomes. Heredity (2005) 95:118–128.[CrossRef][Web of Science][Medline]

    Gaut BS, et al. Recombination: an underappreciated factor in the evolution of plant genomes. Nat. Rev. Genet (2007) 8:77–84.[CrossRef][Web of Science][Medline]

    Kong A, et al. A high-resolution recombination map of the human genome. Nat. Genet (2002) 31:241–247.[CrossRef][Web of Science][Medline]

    Marais G, et al. Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes. Proc. Natl Acad. Sci. USA (2001) 98:5688–5692.[Abstract/Free Full Text]

    Marey EJ. La méthode graphique. (1885) Paris.

    Myers C, et al. A fine-scale map of recombination rates and hotspots across the human genome. Science (2005) 310:321–324.[Abstract/Free Full Text]

    Pal C, Hurst LD. Evidence for co-evolution of gene order and recombination rate. Nat. Genet (2003) 33:392–395.[CrossRef][Web of Science][Medline]

    Presgraves DC. Recombination enhances protein adaptation in Drosophila melanogaster. Curr. Biol (2005) 15:1651–1656.[CrossRef][Web of Science][Medline]

    Stumpf MP, McVean GA. Estimating recombination rates from population-genetic data. Nat. Rev. Genet (2003) 4:959–968.[CrossRef][Web of Science][Medline]

    Voigt C, et al. Non-linear conversion between genetic and physical chromosomal distances. Bioinformatics (2004) 20:1966–1967.[Abstract/Free Full Text]

    Wright SI, et al. Effects of recombination rate and gene density on transposable element distributions in Arabidopsis thaliana. Genome Res (2003) 13:1897–1903.[Abstract/Free Full Text]

    Yu A, et al. Comparison of human genetic and sequence-based physical maps. Nature (2001) 409:951–953.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow All Versions of this Article:
23/16/2188    most recent
btm315v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Rezvoy, C.
Right arrow Articles by Marais, G. A.B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rezvoy, C.
Right arrow Articles by Marais, G. A.B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?