<?xml version="1.0" encoding="ISO-8859-1"?>

<rdf:RDF
 xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
 xmlns="http://purl.org/rss/1.0/"
 xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/"
 xmlns:dc="http://purl.org/dc/elements/1.1/"
 xmlns:syn="http://purl.org/rss/1.0/modules/syndication/"
 xmlns:prism="http://purl.org/rss/1.0/modules/prism/"
 xmlns:admin="http://webns.net/mvcb/"
>

<channel rdf:about="http://bioinformatics.oxfordjournals.org">
<title>Bioinformatics - current issue</title>
<link>http://bioinformatics.oxfordjournals.org</link>
<description>Bioinformatics - RSS feed of current issue</description>
<prism:eIssn>1460-2059</prism:eIssn>
<prism:coverDisplayDate>15 July 2009</prism:coverDisplayDate>
<prism:publicationName>Bioinformatics</prism:publicationName>
<prism:issn>1367-4803</prism:issn>
<items>
 <rdf:Seq>
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1715?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1722?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1731?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1739?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1746?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1754?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1761?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1768?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1775?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1782?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1789?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1796?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1802?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1805?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1807?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1814?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1822?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1831?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1833?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1836?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1838?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1841?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1843?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1846?rss=1" />
 </rdf:Seq>
</items>
</channel>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1715?rss=1">
<title><![CDATA[Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1715?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Chromatin immunoprecipitation (ChIP) experiments followed by array hybridization, or ChIP-chip, is a powerful approach for identifying transcription factor binding sites (TFBS) and has been widely used. Recently, massively parallel sequencing coupled with ChIP experiments (ChIP-seq) has been increasingly used as an alternative to ChIP-chip, offering cost-effective genome-wide coverage and resolution up to a single base pair. For many well-studied TFs, both ChIP-seq and ChIP-chip experiments have been applied and their data are publicly available. Previous analyses have revealed substantial technology-specific binding signals despite strong correlation between the two sets of results. Therefore, it is of interest to see whether the two data sources can be combined to enhance the detection of TFBS.</p>
<p><b>Results:</b> In this work, hierarchical hidden Markov model (HHMM) is proposed for combining data from ChIP-seq and ChIP-chip. In HHMM, inference results from individual HMMs in ChIP-seq and ChIP-chip experiments are summarized by a higher level HMM. Simulation studies show the advantage of HHMM when data from both technologies co-exist. Analysis of two well-studied TFs, NRSF and CCCTC-binding factor (CTCF), also suggests that HHMM yields improved TFBS identification in comparison to analyses using individual data sources or a simple merger of the two.</p>
<p><b>Availability:</b> Source code for the software ChIPmeta is freely available for download at <inter-ref locator="http://www.umich.edu/~hwchoi/HHMMsoftware.zip" locator-type="url">http://www.umich.edu/~hwchoi/HHMMsoftware.zip</inter-ref>, implemented in C and supported on linux.</p>
<p><b>Contact:</b> <inter-ref locator="ghoshd@psu.edu" locator-type="email">ghoshd@psu.edu</inter-ref>; <inter-ref locator="qin@umich.edu" locator-type="email">qin@umich.edu</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp312/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Choi, H., Nesvizhskii, A. I., Ghosh, D., Qin, Z. S.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp312</dc:identifier>
<dc:title><![CDATA[Hierarchical hidden Markov model with application to joint analysis of ChIP-chip and ChIP-seq data]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1721</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1715</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1722?rss=1">
<title><![CDATA[SOrt-ITEMS: Sequence orthology based approach for improved taxonomic estimation of metagenomic sequences]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1722?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b>One of the first steps in metagenomic analysis is the assignment of reads/contigs obtained from various sequencing technologies to their correct taxonomic bins. Similarity-based binning methods assign a read to a taxon/clade, based on the pattern of significant BLAST hits generated against sequence databases. Existing methods, which use bit-score as the sole parameter to ascertain the significance of BLAST hits, have limited specificity and accuracy of binning. A new binning algorithm, called SOrt-ITEMS is introduced, which addresses these limitations. The method uses alignment parameters besides the bit score to first identify an appropriate taxonomic level where the read can be assigned. An orthology-based approach is subsequently used by the method for the final assignment.</p>
<p><b>Results:</b>The performance of SOrt-ITEMS has been validated with reads simulating sequences from 454 and Sanger sequencing technologies. In addition, the taxonomic composition of the Sargasso Sea data set has been analyzed using SOrt-ITEMS. SOrt-ITEMS shows improved specificity and accuracy of assignments especially in simulated scenarios, wherein sequences corresponding to the source organism of the reads are absent in the reference database.</p>
<p><b>Availability:</b>SOrt-ITEMS software is available for download from: <inter-ref locator="http://metagenomics.atc.tcs.com/binning/SOrt-ITEMS" locator-type="url">http://metagenomics.atc.tcs.com/binning/SOrt-ITEMS</inter-ref>. No license is needed for academic and nonprofit use.</p>
<p><b>Contact:</b> <inter-ref locator="sharmila@atc.tcs.com" locator-type="email">sharmila@atc.tcs.com</inter-ref></p>
<p><b>Supplementary information:</b><inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp317/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Monzoorul Haque, M., Ghosh, T. S., Komanduri, D., Mande, S. S.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp317</dc:identifier>
<dc:title><![CDATA[SOrt-ITEMS: Sequence orthology based approach for improved taxonomic estimation of metagenomic sequences]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1730</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1722</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1731?rss=1">
<title><![CDATA[Data structures and compression algorithms for genomic sequence data]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1731?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The continuing exponential accumulation of full genome data, including full diploid human genomes, creates new challenges not only for understanding genomic structure, function and evolution, but also for the storage, navigation and privacy of genomic data. Here, we develop data structures and algorithms for the efficient storage of genomic and other sequence data that may also facilitate querying and protecting the data.</p>
<p><b>Results:</b> The general idea is to encode only the differences between a genome sequence and a reference sequence, using absolute or relative coordinates for the location of the differences. These locations and the corresponding differential variants can be encoded into binary strings using various entropy coding methods, from fixed codes such as Golomb and Elias codes, to variables codes, such as Huffman codes. We demonstrate the approach and various tradeoffs using highly variables human mitochondrial genome sequences as a testbed. With only a partial level of optimization, 3615 genome sequences occupying 56 MB in GenBank are compressed down to only 167 KB, achieving a 345-fold compression rate, using the revised Cambridge Reference Sequence as the reference sequence. Using the consensus sequence as the reference sequence, the data can be stored using only 133 KB, corresponding to a 433-fold level of compression, roughly a 23% improvement. Extensions to nuclear genomes and high-throughput sequencing data are discussed.</p>
<p><b>Availability:</b> Data are publicly available from GenBank, the HapMap web site, and the MITOMAP database. Supplementary materials with additional results, statistics, and software implementations are available from <inter-ref locator="http://mammag.web.uci.edu/bin/view/Mitowiki/ProjectDNACompression" locator-type="url">http://mammag.web.uci.edu/bin/view/Mitowiki/ProjectDNACompression</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="pfbaldi@ics.uci.edu" locator-type="email">pfbaldi@ics.uci.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Brandon, M. C., Wallace, D. C., Baldi, P.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp319</dc:identifier>
<dc:title><![CDATA[Data structures and compression algorithms for genomic sequence data]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1738</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1731</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1739?rss=1">
<title><![CDATA[ESG: extended similarity group method for automated protein function prediction]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1739?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Importance of accurate automatic protein function prediction is ever increasing in the face of a large number of newly sequenced genomes and proteomics data that are awaiting biological interpretation. Conventional methods have focused on high sequence similarity-based annotation transfer which relies on the concept of homology. However, many cases have been reported that simple transfer of function from top hits of a homology search causes erroneous annotation. New methods are required to handle the sequence similarity in a more robust way to combine together signals from strongly and weakly similar proteins for effectively predicting function for unknown proteins with high reliability.</p>
<p><b>Results:</b> We present the extended similarity group (ESG) method, which performs iterative sequence database searches and annotates a query sequence with Gene Ontology terms. Each annotation is assigned with probability based on its relative similarity score with the multiple-level neighbors in the protein similarity graph. We will depict how the statistical framework of ESG improves the prediction accuracy by iteratively taking into account the neighborhood of query protein in the sequence similarity space. ESG outperforms conventional PSI-BLAST and the protein function prediction (PFP) algorithm. It is found that the iterative search is effective in capturing multiple-domains in a query protein, enabling accurately predicting several functions which originate from different domains.</p>
<p><b>Availability:</b> ESG web server is available for automated protein function prediction at <inter-ref locator="http://dragon.bio.purdue.edu/ESG/" locator-type="url">http://dragon.bio.purdue.edu/ESG/</inter-ref></p>
<p><b>Contact:</b> <inter-ref locator="cspark@cau.ac.kr" locator-type="email">cspark@cau.ac.kr</inter-ref>; <inter-ref locator="dkihara@purdue.edu" locator-type="email">dkihara@purdue.edu</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp309/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Chitale, M., Hawkins, T., Park, C., Kihara, D.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp309</dc:identifier>
<dc:title><![CDATA[ESG: extended similarity group method for automated protein function prediction]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1745</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1739</prism:startingPage>
<prism:section>SEQUENCE ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1746?rss=1">
<title><![CDATA[Efficient computation of all perfect repeats in genomic sequences of up to half a gigabyte, with a case study on the human genome]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1746?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> There is a significant ongoing research to identify the number and types of repetitive DNA sequences. As more genomes are sequenced, efficiency and scalability in computational tools become mandatory. Existing tools fail to find distant repeats because they cannot accommodate whole chromosomes, but segments. Also, a quantitative framework for repetitive elements inside a genome or across genomes is still missing.</p>
<p><b>Results:</b> We present a new efficient algorithm and its implementation as a software tool to compute all perfect repeats in inputs of up to 500 million nucleotide bases, possibly containing many genomes. Our algorithm is based on a suffix array construction and a novel procedure to extract all perfect repeats in the entire input, that can be arbitrarily distant, and with no bound on the repeat length. We tested the software on the <I>Homo sapiens</I> DNA genome NCBI 36.49. We computed all perfect repeats of at least 40 bases occurring in any two chromosomes with exact matching. We found that each <I>H.sapiens</I> chromosome shares ~10% of its full sequence with every other human chromosome, distributed more or less evenly among the chromosome surfaces. We give statistics including a quantification of repeats by diversity, length and number of occurrences. We compared the computed repeats against all biological repeats currently obtainable from Ensembl enlarged with the output of the dust program and all elements identified by TRF and RepeatMasker (<inter-ref locator="ftp://ftp.ebi.ac.uk/pub/databases/ensembl/jherrero/.repeats/all_repeats.txt.bz2" locator-type="url">ftp://ftp.ebi.ac.uk/pub/databases/ensembl/jherrero/.repeats/all_repeats.txt.bz2</inter-ref>). We report novel repeats as well as new occurrences of repeats matching with known biological elements.</p>
<p><b>Availability:</b> The source code, results and visualization of some statistics are accessible from <inter-ref locator="http://kapow.dc.uba.ar/patterns/" locator-type="url">http://kapow.dc.uba.ar/patterns/</inter-ref></p>
<p><b>Contact:</b> <inter-ref locator="vbecher@dc.uba.ar" locator-type="email">vbecher@dc.uba.ar</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Becher, V., Deymonnaz, A., Heiber, P.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp321</dc:identifier>
<dc:title><![CDATA[Efficient computation of all perfect repeats in genomic sequences of up to half a gigabyte, with a case study on the human genome]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1753</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1746</prism:startingPage>
<prism:section>SEQUENCE ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1754?rss=1">
<title><![CDATA[Fast and accurate short read alignment with Burrows-Wheeler transform]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1754?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals.</p>
<p><b>Results:</b> We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows&ndash;Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ~10&ndash;20<FONT FACE="arial,helvetica">x</FONT> faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package.</p>
<p><b>Availability:</b> <inter-ref locator="http://maq.sourceforge.net" locator-type="url">http://maq.sourceforge.net</inter-ref></p>
<p><b>Contact:</b> <inter-ref locator="rd@sanger.ac.uk" locator-type="email">rd@sanger.ac.uk</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Li, H., Durbin, R.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp324</dc:identifier>
<dc:title><![CDATA[Fast and accurate short read alignment with Burrows-Wheeler transform]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1760</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1754</prism:startingPage>
<prism:section>SEQUENCE ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1761?rss=1">
<title><![CDATA[pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1761?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Generation of structural models and recognition of homologous relationships for unannotated protein sequences are fundamental problems in bioinformatics. Improving the sensitivity and selectivity of methods designed for these two tasks therefore has downstream benefits for many other bioinformatics applications.</p>
<p><b>Results:</b> We describe the latest implementation of the GenTHREADER method for structure prediction on a genomic scale. The method combines profile&ndash;profile alignments with secondary-structure specific gap-penalties, classic pair- and solvation potentials using a linear combination optimized with a regression SVM model. We find this combination significantly improves both detection of useful templates and accuracy of sequence-structure alignments relative to other competitive approaches. We further present a second implementation of the protocol designed for the task of discriminating superfamilies from one another. This method, pDomTHREADER, is the first to incorporate both sequence and structural data directly in this task and improves sensitivity and selectivity over the standard version of pGenTHREADER and three other standard methods for remote homology detection.</p>
<p><b>Contact:</b> <inter-ref locator="d.jones@cs.ucl.ac.uk" locator-type="email">d.jones@cs.ucl.ac.uk</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp302/DC1" locator-type="url">Supplementary data</inter-ref> are available at Bioinformatics online.</p>
]]></description>
<dc:creator><![CDATA[Lobley, A., Sadowski, M. I., Jones, D. T.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp302</dc:identifier>
<dc:title><![CDATA[pGenTHREADER and pDomTHREADER: new methods for improved protein fold recognition and superfamily discrimination]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1767</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1761</prism:startingPage>
<prism:section>STRUCTURAL BIOINFORMATICS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1768?rss=1">
<title><![CDATA[Literature-based priors for gene regulatory networks]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1768?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The use of prior knowledge to improve gene regulatory network modelling has often been proposed. In this article we present the first research on the massive incorporation of prior knowledge from literature for Bayesian network learning of gene networks. As the publication rate of scientific papers grows, updating online databases, which have been proposed as potential prior knowledge in past research, becomes increasingly challenging. The novelty of our approach lies in the use of gene-pair association scores that describe the overlap in the contexts in which the genes are mentioned, generated from a large database of scientific literature, harnessing the information contained in a huge number of documents into a simple, clear format.</p>
<p><b>Results:</b> We present a method to transform such literature-based gene association scores to network prior probabilities, and apply it to learn gene sub-networks for yeast, <I>Escherichia coli</I> and Human organisms. We also investigate the effect of weighting the influence of the prior knowledge. Our findings show that literature-based priors can improve both the number of true regulatory interactions present in the network and the accuracy of expression value prediction on genes, in comparison to a network learnt solely from expression data. Networks learnt with priors also show an improved biological interpretation, with identified subnetworks that coincide with known biological pathways.</p>
<p><b>Contact:</b> <inter-ref locator="emma.steele@brunel.ac.uk" locator-type="email">emma.steele@brunel.ac.uk</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp277/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Steele, E., Tucker, A., Hoen, P.A.C. t, Schuemie, M.J.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp277</dc:identifier>
<dc:title><![CDATA[Literature-based priors for gene regulatory networks]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1774</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1768</prism:startingPage>
<prism:section>GENE EXPRESSION</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1775?rss=1">
<title><![CDATA[Gradient lasso for Cox proportional hazards model]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1775?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> There has been an increasing interest in expressing a survival phenotype (e.g. time to cancer recurrence or death) or its distribution in terms of a subset of the expression data of a subset of genes. Due to high dimensionality of gene expression data, however, there is a serious problem of collinearity in fitting a prediction model, e.g. Cox's proportional hazards model. To avoid the collinearity problem, several methods based on penalized Cox proportional hazards models have been proposed. However, those methods suffer from severe computational problems, such as slow or even failed convergence, because of high-dimensional matrix inversions required for model fitting. We propose to implement the penalized Cox regression with a lasso penalty via the gradient lasso algorithm that yields faster convergence to the global optimum than do other algorithms. Moreover the gradient lasso algorithm is guaranteed to converge to the optimum under mild regularity conditions. Hence, our gradient lasso algorithm can be a useful tool in developing a prediction model based on high-dimensional covariates including gene expression data.</p>
<p><b>Results:</b> Results from simulation studies showed that the prediction model by gradient lasso recovers the prognostic genes. Also results from diffuse large B-cell lymphoma datasets and Norway/Stanford breast cancer dataset indicate that our method is very competitive compared with popular existing methods by Park and Hastie and Goeman in its computational time, prediction and selectivity.</p>
<p><b>Availability:</b> R package <ty>glcoxph</ty> is available at <inter-ref locator="http://datamining.dongguk.ac.kr/R/glcoxph" locator-type="url">http://datamining.dongguk.ac.kr/R/glcoxph</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="park463@uos.ac.kr" locator-type="email">park463@uos.ac.kr</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Sohn, I., Kim, J., Jung, S.-H., Park, C.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp322</dc:identifier>
<dc:title><![CDATA[Gradient lasso for Cox proportional hazards model]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1781</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1775</prism:startingPage>
<prism:section>GENE EXPRESSION</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1782?rss=1">
<title><![CDATA[Relating periodicity of nucleosome organization and gene regulation]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1782?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The relationship between nucleosome positioning and gene regulation is fundamental yet complex. Previous studies on genomic nucleosome positions have revealed a correlation between nucleosome occupancy on promoters and gene expression levels. Many of these studies focused on individual nucleosomes, especially those proximal to transcription start sites. To study the collective effect of multiple nucleosomes on the gene expression, we developed a mathematical approach based on autocorrelation to relate genomic nucleosome organization to gene regulation.</p>
<p><b>Results:</b> We found that nucleosome organization in gene promoters can be well described by autocorrelation transformation. Some promoters show obvious periods in their nucleosome organization, while others have no clear periodicity. The genes with periodic nucleosome organization in promoters tend to be lower expressed than the genes without periodic nucleosome organization. These suggest that regular organization of nucleosomes plays a critical role in gene regulation. To quantitatively associate nucleosome organization and gene expression, we predicted gene expression solely based on nucleosome status and found that nucleosome status accounts for ~25% of the observed gene expression variability. Furthermore, we explored the underlying forces that maintain the periodicity in nucleosome organization, namely intrinsic (i.e. DNA sequence) and extrinsic forces (i.e. chromatin remodeling factors). We found that the extrinsic factors play a critical role in maintaining the periodic nucleosome organization.</p>
<p><b>Contact:</b> <inter-ref locator="jiang.qian@jhmi.edu" locator-type="email">jiang.qian@jhmi.edu</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp323/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Wan, J., Lin, J., Zack, D. J., Qian, J.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp323</dc:identifier>
<dc:title><![CDATA[Relating periodicity of nucleosome organization and gene regulation]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1788</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1782</prism:startingPage>
<prism:section>GENE EXPRESSION</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1789?rss=1">
<title><![CDATA[Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1789?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> There is a growing interest in improving the cluster analysis of expression data by incorporating into it prior knowledge, such as the Gene Ontology (GO) annotations of genes, in order to improve the biological relevance of the clusters that are subjected to subsequent scrutiny. The structure of the GO is another source of background knowledge that can be exploited through the use of semantic similarity.</p>
<p><b>Results:</b> We propose here a novel algorithm that integrates semantic similarities (derived from the ontology structure) into the procedure of deriving clusters from the dendrogram constructed during expression-based hierarchical clustering. Our approach can handle the multiple annotations, from different levels of the GO hierarchy, which most genes have. Moreover, it treats annotated and unannotated genes in a uniform manner. Consequently, the clusters obtained by our algorithm are characterized by significantly enriched annotations. In both cross-validation tests and when using an external index such as protein&ndash;protein interactions, our algorithm performs better than previous approaches. When applied to human cancer expression data, our algorithm identifies, among others, clusters of genes related to immune response and glucose metabolism. These clusters are also supported by protein&ndash;protein interaction data.</p>
<p><b>Contact:</b> <inter-ref locator="dotna@cs.bgu.ac.il" locator-type="email">dotna@cs.bgu.ac.il</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp327/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Dotan-Cohen, D., Kasif, S., Melkman, A. A.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp327</dc:identifier>
<dc:title><![CDATA[Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1795</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1789</prism:startingPage>
<prism:section>GENE EXPRESSION</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1796?rss=1">
<title><![CDATA[On the inference of spatial structure from population genetics data]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1796?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> In a series of recent papers, Tess, a computer program based on the concept of hidden Markov random field, has been proposed to infer the number and locations of panmictic population units from the genotypes and spatial locations of these individuals. The method seems to be of broad appeal as it is conceptually much simpler than other competing methods and it has been reported by its authors to be fast and accurate. However, this methodology is not grounded in a formal statistical inference method and seems to rely to a large extent on arbitrary choices regarding the parameters used. The present article is an investigation of the accuracy of this method and an attempt to assess whether recent results reported on the basis of this method are genuine features of the genetic process or artefacts of the method.</p>
<p><b>Method:</b> I analyse simulated data consisting of populations at Hardy&ndash;Weinberg and linkage equilibrium and also data simulated under a scenario of isolation-by-distance at mutation&ndash;migration&ndash;drift equilibrium. <I>Arabidopsis thaliana</I> data previously analysed with this method are also reconsidered.</p>
<p><b>Results:</b> Using the Tess program under the no-admixture model to analyse data consisting of several genuine HWLE populations with individuals of pure ancestries leads to highly inaccurate results; Using the Tess program under the admixture model to analyse data consisting of a continuous isolation-by-distance population leads to the inference of spurious HWLE populations whose number and features depend on the parameters used. Results previously reported about the <I>A.thaliana</I> using Tess seem to a large extent to be artefacts of the statistical methodology used. The findings go beyond population clustering models and can be an help to design more efficient algorithms based on graphs.</p>
<p><b>Availability:</b> The data analysed in the present article are available from <inter-ref locator="http://folk.uio.no/gillesg/Bioinformatics-HMRF" locator-type="url">http://folk.uio.no/gillesg/Bioinformatics-HMRF</inter-ref></p>
<p><b>Contact:</b> <inter-ref locator="gilles.guillot@bio.uio.no" locator-type="email">gilles.guillot@bio.uio.no</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp267/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Guillot, G.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp267</dc:identifier>
<dc:title><![CDATA[On the inference of spatial structure from population genetics data]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1801</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1796</prism:startingPage>
<prism:section>GENETICS AND POPULATION ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1802?rss=1">
<title><![CDATA[Comment on 'On the inference of spatial structure from population genetics data']]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1802?rss=1</link>
<description><![CDATA[
<p><b>Contact:</b> <inter-ref locator="Olivier.francois@imag.fr" locator-type="email">Olivier.francois@imag.fr</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Durand, E., Chen, C., Francois, O.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp337</dc:identifier>
<dc:title><![CDATA[Comment on 'On the inference of spatial structure from population genetics data']]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1804</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1802</prism:startingPage>
<prism:section>GENETICS AND POPULATION ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1805?rss=1">
<title><![CDATA[Response to comment on 'On the inference of spatial structure from population genetics data']]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1805?rss=1</link>
<description><![CDATA[]]></description>
<dc:creator><![CDATA[Guillot, G.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp351</dc:identifier>
<dc:title><![CDATA[Response to comment on 'On the inference of spatial structure from population genetics data']]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1806</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1805</prism:startingPage>
<prism:section>GENETICS AND POPULATION ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1807?rss=1">
<title><![CDATA[Estimating the posterior probability that genome-wide association findings are true or false]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1807?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> A limitation of current methods used to declare significance in genome-wide association studies (GWAS) is that they do not provide clear information about the probability that GWAS findings are true of false. This lack of information increases the chance of false discoveries and may result in real effects being missed.</p>
<p><b>Results:</b> We propose a method to estimate the posterior probability that a marker has (no) effect given its test statistic value, also called the local false discovery rate (FDR), in the GWAS. A critical step involves the estimation the parameters of the distribution of the true alternative tests. For this, we derived and implemented the real maximum likelihood function, which turned out to provide us with significantly more accurate estimates than the widely used mixture model likelihood. Actual GWAS data are used to illustrate properties of the posterior probability estimates empirically. In addition to evaluating individual markers, a variety of applications are conceivable. For instance, posterior probability estimates can be used to control the FDR more precisely than Benjamini&ndash;Hochberg procedure.</p>
<p><b>Availability:</b> The codes are freely downloadable from the web site <inter-ref locator="http://www.people.vcu.edu/~jbukszar" locator-type="url">http://www.people.vcu.edu/~jbukszar</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="jbukszar@vcu.edu" locator-type="email">jbukszar@vcu.edu</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp305/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Bukszar, J., McClay, J. L., van den Oord, E. J. C. G.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp305</dc:identifier>
<dc:title><![CDATA[Estimating the posterior probability that genome-wide association findings are true or false]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1813</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1807</prism:startingPage>
<prism:section>GENETICS AND POPULATION ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1814?rss=1">
<title><![CDATA[Structure discovery in PPI networks using pattern-based network decomposition]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1814?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The large, complex networks of interactions between proteins provide a lens through which one can examine the structure and function of biological systems. Previous analyses of these continually growing networks have primarily followed either of two approaches: large-scale statistical analysis of holistic network properties, or small-scale analysis of local topological features. Meanwhile, investigation of meso-scale network structure (above that of individual functional modules, while maintaining the significance of individual proteins) has been hindered by the computational complexity of structural search in networks. Examining protein&ndash;protein interaction (PPI) networks at the meso-scale may provide insights into the presence and form of relationships between individual protein complexes and functional modules.</p>
<p><b>Results:</b> In this article, we present an efficient algorithm for performing sub-graph isomorphism queries on a network and show its computational advantage over previous methods. We also present a novel application of this form of topological search which permits analysis of a network's structure at a scale between that of individual functional modules and that of network-wide properties. This analysis provides support for the presence of hierarchical modularity in the PPI network of <I>Saccharomyces cerevisiae</I>.</p>
<p><b>Contact:</b> <inter-ref locator="ying.liu@utdallas.edu" locator-type="email">ying.liu@utdallas.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Bachman, P., Liu, Y.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp297</dc:identifier>
<dc:title><![CDATA[Structure discovery in PPI networks using pattern-based network decomposition]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1821</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1814</prism:startingPage>
<prism:section>SYSTEMS BIOLOGY</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1822?rss=1">
<title><![CDATA[Robust synthetic biology design: stochastic game theory approach]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1822?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Synthetic biology is to engineer artificial biological systems to investigate natural biological phenomena and for a variety of applications. However, the development of synthetic gene networks is still difficult and most newly created gene networks are non-functioning due to uncertain initial conditions and disturbances of extra-cellular environments on the host cell. At present, how to design a robust synthetic gene network to work properly under these uncertain factors is the most important topic of synthetic biology.</p>
<p><b>Results:</b> A robust regulation design is proposed for a stochastic synthetic gene network to achieve the prescribed steady states under these uncertain factors from the minimax regulation perspective. This minimax regulation design problem can be transformed to an equivalent stochastic game problem. Since it is not easy to solve the robust regulation design problem of synthetic gene networks by non-linear stochastic game method directly, the Takagi&ndash;Sugeno (T&ndash;S) fuzzy model is proposed to approximate the non-linear synthetic gene network via the linear matrix inequality (LMI) technique through the Robust Control Toolbox in Matlab. Finally, an <I>in silico</I> example is given to illustrate the design procedure and to confirm the efficiency and efficacy of the proposed robust gene design method.</p>
<p><b>Availability:</b> <inter-ref locator="http://www.ee.nthu.edu.tw/bschen/SyntheticBioDesign_supplement.pdf" locator-type="url">http://www.ee.nthu.edu.tw/bschen/SyntheticBioDesign_supplement.pdf</inter-ref></p>
<p><b>Contact:</b> <inter-ref locator="bschen@ee.nthu.edu.tw" locator-type="email">bschen@ee.nthu.edu.tw</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp310/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Chen, B.-S., Chang, C.-H., Lee, H.-C.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp310</dc:identifier>
<dc:title><![CDATA[Robust synthetic biology design: stochastic game theory approach]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1830</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1822</prism:startingPage>
<prism:section>SYSTEMS BIOLOGY</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1831?rss=1">
<title><![CDATA[Rahnuma: hypergraph-based tool for metabolic pathway prediction and network comparison]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1831?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b>We present a tool called Rahnuma for prediction and analysis of metabolic pathways and comparison of metabolic networks. Rahnuma represents metabolic networks as hypergraphs and computes all possible pathways between two or more metabolites. It provides an intuitive way to answer biological ques- tions focusing on differences between organisms or the evolution of different species by allowing pathway-based metabolic network comparisons at an organism as well as at a phylogenetic level.</p>
<p><b>Availability:</b> Rahnuma is available online at <inter-ref locator="http://portal.stats.ox.ac.uk:8080/rahnuma/" locator-type="url">http://portal.stats.ox.ac.uk:8080/rahnuma/</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="gail.preston@plants.ox.ac.uk" locator-type="email">gail.preston@plants.ox.ac.uk</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp269/DC1" locator-type="url">Supplementary data</inter-ref> are available at the <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Mithani, A., Preston, G. M., Hein, J.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp269</dc:identifier>
<dc:title><![CDATA[Rahnuma: hypergraph-based tool for metabolic pathway prediction and network comparison]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1832</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1831</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1833?rss=1">
<title><![CDATA[baobabLUNA: the solution space of sorting by reversals]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1833?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Computing the reversal distance and searching for an optimal sequence of reversals to transform a unichromosomal genome into another are useful algorithmic tools to analyse real evolutionary scenarios. Currently, these problems can be solved by at least two available softwares, the prominent of which are <ty>GRAPPA</ty> and <ty>GRIMM</ty>. However, the number of different optimal sequences is usually huge and taking only the distance and/or one example is often insufficient to do a proper analysis. Here, we offer an alternative and present <ty>baobabLUNA</ty>, a framework that contains an algorithm to give a compact representation of the whole space of solutions for the sorting by reversals problem.</p>
<p><b>Availability and Implementation:</b> Compiled code implemented in Java is freely available for download at <inter-ref locator="http://pbil.univ-lyon1.fr/software/luna/" locator-type="url">http://pbil.univ-lyon1.fr/software/luna/</inter-ref>. Documentation with methodological background, technical aspects, download and setup instructions, interface description and tutorial are available at <inter-ref locator="http://pbil.univ-lyon1.fr/software/luna/doc/luna-doc.pdf" locator-type="url">http://pbil.univ-lyon1.fr/software/luna/doc/luna-doc.pdf</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="mdvbraga@gmail.com" locator-type="email">mdvbraga@gmail.com</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp285/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Braga, M. D. V.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp285</dc:identifier>
<dc:title><![CDATA[baobabLUNA: the solution space of sorting by reversals]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1835</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1833</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1836?rss=1">
<title><![CDATA[Apollo: a community resource for genome annotation editing]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1836?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Apollo is a genome annotation-editing tool with an easy to use graphical interface. It is a component of the GMOD project, with ongoing development driven by the community. Recent additions to the software include support for the generic feature format version 3 (GFF3), continuous transcriptome data, a full Chado database interface, integration with remote services for on-the-fly BLAST and Primer BLAST analyses, graphical interfaces for configuring user preferences and full undo of all edit operations. Apollo's user community continues to grow, including its use as an educational tool for college and high-school students.</p>
<p><b>Availability:</b> Apollo is a Java application distributed under a free and open source license. Installers for Windows, Linux, Unix, Solaris and Mac OS X are available at <inter-ref locator="http://apollo.berkeleybop.org" locator-type="url">http://apollo.berkeleybop.org</inter-ref>, and the source code is available from the SourceForge CVS repository at <inter-ref locator="http://gmod.cvs.sourceforge.net/gmod/apollo" locator-type="url">http://gmod.cvs.sourceforge.net/gmod/apollo</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="elee@berkeleybop.org" locator-type="email">elee@berkeleybop.org</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Ed, L., Nomi, H., Mark, G., Raymond, C., Suzanna, L.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp314</dc:identifier>
<dc:title><![CDATA[Apollo: a community resource for genome annotation editing]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1837</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1836</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1838?rss=1">
<title><![CDATA[NTAP: for NimbleGen tiling array ChIP-chip data analysis]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1838?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b>NTAP is designed to analyze ChIP-chip data generated by the NimbleGen tiling array platform and to accomplish various pattern recognition tasks that are useful especially for epigenetic studies. The modular design of NTAP makes the data processing highly customizable. Users can either use NTAP to perform the full process of NimbleGen tiling array data analysis, or choose post-processing modules in NTAP to analyze pre-processed epigenetic data generated by other platforms. The output of NTAP can be saved in standard GFF format files and visualized in GBrowse.</p>
<p><b>Availability and Implementation:</b>The source code of NTAP is freely available at <inter-ref locator="http://ntap.cbi.pku.edu.cn/" locator-type="url">http://ntap.cbi.pku.edu.cn/</inter-ref>. It is implemented in Perl and R and can be used on Linux, Mac and Windows platforms.</p>
<p><b>Contact:</b> <inter-ref locator="ntap@mail.cbi.pku.edu.cn" locator-type="email">ntap@mail.cbi.pku.edu.cn</inter-ref>; <inter-ref locator="luojc@pku.edu.cn" locator-type="email">luojc@pku.edu.cn</inter-ref>; <inter-ref locator="hekun78@gmail.com" locator-type="email">hekun78@gmail.com</inter-ref></p>
]]></description>
<dc:creator><![CDATA[He, K., Li, X., Zhou, J., Deng, X.-W., Zhao, H., Luo, J.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp320</dc:identifier>
<dc:title><![CDATA[NTAP: for NimbleGen tiling array ChIP-chip data analysis]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1840</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1838</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1841?rss=1">
<title><![CDATA[rtracklayer: an R package for interfacing with genome browsers]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1841?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> The <I>rtracklayer</I> package supports the integration of existing genome browsers with experimental data analyses performed in R. The user may (i) transfer annotation tracks to and from a genome browser and (ii) create and manipulate browser views to focus on a particular set of annotations in a specific genomic region. Currently, the UCSC genome browser is supported.</p>
<p><b>Availability:</b> The package is freely available from <inter-ref locator="http://www.bioconductor.org/" locator-type="url">http://www.bioconductor.org/</inter-ref>. A quick-start vignette is included with the package.</p>
<p><b>Contact:</b> <inter-ref locator="mflawren@fhcrc.org" locator-type="email">mflawren@fhcrc.org</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Lawrence, M., Gentleman, R., Carey, V.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp328</dc:identifier>
<dc:title><![CDATA[rtracklayer: an R package for interfacing with genome browsers]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1842</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1841</prism:startingPage>
<prism:section>GENOME ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1843?rss=1">
<title><![CDATA[MetaTISA: Metagenomic Translation Initiation Site Annotator for improving gene start prediction]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1843?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> We proposed a tool named MetaTISA with an aim to improve TIS prediction of current gene-finders for metagenomes. The method employs a two-step strategy to predict translation initiation sites (TISs) by first clustering metagenomic fragments into phylogenetic groups and then predicting TISs independently for each group in an unsupervised manner. As evaluated on experimentally verified TISs, MetaTISA greatly improves the accuracies of TIS prediction of current gene-finders.</p>
<p><b>Availability:</b> The C++ source code is freely available under the GNU GPL license <I>via</I> <inter-ref locator="http://mech.ctb.pku.edu.cn/MetaTISA/" locator-type="url">http://mech.ctb.pku.edu.cn/MetaTISA/</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="hqzhu@pku.edu.cn" locator-type="email">hqzhu@pku.edu.cn</inter-ref></p>
<p><b>Supplementary information:</b> <inter-ref locator="http://bioinformatics.oxfordjournals.org/cgi/content/full/btp272/DC1" locator-type="url">Supplementary data</inter-ref> are available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Hu, G.-Q., Guo, J.-T., Liu, Y.-C., Zhu, H.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp272</dc:identifier>
<dc:title><![CDATA[MetaTISA: Metagenomic Translation Initiation Site Annotator for improving gene start prediction]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1845</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1843</prism:startingPage>
<prism:section>SEQUENCE ANALYSIS</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1846?rss=1">
<title><![CDATA[PESTAS: a web server for EST analysis and sequence mining]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/25/14/1846?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> We have developed a web server for the high-throughput annotation of expressed sequence tags (ESTs) called pipeline for EST analysis service (PESTAS). PESTAS processes entire datasets with an automated pipeline of 13 analytic services, then deposits the data into the MySQL database and transforms it into three kinds of reports: preprocessing, assembling and annotation. All annotated information is provided to the scientist and can be downloaded through a web browser. To get more relevant functional annotation results, a curation function was introduced with which biologists can easily change the best-hit annotation information. We included a gene chip module that detects gene expression differences between libraries by comparing accession number counts from BLAST search results. PESTAS also provides access to the pathway information of KEGG, which is useful for mapping the relationships among networks of annotated enzymes, and is especially valuable for those researchers interested in biological pathways.</p>
<p><b>Availability:</b> PESTAS is available at <inter-ref locator="http://pestas.kribb.re.kr/" locator-type="url">http://pestas.kribb.re.kr/</inter-ref></p>
<p><b>Supplementary information:</b> Supplementary data are available at <inter-ref locator="http://pestas.kribb.re.kr/pestas.jsp" locator-type="url">http://pestas.kribb.re.kr/pestas.jsp</inter-ref>.</p>
<p><b>Contact:</b> <inter-ref locator="odysseus@kribb.re.kr" locator-type="email">odysseus@kribb.re.kr</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Nam, S.-H., Kim, D.-W., Jung, T.-S., Choi, Y.-S., Kim, D.-W., Choi, H.-S., Choi, S.-H., Park, H.-S.]]></dc:creator>
<dc:date>2009-07-02</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btp293</dc:identifier>
<dc:title><![CDATA[PESTAS: a web server for EST analysis and sequence mining]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:number>14</prism:number>
<prism:volume>25</prism:volume>
<prism:endingPage>1848</prism:endingPage>
<prism:publicationDate>2009-07-15</prism:publicationDate>
<prism:startingPage>1846</prism:startingPage>
<prism:section>SEQUENCE ANALYSIS</prism:section>
</item>

</rdf:RDF>