<?xml version="1.0" encoding="ISO-8859-1"?>

<rdf:RDF
 xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
 xmlns="http://purl.org/rss/1.0/"
 xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/"
 xmlns:dc="http://purl.org/dc/elements/1.1/"
 xmlns:syn="http://purl.org/rss/1.0/modules/syndication/"
 xmlns:prism="http://purl.org/rss/1.0/modules/prism/"
 xmlns:admin="http://webns.net/mvcb/"
>

<channel rdf:about="http://bioinformatics.oxfordjournals.org">
<title>Bioinformatics - Advance Access</title>
<link>http://bioinformatics.oxfordjournals.org</link>
<description>Bioinformatics - RSS feed of articles</description>
<prism:eIssn>1460-2059</prism:eIssn>
<prism:publicationName>Bioinformatics</prism:publicationName>
<prism:issn>1367-4803</prism:issn>
<items>
 <rdf:Seq>
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn229v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn224v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn223v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn222v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn220v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn210v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn178v2?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn207v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn194v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn218v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn217v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn216v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn215v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn137v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn219v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn214v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn211v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn213v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn212v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn208v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn205v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn199v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn209v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn206v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn201v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn202v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn200v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn193v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn139v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn198v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn132v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn197v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn195v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn196v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn179v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn138v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn127v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn136v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn135v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn128v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn134v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn133v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn130v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn131v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn125v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn117v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn129v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn111v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn033v2?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btm634v1?rss=1" />
  <rdf:li rdf:resource="http://bioinformatics.oxfordjournals.org/cgi/content/short/btm094v2?rss=1" />
 </rdf:Seq>
</items>
</channel>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn229v1?rss=1">
<title><![CDATA[Evolutionary design principles of modules that control cellular differentiation: Consequences for hysteresis and multistationarity]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn229v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Gene regulatory networks govern cellular differentiation processes and enable construction of multi-cellular organisms from single cells. Although such networks are complex, there must be evolutionary design principles that shape the network to its present form, gaining complexity from simple modules.</p>
<p><b>Results:</b> To isolate particular design principles, we have computationally evolved random regulatory networks with a preference to result either in hysteresis (switching threshold depending on current state), or in multistationarity (having multiple steady states), two commonly observed dynamical features of gene regulatory networks related to differentiation processes. We have analyzed the resulting evolved networks and compared their structures and characteristics with real gene regulatory networks reported from experiments.</p>
<p><b>Conclusion:</b> We found that the artificially evolved networks have particular topologies and it was notable that these topologies share important features and similarities with the real gene regulatory networks, particularly in contrasting properties of positive and negative feedback loops. We conclude that the structures of real gene regulatory networks are consistent with selection to favor one or other of the dynamical features of multistationarity or hysteresis.</p>
<p><b>Contact:</b>  <inter-ref locator="ckh@kaist.ac.kr" locator-type="email">ckh@kaist.ac.kr</inter-ref></p>
<p><b>Supplementary Material:</b> Supplementary Material is available at <I>Bioinformatics</I> online</p>
]]></description>
<dc:creator><![CDATA[Kim, J., Kim, T.-G., Jung, S. H., Kim, J.-R., Park, T., Heslop-Harrison, P., Cho, K.-H.]]></dc:creator>
<dc:date>2008-05-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn229</dc:identifier>
<dc:title><![CDATA[Evolutionary design principles of modules that control cellular differentiation: Consequences for hysteresis and multistationarity]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-08</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn224v1?rss=1">
<title><![CDATA[lumi: a pipeline for processing Illumina microarray]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn224v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Illumina microarray is becoming a popular microarray platform. The BeadArray technology from Illumina makes its preprocessing and quality control different from other microarray technologies. Unfortunately, most other analyses have not taken advantage of the unique properties of the BeadArray system, and have just incorporated preprocessing methods originally designed for Affymetrix microarrays. <I>lumi</I> is a Bioconductor package especially designed to process the Illumina microarray data. It includes data input, quality control, variance stabilization, normalization and gene annotation portions. In specific, the <I>lumi</I> package includes a variance-stabilizing transformation (VST) algorithm that takes advantage of the technical replicates available on every Illumina microarray. Different normalization method options and multiple quality control plots are provided in the package. To better annotate the Illumina data, a vendor independent nucleotide universal identifier (nuID) was devised to identify the probes of Illumina microarray. The nuID annotation packages and output of <I>lumi</I> processed results can be easily integrated with other Bioconductor packages to construct a statistical data analysis pipeline for Illumina data.</p>
<p><b>Availability:</b> The <I>lumi</I> Bioconductor package, <inter-ref locator="www.bioconductor.org" locator-type="url">www.bioconductor.org</inter-ref></p>
<p><b>Contact: </b>Pan Du (<inter-ref locator="dupan@northwestern.edu" locator-type="email">dupan@northwestern.edu</inter-ref>), Warren Kibbe (<inter-ref locator="wakibbe@northwestern.edu" locator-type="email">wakibbe@northwestern.edu</inter-ref>), Simon Lin (<inter-ref locator="s-lin2@northwestern.edu" locator-type="email">s-lin2@northwestern.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Du, P., Kibbe, W. A., Lin, S. M.]]></dc:creator>
<dc:date>2008-05-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn224</dc:identifier>
<dc:title><![CDATA[lumi: a pipeline for processing Illumina microarray]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-08</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn223v1?rss=1">
<title><![CDATA[PatMaN: rapid alignment of short sequences to large databases]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn223v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> We present a tool suited for searching for many short nucleotide sequences in large databases, allowing for a pre-defined number of gaps and mismatches. The commandline-driven program implements a nondeterministic automata matching-algorithm on a keyword tree of the search strings. Both queries with and without ambiguity codes can be searched. Search time is short for perfect matches, and retrieval time rises exponentially with the number of edits allowed.</p>
<p><b>Availability:</b> The C++ source code for PatMaN is distributed under the GNU General Public License and has been tested on the GNU/Linux operating system. It is available from <inter-ref locator="http://bioinf.eva.mpg.de/patman" locator-type="url">http://bioinf.eva.mpg.de/patman</inter-ref>.</p>
<p><b>Contact: </b> <inter-ref locator="pruefer@eva.mpg.de" locator-type="email">pruefer@eva.mpg.de</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Prufer, K., Stenzel, U., Dannemann, M., Green, R. E., Lachmann, M., Kelso, J.]]></dc:creator>
<dc:date>2008-05-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn223</dc:identifier>
<dc:title><![CDATA[PatMaN: rapid alignment of short sequences to large databases]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-08</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn222v1?rss=1">
<title><![CDATA[HSEpred: predict Half-Sphere Exposure from protein sequences]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn222v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Half-Sphere Exposure (HSE) is a newly developed two-dimensional solvent exposure measure. By conceptually separating an amino acid's sphere in a protein structure into two half spheres which represent its distinct spatial neighborhoods in the upward and downward directions, the HSE-up and HSE-down measures show superior performance compared with other measures such as accessible surface area, residue depth and contact number. However, currently there is no existing method for the prediction of HSE measures from sequence data.</p>
<p><b>Results:</b> In this article, we propose a novel approach to predict the HSE measures and infer residue contact numbers using the predicted HSE values, based on a well-prepared non-homologous protein structure dataset. In particular, we employ support vector regression to quantify the relationship between HSE measures and protein sequences and evaluate its prediction performance. We extensively explore five sequence encoding schemes to examine their effects on the prediction performance. Our method could achieve the correlation coefficients of 0.72 and 0.68 between the predicted and observed HSE-up and HSE-down measures, respectively. Moreover, contact number can be accurately predicted by the summation of the predicted HSE-up and HSE-down values, which has further enlarged the application of this method. The successful application of support vector regression approach in this study suggests that it should be more useful in quantifying the protein sequence-structure relationship and predicting the structural property profiles from protein sequences.</p>
<p><b>Availability:</b> The prediction webserver and supplementary materials are accessible at <inter-ref locator="http://sunflower.kuicr.kyoto-u.ac.jp/~sjn/hse/" locator-type="url">http://sunflower.kuicr.kyoto-u.ac.jp/~sjn/hse/</inter-ref>.</p>
<p><b>Contact: </b> <inter-ref locator="sjn@kuicr.kyoto-u.ac.jp" locator-type="email">sjn@kuicr.kyoto-u.ac.jp</inter-ref>;  <inter-ref locator="takutsu@kuicr.kyoto-u.ac.jp" locator-type="email">takutsu@kuicr.kyoto-u.ac.jp</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Song, J., Tan, H., Takemoto, K., Akutsu, T.]]></dc:creator>
<dc:date>2008-05-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn222</dc:identifier>
<dc:title><![CDATA[HSEpred: predict Half-Sphere Exposure from protein sequences]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-08</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn220v1?rss=1">
<title><![CDATA[Discerning static and causal interactions in genome-wide reverse engineering problems]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn220v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> In the past years devising methods for discovering gene regulatory mechanisms at a genome-wide level has become a fundamental topic in the field of systems biology. The aim is to infer gene-gene interactions in an increasingly sophisticated and reliable way through the continuous improvement of reverse engineering algorithms exploiting microarray data.</p>
<p><b>Results:</b> This work is inspired by the several studies suggesting that co-expression is mostly related to "static" stable binding relationships, like belonging to the same protein complex, rather than other types of interactions more of a "causal" and transient nature (e.g. transcription factor&ndash;binding site interactions). The aim of this work is to verify if direct or conditional network inference algorithms (e.g. Pearson correlation for the former, partial Pearson correlation for the latter) are indeed useful in discerning static from causal dependencies in artificial and real gene networks (derived from <I>E.coli</I> and <I>S.cerevisiae</I>).</p>
<p><b>Contact: </b> <inter-ref locator="altafini@sissa.it" locator-type="email">altafini@sissa.it</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Zampieri, M., Soranzo, N., Altafini, C.]]></dc:creator>
<dc:date>2008-05-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn220</dc:identifier>
<dc:title><![CDATA[Discerning static and causal interactions in genome-wide reverse engineering problems]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-08</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn210v1?rss=1">
<title><![CDATA[Microbial Genotype-Phenotype Mapping by Class Association Rule Mining]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn210v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Microbial phenotypes are typically due to the concerted action of multiple gene functions, yet the presence of each gene may have only a weak correlation with the observed phenotype. Hence, it may be more appropriate to examine co-occurrence between sets of genes and a phenotype (multiple-to-one) instead of pairwise relations between a single gene and the phenotype. Here, we propose an efficient Class Association Rule mining algorithm, NETCAR, in order to extract sets of COGs (Clusters of Orthologous Groups of proteins) associated with a phenotype from COG phylogenetic profiles and a phenotype profile. NETCAR takes into account the phylogenetic cooccurrence graph between COGs to restrict hypothesis space, and uses mutual information to evaluate the biconditional relation.</p>
<p><b>Results:</b> We examined the mining capability of pairwise and multiple-toone association by using NETCAR to extract COGs relevant to six microbial phenotypes (aerobic, anaerobic, facultative, endospore, motility, and Gram negative) from 11,969 unique COG profiles across 155 prokaryotic organisms. With the same level of False Discovery Rate (FDR), multiple-to-one association can extract about 10 times more relevant COGs than one-to-one association. We also reveal various topologies of association networks among COGs (modules) from extracted multiple-to-one correlation rules relevant with the six phenotypes; including a well-connected network for motility, a startshaped network for aerobic, and intermediate topologies for the other phenotypes. NETCAR outperforms a standard Class Association Rule mining algorithm, CARAPRIORI, while requiring several orders of magnitude less computational time for extracting 3-COG sets.</p>
<p><b>Availability:</b> Source code of the Java implementation is available as Supplementary material at the Bioinformatics online website, or upon request to the author.</p>
<p><b>Contact: </b> <inter-ref locator="makio323@gmail.com" locator-type="email">makio323@gmail.com</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Tamura, M., D'haeseleer, P.]]></dc:creator>
<dc:date>2008-05-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn210</dc:identifier>
<dc:title><![CDATA[Microbial Genotype-Phenotype Mapping by Class Association Rule Mining]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-08</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn178v2?rss=1">
<title><![CDATA[Annotation-Modules: A tool for finding significant combinations of multisource annotations for gene lists]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn178v2?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The ontological analysis of the gene lists obtained from DNA microarray experiments constitutes an important step in understanding the underlying biology of the analyzed system. Over the last years, many other high-throughput techniques emerged, covering now basically all "omics" fields. However, for some of these techniques the generally used functional ontologies might not be sufficient to describe the biological system represented by the derived gene lists. For a more complete and correct interpretation of these experiments, it is important to extend substantially the number of annotations, adapting the ontological analysis to the new emerging techniques.</p>
<p><b>Results:</b> We developed Annotation-Modules, which offers an improvement over the current tools which improves the current tools in two critical aspects. Firstly, the underlying annotation database implements features from many different fields like gene regulation and expression, sequence properties, evolution and conservation, genomic localization and functional categories - resulting in about 60 different annotation features. Secondly, it examines not only single annotations but also all the combinations, which is important to gain insight into the interplay of different mechanisms in the analyzed biological system.</p>
<p><b>Availability:</b> <inter-ref locator="http://web.bioinformatics.cicbiogune.es/AM/AnnotationModules.php" locator-type="url">http://web.bioinformatics.cicbiogune.es/AM/AnnotationModules.php</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Hackenberg, M., Matthiesen, R.]]></dc:creator>
<dc:date>2008-05-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn178</dc:identifier>
<dc:title><![CDATA[Annotation-Modules: A tool for finding significant combinations of multisource annotations for gene lists]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-08</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn207v1?rss=1">
<title><![CDATA[SYCAMORE - A SYstems biology Computational Analysis and MOdeling Research Environment]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn207v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> SYCAMORE is a browser-based application that facilitates construction, simulation and analysis of kinetic models in systems biology. Thus, it allows e.g. database supported modelling, basic model checking and the estimation of unknown kinetic parameters based on protein structures. In addition, it offers some guidance in order to allow non-expert users to perform basic computational modelling tasks.</p>
<p><b>Availability:</b> SYCAMORE is freely available for academic use at <inter-ref locator="http://sycamore.eml.org" locator-type="url">http://sycamore.eml.org</inter-ref>. Commercial users may acquire a license.</p>
<p><b>Contact: </b> <inter-ref locator="ursula.kummer@bioquant.uni-heidelberg.de" locator-type="email">ursula.kummer@bioquant.uni-heidelberg.de</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Weidemann, A., Richter, S., Stein, M., Sahle, S., Gauges, R., Gabdoulline, R., Surovtsova, I., Semmelrock, N., Besson, B., Rojas, I., Wade, R., Kummer, U.]]></dc:creator>
<dc:date>2008-05-07</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn207</dc:identifier>
<dc:title><![CDATA[SYCAMORE - A SYstems biology Computational Analysis and MOdeling Research Environment]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-07</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn194v1?rss=1">
<title><![CDATA[Identification of OBO Nonalignments and Its Implications for OBO Enrichment]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn194v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Existing projects that focus on the semiautomatic addition of links between existing terms in the Open Biomedical Ontologies can take advantage of reasoners that can make new inferences between terms that are based on the added formal definitions and that reflect nonalignments between the linked terms.  However, these projects require that these definitions be necessary and sufficient, a strong requirement that often does not hold.  If such definitions cannot be added, the reasoners cannot point to the nonalignments through the suggestion of new inferences.</p>
<p><b>Results:</b> We describe a methodology by which we have identified over 1,9800 instances of nonredundant nonalignments between terms from the GO biological-process (BP), cellular-component (CC), and molecular-function (MF) ontologies, ChEBI, and the Cell Type Ontology (CL).  Many of the 39.838.1% of these nonalignments whose object terms are more atomic than the subject terms are not currently examined in other ontology-enrichment projects due to the fact that the necessary and sufficient conditions required for the inferences are not currently examined.  Analysis of the ratios of nonalignments to assertions from which the nonalignments were identified suggests that BP-MF, BP-BP, BP-CL, and CC-CC, BP-BP, and BP-CL terms are relatively well-aligned, while BP-ChEBI-MF, BP-ChEBI, and CC-MF and ChEBI-MF terms are relatively not aligned well.  We propose four ways to resolve an identified nonalignment and recommend an analogous implementation of our methodology in ontology-enrichment tools to identify types of nonalignments that are currently not detected.</p>
<p><b>Availability:</b> The nonalignments discussed in this article may be viewed at  <inter-ref locator="http://compbio.uchsc.edu/Hunter_lab/Bada/" locator-type="url">http://compbio.uchsc.edu/Hunter_lab/Bada/</inter-ref> nonalignments_20087_03_0614.html.  Code for the generation of these nonalignments is available upon request.</p>
<p><b>Contact: </b> <inter-ref locator="mike.bada@uchsc.edu" locator-type="email">mike.bada@uchsc.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Bada, M., Hunter, L.]]></dc:creator>
<dc:date>2008-05-07</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn194</dc:identifier>
<dc:title><![CDATA[Identification of OBO Nonalignments and Its Implications for OBO Enrichment]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-07</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn218v1?rss=1">
<title><![CDATA[A Support Vector Machine model for the prediction of proteotypic peptides for accurate mass and time proteomics]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn218v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The standard approach to identifying peptides based on accurate mass and elution time (AMT) compares these profiles obtained from a high resolution mass spectrometer to a database of peptides previously identified from tandem mass spectrometry (MS/MS) studies.  It would be advantageous, with respect to both accuracy and cost, to only search for those peptides that are detectable by MS (proteotypic).</p>
<p><b>Results:</b> We present a Support Vector Machine (SVM) model that uses a simple descriptor space based on 35 properties of amino acid content, charge, hydrophilicity, and polarity for the quantitative prediction of proteotypic peptides.  Using three independently derived AMT databases (<I>Shewanella oneidensis</I>, <I>Salmonella typhimurium</I>, <I>Yersinia pestis</I>) for training and validation within and across species, the SVM resulted in an average accuracy measure of ~0.8 with a standard deviation of less than 0.025. Furthermore, we demonstrate that these results are achievable with a small set of 12 variables and can achieve high proteome coverage.</p>
<p><b>Availability:</b> <inter-ref locator="http://omics.pnl.gov/software/STEPP.php" locator-type="url">http://omics.pnl.gov/software/STEPP.php</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Webb-Robertson, B.-J. M., Cannon, W. R., Oehmen, C. S., Shah, A. R., Gurumoorthi, V., Lipton, M. S., Waters, K. M.]]></dc:creator>
<dc:date>2008-05-03</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn218</dc:identifier>
<dc:title><![CDATA[A Support Vector Machine model for the prediction of proteotypic peptides for accurate mass and time proteomics]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-03</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn217v1?rss=1">
<title><![CDATA[DAnTE: a statistical tool for quantitative analysis of -omics data]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn217v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> DAnTE (Data Analysis Tool Extension) is a statistical tool designed to address challenges associated with quantitative bottom-up, shotgun proteomics data. This tool has also been demonstrated for microarray data and can easily be extended to other high-throughput data types. DAnTE features selected normalization methods, missing value imputation algorithms, peptide to protein rollup methods, an extensive array of plotting functions, and a comprehensive hypothesis testing scheme that can handle unbalanced data and random effects. The Graphical User Interface (GUI) is designed to be very intuitive and user friendly.</p>
<p><b>Availability:</b> DAnTE may be downloaded free of charge at <inter-ref locator="http://ncrr.pnl.gov/software/" locator-type="url">http://ncrr.pnl.gov/software/</inter-ref></p>
<p><b>Contact:</b>  <inter-ref locator="rds@pnl.gov" locator-type="email">rds@pnl.gov</inter-ref> or  <inter-ref locator="proteomics@pnl.gov" locator-type="email">proteomics@pnl.gov</inter-ref></p>
<p><b>Supplementary information:</b> An example dataset with instructions on how to perform a series of analysis steps is available at <inter-ref locator="http://ncrr.pnl.gov/software/" locator-type="url">http://ncrr.pnl.gov/software/</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Polpitiya, A. D., Qian, W.-J., Jaitly, N., Petyuk, V. A., Adkins, J. N., Camp, D. G., Anderson, G. A., Smith, R. D.]]></dc:creator>
<dc:date>2008-05-03</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn217</dc:identifier>
<dc:title><![CDATA[DAnTE: a statistical tool for quantitative analysis of -omics data]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-03</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn216v1?rss=1">
<title><![CDATA[High-performance hardware implementation of a parallel database search engine for real-time peptide mass fingerprinting]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn216v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Peptide Mass Fingerprinting (PMF) is a method for protein identification in which a protein is fragmented by a defined cleavage protocol (usually proteolysis with trypsin), and the masses of these products constitute a &lsquo;fingerprint&rsquo; that can be searched against theoretical fingerprints of all known proteins. In the first stage of PMF, the raw mass spectrometric data are processed to generate a peptide mass list. In the second stage this protein fingerprint is used to search a database of known proteins for the best protein match. Although current software solutions can typically deliver a match in a relatively short time, a system that can find a match in real-time could change the way in which PMF is deployed and presented. In a paper published earlier (Bogdan <I>et al.</I>, 2007) we presented a hardware design of a raw mass spectra processor that, when implemented in FPGA hardware, achieves almost 170-fold speed gain relative to a conventional software implementation running on a dual processor server. In this paper we present a complementary hardware realisation of a parallel database search engine that, when running on a Xilinx Virtex 2 FPGA at 100MHz, delivers 1800-fold speed-up compared with an equivalent C software routine, running on a 3.06GHz Xeon workstation. The inherent scalability of the design means that processing speed can be multiplied by deploying the design on multiple FPGAs. The database search processor and the mass spectra processor, running on a reconfigurable computing platform, provide a complete real-time PMF protein identification solution.</p>
<p><b>Contact:</b>  <inter-ref locator="d.coca@sheffield.ac.uk" locator-type="email">d.coca@sheffield.ac.uk</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Bogdan, I., Rivers, J., Beynon, R. J, Coca, D.]]></dc:creator>
<dc:date>2008-05-03</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn216</dc:identifier>
<dc:title><![CDATA[High-performance hardware implementation of a parallel database search engine for real-time peptide mass fingerprinting]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-03</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn215v1?rss=1">
<title><![CDATA[An efficient method to identify differentially expressed genes in microarray experiments]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn215v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Microarray experiments typically analyze thousands to tens of thousands of genes from small numbers of biological replicates. The fact that genes are normally expressed in functionally relevant patterns suggests that gene expression data can be stratified and clustered into relatively homogenous groups. Cluster-wise dimensionality reduction should make it feasible to improve screening power while minimizing information loss.</p>
<p><b>Results:</b> We propose a powerful and computationally simple method for finding differentially expressed genes in small microarray experiments. The method incorporates a novel stratification-based tight clustering algorithm, principal component analysis and information pooling. Comprehensive simulations show that our method is substantially more powerful than the popular SAM and eBayes approaches. We applied the method to three real microarray datasets: one from a <I>Populus</I> nitrogen stress experiment with 3 biological replicates; and two from public microarray datasets of human cancers with 10 to 40 biological replicates. In all three analyses, our method proved more robust than the popular alternatives for identification of differentially expressed genes.</p>
<p><b>Availability:</b> The C++ code to implement the proposed method is available upon request for academic use.</p>
<p><b>Contact: </b> <inter-ref locator="shuzhang@mtu.edu" locator-type="email">shuzhang@mtu.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Qin, H., Feng, T., Harding, S. A., Tsai, C.-J., Zhang, S.]]></dc:creator>
<dc:date>2008-05-03</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn215</dc:identifier>
<dc:title><![CDATA[An efficient method to identify differentially expressed genes in microarray experiments]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-03</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn137v1?rss=1">
<title><![CDATA[Mireval: a web tool for simple microRNA prediction in genome sequences]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn137v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> We have developed an online tool called mirEval which can search sequences of up to 10,000nt for novel microRNAs in multiple organisms. It is a comprehensive tool, easy to use and very informative. It will allow users with no prior knowledge of in-silico detection of microRNAs to take advantage of the most successful approaches to investigate sequences of interest</p>
<p><b>Availability:</b> The mirEval web server is available at <inter-ref locator="http://tagc.univ-mrs.fr/mireval" locator-type="url">http://tagc.univ-mrs.fr/mireval</inter-ref></p>
<p><b>Contact:</b>  <inter-ref locator="W.Ritchie@centenary.org.au" locator-type="email">W.Ritchie@centenary.org.au</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Ritchie, W., Theodule, F.-X., Gautheret, D.]]></dc:creator>
<dc:date>2008-05-03</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn137</dc:identifier>
<dc:title><![CDATA[Mireval: a web tool for simple microRNA prediction in genome sequences]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-03</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn219v1?rss=1">
<title><![CDATA[quantiNEMO: an individual-based program to simulate quantitative traits with explicit genetic architecture in a dynamic metapopulation]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn219v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> quantiNEMO is an individual-based, genetically explicit stochastic simulation program. It was developed to investigate the effects of selection, mutation, recombination, and drift on quantitative traits with varying architectures in structured populations connected by migration and located in a heterogeneous habitat. QuantiNEMO is highly flexible at various levels: population, selection, trait(s) architecture, genetic map for QTL and/or markers, environment, demography, mating system, etc. QuantiNEMO is coded in C++ using an object oriented approach and runs on any computer platform.</p>
<p><b>Availability:</b> Executables for several platforms, user's manual, and source code are freely available under the GNU General Public License at <inter-ref locator="http://www2.unil.ch/popgen/softwares/quantinemo" locator-type="url">http://www2.unil.ch/popgen/softwares/quantinemo</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="samuel.neuenschwander@unil.ch" locator-type="email">samuel.neuenschwander@unil.ch</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Neuenschwander, S., Hospital, F., Guillaume, F., Goudet, J.]]></dc:creator>
<dc:date>2008-05-01</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn219</dc:identifier>
<dc:title><![CDATA[quantiNEMO: an individual-based program to simulate quantitative traits with explicit genetic architecture in a dynamic metapopulation]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-01</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn214v1?rss=1">
<title><![CDATA[Characterization and Prediction of Residues Determining Protein Functional Specificity]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn214v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Within a homologous protein family, proteins may be grouped into subtypes that share specific functions that are not common to the entire family. Often, the amino acids present in a small number of sequence positions determine each protein's particular functional specificity. Knowledge of these specificity determining positions (SDPs) aids in protein function prediction, drug design, and experimental analysis. A number of sequence-based computational methods have been introduced for identifying SDPs; however, their further development and evaluation have been hindered by the limited number of known experimentally-determined SDPs.</p>
<p><b>Results:</b> We combine several bioinformatics resources to automate a process, typically undertaken manually, to build a data set of SDPs. The resulting large data set, which consists of SDPs in enzymes, enables us to characterize SDPs in terms of their physicochemical and evolutionary properties. It also facilitates the large-scale evaluation of sequence-based SDP prediction methods. We present a simple sequence-based SDP prediction method, <I>GroupSim</I>, and show that, surprisingly, it is competitive with a representative set of current methods. We also describe <I>ConsWin</I>, a heuristic that considers sequence conservation of neighboring amino acids, and demonstrate that it improves the performance of all methods tested on our large data set of enzyme SDPs.</p>
<p><b>Availability:</b> Data sets and <I>GroupSim</I> code are available online at <inter-ref locator="http://compbio.cs.princeton.edu/specificity/" locator-type="url">http://compbio.cs.princeton.edu/specificity/</inter-ref>.</p>
<p><b>Contact: </b> <inter-ref locator="msingh@cs.princeton.edu" locator-type="email">msingh@cs.princeton.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Capra, J. A., Singh, M.]]></dc:creator>
<dc:date>2008-05-01</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn214</dc:identifier>
<dc:title><![CDATA[Characterization and Prediction of Residues Determining Protein Functional Specificity]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-01</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn211v1?rss=1">
<title><![CDATA[A correction for estimating error when using the Local Pooled Error Statistical Test]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn211v1?rss=1</link>
<description><![CDATA[
<p>Jain et al. (2003) introduced the Local Pooled Error statistical test designed for use with small sample size microarray gene expression data. Based on an asymptotic proof, the test multiplicatively adjusts the standard error for a test of differences between two classes of observations by /2 due to the use of medians rather than means as measures of central tendency. The adjustment is upwardly biased at small sample sizes, however, producing fewer than expected small p-values with a consequent loss of statistical power. We present an empirical correction to the adjustment factor which removes the bias and produces theoretically expected p-values when distributional assumptions are met. Our adjusted LPE measure should prove useful to ongoing methodological studies designed to improve the LPE's performance for microarray and proteomics applications and for future work for other high-throughput biotechnologies.</p>
<p><b>Availability:</b> The software is implemented in the R language and can be downloaded from the Bioconductor project website (<inter-ref locator="http://www.bioconductor.org" locator-type="url">http://www.bioconductor.org</inter-ref>).</p>
<p><b>Contact: </b> <inter-ref locator="robert.nadon@mcgill.ca" locator-type="email">robert.nadon@mcgill.ca</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Murie, C., Nadon, R.]]></dc:creator>
<dc:date>2008-05-01</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn211</dc:identifier>
<dc:title><![CDATA[A correction for estimating error when using the Local Pooled Error Statistical Test]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-05-01</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn213v1?rss=1">
<title><![CDATA[jSquid: a Java applet for graphical on-line network exploration]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn213v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> jSquid is a graph visualization tool for exploring graphs from protein-protein interaction or functional coupling networks. The tool was designed for the FunCoup web site, but can be used for any similar network exploring purpose. The program offers various visualization and graph manipulation techniques to increase the utility for the user.</p>
<p><b>Availability:</b> jSquid is available for direct usage and download at <inter-ref locator="http://jSquid.sbc.su.se" locator-type="url">http://jSquid.sbc.su.se</inter-ref> including source code under the GPLv3 license, and input examples. It requires Java version 5 or higher to run properly.</p>
<p><b>Contact:</b>  <inter-ref locator="erik.sonnhammer@sbc.su.se" locator-type="email">erik.sonnhammer@sbc.su.se</inter-ref></p>
<p><b>Supplementary Information:</b> available at <I>Bioinformatics</I> online</p>
]]></description>
<dc:creator><![CDATA[Klammer, M., Roopra, S., Sonnhammer, E. L. L.]]></dc:creator>
<dc:date>2008-04-29</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn213</dc:identifier>
<dc:title><![CDATA[jSquid: a Java applet for graphical on-line network exploration]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-29</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn212v1?rss=1">
<title><![CDATA[nuScore: a web-interface for nucleosome positioning predictions]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn212v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Sequence-directed mapping of nucleosome positions is of major biological interest. Here, we present a web-interface for estimation of the affinity of the histone core to DNA and prediction of nucleosome arrangement on a given sequence. Our approach is based on assessment of the energy cost of imposing the deformations required to wrap DNA around the histone surface. The interface allows the user to specify a number of options such as selecting from several structural templates for threading calculations and adding random sequences to the analysis.</p>
<p><b>Availability:</b> The nuScore interface is freely available for use at <inter-ref locator="http://compbio.med.harvard.edu/nuScore" locator-type="url">http://compbio.med.harvard.edu/nuScore</inter-ref>.</p>
<p><b>Supplementary information:</b> The site contains user manual, description of the methodology, and examples.</p>
<p><b>Contact:</b>  <inter-ref locator="peter_park@harvard.edu" locator-type="email">peter_park@harvard.edu</inter-ref>;  <inter-ref locator="tolstorukov@gmail.com" locator-type="email">tolstorukov@gmail.com</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Tolstorukov, M. Y., Choudhary, V., Olson, W. K., Zhurkin, V. B., Park, P. J.]]></dc:creator>
<dc:date>2008-04-29</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn212</dc:identifier>
<dc:title><![CDATA[nuScore: a web-interface for nucleosome positioning predictions]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-29</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn208v1?rss=1">
<title><![CDATA[Cytoscape ESP: simple search of complex biological networks]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn208v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Cytoscape ESP enables searching complex biological networks on multiple attribute fields using logical operators and wildcards. Queries use an intuitive syntax and simple search line interface. ESP is implemented as a Cytoscape plugin and complements existing search functions in the Cytoscape network visualization and analysis software, allowing users to easily identify nodes, edges and subgraphs of interest, even for very large networks.</p>
<p><b>Availability:</b> <inter-ref locator="http://conklinwolf.ucsf.edu/genmappwiki/Google_Summer_of_Code_2007/Maital" locator-type="url">http://conklinwolf.ucsf.edu/genmappwiki/Google_Summer_of_Code_2007/Maital</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="ashkenaz@agri.huji.ac.il" locator-type="email">ashkenaz@agri.huji.ac.il</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Ashkenazi, M., Bader, G. D., Kuchinsky, A., Moshelion, M., States, D. J.]]></dc:creator>
<dc:date>2008-04-28</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn208</dc:identifier>
<dc:title><![CDATA[Cytoscape ESP: simple search of complex biological networks]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-28</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn205v1?rss=1">
<title><![CDATA[Positive selection drives a correlation between nonsynonymous/ synonymous divergence and functional divergence]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn205v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Functional divergence among proteins is often assumed to be strongly influenced by natural selection, as inferred from the ratio of nonsynonymous nucleotide divergence (d<SUB>N</SUB>) to synonymous nucleotide divergence (d<SUB>S</SUB>). That is, the more a mutation changes protein function, the more likely it is to be either selected against or selectively favored, and because the d<SUB>N</SUB>/d<SUB>S</SUB> ratio is a measure of natural selection, this ratio can be used to predict the degree of functional divergence (d<SUB>F</SUB>). However, these hypotheses have rarely been experimentally tested.</p>
<p><b>Results:</b> I present a novel method to address this issue, and demonstrate that divergence in bacteria-killing activity among animal antimicrobial peptides is positively correlated with the log of the d<SUB>N</SUB>/d<SUB>S</SUB> ratio. The primary cause of this pattern appears to be that positively selected substitutions change protein function more than neutral substitutions do. Thus, the d<SUB>N</SUB>/d<SUB>S</SUB> ratio is an accurate estimator of adaptive functional divergence.</p>
<p><b>Contact:</b>  <inter-ref locator="tennessj@science.oregonstate.edu" locator-type="email">tennessj@science.oregonstate.edu</inter-ref></p>
<p><b>Supplementary information:</b> Supplementary data, including GenBank Accession numbers, are available at <I>Bioinformatics</I> Online</p>
]]></description>
<dc:creator><![CDATA[Tennessen, J. A.]]></dc:creator>
<dc:date>2008-04-28</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn205</dc:identifier>
<dc:title><![CDATA[Positive selection drives a correlation between nonsynonymous/ synonymous divergence and functional divergence]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-28</prism:publicationDate>
<prism:section>DISCOVERY NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn199v1?rss=1">
<title><![CDATA[PEPITO: Improved Discontinuous B-Cell Epitope Prediction Using Multiple Distance Thresholds and Half Sphere Exposure]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn199v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Accurate prediction of B-cell epitopes is an important goal of computational immunology. Up to 90% of B-cell epitopes are discontinuous in nature, yet most predictors focus on linear epitopes. Even whenith the tertiary structure of the antigen is available, the accurate prediction of B-cell epitopes remains challenging.</p>
<p><b>Results:</b> Our predictor, PEPITO, uses a combination of amino acid propensity scores and half sphere exposure values at multiple distances to achieve state-of-the-art performance. PEPITO achieves an Area Under the Curve (AUC) of 75.4 on the Discotope dataset. Additionally, we benchmark PEPITO as well as the Discotope predictor on the more recent Epitome datasaset, achieving AUCs of 68.3 and 66.0 respectively.</p>
<p><b>Availability: </b>PEPITO is available as part of the SCRATCH suite of protein structure predictors via <inter-ref locator="www.igb.uci.edu" locator-type="url">www.igb.uci.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Sweredoski, M. J., Baldi, P.]]></dc:creator>
<dc:date>2008-04-28</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn199</dc:identifier>
<dc:title><![CDATA[PEPITO: Improved Discontinuous B-Cell Epitope Prediction Using Multiple Distance Thresholds and Half Sphere Exposure]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-28</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn209v1?rss=1">
<title><![CDATA[fdrtool: a versatile R package for estimating local and tail area-based false discovery rates]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn209v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> False discovery rate (FDR) methodologies are essential in the study of high-dimensional genomic and proteomic data. The R package "fdrtool" facilitates such analyzes by offering a comprehensive set of procedures for FDR estimation. Its distinctive features include: i) many different types of test statistics are allowed as input data, such as <I>p</I>-values, <I>z</I>-scores, correlations, and <I>t</I>-scores; ii) simultaneously, both local FDR and tail area-based FDR values are estimated for all test statistics; iii) empirical null models are fit where possible, thereby taking account of potential over- or underdispersion of the theoretical null. In addition, "fdrtool" provides readily interpretable graphical output, and can be applied to very large scale (in the order of millions of hypotheses) multiple testing problems. Consequently, "fdrtool" implements a flexible FDR estimation scheme that is unified across different test statistics and variants of FDR.</p>
<p><b>Availability:</b> The program is freely available from the Comprehensive R Archive Network (<inter-ref locator="http://cran.r-project.org/" locator-type="url">http://cran.r-project.org/</inter-ref>) under the terms of the GNU General Public License (version 3 or later).</p>
<p><b>Contact:</b>  <inter-ref locator="strimmer@uni-leipzig.de" locator-type="email">strimmer@uni-leipzig.de</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Strimmer, K.]]></dc:creator>
<dc:date>2008-04-25</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn209</dc:identifier>
<dc:title><![CDATA[fdrtool: a versatile R package for estimating local and tail area-based false discovery rates]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-25</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn206v1?rss=1">
<title><![CDATA[gpDB: A database of GPCRs, G-proteins, Effectors and their interactions.]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn206v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> gpDB is a publicly accessible, relational database, containing information about G-proteins, GPCRs and effectors, as well as information concerning known interactions between these molecules. The sequences are classified according to a hierarchy of different classes, families and subfamilies based on literature search. The main innovation besides the classification of G-proteins, GPCRs and effectors is the relational model of the database, describing the known coupling specificity of GPCRs to their respective alpha subunits of G-proteins, and also the specific interaction between G-proteins and their effectors, a unique feature not available in any other database.</p>
<p><b>Availability: </b><inter-ref locator="http://bioinformatics.biol.uoa.gr/gpDB" locator-type="url">http://bioinformatics.biol.uoa.gr/gpDB</inter-ref></p>
<p><b>Contact:</b>  <inter-ref locator="shamodr@biol.uoa.gr" locator-type="email">shamodr@biol.uoa.gr</inter-ref></p>
<p><b>Supplementary information: </b>Supplementary data are available on <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Theodoropoulou, M. C, Bagos, P. G, Spyropoulos, I. C, Hamodrakas, S. J]]></dc:creator>
<dc:date>2008-04-25</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn206</dc:identifier>
<dc:title><![CDATA[gpDB: A database of GPCRs, G-proteins, Effectors and their interactions.]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-25</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn201v1?rss=1">
<title><![CDATA[MAMOT: Hidden MArkov MOdeling Tool]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn201v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Hidden Markov Models are probabilistic models that are well adapted to many tasks in bioinformatics, for example for predicting the occurrence of specific motifs in biological sequences. MAMOT is a command-line program for Unix-like operating systems, including MacOS X, that we developed to allow scientists to apply Hidden Markov Models more easily in their research.</p>
<p>One can define the architecture and initial parameters of the model in a text file and then use MAMOT for parameter optimization on example data, decoding (like predicting motif occurrence in sequences) and the production of stochastic sequences generated according to the probabilistic model. Two examples for which models are provided are coiled-coil domains in protein sequences and protein binding sites in DNA. A wealth of useful features include the use of pseudocounts, state tying and fixing of selected parameters in learning, and the inclusion of prior probabilities in decoding.</p>
<p><b>Availability:</b> MAMOT is implemented in C++, and is distributed under the GNU General Public Licence (GPL). The software, documentation, and example model files can be found at <inter-ref locator="http://bcf.isb-sib.ch/mamot" locator-type="url">http://bcf.isb-sib.ch/mamot</inter-ref>.</p>
<p><b>Contact:</b>  <inter-ref locator="Mauro.Delorenzi@isb-sib.ch" locator-type="email">Mauro.Delorenzi@isb-sib.ch</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Schutz, F., Delorenzi, M.]]></dc:creator>
<dc:date>2008-04-25</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn201</dc:identifier>
<dc:title><![CDATA[MAMOT: Hidden MArkov MOdeling Tool]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-25</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn202v1?rss=1">
<title><![CDATA[TOPDOM: database of domains and motifs with conservative location in transmembrane proteins]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn202v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> The TOPDOM database is a collection of domains and sequence motifs located consistently on the same side of the membrane in -helical transmembrane proteins. The database was created by scanning well annotated transmembrane protein sequences in the UniProt database by specific domain or motif detecting algorithms. The identified domains or motifs were added to the database if they were uniformly annotated on the same side of the membrane of the various proteins in the UniProt database. The information about the location of the collected domains and motifs can be incorporated into constrained topology prediction algorithms, like HMMTOP, increasing the prediction accuracy.</p>
<p><b>Availability:</b> The TOPDOM database and the constrained HMMTOP prediction server are available on the page <inter-ref locator="http://topdom.enzim.hu" locator-type="url">http://topdom.enzim.hu</inter-ref>.</p>
<p><b>Contact: </b> <inter-ref locator="tusi@enzim.hu" locator-type="email">tusi@enzim.hu</inter-ref>,  <inter-ref locator="lkamar@enzim.hu" locator-type="email">lkamar@enzim.hu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Tusnady, G. E., Kalmar, L., Hegyi, H., Tompa, P., Simon, I.]]></dc:creator>
<dc:date>2008-04-23</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn202</dc:identifier>
<dc:title><![CDATA[TOPDOM: database of domains and motifs with conservative location in transmembrane proteins]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-23</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn200v1?rss=1">
<title><![CDATA[A Global Pathway Crosstalk Network]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn200v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Given the complex nature of biological systems, pathways often need to function in a coordinated fashion in order to produce appropriate physiological responses to both internal and external stimuli (Hartwell et al., 1999). Therefore, understanding the interaction and crosstalk between pathways is important for understanding the function of both cells and more complex systems.</p>
<p><b>Results:</b> We have developed a computational approach to detect crosstalk among pathways based on protein interactions between the pathway components. We built a global mammalian pathway crosstalk network that includes 580 pathways (covering 4,753 genes) with 1,815 edges between pathways.  This crosstalk network follows a power-law distribution: P(k) ~ k<sup>-</sup>, = 1.45, where P(k) is the number of pathways with k neighbors, thus pathway interactions may exhibit the same scale-free phenomenon that has been documented for protein interaction networks. We further used this network to understand colorectal cancer progression to metastasis based on transcriptomic data.</p>
<p><b>Contact:</b>  <inter-ref locator="yong.2.li@gsk.com" locator-type="email">yong.2.li@gsk.com</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Li, Y., Agarwal, P., Rajagopalan, D.]]></dc:creator>
<dc:date>2008-04-23</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn200</dc:identifier>
<dc:title><![CDATA[A Global Pathway Crosstalk Network]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-23</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn193v1?rss=1">
<title><![CDATA[RNAplex: a fast tool for RNA-RNA interaction search]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn193v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Regulatory RNAs often unfold their action via RNA-RNA interaction. Transcriptional gene silencing by means of siRNAs and miRNA as well as snoRNA directed RNA editing rely on this mechanism. Additionally ncRNA regulation in bacteria is mainly based upon RNA duplex formation. Finding putative target sites for newly discovered ncRNAs is a lengthy task as tools for cofolding RNA molecules like RNAcofold and RNAup are too slow for genome-wide search. Tools like RNAhybrid that neglects intramolecular interactions have runtimes proportional to <f>$$\mathcal{O}$$</f> (<I>m n</I>), albeit with a large prefactor. Still in many cases the need for even faster methods exists.</p>
<p><b>Results:</b> We present a new program, RNAplex, especially designed to quickly find possible hybridization sites for a query RNA in large RNA databases. RNAplex uses a slightly different energy model which reduces the computational time by a factor 10-27 compared to RNAhybrid. In addition a length penalty allows to focus the target search on short highly stable interactions.</p>
<p><b>Availability:</b> RNAplex can be downloaded at <inter-ref locator="http://www.tbi.univie.ac.at/~htafer/" locator-type="url">http://www.tbi.univie.ac.at/~htafer/</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="ivo@tbi.univie.ac.at" locator-type="email">ivo@tbi.univie.ac.at</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Tafer, H., Hofacker, I. L.]]></dc:creator>
<dc:date>2008-04-23</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn193</dc:identifier>
<dc:title><![CDATA[RNAplex: a fast tool for RNA-RNA interaction search]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-23</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn139v1?rss=1">
<title><![CDATA[swissPIT: A novel approach for pipelined analysis of mass spectrometry data]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn139v1?rss=1</link>
<description><![CDATA[
<p>The identification and characterisation of peptides from tandem mass spectrometry (MS/MS) data represents a critical aspect of proteomics. Today, tandem mass spectrometry analysis is often performed by only using a single identification program achieving identification rates between 10 &ndash; 50 % (Elias and Gygi, 2007). Be-side the development of new analysis tools, recent publications describe also the pipelining of different search programs to increase the identification rate (Keller et al., 2005, Hartler et al., 2007).</p>
<p>The swissPIT (Swiss Protein Identification Toolbox) follows this approach but goes a step further by providing the user an expand-able multi-tool platform capable of executing workflows to analyze Tandem MS-based data. One of the major problems in proteomics is the absent of standardized workflows to analyze the produced data. This includes the pre-processing part as well as the final identifica-tion of peptides and proteins. The main idea of swissPIT is not only the usage of different identification tool in parallel but also the mean-ingful concatenation of different identification strategies at the same time. The swissPIT is open source software but we also provide a user-friendly web platform, which demonstrates the capabilities of our software and which is available at <inter-ref locator="http://swisspit.cscs.ch" locator-type="url">http://swisspit.cscs.ch</inter-ref> upon request for account.</p>
]]></description>
<dc:creator><![CDATA[Quandt, A., Hernandez, P., Masselot, A., Hernandez, C., Maffioletti, S., Pautasso, C., Appel, R. D., Lisacek, F.]]></dc:creator>
<dc:date>2008-04-23</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn139</dc:identifier>
<dc:title><![CDATA[swissPIT: A novel approach for pipelined analysis of mass spectrometry data]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-23</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn198v1?rss=1">
<title><![CDATA[Eukaryotic transcription factor binding sites - modeling and integrative search methods]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn198v1?rss=1</link>
<description><![CDATA[
<p>A comprehensive knowledge of transcription factor binding sites is important for a mechanistic understanding of transcriptional regulation as well as for inferring gene regulatory networks. Because the DNA motif recognized by a transcription factor is typically short and degenerate, computational approaches for identifying binding sites based only on the sequence motif inevitably suffer from high error rates. Current state-of-the-art techniques for improving computational identification of binding sites can be broadly categorized into two classes: (1) Approaches that aim to improve binding motif models by extracting maximal sequence information from experimentally determined binding sites and (2) Approaches that supplement binding motif models with additional genomic or other attributes (such as evolutionary conservation). In this review we will discuss recent attempts to improve computational identification of transcription factor binding sites through these two types of approaches and conclude with thoughts on future development.</p>
]]></description>
<dc:creator><![CDATA[Hannenhalli, S.]]></dc:creator>
<dc:date>2008-04-21</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn198</dc:identifier>
<dc:title><![CDATA[Eukaryotic transcription factor binding sites - modeling and integrative search methods]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-21</prism:publicationDate>
<prism:section>REVIEW</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn132v1?rss=1">
<title><![CDATA[OnD-CRF: predicting order and disorder in proteins using conditional random fields]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn132v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Order and Disorder prediction using Conditional Random Fields, OnD-CRF, is a new method for accurately predicting the transition between structured and mobile or disordered regions in proteins. OnD-CRF applies CRFs relying on features which are generated from the amino acids sequence and from secondary structure prediction. Benchmarking results based on CASP7 targets, and evaluation with respect to several CASP criteria, rank the OnD-CRF model highest among the fully automatic server group.</p>
<p><b>Availability:</b> <inter-ref locator="http://babel.ucmp.umu.se/ond-crf/" locator-type="url">http://babel.ucmp.umu.se/ond-crf/</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="Uwe.Sauer@ucmp.umu.se" locator-type="email">Uwe.Sauer@ucmp.umu.se</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Wang, L., Sauer, U. H.]]></dc:creator>
<dc:date>2008-04-21</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn132</dc:identifier>
<dc:title><![CDATA[OnD-CRF: predicting order and disorder in proteins using conditional random fields]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-21</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn197v1?rss=1">
<title><![CDATA[Comparative conservation analysis of the human mitotic phosphoproteome]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn197v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> A key challenge in phosphoproteomic studies is to distinguish functionally relevant phosphorylation sites from potentially "silent" phosphorylation. Considering that relevant phosphorylation sites are expected to be better conserved during evolution than overall Serine, Threonine, and Tyrosine (S/T/Y) residues, we asked whether this can be directly demonstrated through statistic analysis, using a large experimental dataset.</p>
<p><b>Results:</b> Analyzing phosphoproteomic data derived from the human mitotic spindle apparatus, we found that 95.2 % of 1744 phosphorylation sites are conserved in at least one of six other vertebrate species. Using a new score, termed CZ-Score, we demonstrate that phosphorylation sites are significantly better conserved than other S/T/Y sites, a conclusion validated from several kinase consensus motifs. Most importantly, phosphorylation sites with experimentally verified biological functions were significantly better conserved than other phosphorylation sites, indicating that analysis utilizing evolutionary conservation may constitute a powerful basis for the development of improved phosphorylation site predictors.</p>
<p><b>Contact:</b>  <inter-ref locator="malik@biochem.mpg.de" locator-type="email">malik@biochem.mpg.de</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Malik, R., Nigg, E. A., Korner, R.]]></dc:creator>
<dc:date>2008-04-20</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn197</dc:identifier>
<dc:title><![CDATA[Comparative conservation analysis of the human mitotic phosphoproteome]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-20</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn195v1?rss=1">
<title><![CDATA[Prediction of disordered regions in proteins based on the meta approach]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn195v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Intrinsically disordered regions in proteins have no unique stable structures without their partner molecules, thus these regions sometimes prevent high quality structure determination. Furthermore, proteins with disordered regions are often involved in important biological processes, and the disordered regions are considered to play important roles in molecular interactions. Therefore, identifying disordered regions is important to obtain high-resolution structural information and to understand the functional aspects of these proteins.</p>
<p><b>Results:</b> We developed a new prediction method for disordered regions in proteins based on the meta approach and implemented a web-server for this prediction method named "metaPrDOS".  The method predicts the disorder tendency of each residue using support vector machines from the prediction results of the seven independent predictors. Evaluation of the meta approach was performed using the CASP7 prediction targets to avoid an overestimation due to the inclusion of proteins used in the training set of some component predictors. As a result, the meta approach achieved higher prediction accuracy than all methods participating in CASP7.</p>
<p><b>Availability:</b> <inter-ref locator="http://prdos.hgc.jp/meta/" locator-type="url">http://prdos.hgc.jp/meta/</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="t-ishida@hgc.jp" locator-type="email">t-ishida@hgc.jp</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Ishida, T., Kinoshita, K.]]></dc:creator>
<dc:date>2008-04-20</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn195</dc:identifier>
<dc:title><![CDATA[Prediction of disordered regions in proteins based on the meta approach]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-20</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn196v1?rss=1">
<title><![CDATA[Fast grid layout algorithm for biological networks with sweep calculation]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn196v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Properly drawn biological networks are of great help in the comprehension of their characteristics. The quality of the layouts for retrieved biological networks is critical for pathway databases. However, since it is unrealistic to manually draw biological networks for every retrieval, automatic drawing algorithms are essential. Grid layout algorithms handle various biological properties such as aligning vertices having the same attributes and complicated positional constraints according to their subcellular localizations; thus, they succeed in providing biologically comprehensible layouts. However, existing grid layout algorithms are not suitable for real-time drawing, which is one of requisites for applications to pathway databases, due to their high computational cost. In addition, they do not consider edge directions and their resulting layouts lack traceability for biochemical reactions and gene regulations, which are the most important features in biological networks.</p>
<p><b>Results:</b> We devise a new calculation method termed sweep calculation and reduce the time complexity of the current grid layout algorithms through its encoding and decoding processes. We conduct practical experiments by using 95 pathway models of various sizes from TRANSPATH and show that our new grid layout algorithm is much faster than existing grid layout algorithms. For the cost function, we introduce a new component that penalizes undesirable edge directions to avoid the lack of traceability in pathways due to the differences in direction between in-edges and out-edges of each vertex.</p>
<p><b>Availability:</b> Java implementations of our layout algorithms are available in Cell Illustrator.</p>
<p><b>Contact: </b> <inter-ref locator="masao@ims.u-tokyo.ac.jp" locator-type="email">masao@ims.u-tokyo.ac.jp</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Kojima, K., Nagasaki, M., Miyano, S.]]></dc:creator>
<dc:date>2008-04-18</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn196</dc:identifier>
<dc:title><![CDATA[Fast grid layout algorithm for biological networks with sweep calculation]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-18</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn179v1?rss=1">
<title><![CDATA[Forward-time Simulations of Nonrandom Mating Populations using simuPOP]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn179v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Computer simulations play an important role in studies of nonrandom mating populations. Because of implementation difficulties, only very limited types of nonrandom mating schemes are provided in the currently available simulation programs. Starting with version 0.8.5, simuPOP provides a few mating schemes that can be used to simulate arbitrary nonrandom mating models. This paper describes the concepts and methods behind these mating schemes and demonstrates their uses in a few examples, including partial self-mating, positive assortative mating, nonrandom outbreeding, and simulation of overlapping generations in age-structured populations.</p>
<p><b>Availability:</b> simuPOP is freely available at <inter-ref locator="http://simupop.sourceforge.net" locator-type="url">http://simupop.sourceforge.net</inter-ref>, distributed under a GPL license. Cited examples are in the doc/cookbook directory of a simuPOP distribution.</p>
<p><b>Contact: </b> <inter-ref locator="bpeng@mdanderson.org" locator-type="email">bpeng@mdanderson.org</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Peng, B., Amos, C. I.]]></dc:creator>
<dc:date>2008-04-15</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn179</dc:identifier>
<dc:title><![CDATA[Forward-time Simulations of Nonrandom Mating Populations using simuPOP]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-15</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn138v1?rss=1">
<title><![CDATA[IDMap: facilitating the detection of potential leads with therapeutic targets]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn138v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Pharmaceutical industry has been striving to reduce the costs of drug development and increase productivity. Among the many different attempts, drug repositioning (retargeting existing drugs) comes into the spotlight because of its financial efficiency. We introduce IDMap which predicts novel relationships between targets and chemicals and thus is capable of repositioning the marketed drugs by using text mining and chemical structure information. Also capable of mapping commercial chemicals to possible drug targets and vice versa, IDMap creates conven-ient environments for identifying the potential lead and its targets, especially in the field of drug repositioning.</p>
<p><b>Availability:</b> IDMap executable and its user manual including color images are freely available to non-commercial users at <inter-ref locator="http://www.equispharm.com/idmap" locator-type="url">http://www.equispharm.com/idmap</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="idmap@equispharm.com" locator-type="email">idmap@equispharm.com</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Ha, S., Seo, Y.-J., Kwon, M.-S., Chang, B.-H., Han, C.-K., Yoon, J.-H.]]></dc:creator>
<dc:date>2008-04-15</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn138</dc:identifier>
<dc:title><![CDATA[IDMap: facilitating the detection of potential leads with therapeutic targets]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-15</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn127v1?rss=1">
<title><![CDATA[URL decay in MEDLINE - a 4-year follow-up study]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn127v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Internet-based electronic resources, as given by Uniform Resource Locators (URLs), are being increasingly used in scientific publications but are also becoming inaccessible in a time-dependant manner, a phenomenon documented across disciplines. Initial reports brought attention to the problem, spawning methods of effectively preserving URL content while some journals adopted policies regarding URL publication and begun storing supplementary information on journal websites. Thus, a re-examination of URL growth and decay in the literature is merited to see if the problem has grown or been mitigated by any of these changes</p>
<p><b>Results:</b> After the 2003 study, three follow-up studies were conducted in 2004, 2005 and 2007. Unfortunately, no significant change was found in the rate of URL decay among any of the studies. However, only 5% of URLs cited more than twice have decayed versus 20% of URLs cited once or twice. The most common types of lost content were computer programs (43%), followed by scholarly content (38%) and databases (19%). Compared to URLs still available, no lost content type was significantly over or under-represented. Searching for 30 of these websites using Google, 11 (37%) were found relocated to different URLs.</p>
<p><b>Conclusions:</b> URL decay continues unabated, but URLs published by organizations tend to be more stable. Repeated citation of URLs suggests calculation of an electronic impact factor (eIF) would be an objective, quantitative way to measure the impact of Internet-based resources on scientific research.</p>
<p><b>Contact: </b> <inter-ref locator="Jonathan-Wren@OMRF.org" locator-type="email">Jonathan-Wren@OMRF.org</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Wren, J. D.]]></dc:creator>
<dc:date>2008-04-15</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn127</dc:identifier>
<dc:title><![CDATA[URL decay in MEDLINE - a 4-year follow-up study]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-15</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn136v1?rss=1">
<title><![CDATA[Analysing georeferenced population genetics data with Geneland: a new algorithm to deal with null alleles and a friendly graphical user interface.]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn136v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> We introduce a new algorithm to account for the presence of null alleles in inferences of populations clusters from individual multilocus genetic data. We show by simulations that the presence of null alleles can affect the accuracy of inferences if not properly accounted for and that our algorithm improve signficantly their accuracy.</p>
<p><b>Availability:</b> This new algorithm is implemented in the program Geneland. It is freely available under GNU public license as an R package on the Comprehensive R Archive Network. It now includes a fully clickable graphical interface. Informations on how to get the software are available on <inter-ref locator="folk.uio.no/gillesg/Geneland.html" locator-type="url">folk.uio.no/gillesg/Geneland.html</inter-ref></p>
<p><b>Supplementary material: </b>Details on the simulation study are available from <inter-ref locator="folk.uio.no/gillesg/BioInformatics_Geneland" locator-type="url">folk.uio.no/gillesg/BioInformatics_Geneland</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="gilles.guillot@bio.uio.no" locator-type="email">gilles.guillot@bio.uio.no</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Guillot, G., Santos, F., Estoup, A.]]></dc:creator>
<dc:date>2008-04-14</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn136</dc:identifier>
<dc:title><![CDATA[Analysing georeferenced population genetics data with Geneland: a new algorithm to deal with null alleles and a friendly graphical user interface.]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-14</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn135v1?rss=1">
<title><![CDATA[Implementation of a regulatory gene network to simulate the TH1/2 differentiation in an agent-based model of hyper-sensitivity reactions]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn135v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> An unbalanced differentiation of T helper cells from precursor type TH0 to the TH1 or TH2 phenotype in immune responses often leads to a pathological condition. In general, immune reactions biased toward TH1 responses may result in auto-immune diseases, while enhanced TH2 responses may cause allergic reactions.</p>
<p>The aim of this work is to integrate a gene network of the TH differentiation in an agent-based model of the hyper-sensitivity reaction. The implementation of such a system introduces a second level of description beyond the mesoscopic level of the inter-cellular interaction of the agent-based model.</p>
<p>The intra-cellular level consists in the cell internal dynamics of gene activation and transcription. The gene regulatory network includes genes-related molecules that have been found to be involved in the differentiation process in TH cells.</p>
<p><b>Results:</b> The simulator reproduces the hallmarks of an IgEmediated hypersensitive reaction and provides an example of how to combine the mesoscopic level description of immune cells with the microscopic gene-level dynamics.</p>
<p><b>Availability:</b> The basic version of the simulator of the immune response can be downloaded here: <inter-ref locator="http://www.iac.cnr.it/~filippo/C-ImmSim.html" locator-type="url">http://www.iac.cnr.it/~filippo/C-ImmSim.html</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="f.castiglione@iac.cnr.it" locator-type="email">f.castiglione@iac.cnr.it</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Santoni, D., Pedicini, M., Castiglione, F.]]></dc:creator>
<dc:date>2008-04-14</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn135</dc:identifier>
<dc:title><![CDATA[Implementation of a regulatory gene network to simulate the TH1/2 differentiation in an agent-based model of hyper-sensitivity reactions]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-14</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn128v1?rss=1">
<title><![CDATA[Accurate approximation method for prediction of class I MHC affinities for peptides of length 8, 10 and 11 using prediction tools trained on 9mers.]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn128v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Several accurate prediction systems have been developed for prediction of class I MHC:peptide binding. Most of these are trained on binding affinity data of primarily 9mer peptides. Here, we show how prediction methods trained on 9mer data can be used for accurate binding affinity prediction of peptides of length 8, 10 and 11. The method gives the opportunity to predict peptides with a different length than nine for MHC alleles where no such peptides have been measured. As validation, the performance of this approach is compared to predictors trained on peptides of the peptide length in question. In this validation, the approximation method has an accuracy that is comparable to or better than methods trained on a  peptide length identical to the predicted peptides.</p>
<p><b>Availability:</b> The algorithm has been implemented in the web-accessible servers NetMHC-3.0: <inter-ref locator="http://www.cbs.dtu.dk/services/NetMHC-3.0" locator-type="url">http://www.cbs.dtu.dk/services/NetMHC-3.0</inter-ref>, and NetMHCpan-1.1: <inter-ref locator="http://www.cbs.dtu.dk/services/NetMHCpan-1.1" locator-type="url">http://www.cbs.dtu.dk/services/NetMHCpan-1.1</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Lundegaard, C., Lund, O., Nielsen, M.]]></dc:creator>
<dc:date>2008-04-14</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn128</dc:identifier>
<dc:title><![CDATA[Accurate approximation method for prediction of class I MHC affinities for peptides of length 8, 10 and 11 using prediction tools trained on 9mers.]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-14</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn134v1?rss=1">
<title><![CDATA[Towards patterns tree of gene co-expression in eukaryotic species]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn134v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Cellular pathways behave coordinated regulation activity, and some reported works also have affirmed that genes in the same pathway have similar expression pattern. However, the complexity of biological systems regulation actually causes expression relationships between genes to display multiple patterns, such as linear, nonlinear, local, global, linear with time-delayed, nonlinear with time-delayed, monotonic and non-monotonic, which should be the explicit representation of cellular inner regulation mechanism in mRNA level. To investigate the relationship between different patterns, our work aims to systematically reveal gene expression relationship patterns in cellular pathways and to check for the existence of dominating gene expression pattern. By a large scale analysis of genes expression in three eukaryotic species,<I> Saccharomyces cerevisiae</I>,<I> Caenorhabditis elegans</I> and <I>Human, </I>we constructed gene co-expression patterns tree to systematically and hierarchically illustrate the different patterns and their interrelations.</p>
<p><b>Results:</b> The results show that the linear is the dominating expression pattern in the same pathway. The time-shifted pattern is another important relationship pattern. Many genes from the different pathway also present co-expression patterns. The nonlinear, non-monotonic and time-delayed relationship patterns reflect the remote interactions between the genes in cellular processes. Gene co-expression phenomena in the same pathways are diverse in different species. Genes in <I>Saccharomyces cerevisiae</I> and <I>Caenorhabditis elegans</I> present strong co-expression relationships, especially in <I>Caenorhabditis elegans,</I> co-expression is more universal and stronger due to its special array of genes. However in <I>Human,</I> gene co-expression is not apparent and the human genome involves more complicated functional relationships. In conclusion, different patterns corresponding to different coordinating behaviors coexist. The patterns trees of different species give us comprehensive insight and understanding of genes expression activity in the cellular society.</p>
<p><b>Contact:</b>  <inter-ref locator="whywhy_flying@163.com" locator-type="email">whywhy_flying@163.com</inter-ref>;  <inter-ref locator="wtq_flying@hotmail.com" locator-type="email">wtq_flying@hotmail.com</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Wang, H., Wang, Q., Li, X., Sheng, B., Ding, M., Shen, Z.]]></dc:creator>
<dc:date>2008-04-10</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn134</dc:identifier>
<dc:title><![CDATA[Towards patterns tree of gene co-expression in eukaryotic species]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-10</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn133v1?rss=1">
<title><![CDATA[Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: Detecting varying patterns in expression profiles]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn133v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Cluster analysis (of gene expression data) is a useful tool for identifying biologically relevant groups of genes that show similar expression patterns under multiple experimental conditions. Many clustering methods have been proposed for clustering geneexpression data. However most of these algorithms have several shortcomings for gene expression data clustering. In the present paper, we focus on several shortcomings of conventional clustering algorithms and propose a new clustering algorithm that is able to produce better clustering solution than that produced by some others.</p>
<p><b>Results:</b> We present the Divisive Correlation Clustering Algorithm (DCCA) that is suitable for finding a group of genes having similar pattern of variation in their expression values. To detect clusters with high correlation and biological significance, we use the correlation clustering concept introduced by Bansal's et al. (Bansal <I>et al</I>. (2004)). Our proposed algorithm DCCA produces a clustering solution without taking number of clusters to be created as an input. DCCA uses the correlation matrix in such a way that all genes in a cluster have highest average correlation with genes in that cluster. To test the performance of the DCCA, we have applied DCCA and some well-known conventional methods to an artificial data set, and nine gene expression datasets, and compared the performance of the algorithms. The clustering results of the DCCA are found to be more significantly relevant to the biological annotations than those of the other methods. All these facts show the superiority of the DCCA over some others for the clustering of gene-expression data.</p>
<p><b>Availability of the software: </b>The software has been developed using C and Visual Basic languages, and can be executed on the Microsoft Windows platforms. The software may be downloaded as a zip file from <inter-ref locator="http://www.isical.ac.in/~rajat" locator-type="url">http://www.isical.ac.in/~rajat</inter-ref>. Then it needs to be installed. Two word files (included in the zip file) need to be consulted before installation and execution of the software.</p>
<p><b>Contact: </b> <inter-ref locator="rajat@isical.ac.in" locator-type="email">rajat@isical.ac.in</inter-ref></p>
<p><b>Supplementary Material: </b>Supplementary Material has been uploaded as a separate file.</p>
]]></description>
<dc:creator><![CDATA[Bhattacharya, A., De, R. K.]]></dc:creator>
<dc:date>2008-04-10</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn133</dc:identifier>
<dc:title><![CDATA[Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: Detecting varying patterns in expression profiles]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-10</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn130v1?rss=1">
<title><![CDATA[Simple is beautiful: a straightforward approach to improve the delineation of true and false positives in PSI-BLAST searches]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn130v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The deluge of biological information from different genomic initiatives and the rapid advancement in biotechnologies have made bioinformatics tools an integral part of modern biology. Among the widely-used sequence alignment tools, BLAST and PSI-BLAST are arguably the most popular. PSI-BLAST, which uses an iterative profile (PSSM)-based search strategy, is more sensitive than BLAST in detecting weak homologies, thus making it suitable for remote homolog detection. Many refinements have been made to improve PSI-BLAST and its computational efficiency and high specificity have been much touted. Nevertheless, corruption of its profile via the incorporation of false positive sequences remains a major challenge.</p>
<p><b>Results:</b> We have developed a simple and elegant approach to resolve the problem of model corruption in PSI-BLAST searches. We hypothesized that combining results from the first (least-corrupted) profile with results from later (most sensitive) iterations of PSI-BLAST provides a better discriminator for true and false hits. Accordingly, we have derived a formula that utilizes the <I>E</I>-values from these two PSI-BLAST iterations to obtain a figure of merit for rank-ordering the hits. Our verification results based on a "gold-standard" test set indicate that this figure of merit does indeed delineate true positives from false positives better than PSI-BLAST <I>E</I>-values. Perhaps what is most notable about this strategy is that it is simple and straightforward to implement.</p>
]]></description>
<dc:creator><![CDATA[Lee, M. M., Chan, M. K., Bundschuh, R.]]></dc:creator>
<dc:date>2008-04-10</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn130</dc:identifier>
<dc:title><![CDATA[Simple is beautiful: a straightforward approach to improve the delineation of true and false positives in PSI-BLAST searches]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-10</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn131v1?rss=1">
<title><![CDATA[Fast Network Component Analysis (FastNCA) for gene regulatory network reconstruction from microarray data]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn131v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Recently developed network component analysis (NCA) approach is promising for gene regulatory network reconstruction from microarray data. The existing NCA algorithm is an iterative method which has two potential limitations: computational instability and multiple local solutions. The subsequently developed NCA-r algorithm with Tikhonov regularization can help solve the first issue but cannot completely handle the second one. Here we develop a novel Fast Network Component Analysis (FastNCA) algorithm which has an analytical solution that is much faster and does not have the above limitations.</p>
<p><b>Results:</b> Firstly FastNCA is compared to NCA and NCA-r using synthetic data. The reconstruction of FastNCA is more accurate than that of NCA-r and comparable to that of properly converged NCA. FastNCA is not sensitive to the correlation among the input signals, while its performance does degrade a little but not as dramatically as that of NCA. Like NCA, FastNCA is not very sensitive to small inaccuracies in <I>a priori</I> information on the network topology. FastNCA is about several tens times faster than NCA and several hundreds times faster than NCA-r. Then, the method is applied to real yeast cell cycle microarray data. The activities of the estimated cell cycle regulators by FastNCA and NCA-r are compared to the semiquantitative results obtained independently by Lee et al. (2002). It is shown here that there is a greater agreement between the results of FastNCA and Lee's, which is represented by the ratio 23/33, than that between the results of NCA-r and Lee's, which is 14/33.</p>
<p><b>Availability:</b> Software and supplementary materials are available from <inter-ref locator="http://www.eee.hku.hk/~cqchang/FastNCA.htm" locator-type="url">http://www.eee.hku.hk/~cqchang/FastNCA.htm</inter-ref></p>
<p><b>Contact: </b> <inter-ref locator="cqchang@eee.hku.hk" locator-type="email">cqchang@eee.hku.hk</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Chang, C., Ding, Z., Hung, Y. S., Fung, P. C. W.]]></dc:creator>
<dc:date>2008-04-09</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn131</dc:identifier>
<dc:title><![CDATA[Fast Network Component Analysis (FastNCA) for gene regulatory network reconstruction from microarray data]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-09</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn125v1?rss=1">
<title><![CDATA[EPO-KB: A searchable knowledge base of biomarker to protein links]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn125v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> The knowledge base EPO-KB (Empirical Proteomic Ontology Knowledge Base) is based on an OWL ontology that represents current knowledge linking Mass-To-Charge (<I>m/z</I>) ratios to proteins on multiple platforms including Matrix Assisted Laser/Desorption Ionization (MALDI) and Surface Enhanced Laser/Desorption Ionization (SELDI) - Time of Flight (TOF). At present, it contains information on <I>m/z</I> ratio to protein links that were extracted from 120 published research papers.  It has a web interface that allows researchers to query and retrieve putative proteins that correspond to a user-specified <I>m/z</I> ratio. EPO-KB also allows automated entry of additional <I>m/z</I> ratio to protein links and is expandable to the addition of gene to protein and protein to disease links.</p>
<p><b>Availability:</b> <inter-ref locator="http://www.dbmi.pitt.edu/EPO-KB" locator-type="url">http://www.dbmi.pitt.edu/EPO-KB</inter-ref></p>
<p><b>Contact:</b>  <inter-ref locator="JLL47@pitt.edu" locator-type="email">JLL47@pitt.edu</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Lustgarten, J. L., Kimmel, C., Ryberg, H., Hogan, W.]]></dc:creator>
<dc:date>2008-04-09</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn125</dc:identifier>
<dc:title><![CDATA[EPO-KB: A searchable knowledge base of biomarker to protein links]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-09</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn117v1?rss=1">
<title><![CDATA[MedEvi: Retrieving textual evidence of relations between biomedical concepts from Medline]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn117v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> Search engines running on MEDLINE abstracts have been widely used by biologists to find publications that are related to their research. The existing search engines such as PubMed, however, have limitations when applied for the task of seeking textual evidence of relations between given concepts. The limitations are mainly due to the problem that the search engines do not effectively deal with multi-term queries which may imply semantic relations between the terms. To address this problem, we present MedEvi, a novel search engine that imposes positional restriction on occurrences matching multi-term queries, based on the observation that terms with semantic relations which are explicitly stated in text are not found too far from each other. MedEvi further identifies additional keywords of biological and statistical significance from local context of matching occurrences in order to help users reformulate their queries for better results.</p>
<p><b>Availability:</b> <inter-ref locator="http://www.ebi.ac.uk/tc-test/textmining/medevi/" locator-type="url">http://www.ebi.ac.uk/tc-test/textmining/medevi/</inter-ref></p>
<p><b>Contact:</b>  <inter-ref locator="kim@ebi.ac.uk" locator-type="email">kim@ebi.ac.uk</inter-ref>,  <inter-ref locator="pezik@ebi.ac.uk" locator-type="email">pezik@ebi.ac.uk</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Kim, J.-j., Pezik, P., Rebholz-Schuhmann, D.]]></dc:creator>
<dc:date>2008-04-09</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn117</dc:identifier>
<dc:title><![CDATA[MedEvi: Retrieving textual evidence of relations between biomedical concepts from Medline]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-09</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn129v1?rss=1">
<title><![CDATA[adegenet: a R package for the multivariate analysis of genetic markers]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn129v1?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> The package <I>adegenet</I> for the R software is dedicated to the multivariate analysis of genetic markers. It extends the <I>ade4</I> package of multivariate methods by implementing formal classes and functions to manipulate and analyse genetic markers. Data can be imported from common population genetics software and exported to other software and R packages. <I>adegenet</I> also implements standard population genetics tools along with more original approaches for spatial genetics and hybridization.</p>
<p><b>Results:</b> </p>
<p><b>Availability:</b> Stable version is available from CRAN: <inter-ref locator="http://cran.rproject.org/mirrors.html" locator-type="url">http://cran.rproject.org/mirrors.html</inter-ref>. Development version is available from adegenet website: <inter-ref locator="http://adegenet.r-forge.r-project.org/" locator-type="url">http://adegenet.r-forge.r-project.org/</inter-ref>. Both versions can be installed directly from R. adegenet is distributed under the GNU General Public Licence (v.2).</p>
<p><b>Contact:</b>  <inter-ref locator="jombart@biomserv.univ-lyon1.fr" locator-type="email">jombart@biomserv.univ-lyon1.fr</inter-ref></p>
]]></description>
<dc:creator><![CDATA[Jombart, T.]]></dc:creator>
<dc:date>2008-04-08</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn129</dc:identifier>
<dc:title><![CDATA[adegenet: a R package for the multivariate analysis of genetic markers]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-08</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn111v1?rss=1">
<title><![CDATA[Expected Gene Order Distances and Model Selection in Bacteria]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn111v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> The evolutionary distance inferred from gene order comparisons of related bacteria is dependent on the model. Therefore, it is highly important to establish reliable assumptions before inferring its magnitude.</p>
<p><b>Results:</b> We investigate the patterns of dotplots between species of bacteria with the purpose of model selection in gene order problems. We find several categories of data which can be explained by carefully weighing the contributions of reversals, transpositions, symmetrical reversals, single gene transpositions, and single gene reversals. We also derive method of moments distance estimates for some previously uncomputed cases, such as symmetrical reversals, single gene reversals and their combinations, as well as the single gene transpositions edit distance.</p>
<p><b>Contact:</b>  <inter-ref locator="ner@math.chalmers.se" locator-type="email">ner@math.chalmers.se</inter-ref></p>
<p><b>Supplementary information: </b>Available at <I>Bioinformatics</I> online.</p>
]]></description>
<dc:creator><![CDATA[Dalevi, D., Eriksen, N.]]></dc:creator>
<dc:date>2008-04-01</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn111</dc:identifier>
<dc:title><![CDATA[Expected Gene Order Distances and Model Selection in Bacteria]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-04-01</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btn033v2?rss=1">
<title><![CDATA[MiSearch Adaptive PubMed Search Tool]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btn033v2?rss=1</link>
<description><![CDATA[
<p><b>Summary:</b> MiSearch is an adaptive biomedical literature search tool that ranks citations based on a statistical model for the likelihood that a user will choose to view them.  Citation selections are automatically acquired during browsing and used to dynamically update a likelihood model that includes authorship, journal and PubMed indexing information. The user can optionally elect to include or exclude specific features and vary the importance of timeliness in the ranking. </p>
<p><b>Availability:</b> <inter-ref locator="http://misearch.ncibi.org" locator-type="url">http://misearch.ncibi.org</inter-ref></p>
<p><b>Contact:</b> David J. States (<inter-ref locator="dstates@umich.edu" locator-type="email">dstates@umich.edu</inter-ref>).</p>
]]></description>
<dc:creator><![CDATA[States, D. J., Ade, A. S., Wright, Z. C., Bookvich, A. V., Athey, B. D.]]></dc:creator>
<dc:date>2008-03-11</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btn033</dc:identifier>
<dc:title><![CDATA[MiSearch Adaptive PubMed Search Tool]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-03-11</prism:publicationDate>
<prism:section>APPLICATIONS NOTE</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btm634v1?rss=1">
<title><![CDATA[Estimating Large Scale Signaling Networks through Nested Effect Models with Intervention Effects from Microarray Data]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btm634v1?rss=1</link>
<description><![CDATA[
<p><b>Motivation:</b> Targeted interventions using RNA interference in combination with the measurement of secondary effects with DNA microarrays can be used to computationally reverse engineer features of upstream non-transcriptional signaling cascades based on the nested structure of effects.</p>
<p><b>Results:</b> We extend previous work by Markowetz <I>et al</I>., who proposed a statistical framework to score different network hypotheses. Our extensions go in several directions: We show how prior assumptions on the network structure can be incorporated into the scoring scheme by defining appropriate prior distributions on the network structure as well as on hyperparameters. An approach called <I>module networks</I> is introduced to scale up the original approach, which is limited to around 5 genes, to infer large scale networks of more than 30 genes. Instead of the data discretization step needed in the original framework, we propose the usage of a beta-uniform mixture distribution on the <I>p</I>-value profile, resulting from differential gene expression calculation, to quantify effects. Extensive simulations on artificial data and application of our <I>module network</I> approach to infer the signaling network between 13 genes in the ER- pathway in human MCF-7 breast cancer cells show that our approach gives sensible results. Using a bootstrapping and a jackknife approach this reconstruction is found to be statistically stable.</p>
<p><b>Availability:</b> The proposed method is available within the Bioconductor <I>R</I>-package <I>nem</I>.</p>
<p><b>Contact: </b>h.froehlich@dkfz.de</p>
]]></description>
<dc:creator><![CDATA[Frohlich, H., Fellmann, M., Sultmann, H., Poustka, A., Beissbarth, T.]]></dc:creator>
<dc:date>2008-01-28</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btm634</dc:identifier>
<dc:title><![CDATA[Estimating Large Scale Signaling Networks through Nested Effect Models with Intervention Effects from Microarray Data]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2008-01-28</prism:publicationDate>
<prism:section>ORIGINAL PAPER</prism:section>
</item>

<item rdf:about="http://bioinformatics.oxfordjournals.org/cgi/content/short/btm094v2?rss=1">
<title><![CDATA[This paper was published in error before receiving the final version]]></title>
<link>http://bioinformatics.oxfordjournals.org/cgi/content/short/btm094v2?rss=1</link>
<description><![CDATA[]]></description>
<dc:creator><![CDATA[]]></dc:creator>
<dc:date>2007-07-30</dc:date>
<dc:identifier>info:doi/10.1093/bioinformatics/btm094</dc:identifier>
<dc:title><![CDATA[This paper was published in error before receiving the final version]]></dc:title>
<dc:publisher>Oxford University Press</dc:publisher>
<prism:publicationDate>2007-07-30</prism:publicationDate>
<prism:section>ARTICLE</prism:section>
</item>

</rdf:RDF>