Bioinformatics Advance Access originally published online on October 22, 2004
Bioinformatics 2005 21(5):617-623; doi:10.1093/bioinformatics/bti057
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
PSORTb v.2.0: Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis
1 Department of Molecular Biology and Biochemistry, Simon Fraser University Burnaby, BC, Canada V5A 1S6
2 Department of Computing Science, Simon Fraser University Burnaby, BC, Canada V5A 1S6
*To whom correspondence should be addressed.
| Abstract |
|---|
|
|
|---|
Motivation: PSORTb v.1.1 is the most precise bacterial localization prediction tool available. However, the program's predictive coverage and recall are low and the method is only applicable to Gram-negative bacteria. The goals of the present work are as follows: increase PSORTb's coverage while maintaining the existing precision level, expand it to include Gram-positive bacteria and then carry out a comparative analysis of localization.
Results: An expanded database of proteins of known localization and new modules using frequent subsequence-based support vector machines was introduced into PSORTb v.2.0. The program attains a precision of 96% for Gram-positive and Gram-negative bacteria and predictive coverage comparable to other tools for whole proteome analysis. We show that the proportion of proteins at each localization is remarkably consistent across species, even in species with varying proteome size.
Availability: Web-based version: http://www.psort.org/psortb. Standalone version: Available through the website under GNU General Public License.
Contact: psort-mail{at}sfu.ca, brinkman{at}sfu.ca
Supplementary information: http://www.psort.org/psortb/supplementaryinfo.html
| INTRODUCTION |
|---|
|
|
|---|
Subcellular localization prediction allows researchers to make inferences regarding a protein's function, to annotate genomes, to design proteomics experiments andparticularly in the case of bacterial pathogen proteinsto identify potential diagnostic, drug and vaccine targets. The last year has seen the release of several localization prediction tools, including CELLO (Yu et al., 2004) and Proteome Analyst (Lu et al., 2004), the only new tools capable of analyzing bacterial proteins. CELLO utilizes an n-peptide composition-based support vector machine (SVM) approach in its analyses, while Proteome Analyst generates predictions using an annotation keyword-based system.
In 2003 we released PSORTb, an open-source tool for localization prediction in Gram-negative bacteria (Gardy et al., 2003). PSORTb uses the multiple classification method approach pioneered by PSORT I (Nakai and Kanehisa, 1991), in which several sequence features known to influence localization are analyzed using different computational techniques. By analyzing features including signal peptides, transmembrane helices, homology to proteins of known localization, amino acid composition and motifs, PSORTb v.1.1 attained a classification precision of 97%. However, the method did not extend to Gram-positive organisms and its predictive coverage when applied to whole proteomesthe number of proteins for which a prediction could be maderemained low, at 28%. The goal of the present work was to expand PSORTb's predictive scope by introducing additional classification methods applicable to both Gram-positive and Gram-negative bacteria, while maintaining the existing standard of high precision. SVM was investigated as a potential method for increasing coverage.
An SVM (Vapnik, 1995) is a kernel learning algorithm, in which all the data are mapped as vectors in n-dimensional feature space. Given training data from two classes (positive and negative), an SVM learns the optimal separating hyperplane that separates the two classes and maximizes their distance from the hyperplane. In the previous works on the applicability of SVMs to the localization classification problem, nucleotide or protein sequences have been modeled as vectors representing amino acid composition (Hua and Sun, 2001; Yu et al., 2004). We proposed, however, that the precision of an SVM could be improved by utilizing frequently occurring subsequences rather than overall amino acid composition. Such common patterns within a group of proteins may indicate the site of a common biochemical mechanism or a structural motif. In an earlier work, we examined the applicability of this method to the classification of outer membrane proteins (She et al., 2003); here, we show that it can be used for high-precision classification of all prokaryotic localization sites.
By the introduction of an SVM-based classifier and expansion of the SCL-BLAST and motif-based analyses, we have significantly improved PSORTb's predictive capacity relative to version 1.1. The program is now capable of generating predictions for Gram-positive bacteria, and is able to make predictions for 75% of a Gram-positive proteome and 57% of a Gram-negative proteome, comparable to the coverage attained by other methods. Five localization sites are predicted for Gram-negative bacteria (cytoplasm, cytoplasmic membrane, periplasm, outer membrane and extracellular) and four for Gram-positive bacteria (cytoplasm, cytoplasmic membrane, cell wall and extracellular), with the program also able to flag potentially multiply localized proteins. PSORTb remains the most precise tool for localization prediction available, with a measured classification precision of 96% for both Gram-negative and Gram-positive bacteria.
The improved coverage attained by PSORTb v.2.0 allowed us to compare the proportion of proteins resident at each localization site across multiple proteomes. We hypothesized that free-living organisms with more diverse environmental niches may contain more membrane proteins in order to facilitate uptake of a variety of materials. We found, however, that the proportion of proteins at a given localization site remains remarkably constant across species, regardless of lifestyle, environmental niche or proteome size.
| SYSTEM AND METHODS |
|---|
|
|
|---|
Dataset
PSORTb v.2.0 was trained and evaluated using an expanded version of the original PSORTdb dataset (Gardy et al., 2003). This updated dataset, the composition of which is shown in Table 1, includes 150 new Gram-negative proteins and 576 new Gram-positive proteins. Each protein's localization site has been experimentally verified and reported in the literature. The dataset is freely available at http://www.psort.org/dataset.
|
PSORTb v.2.0 organization
Similar to PSORTb v.1.1, PSORTb v.2.0 consists of a series of analytical modules, each capable of generating predictions for one or more localization sites. However, several significant changes have been made to the modules in the new version. In version 2.0, the SubLocC module has been replaced with a new SVM-based method, as described below. The signal peptide identification module has now been trained with Gram-positive data in addition to the Gram-negative data used in version 1.1 (Nielsen et al., http://www.cbs.dtu.dk/ftp/signalp). The SCL-BLAST and Motif modules have been expanded as described below. No changes were made to the HMMTOP transmembrane helix identification module (Tusnády and Simon, 2001) or the OMPMotif module.
As in version 1.1, the modules' predictions are weighted and integrated using a Bayesian network in order to generate the final prediction, which comes in the form of a score distribution. When a single localization site displays a score of 7.5 or greater, that site is returned as a final prediction. New to version 2.0 is the multiple localization flaggingif two sites return high scores, a flag of This protein may have multiple localization sites is appended to the final prediction. This flag is triggered when a site scores between 4.0 and 7.49 for Gram-negative proteins, and between 5.0 and 7.49 for Gram-positive proteins. If no site scores above 4.0 or 5.0, depending on the class, a localization site of Unknown is returned. PSORTb's emphasis is on precision, and returning a result of Unknown when not enough information is available to make a prediction avoids potential false positive results.
SCL-BLAST and SCL-BLASTe
PSORTb's SCL-BLAST module assigns putative localizations based on homology to a protein of known localization. Version 2.0 improves the recall associated with this module by implementing a BLASTP search (Altschul et al., 1990) against the expanded PSORTdb database. We have also introduced an exact match filter to detect if a user's query protein is already in the databaseif a query protein displays 100% identity to a protein in PSORTdb with a difference between query and subject length of not more than one character (to account for some users' removal of the initial f-methionine residue), the SCL-BLASTe subroutine returns the localization site associated with the subject protein. In cases where an exact match is identified, the query protein is not analyzed by subsequent modules, enabling a result to be returned faster. The SCL-BLAST module is able to generate predictions for each of the five Gram-negative and four Gram-positive localization sites.
Motifs and profiles
In PSORTb v.1.1, the Motif module scanned a query sequence for the presence of any 1 of 26 PROSITE motifs indicative of specific Gram-negative localization sites. In PSORTb v.2.0, the module has been expanded to include 44 Gram-negative motifs derived from PROSITE v.18 (Hulo et al., 2004), covering all but the cytoplasmic localization site, and 25 Gram-positive motifs covering all 4 localization sites. The complete list of motifs is available at http://www.psort.org/motifs. Each motif has been checked against PSORTdb to ensure that it produces no false positive results. Two motifs used in PSORTb v.1.1 were removed from v.2.0 due to the occurrence of false positives when examined against the expanded PSORTdb.
PSORTb v.2.0 also includes a Profile module, in which localization-specific profiles derived from PROSITE v.18 were selected to generate 100.0% precise predictions against PSORTdb. Each profile is similar to a motif but with position-specific weighting information included, such that more degenerate sequences can be retrieved than via the strict pattern matching of the Motif module. Six profiles were selected, four of which identify both Gram-negative and Gram-positive cytoplasmic proteins and cytoplasmic membrane proteins, and two of which are specific to the Gram-positive cell wall and extracellular sites. The profiles are also available at http://www.psort.org/motifs.
Frequent subsequence-based support vector machines
PSORTb v.2.0 contains a new series of modules utilizing SVMs for classification. Nine SVMs were developed, one for each Gram-negative and Gram-positive localization site. Training data for each SVM consists of a positive class comprising all proteins resident at a specific localization site and a negative class comprising all other proteins of the same Gram category.
Each of the nine positive class datasets was first mined for frequent subsequences using an implementation of the generalized suffix tree (Wang et al., 1994). A subsequence was defined as frequent if it occurred in at least X% of proteins in the positive class of training data, where X is a parameter called minimum support, or MinSup. Multiple values of the MinSup parameter were tested.
SVMLight (Joachims, 2002, http://svmlight.joachims.org/) was used to implement nine SVMs whose feature spaces consisted of the frequent subsequences characteristic of a specific localization site. For each localization site, different SVMs were tested using different combinations of MinSup (range: 0.813%) and kernel (linear, polynomial with degree = 2, radial basis function with
= 0.005). The MinSup/kernel combination giving the highest classification precision combined with a reasonable level of recall (>40%) was selected for inclusion in PSORTb v.2.0. Variations in the margin error penalization parameter C were not evaluated, as our earlier work on the subject showed a negligible effect on precision and recall values (She et al., 2003). The final SVMs implemented in PSORTb v.2.0 utilize LibSVM (Lin, 2003, http://www.csie.ntu.edu.tw/~cjlin/libsvm/).
Evaluation
All evaluations were carried out using 5-fold cross-validation, in each round of which four randomly generated folds of the data were used for training or construction of the module(s) in question and the fifth fold was reserved for testing. Where possible, we have included confusion matrices with our results to aid further evaluation of PSORTb's performance by the researchers using other definitions of accuracy. We have defined precision as TP/(TP + FP) and recall as TP/(TP + FN). In cases where a protein has dual localizations, we count a prediction of either of the two actual sites as a true positive.
| IMPLEMENTATION |
|---|
|
|
|---|
Expanded PSORTb database and SCL-BLAST
PSORTb's SCL-BLAST module predicts localization of a query sequence based on homology to a protein in the PSORTdb database of proteins of experimentally verified localization. It is therefore expected that a larger and more diverse database will lead to an increase in the program's recall. SCL-BLAST v.2.0 utilizes an updated version of the original PSORTdb databaseGram-positive queries are run against the subset of 576 new proteins of Gram-positive origin, and Gram-negative queries are run against the expanded set of Gram-negative proteins. Furthermore, we investigated whether subsets of the Gram-negative and Gram-positive database could be combined. For example, the cytoplasmic sites and cytoplasmic membrane sites were hypothesized to be functionally equivalent, such that a Gram-negative protein could be searched against a BLAST database containing both Gram-negative proteins and Gram-positive cytoplasmic proteins and cytoplasmic membrane proteins. We examined whether a larger database with such combinations of proteins would increase recall even further.
We tested several combined databases using 5-fold cross-validation and found that higher recall and comparable precision was indeed achieved. For Gram-positive results, a database including Gram-negative cytoplasmic proteins, cytoplasmic membrane proteins and extracellular proteins yielded the best predictions. For Gram-negative queries, optimal results were achieved when the queries were searched against a database that included both Gram-positive cytoplasmic proteins and cytoplasmic membrane proteinsincluding extracellular proteins in the database resulted in several periplasmic proteins being falsely predicted as extracellular. The results of 5-fold cross-validation testing of SCL-BLAST v.2.0 for each localization site are shown in Table 2. The Gram-negative version of the module retains the 96% precision exhibited in v.1.1, and improves the recall by 8%. The new Gram-positive version also displays a precision of 96% and a recall of 58%, the lower recall most probably due to the smaller Gram-positive database. It is important to note, however, that such recall values are not to be expected when SCL-BLAST is applied to datasets containing a large number of hypothetical proteins, due to their lack of similarity to proteins in the SCL-BLAST database.
|
Support vector machine-based classification
Our previous work on the applicability of frequent subsequence-based SVM to outer membrane protein prediction (She et al., 2003) led us to examine whether the method was applicable to proteins resident at all nine localization sites. We reasoned that frequent subsequences found in proteins resident at each site represented conserved functional and structural motifs that would yield higher precision classification than methods based on overall amino acid composition alone.
By mining frequent subsequences from each of the nine localization sites, again combining the Gram-positive and Gram-negative cytoplasmic sequences and cytoplasmic membrane sequences, we were able to develop nine SVMs, each capable of classifying a protein as likely being resident at a specific localization site or not. Varying numbers of frequent subsequences were tested, as were different kernel functions, and the combination of frequent subsequences and kernel yielding the highest precision as well as a reasonable level of recall were selected for use in PSORTb v.2.0. The performance and parameters associated with each of the nine SVMs is shown in Table 3.
|
By using a feature space comprising frequent subsequences rather than amino acid composition, we were able to attain high-precision classification across all localization sites. Although the precision values for the two cytoplasmic classifiers are the lowest of the nine values, the 84% precision achieved by the Gram-negative SVM represents a 5% increase relative to the cytoplasmic composition-based SVM SubLocC used in PSORTb v.1.1. We believe that the reduced precision associated with cytoplasmic proteins may be due to the extremely diverse nature of proteins found at this siteproteins found at other sites exhibit more functional and structural constraints, resulting in more unique and characteristic frequent subsequences. This is especially evident when classifying cytoplasmic membrane proteinsthe frequent subsequences mined from this structurally and environmentally constrained group of proteins results in high-precision classification.
We observed that as the MinSup value increased for each classifier, the number of frequent patterns decreased, as did precision; recall, however, remained comparatively stable (Chen et al., unpublished data). We also noted that the best performance is not achieved at the smallest MinSup valuewhen the number of frequent subsequences exceeds a certain level, the performance of the SVM is degraded.
PSORTb v.2.0 performance
The new SVM modules, as well as the updated motif, profile and signal peptide modules, were incorporated into PSORTb v.2.0. As in version 1.1, a Bayesian network was constructed in order to integrate the predictions of all modules to generate a final prediction. Multiple weighting values were tested, and the values yielding the highest precision were used in the final version of PSORTb v.2.0. Five-fold cross-validation was then used to evaluate the Gram-negative and Gram-positive versions of the complete program. The resulting confusion matrices are available as Supplementary Tables S1a and S1b. From the confusion matrices, we calculated the precision and recall values for each localization site for both proteins annotated as having a single localization site (Table 4) and those annotated as having dual localization sites (Table 5).
|
|
On single localization proteins, PSORTb v.2.0 attained precision values of 96% for both classes of organisms, and recall of 83 and 81% for Gram-negative and Gram-positive proteins, respectively. We observed that the precision values remained relatively constant across localization sites, while the recall was highest for membrane proteins, most probably due to their conserved structural motifs readily identifiable by the frequent subsequence-based SVMs, HMMTOP and OMPMotif modules. The Gram-negative version of PSORTb v.2.0 exhibits a 0.7% decrease in precision relative to PSORTb v.1.1; however, an 8% increase in recall is observed.
Performance of the program on proteins annotated as having dual localization sites is comparable to the performance for singly localized proteins with respect to Gram-negative organisms, with a precision of 95% and a recall of 84%. However, we noted that the overall precision for Gram-positive multiply localized proteins was only 75%. Upon inspection, we realized that this was due to six annotated cytoplasmic membrane/cell wall proteins being predicted as cytoplasmic proteins. Noting that singly localized cytoplasmic membrane and cell wall proteins were infrequently mispredicted as cytoplasmic proteins, we investigated these six proteins further. Although experimental evidence supporting a possible cytoplasmic localization was found for only one of the six protiensBacillus subtilis ComGG (Chung et al., 1998)the other five proteins include enzymes and heat-shock proteins, for which cytoplasmic- or peripheral membrane-associated localizations are not uncommon. It may be that rather than making mispredictions, PSORTb is detecting a more complex pattern of localization for certain proteins.
Proteome coverage
The measured recall of a program when evaluated using 5-fold cross-validation does not give an accurate reflection of the predictive coverage when the program is applied to the analysis of whole proteomes. Because the training and testing data consist of a number of well-characterized proteins, a large number of predictions is possible. However, hypothetical proteinswhich make up a notable proportion of a proteomeoften do not contain enough information for a prediction to be generated. We therefore set out to measure PSORTb v.2.0's performance when applied to whole proteomes, with the expectation that we would see an increase in the 28% average coverage of version 1.1.
A total of 106 Gram-negative and 45 Gram-positive proteome files from the NCBI's Microbial Genomes page were analyzed (one organism may have multiple proteome files, each representing a different chromosome), and a complete summary of the results can be found in Supplementary Tables S2. The average coverage when PSORTb v.2.0 is applied to Gram-negative proteomes is 56.7%, and a maximum coverage of 78.8% (Thermotoga maritima) was achieved. When applied to Gram-positive proteomes, the average coverage increases to 74.8%, with a maximum of 83.2% (Bacillus halodurans).
The Gram-positive version of the program displays higher predictive coverage than the Gram-negative version due to the higher recall associated with the Gram-positive cytoplasmic SVM. Cytoplasmic proteins represent the largest class of proteins within the cell, and an improved ability to identify these results in high overall coverage. The level of coverage for each proteome appears to be irrespective of phylogenetic grouping, with predictions being generated as readily for organisms such as spirochetes or mollicutes as for the proteobacteria.
Comparison to other methods
We next set out to compare PSORTb v.2.0's performance with that of other comprehensive web-based predictive tools. Proteome Analyst (Lu et al., 2004) is capable of generating predictions for five Gram-negative localization sites and three Gram-positive sitesit does not differentiate between cell wall and extracellular proteins. CELLO (Yu et al., 2004) generates predictions for the five Gram-negative localization sites only. SubLoc (Hua and Sun, 2001) was not evaluated as it does not predict membrane proteins, and a comparison between PSORT I (Nakai and Kanehisa, 1991) and PSORTb appears in our earlier work (Gardy et al., 2003).
Because both Proteome Analyst and CELLO were trained using the original PSORTb dataset of 1443 Gram-negative proteins, a fair method of assessment was to use proteins not included in these programs' training data. A total of 144 singly localized new Gram-negative proteins in the version of PSORTdb described here were submitted to the Proteome Analyst and CELLO web servers for analysis. For a comparable evaluation of PSORTb v.2.0, the predictions generated for these proteins during the earlier 5-fold cross-validation procedure were used, such that the new proteins were not included in the PSORTb training data. A comparison of the performance of the three programs is given in Table 6, and the associated confusion matrices are available as Supplementary Table S3. A Gram-positive comparison was not carried out, as we were unsure whether Proteome Analyst's Gram-positive training data and that used in PSORTb overlapped.
|
In terms of precision, PSORTb v.2.0 outperforms both Proteome Analyst and CELLO by 7.6 and 26.1%, respectively. The significant difference between PSORTb and CELLO is due to the fact that unlike the other two programs, CELLO forces predictions for each query protein. While this does lead to a prediction generated for every protein in a proteome, the cost in terms of reliability of these predictions is significant. This decreased precision may not be apparent when evaluations are reported using the accuracy measure, in which high recall is able to compensate for lower precision, and illustrates that reporting confusion matrices leads to, epigrammatically enough, the least confusion when comparing the performance of multiple programs.
We also wished to compare the predictive coverage of PSORTb v.2.0 to that of the other programs when applied to the analysis of whole proteomes. Because CELLO generates a prediction in every case, it was not included in the present analysis. In Lu et al. (2004), the authors of Proteome Analyst report predictive coverage for two proteomesone Gram-negative and one Gram-positive. Proteome Analyst displayed a coverage of 75.6% for the Gram-negative bacterium Pseudomonas aeruginosa and 67.2% for the Gram-positive bacterium B.subtilis. When PSORTb v.2.0 was used to analyze the same organisms, it attained a coverage of 68.1% for P.aeruginosa and 76.5% for B.subtilis.
An analysis based on these two proteomes suggests that while Proteome Analyst attains higher coverage on a Gram-negative organism, PSORTb v.2.0 generates more predictions for a Gram-positive proteome. Because Proteome Analyst relies on SWISS-PROT annotation keywords for classification, we believe that PSORTb's sequence feature-based method may yield higher coverage for organisms with little database annotation available.
Comparative proteome analysis
Using the data generated during our analysis of whole proteome predictive coverage (Supplementary Tables S2a and S2b), we investigated our hypothesis that free-living organisms might exhibit a higher than normal proportion of membrane proteins. The proportion of proteins at each localization site was determined, both as a fraction of the total predictions and as a fraction of total proteome size. Table 7 shows the average and standard deviation for both types of calculation, as well as the correlation coefficient between the number of proteins predicted at a given site and overall proteome size.
|
Cytoplasmic proteins and cytoplasmic membrane proteins represent the largest fractions of the proteome, and the large sample size yields a high correlation coefficient. This indicates that proteome size and not lifestyle or other factors is the primary determinant for the number of proteins at a given site. This correlation is evident for most other localization sites as well, with only the cell wall showing variable values. Because the cell wall represents a comparatively tiny fraction of the proteome, however, this variability may be attributed to a small sample size.
When the data are visualized as a scatter plot (Supplementary Figure S1), these constant proportions are more easily observable. Several points of interest also become obvious. Two Gram-negative organisms, T.maritima and Aquifex aeolicusorganisms that are found near the base of the tree of lifeappear to have unusually high proportions of cytoplasmic proteins. The mycoplasmas, noted for their smaller genomes and membrane protein variability, exhibit higher than normal proportions of outer membrane proteins.
| DISCUSSION |
|---|
|
|
|---|
We have developed PSORTb v.2.0, an updated version of the PSORTb tool for the prediction of bacterial protein subcellular localization. Version 2.0 improves significantly upon the original release of the program, with its predictive capability extended to include Gram-positive organisms and its predictive coverage increased. A flag indicating potentially multiply localized proteins has also been added. PSORTb's existing standard of high precision is maintained, and with a measured precision of 96%, the program continues to be the most precise tool for bacterial localization prediction available.
We attribute the 2-fold increase in predictive coverage primarily to the incorporation of nine frequent subsequence-based SVM modules. With a higher precision than the amino acid composition-based SVMs such as SubLoc and CELLO, our method allows for the classification of proteins based on characteristic patterns that might not have been detected through conventional methods, such as multiple sequence alignment. The SVMs have allowed us to address concerns raised with the first release of PSORTb, namely how to identify a larger number of cytoplasmic proteins, as well as cytoplasmic membrane proteins with three or fewer predicted helices. Cytoplasmic proteins in particular are a large and diverse group of proteins and represent the majority of proteins encoded for by a genome. The ability to identify these proteins is the key to attaining a high predictive coverage rate, and we are interested in pursuing ways of increasing our SVM's ability to detect these proteins, particularly in the case of Gram-negative organisms.
Comparison with other available predictive tools shows that PSORTb remains the most precise predictive method available. We believe this is the result of two aspects of the program. First, a recent review has highlighted the importance of utilizing multiple methods for localization prediction (Schneider and Fechner, 2004), and PSORTb is one of the few localization prediction methods to take such an approach. Second, PSORTb does not force predictionsif we are unable to generate a confident prediction, the program will return a result of Unknown. Instead of optimizing precision and recall, we have always chosen to emphasize precision in the development of PSORTb, reasoning that biologists are searching for correct results and, given the choice of an incorrect result or an Unknown result, prefer Unknown.
Comparative analysis also highlights the importance of publishing confusion matricesdifferent tools use different metrics in their reporting, and often the definition of a particular metric varies between groups. Without access to actual predictions, it can be difficult to objectively assess multiple predictive tools. Still, we show here that of the other localization tools currently available, Proteome Analyst offers an excellent complement to PSORTbdespite a slightly lower precision, Proteome Analyst's recall of Gram-negative proteins is especially good.
PSORTb's significant increase in predictive coverage allowed us to examine the distribution of proteins across localization sites on a proteome-wide scale. We found that with very few exceptions the proportions of proteins found at each localization site remained notably consistent across species, regardless of lifestyle, physiology or proteome size. This may reflect the nature of biological networks. When a new gene is introduced into an organism, its product will carry out certain new functions. In order for these new functions to be of any benefit to the organism, however, more new genes and gene products might be necessarythe proteins that will form a complete new functional pathway capable of interacting with other pathways in the cell. The new proteins that constitute the pathway will likely span different cellular compartments. For example, an organism taking up residence in a new environment may develop a series of membrane transporters to take up nutrients from and sense its surroundings; however, it must also develop the cytoplasmic components necessary for processing the incoming nutrients and signals.
PSORTb version 2.0 is available on the Web at http://www.psort.org/psortb. Users can submit one or multiple query sequences over the Web for analysis, selecting one of three possible output formats, or can download a standalone version of the program under the GNU General Public License. All of the completed microbial genomes listed on the NCBI have precomputed results available for download at http://www.psort.org/genomes/, and this Genomes page will be updated as new genomes are released. The PSORTdb dataset used in the development of the program is also available on the website, along with other resources for subcellular localization prediction, including links to other tools and datasets of interest. We believe that PSORTb v.2.0 and psort.org are valuable resources to the microbiology and localization prediction communities. We offer an open source, flexible and high coverage predictive tool, which is presently the most precise localization prediction method available. Utilizing the program for comparative proteome analysis has already generated interesting results regarding the proportion of proteins in different cellular compartments, and we are presently investigating this area further.
| Acknowledgments |
|---|
The authors would like to thank Cory Spencer for his work on the first version of PSORTb, as well as Rong She, Ke Wang and other members of the data mining group at Simon Fraser University's Department of Computing Science. J.L.G. and F.S.L.B. are a Michael Smith Foundation for Health Research Trainee and Scholar, respectively. Funding for this work was provided by the Natural Sciences and Engineering Research Council of Canada.
Received on July 5, 2004; revised on September 17, 2004; accepted on September 28, 2004
| REFERENCES |
|---|
|
|
|---|
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J. (1990) Basic local alignment search tool. J. Mol. Biol., 215, 403410[CrossRef][Web of Science][Medline].
Chung, Y.S., Breidt, F., Dubnau, D. (1998) Cell surface localization and processing of the ComG proteins, required for DNA binding during transformation of Bacillus subtilis . Mol. Microbiol., 29, 905913[CrossRef][Web of Science][Medline].
Gardy, J.L., Spencer, C., Wang, K., Ester, M., Tusnády, G.E., Simon, I., Hua, S., deFays, K., Lambert, C., Nakai, K., Brinkman, F.S.L. (2003) PSORTb: improving protein subcellular localization prediction for Gram-negative bacteria. Nucleic Acids Res., 31, 36133617
Hua, S. and Sun, Z. (2001) Support vector machine approach for protein subcellular localization prediction. Bioinformatics, 17, 721728
Hulo, N., Sigrist, C.J.A., Le Saux, V., Langendijk-Genevaux, P.S., Bordoli, L., Gattiker, A., De Castro, E., Bucher, P., Bairoch, A. (2004) Recent improvements to the PROSITE database. Nucleic Acids Res., 32, D134D137
Lu, Z., Szafron, D., Greiner, R., Lu, P., Wishart, D.S., Poulin, B., Anvik, J., Macdonell, C., Eisner, R. (2004) Predicting subcellular localization of proteins using machine-learned classifiers. Bioinformatics, 20, 547556
Nakai, K. and Kanehisa, M. (1991) Expert system for predicting protein localization sites in Gram-negative bacteria. Proteins, 11, 95110[CrossRef][Web of Science][Medline].
Nielsen, H., Engelbrecht, J., Brunak, S., von Heijne, G. (1997) Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Eng., 10, 16
Schneider, G. and Fechner, U. (2004) Advances in the prediction of protein targeting signals. Proteomics, 4, 15711580[CrossRef][Web of Science][Medline].
She, R., Chen, F., Wang, K., Ester, M., Gardy, J.L., Brinkman, F.S.L. (2003) Frequent-subsequence-based prediction of outer membrane proteins. Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, , NY ACM Press, pp. 436445.
Tusnády, G.E. and Simon, I. (2001) The HMMTOP transmembrane topology prediction server. Bioinformatics, 17, 849850
Vapnik, V. The Nature of Statistical Learning Theory, (1995) , NY Springer.
Wang, J.T., Chirn, G., Marr, T.G., Shapiro, B., Shasha, D., Zhang, K. (1994) Combinatorial pattern discovery for scientific data: some preliminary results. Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, , NY ACM Press, pp. 115125.
Yu, C., Lin, C., Hwang, J. (2004) Predicting subcellular localization of proteins for Gram-negative bacteria by support vector machines based on n-peptide compositions. Protein Sci., 13, 14021406[CrossRef][Web of Science][Medline].
This article has been cited by other articles:
![]() |
R. Belas, E. Horikawa, S.-I. Aizawa, and R. Suvanasuthi Genetic Determinants of Silicibacter sp. TM1040 Motility J. Bacteriol., July 15, 2009; 191(14): 4502 - 4512. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Mukhopadhyay, A. Akmal, A. C. Stewart, R.-c. Hsia, and T. D. Read Identification of Bacillus anthracis Spore Component Antigens Conserved across Diverse Bacillus cereus sensu lato Strains Mol. Cell. Proteomics, June 1, 2009; 8(6): 1174 - 1191. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Barabote, G. Xie, D. H. Leu, P. Normand, A. Necsulea, V. Daubin, C. Medigue, W. S. Adney, X. C. Xu, A. Lapidus, et al. Complete genome of the cellulolytic thermophile Acidothermus cellulolyticus 11B provides insights into its ecophysiological and evolutionary adaptations Genome Res., June 1, 2009; 19(6): 1033 - 1043. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Elias, A. Mukhopadhyay, M. P. Joachimiak, E. C. Drury, A. M. Redding, H.-C. B. Yen, M. W. Fields, T. C. Hazen, A. P. Arkin, J. D. Keasling, et al. Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation Nucleic Acids Res., May 1, 2009; 37(9): 2926 - 2939. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Ouyang, M. He, T. Oman, X. F. Yang, and M. V. Norgard A manganese transporter, BB0219 (BmtA), is required for virulence by the Lyme disease spirochete, Borrelia burgdorferi PNAS, March 3, 2009; 106(9): 3449 - 3454. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Rossier, J. Dao, and N. P. Cianciotto A type II secreted RNase of Legionella pneumophila facilitates optimal intracellular infection of Hartmannella vermiformis Microbiology, March 1, 2009; 155(3): 882 - 890. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. I. Hobb, J. A. Fields, C. M. Burns, and S. A. Thompson Evaluation of procedures for outer membrane isolation from Campylobacter jejuni Microbiology, March 1, 2009; 155(3): 979 - 988. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Wheeler Outer membrane proteomics of Pasteurella multocida isolates to identify putative host-specificity determinants Bioscience Horizons, March 1, 2009; 2(1): 1 - 12. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. P. Faucher, C. Forest, M. Beland, and F. Daigle A novel PhoP-regulated locus encoding the cytolysin ClyA and the secreted invasin TaiA of Salmonella enterica serovar Typhi is involved in virulence Microbiology, February 1, 2009; 155(2): 477 - 488. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pina, A. Olvera, A. Barcelo, and A. Bensaid Trimeric Autotransporters of Haemophilus parasuis: Generation of an Extensive Passenger Domain Repertoire Specific for Pathogenic Strains J. Bacteriol., January 15, 2009; 191(2): 576 - 587. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Winstanley, M. G.I. Langille, J. L. Fothergill, I. Kukavica-Ibrulj, C. Paradis-Bleau, F. Sanschagrin, N. R. Thomson, G. L. Winsor, M. A. Quail, N. Lennard, et al. Newly introduced genomic prophage islands are critical determinants of in vivo competitiveness in the Liverpool Epidemic Strain of Pseudomonas aeruginosa Genome Res., January 1, 2009; 19(1): 12 - 23. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. L. Winsor, T. Van Rossum, R. Lo, B. Khaira, M. D. Whiteside, R. E. W. Hancock, and F. S. L. Brinkman Pseudomonas Genome Database: facilitating user-friendly, comprehensive comparisons of microbial genomes Nucleic Acids Res., January 1, 2009; 37(suppl_1): D483 - D488. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Hommais, C. Oger-Desfeux, F. Van Gijsegem, S. Castang, S. Ligori, D. Expert, W. Nasser, and S. Reverchon PecS Is a Global Regulator of the Symptomatic Phase in the Phytopathogenic Bacterium Erwinia chrysanthemi 3937 J. Bacteriol., November 15, 2008; 190(22): 7508 - 7522. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Bjorling and M. Uhlen Antibodypedia, a Portal for Sharing Antibody and Antigen Validation Data Mol. Cell. Proteomics, October 1, 2008; 7(10): 2028 - 2037. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Alves-Pereira, J. Canales, A. Cabezas, P. Martin Cordero, M. J. Costas, and J. C. Cameselle CDP-Alcohol Hydrolase, a Very Efficient Activity of the 5'-Nucleotidase/UDP-Sugar Hydrolase Encoded by the ushA Gene of Yersinia intermedia and Escherichia coli J. Bacteriol., September 15, 2008; 190(18): 6153 - 6161. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. J. Tudor, J. J. Davis, M. Panichella, and A. Zwolak Isolation of Predation-Deficient Mutants of Bdellovibrio bacteriovorus by Using Transposon Mutagenesis Appl. Envir. Microbiol., September 1, 2008; 74(17): 5436 - 5443. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. Barbour, A. Jasinskas, M. A. Kayala, D. H. Davies, A. C. Steere, P. Baldi, and P. L. Felgner A Genome-Wide Proteome Array Reveals a Limited Set of Immunogens in Natural Infections of Humans and White-Footed Mice with Borrelia burgdorferi Infect. Immun., August 1, 2008; 76(8): 3374 - 3389. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. Jones, J. Deck, R. D. Edmondson, and M. E. Hart Relative Quantitative Comparisons of the Extracellular Protein Profiles of Staphylococcus aureus UAMS-1 and Its sarA, agr, and sarA agr Regulatory Mutants Using One-Dimensional Polyacrylamide Gel Electrophoresis and Nanocapillary Liquid Chromatography Coupled with Tandem Mass Spectrometry J. Bacteriol., August 1, 2008; 190(15): 5265 - 5278. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Dumetz, E. Duchaud, S. Claverol, N. Orieux, S. Papillon, D. Lapaillerie, and M. Le Henaff Analysis of the Flavobacterium psychrophilum outer-membrane subproteome and identification of new antigenic targets for vaccine by immunomics Microbiology, June 1, 2008; 154(6): 1793 - 1801. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-K. Jun, Y.-M. Kang, H.-R. Lee, S.-H. Lee, and B.-K. Choi Highly Conserved Surface Proteins of Oral Spirochetes as Adhesins and Potent Inducers of Proinflammatory and Osteoclastogenic Factors Infect. Immun., June 1, 2008; 76(6): 2428 - 2438. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Galka, S. N. Wai, H. Kusch, S. Engelmann, M. Hecker, B. Schmeck, S. Hippenstiel, B. E. Uhlin, and M. Steinert Proteomic Characterization of the Whole Secretome of Legionella pneumophila and Functional Analysis of Outer Membrane Vesicles Infect. Immun., May 1, 2008; 76(5): 1825 - 1836. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Onder, S. Turkarslan, D. Sun, and F. Daldal Overproduction or Absence of the Periplasmic Protease DegP Severely Compromises Bacterial Growth in the Absence of the Dithiol: Disulfide Oxidoreductase DsbA Mol. Cell. Proteomics, May 1, 2008; 7(5): 875 - 890. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Soderberg and N. P. Cianciotto A Legionella pneumophila Peptidyl-Prolyl cis-trans Isomerase Present in Culture Supernatants Is Necessary for Optimal Growth at Low Temperatures Appl. Envir. Microbiol., March 1, 2008; 74(5): 1634 - 1638. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. L. Obuchowski and C. Jacobs-Wagner PflI, a Protein Involved in Flagellar Positioning in Caulobacter crescentus J. Bacteriol., March 1, 2008; 190(5): 1718 - 1729. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Berlanda Scorza, F. Doro, M. J. Rodriguez-Ortega, M. Stella, S. Liberatori, A. R. Taddei, L. Serino, D. Gomes Moriel, B. Nesta, M. R. Fontana, et al. Proteomics Characterization of Outer Membrane Vesicles from the Extraintestinal Pathogenic Escherichia coli {Delta}tolR IHE3034 Mutant Mol. Cell. Proteomics, March 1, 2008; 7(3): 473 - 485. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. P. Bannantine, M. L. Paustian, W. R. Waters, J. R. Stabel, M. V. Palmer, L. Li, and V. Kapur Profiling Bovine Antibody Responses to Mycobacterium avium subsp. paratuberculosis Infection by Using Protein Arrays Infect. Immun., February 1, 2008; 76(2): 739 - 749. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Clock, P. J. Planet, B. A. Perez, and D. H. Figurski Outer Membrane Components of the Tad (Tight Adherence) Secreton of Aggregatibacter actinomycetemcomitans J. Bacteriol., February 1, 2008; 190(3): 980 - 990. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ibrahim, P. Nicolas, P. Bessieres, A. Bolotin, V. Monnet, and R. Gardan A genome-wide survey of short coding sequences in streptococci Microbiology, November 1, 2007; 153(11): 3631 - 3644. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Russell and K. C. Keiler Peptide Signals Encode Protein Localization J. Bacteriol., November 1, 2007; 189(21): 7581 - 7585. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Nelson, P. P. Glocka, S. Agarwal, D. P. Grimm, and M. J. McBride Flavobacterium johnsoniae SprA Is a Cell Surface Protein Involved in Gliding Motility J. Bacteriol., October 1, 2007; 189(19): 7145 - 7150. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Myers-Morales, C. Cowan, M. E. Gray, C. R. Wulff, C. E. Parker, C. H. Borchers, and S. C. Straley A Surface-Focused Biotinylation Procedure Identifies the Yersinia pestis Catalase KatY as a Membrane-Associated but Non-Surface-Located Protein Appl. Envir. Microbiol., September 15, 2007; 73(18): 5750 - 5759. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-K. Jun, H.-R. Lee, S.-H. Lee, and B.-K. Choi Mapping of the proinflammatory domains of MspTL of Treponema lecithinolyticum Microbiology, August 1, 2007; 153(8): 2386 - 2392. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Lopez-Serrano, F. Solano, and A. Sanchez-Amat Involvement of a novel copper chaperone in tyrosinase activity and melanin synthesis in Marinomonas mediterranea Microbiology, July 1, 2007; 153(7): 2241 - 2249. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Movahedi and D. J. Hampson Distribution of the clpX gene in Brachyspira species and reactivity of recombinant Brachyspira pilosicoli ClpX with sera from mice and humans J. Med. Microbiol., July 1, 2007; 56(7): 930 - 936. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sundaresh, A. Randall, B. Unal, J. M. Petersen, J. T. Belisle, M. Gill Hartley, M. Duffield, R. W. Titball, D. H. Davies, P. L. Felgner, et al. From protein microarrays to diagnostic antigen discovery: a study of the pathogen Francisella tularensis Bioinformatics, July 1, 2007; 23(13): i508 - i518. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Xie, D. C. Bruce, J. F. Challacombe, O. Chertkov, J. C. Detter, P. Gilna, C. S. Han, S. Lucas, M. Misra, G. L. Myers, et al. Genome Sequence of the Cellulolytic Gliding Bacterium Cytophaga hutchinsonii Appl. Envir. Microbiol., June 1, 2007; 73(11): 3536 - 3546. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. K. Boonjakuakul, H. L. Gerns, Y.-T. Chen, L. D. Hicks, M. F. Minnick, S. E. Dixon, S. C. Hall, and J. E. Koehler Proteomic and Immunoblot Analyses of Bartonella quintana Total Membrane Proteins Identify Antigens Recognized by Sera from Infected Patients Infect. Immun., May 1, 2007; 75(5): 2548 - 2561. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Ruiz, S. Falcocchio, F. I. J. Pastor, L. Saso, and P. Diaz Helicobacter pylori EstV: Identification, Cloning, and Characterization of the First Lipase Isolated from an Epsilon-Proteobacterium Appl. Envir. Microbiol., April 15, 2007; 73(8): 2423 - 2431. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Barsky, J. L. Gardy, R. E. W. Hancock, and T. Munzner Cerebral: a Cytoscape plugin for layout of and interaction with biological networks using subcellular localization annotation Bioinformatics, April 15, 2007; 23(8): 1040 - 1042. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Y. Abajy, J. Kopec, K. Schiwon, M. Burzynski, M. Doring, C. Bohn, and E. Grohmann A Type IV-Secretion-Like System Is Required for Conjugative DNA Transport of Broad-Host-Range Plasmid pIP501 in Gram-Positive Bacteria J. Bacteriol., March 15, 2007; 189(6): 2487 - 2496. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. N. Williams, P. J. Skipp, H. E. Humphries, M. Christodoulides, C. D. O'Connor, and J. E. Heckels Proteomic Analysis of Outer Membranes and Vesicles from Wild-Type Serogroup B Neisseria meningitidis and a Lipopolysaccharide-Deficient Mutant Infect. Immun., March 1, 2007; 75(3): 1364 - 1372. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. M. Meier, P. Muschler, and B. E. Scharf Functional Analysis of Nine Putative Chemoreceptor Proteins in Sinorhizobium meliloti J. Bacteriol., March 1, 2007; 189(5): 1816 - 1826. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. F. da Silva Neto, T. Koide, S. L. Gomes, and M. V. Marques The Single Extracytoplasmic-Function Sigma Factor of Xylella fastidiosa Is Involved in the Heat Shock Response and Presents an Unusual Regulatory Mechanism J. Bacteriol., January 15, 2007; 189(2): 551 - 560. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. K. McNeil, C. Reich, R. K. Aziz, D. Bartels, M. Cohoon, T. Disz, R. A. Edwards, S. Gerdes, K. Hwang, M. Kubal, et al. The National Microbial Pathogen Database Resource (NMPDR): a genomics platform based on subsystem annotation Nucleic Acids Res., January 12, 2007; 35(suppl_1): D347 - D353. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. DebRoy, J. Dao, M. Soderberg, O. Rossier, and N. P. Cianciotto Legionella pneumophila type II secretome reveals unique exoproteins and a chitinase that promotes bacterial persistence in the lung PNAS, December 12, 2006; 103(50): 19146 - 19151. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Rohmer, M. Brittnacher, K. Svensson, D. Buckley, E. Haugen, Y. Zhou, J. Chang, R. Levy, H. Hayden, M. Forsman, et al. Potential Source of Francisella tularensis Live Vaccine Strain Attenuation Determined by Genome Comparison Infect. Immun., December 1, 2006; 74(12): 6895 - 6906. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Haveman, D. E. Holmes, Y.-H. R. Ding, J. E. Ward, R. J. DiDonato Jr., and D. R. Lovley c-Type Cytochromes in Pelobacter carbinolicus Appl. Envir. Microbiol., November 1, 2006; 72(11): 6980 - 6985. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Tomich, D. H. Fine, and D. H. Figurski The TadV Protein of Actinobacillus actinomycetemcomitans Is a Novel Aspartic Acid Prepilin Peptidase Required for Maturation of the Flp1 Pilin and TadE and TadF Pseudopilins. J. Bacteriol., October 1, 2006; 188(19): 6899 - 6914. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Marczak, A. Mazur, J. E. Krol, W. I. Gruszecki, and A. Skorupska Lipoprotein PssN of Rhizobium leguminosarum bv. trifolii: Subcellular Localization and Possible Involvement in Exopolysaccharide Export. J. Bacteriol., October 1, 2006; 188(19): 6943 - 6952. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Seers, N. Slakeski, P. D. Veith, T. Nikolof, Y.-Y. Chen, S. G. Dashper, and E. C. Reynolds The RgpB C-Terminal Domain Has a Role in Attachment of RgpB to the Outer Membrane and Belongs to a Novel C-Terminal-Domain Family Found in Porphyromonas gingivalis. J. Bacteriol., September 1, 2006; 188(17): 6376 - 6386. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Gil, G. J. Platz, C. A. Forestal, M. Monfett, C. S. Bakshi, T. J. Sellati, M. B. Furie, J. L. Benach, and D. G. Thanassi Deletion of TolC orthologs in Francisella tularensis identifies roles in multidrug resistance and virulence PNAS, August 22, 2006; 103(34): 12897 - 12902. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, V. Loux, R. Bossy, P. Nicolas, S. Chaillou, M. van de Guchte, S. Penaud, E. Maguin, M. Hoebeke, P. Bessieres, et al. AGMIAL: implementing an annotation strategy for prokaryote genomes as a distributed system Nucleic Acids Res., July 19, 2006; 34(12): 3533 - 3545. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. L. Nunn, S. A. Shaffer, A. Scherl, B. Gallis, M. Wu, S. I. Miller, and D. R. Goodlett Comparison of a Salmonella typhimurium proteome defined by shotgun proteomics directly on an LTQ-FT and by proteome pre-fractionation on an LCQ-DUO. Brief Funct Genomic Proteomic, June 1, 2006; 5(2): 154 - 168. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Alvarez, P. Secades, M. Prieto, M. J. McBride, and J. A. Guijarro A Mutation in Flavobacterium psychrophilum tlpB Inhibits Gliding Motility and Induces Biofilm Formation. Appl. Envir. Microbiol., June 1, 2006; 72(6): 4044 - 4053. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. E. Barrios-Llerena, P. K. Chong, C. S. Gan, A. P. L. Snijders, K. F. Reardon, and P. C. Wright Shotgun proteomics of cyanobacteria--applications of experimental and data-mining techniques Brief Funct Genomic Proteomic, June 1, 2006; 5(2): 121 - 132. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Paddick, S. R. Brailsford, S. Rao, R. F. Soares, E. A. M. Kidd, D. Beighton, and K. A. Homer Effect of Biofilm Growth on Expression of Surface Proteins of Actinomyces naeslundii Genospecies 2. Appl. Envir. Microbiol., May 1, 2006; 72(5): 3774 - 3779. [Abstract] [Full Text] [PDF] |
||||
![]() |
B.-C. Kim, X. Qian, C. Leang, M. V. Coppi, and D. R. Lovley Two Putative c-Type Multiheme Cytochromes Required for the Expression of OmcB, an Outer Membrane Protein Essential for Optimal Fe(III) Reduction in Geobacter sulfurreducens. J. Bacteriol., April 1, 2006; 188(8): 3138 - 3142. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. T. Reece, G. Ireton, R. Mohamath, J. Guderian, W. Goto, R. Gelber, N. Groathouse, J. Spencer, P. Brennan, and S. G. Reed ML0405 and ML2331 Are Antigens of Mycobacterium leprae with Potential for Diagnosis of Leprosy. Clin. Vaccine Immunol., March 1, 2006; 13(3): 333 - 340. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Mooney, N. D. O'Leary, and A. D. W. Dobson Cloning and Functional Characterization of the styE Gene, Involved in Styrene Transport in Pseudomonas putida CA-3 Appl. Envir. Microbiol., February 1, 2006; 72(2): 1302 - 1309. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. R. Couturier, E. Tasca, C. Montecucco, and M. Stein Interaction with CagF Is Required for Translocation of CagA into the Host via the Helicobacter pylori Type IV Secretion System Infect. Immun., January 1, 2006; 74(1): 273 - 281. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Glasner, M. Rusch, P. Liss, G. Plunkett III, E. L. Cabot, A. Darling, B. D. Anderson, P. Infield-Harm, M. C. Gilson, and N. T. Perna ASAP: a resource for annotating, curating, comparing, and disseminating genomic data Nucleic Acids Res., January 1, 2006; 34(suppl_1): D41 - D45. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Xie, A. Li, M. Wang, Z. Fan, and H. Feng LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST Nucleic Acids Res., July 1, 2005; 33(suppl_2): W105 - W110. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||












