Bioinformatics Advance Access originally published online on September 5, 2006
Bioinformatics 2006 22(22):2711-2714; doi:10.1093/bioinformatics/btl468
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
DUF283 domain of Dicer proteins has a double-stranded RNA-binding fold
Department of Microbiology, Montana State University Bozeman, MT 59717, USA
| ABSTRACT |
|---|
|
|
|---|
Two RNases, Dicer and Argonaute, are at the heart of the RNA interference (RNAi) molecular machinery responsible for gene silencing. Both RNases contain multiple domains, most of which have been characterized or have functions that can be predicted based on sequence comparisons. However, Dicers of higher eukaryotes contain the domain known as DUF283 which at present has no assigned role. Using sensitive profileprofile comparisons, we detected a divergent double-stranded RNA-binding domain coinciding with the DUF283 of Dicer. This finding has potential implications regarding the mechanistic role of Dicer in RNAi.
Contact: mdlakic{at}montana.edu
Supplementary information: Supplementary images are available at Bioinformatics online.
| 1 INTRODUCTION |
|---|
|
|
|---|
RNA interference (RNAi) is an evolutionarily conserved mechanism for gene silencing using double-stranded RNA (dsRNA) molecules known as small interfering RNAs (siRNAs). Two nucleases, an RNase III enzyme Dicer and RNase H enzyme Argonaute, are essential for the initiation and effector phases of RNAi, respectively (Bernstein et al., 2001; Hammond et al., 2001). Recent structural and biochemical studies provided insights into functions of both enzymes (reviewed in Collins and Cheng, 2005; Hammond, 2005), yet many questions remain regarding the mechanistic aspects of RNAi (Tomari and Zamore, 2005). For example, Dicers of higher eukaryotes contain multiple protein domains for which the general function is known or can be predicted with confidence (Bernstein et al., 2001), yet the function is unknown for the so-called Domain of Unknown Function 283 (DUF283). DUF283 is classified in PFAM (Bateman et al., 2004) as a distinct domain based on its sequence conservation among Dicer-like proteins. However, even the sensitive sequence profile tools, such as HMMer and PSI-BLAST, failed to assign any function to this domain as it lacks clear similarity to characterized proteins. We used recently developed methods for profileprofile comparisons to show that DUF283 has a fold similar to the double-stranded RNA-binding domain (dsRBD). This finding is discussed in the light of currently unanswered questions about the mechanistic role of Dicer proteins.
| 2 METHODS |
|---|
|
|
|---|
The consensus sequences of 8183 Pfam 19.0 families were extracted from their profile hidden Markov models (HMMs; Bateman et al., 2004). The PDB-derived collection (as of March, 2006) was clustered at 95% identity, and both groups of sequences were used as queries for the Target2K procedure (Karplus et al., 2005), which automatically searches the protein database and builds an alignment of identified homologs. Predicted secondary structure (Jones, 1999) was added to the alignments to generate profile HMMs (Soding, 2005). The principal difference between these databases and the ones distributed on the HHpred server (Soding et al., 2005) is that the alignments are, on average, more accurate yet include fewer sequences. Consensus sequence for DUF283 was also submitted to the 3D-Jury meta server (Ginalski et al., 2003).
Model building was done using MODELLER 8v2 (Eswar et al., 2003) and the side-chains were rebuilt using Dunbrack's backbone-dependent rotamer library (Canutescu et al., 2003). All models were evaluated by Prosa (Sippl, 1993), Verify3D (Luthy et al., 1992) and ProQ/MaxSub (Wallner and Elofsson, 2003).
| 3 RESULTS AND DISCUSSION |
|---|
|
|
|---|
The development of novel computational methods in recent years has stretched the limits of identifying meaningful biological similarities from remote protein homology (Petrey and Honig, 2005). To search for distant relatives of DUF283, we used one such method (Soding, 2005) that employs pairwise alignments of HMMs. At present, this approach offers arguably the best combination of sensitivity needed to detect subtle similarities and the speed required to scan large databases of protein families (Soding, 2005). The consensus sequence of DUF283 (Pfam code PF03368) was extracted from the HMM profile deposited in Pfam (Bateman et al., 2004) and was used to generate the profile HMM of DUF283 as described above. We compared this HMM to protein families in Pfam and the search yielded significant similarity to the dsRNA binding motif (PF00035; E = 4.9 x 107). This finding was confirmed when comparing DUF283 profile HMM to known protein structures in PDB, as >10 significant matches to dsRNA binding domains (dsRBDs) were detected with E-values between 9 x 107 and 1 x 103. The credibility of this prediction is strengthened by an excellent match between predicted secondary structure of DUF283 (Jones, 1999) and the experimentally determined secondary structure of dsRNA binding proteins (ss_pred and ss_dssp lines in Fig. 1, respectively). Finally, the consensus sequence of DUF283 was submitted to the 3D-Jury server (http://bioinfo.pl/Meta/), which returned significant matches (scores >50) to three dsRBD-containing proteins (Ginalski et al., 2003).
|
Due to low target-template sequence identity (<20%), we had to use a trial-and-error strategy for building 3D models of human and Drosophila's DUF283. After each cycle of model building and evaluation, the alignments were changed manually in regions that had unfavorable energy as judged by Prosa scores. The final model of Drosophila's DUF283 (Fig. 1) had the evaluation scores that are consistent with good to very good models: 4.57 for Prosa (Sippl, 1993); 19.49 for Verify3D (Luthy et al., 1992); 6.85 for ProQ and 0.72 for MaxSub (Wallner and Elofsson, 2003). In addition, the overall fold of the model is classified as correct based on the pG score of 0.91 (Sanchez and Sali, 1998).
dsRBD proteins adopt an
-ß-ß-ß-
topology that preferentially binds dsRNA over ssRNA or dsDNA, although without much sequence specificity (Ryter and Schultz, 1998). dsRBD is commonly found in many proteins involved in RNA processing (Doyle and Jantsch, 2002), including all three classes of RNase III enzymes (Hammond, 2005). Bacterial RNase III enzymes (Class I) contain one dsRBD and one RNase III domain and they form highly symmetric homodimers (Zhang et al., 2004; Gan et al., 2006). Dicers of higher eukaryotes, on the other hand, contain two joined RNase III domains that form an intramolecular dimer (Zhang et al., 2004), and have only one canonical dsRBD (Class III). Given the diversity of dsRNAs in eukaryotic cells, it is not clear how a single dsRBD could provide Dicers with distinct specificity for genuine siRNAs. Part of this functionality may be supplied by the PAZ domain (Lingel et al., 2004; Ma et al., 2004), or in trans by Dicer's binding partners that contain dsRBDs (Tabara et al., 2002). Our finding provides an additional avenue for future experiments aimed at better understanding of Dicer's binding specificity.
In a recent article, MacRae et al. (2006) presented the structure of Dicer from Giardia intestinalis. This seminal work provided the platform for understanding dsRNA processing that takes place during the initiation phase of RNAi. MacRae et al. (2006) suggested, based on low but consistent sequence similarity between the N-terminal platform domain (NTPD) of Giardia Dicer and DUF283 (shown in their Supplementary Figure 4), that these two domains have similar structures. However, we find no proof for this sequence similarity in terms of statistical significance. For example, profile HMM of DUF283 does not match any part of Giardia Dicer even when the E-value is raised up to 100. In retrospect, it appears that the homology inferred by MacRae et al. was derived, at least in part, from the global similarity between these two domains: the NTPD of Giardia Dicer has a ß-ß-ß-ß-
-fold, while DUF283 is predicted to adopt the canonical
-ß-ß-ß-
-fold found in all dsRBDs. Despite similar topologies, however, these two domains are structurally unrelated since their secondary structure elements have different spatial arrangements.
What could be the function of this divergent dsRBD coinciding with DUF283? In addition to their most common role in dsRNA binding, some dsRBDs have additional functions such as dimerization (Hitti et al., 2004) and nuclear localization (Eckmann et al., 2001). To gain insight into potential function of DUF283, we compared surface conservation of DUF283 and dsRBD proteins. The seed alignment of dsRBD from Pfam (Bateman et al., 2004) was used as an input for the ConSurf server (Landau et al., 2005). Residue conservation from this analysis was converted to colors and mapped onto the dsRBD structure from 1UIL. As expected, the RNA-binding face of dsRBD proteins contains the majority of highly and moderately conserved residues (colored in red and green, respectively, in Supplementary Figure 1A). Next we merged DUF283 and dsRBD alignments (Edgar and Sjolander, 2004), reasoning that this combined alignment would have a different pattern of sequence conservation if DUF283 homologs were unrelated to dsRBD proteins. The combined alignment was mapped onto the 3D model of DUF283 (Supplementary Figure 1B) and showed very similar pattern of surface conservation to dsRBD alone. This analysis provides a strong argument that the surface of DUF283 is similar to the majority of dsRBD proteins and could bind RNA. There are, however, several conserved residues in DUF283, e.g. the nearly invariant cysteines at positions 833 and 897 in Drosophila's Dicer, which have no equivalents in dsRBD proteins. These cysteines are mentioned as potential zinc ligands in Pfam annotation of DUF283, yet in our 3D model they are too far apart to be involved in metal coordination or disulphide bond formation (shown as sticks in Supplementary Figure 1B). These residues could be brought within interacting distance only if the packing of
-helices in DUF283 is significantly different from our model.
Dicers not only generate siRNAs, but also accompany them into the RNA-induced silencing complex (RISC) (Lee et al., 2004; Pham et al., 2004). It is thought that the polarity of Dicer processing may influence the subsequent strand selection, as only one of the two strands of the siRNA will be used by the RISC (reviewed in Sontheimer, 2005). Therefore, any asymmetric interaction(s) between Dicer and siRNAs could establish the order and directionality of processing steps in RISC. In particular, this may determine from which end the dsRNA is unwound, leading to the incorporation of the guide strand into RISC and the disposal of the passenger strand (reviewed in Sontheimer, 2005; Tomari and Zamore, 2005). It has been shown that the relative thermal stability of base pairing at each end of the siRNA duplex provides this asymmetric signal, or at least part of it (Khvorova et al., 2003; Schwarz et al., 2003). It is not clear, however, whether Dicer recognizes the asymmetry of its substrate alone or in combination with other dsRNA-binding proteins (reviewed in Tomari and Zamore, 2005). In a scenario where Dicer does not need a protein partner for siRNA processing and subsequent strand selection, subtle differences in double-stranded quality of siRNA ends could be recognized by DUF283 and dsRBD; this hypothesis can be tested by exchanging the positions of the two domains. If, however, Dicer needs a dsRNA-binding partner for this task, DUF283 could instead serve as a dimerization domain. It has been documented that Dicers of Drosophila and human form heterodimers with TRBP/Loquacious, both of which contain three dsRBDs and are needed for proper microRNA (miRNA) maturation and subsequent assembly into RISC (Chendrimada et al., 2005; Forstemann et al., 2005; Haase et al., 2005). In addition, Drosophila's Dicer-2 forms a heterodimer with dsRBD protein R2D2, which in turn orients the heterodimers on the siRNA duplex (Tomari et al., 2004).
RNase III domains of Dicer form an intramolecular dimer and likely cleave RNA similarly to intermolecular dimers of bacterial RNase III (Zhang et al., 2004). However, fine details of these two ribonucleoprotein complexes must be different because Dicers have no dsRBD at the C-terminus of the first RNase III domain. Instead, DUF283 is positioned at the N-terminus of the first RNase III domain, creating an arrangement that additionally contributes to the asymmetry of the resulting proteinRNA complex. We therefore propose that DUF283 is involved in siRNA/miRNA strand selection, be it by recognizing the asymmetry of RNA duplexes directly or by recruiting another dsRBD protein. It will be interesting to examine how the recognition of siRNA/miRNA duplexes is transmitted to the adjacent helicase domain to aid correct strand selection for the RISC assembly.
| Acknowledgments |
|---|
This work was supported in part by NIH Grant P20 RR16455-06 from the INBRE-BRIN Program of the National Center for Research Resources. Funding to pay the Open Access publication charges for this article was provided by NIH Grant P20 RR16455-06.
Conflict of Interest: none declared.
| FOOTNOTES |
|---|
Associate Editor: Golan Yona
Received on June 14, 2006; revised on August 7, 2006; accepted on August 29, 2006
| REFERENCES |
|---|
|
|
|---|
Bateman, A., et al. (2004) The Pfam protein families database. Nucleic Acids Res, . 32, D138D141
Bernstein, E., et al. (2001) Role for a bidentate ribonuclease in the initiation step of RNA interference. Nature, 409, 363366[CrossRef][Medline].
Canutescu, A.A., et al. (2003) A graph-theory algorithm for rapid protein side-chain prediction. Protein Sci, . 12, 20012014
Chendrimada, T.P., et al. (2005) TRBP recruits the Dicer complex to Ago2 for microRNA processing and gene silencing. Nature, 436, 740744[CrossRef][Medline].
Collins, R.E. and Cheng, X. (2005) Structural domains in RNAi. FEBS Lett, . 579, 58415849[CrossRef][ISI][Medline].
Doyle, M. and Jantsch, M.F. (2002) New and old roles of the double-stranded RNA-binding domain. J. Struct. Biol, . 140, 147153[CrossRef][ISI][Medline].
Eckmann, C.R., et al. (2001) The human but not the Xenopus RNA-editing enzyme ADAR1 has an atypical nuclear localization signal and displays the characteristics of a shuttling protein. Mol. Biol. Cell, 12, 19111924
Edgar, R.C. and Sjolander, K. (2004) COACH: profileprofile alignment of protein families using hidden Markov models. Bioinformatics, 20, 13091318
Eswar, N., et al. (2003) Tools for comparative protein structure modeling and analysis. Nucleic Acids Res, . 31, 33753380
Forstemann, K., et al. (2005) Normal microRNA maturation and germ-line stem cell maintenance requires loquacious, a double-stranded RNA-binding domain protein. PLoS Biol, . 3, e236[CrossRef][Medline].
Gan, J., et al. (2006) Structural insight into the mechanism of double-stranded RNA processing by ribonuclease III. Cell, 124, 355366[CrossRef][ISI][Medline].
Ginalski, K., et al. (2003) 3D-Jury: a simple approach to improve protein structure predictions. Bioinformatics, 19, 10151018
Haase, A.D., et al. (2005) TRBP, a regulator of cellular PKR and HIV-1 virus expression, interacts with Dicer and functions in RNA silencing. EMBO Rep, . 6, 961967[Medline].
Hammond, S.M. (2005) Dicing and slicing: the core machinery of the RNA interference pathway. FEBS Lett, . 579, 58225829[CrossRef][ISI][Medline].
Hammond, S.M., et al. (2001) Argonaute2, a link between genetic and biochemical analyses of RNAi. Science, 293, 11461150
Hitti, E.G., et al. (2004) Oligomerization activity of a double-stranded RNA-binding domain. FEBS Lett, . 574, 2530[CrossRef][ISI][Medline].
Jones, D.T. (1999) Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol, . 292, 195202[CrossRef][ISI][Medline].
Karplus, K., et al. (2005) SAM-T04: what is new in protein-structure prediction for CASP6. Proteins, 61, Suppl. 7, 135142[CrossRef][ISI][Medline].
Khvorova, A., et al. (2003) Functional siRNAs and miRNAs exhibit strand bias. Cell, 115, 209216[CrossRef][ISI][Medline].
Landau, M., et al. (2005) ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures. Nucleic Acids Res, . 33, W299W302
Lee, Y.S., et al. (2004) Distinct roles for Drosophila Dicer-1 and Dicer-2 in the siRNA/miRNA silencing pathways. Cell, 117, 6981[CrossRef][ISI][Medline].
Lingel, A., et al. (2004) Nucleic acid 3'-end recognition by the Argonaute2 PAZ domain. Nat. Struct. Mol. Biol, . 11, 576577[CrossRef][ISI][Medline].
Luthy, R., et al. (1992) Assessment of protein models with three-dimensional profiles. Nature, 356, 8385[CrossRef][Medline].
Ma, J.B., et al. (2004) Structural basis for overhang-specific small interfering RNA recognition by the PAZ domain. Nature, 429, 318322[CrossRef][Medline].
MacRae, I.J., et al. (2006) Structural basis for double-stranded RNA processing by Dicer. Science, 311, 195198
Petrey, D. and Honig, B. (2005) Protein structure prediction: inroads to biology. Mol. Cell, 20, 811819[CrossRef][ISI][Medline].
Pham, J.W., et al. (2004) A Dicer-2-dependent 80S complex cleaves targeted mRNAs during RNAi in Drosophila. Cell, 117, 8394[CrossRef][ISI][Medline].
Ryter, J.M. and Schultz, S.C. (1998) Molecular basis of double-stranded RNAprotein interactions: structure of a dsRNA-binding domain complexed with dsRNA. EMBO J, . 17, 75057513[CrossRef][ISI][Medline].
Sanchez, R. and Sali, A. (1998) Large-scale protein structure modeling of the Saccharomyces cerevisiae genome. Proc. Natl Acad. Sci. USA, 95, 1359713602
Schwarz, D.S., et al. (2003) Asymmetry in the assembly of the RNAi enzyme complex. Cell, 115, 199208[CrossRef][ISI][Medline].
Sippl, M.J. (1993) Recognition of errors in three-dimensional structures of proteins. Proteins, 17, 355362[CrossRef][ISI][Medline].
Soding, J. (2005) Protein homology detection by HMMHMM comparison. Bioinformatics, 21, 951960
Soding, J., et al. (2005) The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res, . 33, W244248
Sontheimer, E.J. (2005) Assembly and function of RNA silencing complexes. Nat. Rev. Mol. Cell Biol, . 6, 127138[CrossRef][ISI][Medline].
Tabara, H., et al. (2002) The dsRNA binding protein RDE-4 interacts with RDE-1, DCR-1, and a DExH-box helicase to direct RNAi in C. elegans. Cell, 109, 861871[CrossRef][ISI][Medline].
Tomari, Y. and Zamore, P.D. (2005) Perspective: machines for RNAi. Genes Dev, . 19, 517529
Tomari, Y., et al. (2004) A protein sensor for siRNA asymmetry. Science, 306, 13771380
Wallner, B. and Elofsson, A. (2003) Can correct protein models be identified? Protein Sci, . 12, 10731086
Zhang, H., et al. (2004) Single processing center models for human Dicer and bacterial RNase III. Cell, 118, 5768[CrossRef][ISI][Medline].
This article has been cited by other articles:
![]() |
E. V. Dolgosheina, R. D. Morin, G. Aksay, S. C. Sahinalp, V. Magrini, E. R. Mardis, J. Mattsson, and P. J. Unrau Conifers have a unique small RNA silencing signature RNA, August 1, 2008; 14(8): 1508 - 1515. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Casas-Mollano, J. Rohr, E.-J. Kim, E. Balassa, K. van Dijk, and H. Cerutti Diversification of the Core RNA Interference Machinery in Chlamydomonas reinhardtii and the Role of DCL1 in Transposon Silencing Genetics, May 1, 2008; 179(1): 69 - 81. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Ye, Z. Paroo, and Q. Liu Functional Anatomy of the Drosophila MicroRNA-generating Enzyme J. Biol. Chem., September 28, 2007; 282(39): 28373 - 28378. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



