Bioinformatics Advance Access originally published online on February 29, 2008
Bioinformatics 2008 24(7):958-964; doi:10.1093/bioinformatics/btn064
Identifying trait clusters by linkage profiles: application in genetical genomics
1Department of Biostatistics, University of Washington and 2Statistical Center for HIV/AIDS Research and Prevention, Seattle, WA, USA
*To whom correspondence should be addressed.
| ABSTRACT |
|---|
|
|
|---|
Motivation: Genes often regulate multiple traits. Identifying clusters of traits influenced by a common group of genes helps elucidate regulatory networks and can improve linkage mapping.
Methods: We show that the Pearson correlation coefficient,
, between two LOD score profiles can, with high specificity and sensitivity, identify pairs of genes that have their transcription regulated by shared quantitative trait loci (QTL). Furthermore, using theoretical and/or empirical methods, we can approximate the distribution of
under the null hypothesis of no common QTL. Therefore, it is possible to calculate P-values and false discovery rates for testing whether two traits share common QTL. We then examine the properties of
through simulation and use
to cluster genes in a genetical genomics experiment examining Saccharomyces cerevisiae.
Results: Simulations show that
can have more power than the clustering methods currently used in genetical genomics. Combining experimental results with Gene Ontology (GO) annotations show that genes within a purported cluster often share similar function.
Software: R-code included in online Supplementary Material.
Contact: joshua.sampson{at}yale.edu
Supplementary information: Supplementary materials are available at Bioinformatics online.
| 1 INTRODUCTION |
|---|
|
|
|---|
Genetic linkage analysis, or linkage mapping, is a powerful tool for locating genes influencing quantitative, or continuously varying, traits. For linkage mapping, the trait of interest is measured on a group of related individuals and then the genotypes at a set of genetic markers (i.e. single nucleotide polymorphisms) are recorded for that same group. Markers that are strongly correlated with the trait are reported as quantitative trait loci (QTL).
When a single gene regulates two or more traits, an occurrence known as pleiotropy, the power to detect that gene and the precision of its estimated location are often improved by mapping all regulated traits simultaneously (Jiang and Zeng, 1995). Given a set of genetically correlated traits, several methods are available for joint linkage analysis. Maximum likelihood approaches can be applied to multivariate distributions (Chen, 2005; Korol et al., 1996). Haley–Knott regression is easily adapted to multiple traits by using multivariate regression and ANOVA (Knott and Haley, 2000). Principal components analysis can transform the traits into a set of orthogonal, canonical variables and each component can then be analyzed by standard, single-trait methods (Mangin et al., 1998; Weller et al., 1996). These methods have been used extensively in recent years, uncovering genes influencing milk production in cows, grain yield in wheat, and multi symptom illnesses in a variety of organisms (Kraft et al. 2003; Mangin, et al. 1998; Martin et al. 2003).
Before benefiting from such methods, we must first identify a set of genetically coregulated traits. We quantify genetic coregulation by averaging the percentage of influential genes common to both traits, C(·, ·). Usually traits have been clustered because of biological relationships or prior experiments. However, our knowledge may be limited to the data collected for the linkage studies. Therefore, using only the recorded trait values and marker genotypes, we want a method to determine if all, or a subgroup, of those traits are genetically coregulated. If the traits could share only a single gene, such a method would analyse each marker separately and mimic the previously listed joint mapping methods. However, the heritable traits still being studied are influenced by multiple genes, and two traits sharing one gene would likely share a set of genes.
In this article, we formalize a novel method for identifying groups, or clusters, of traits likely to share common QTL (Schadt et al., 2005). The need for such a method has dramatically increased with the emergence of genetical genomics. Genetics and genomics can be combined by measuring the expression levels of thousands of genes simultaneously in a group of related individuals and then treating each expression level as a quantitative trait for linkage mapping (Brem et al., 2002; Li and Burmeister, 2005; Segal et al., 2003). QTL controlling expression levels, eQTL, have been successfully identified in mice, yeast, wheat, and humans (Li and Burmeister, 2005; MacLaren and Sikela, 2005; Yvert et al., 2003). Here, we will offer a way to cluster genes regulated by common eQTL, thereby improving our estimates of QTL locations and identifying collections of genes participating in the same biological pathway.
Our clustering method is based on the correlation,
, between the LOD score profiles of two traits. Given a large group of traits, we form clusters so the value of
between any two members exceeds some threshold. We will introduce this method in the context of a single genetical genomics experiment. Until now, these analyses have generally formed clusters so all pairs within a cluster have highly correlated expression levels. We can therefore compare our method to this established standard. In other cases, such as clustering genes where the expression levels were measured on different populations, there is no alternative to clustering by
.
The remainder of the article is divided as follows. First, we introduce notation and briefly review LOD Scores. Second, we introduce
as an estimate for C(j1, j2). Third, we discuss
as a similarity measure, and show how combining this measure and an appropriate algorithm can identify clusters of genetically coregulated traits (Hastie et al., 2001). Fourth, we apply our clustering method to simulations and the experimental results from Yvert et al.'s study of Saccharomyces cerevisiae (Yvert et al., 2003).
| 2 METHODS |
|---|
|
|
|---|
2.1 Notation
Assume a cross between two strains, ST–1 and ST1, of yeast producing n individuals. For subject i, i
{1, ..., n}, let Yji be the expression level for gene j and let
i2. Recall, yeast are haploid and have only two possible genotypes with
To simplify the discussion, assume that all genes are located at markers. We say that gene t, marker t, or QTL t influences the expression of gene j if f(Yji | Gti = –1)
f(Yji | Gti = 1), ignoring the possibility of epistasis. Furthermore, let Rj
{tj1, ..., tjNj} be the positions of the Nj QTL influencing the expression of gene j. The LOD score,
, is the log10 of the likelihood ratio statistic testing whether t
Rj (see Supplementary Material for definition) and can be approximated by Haley–Knott regression:
. Here, SST and SSE are the total and residual sum of squares from solving equation 1 (Haley et al., 1994). The vector of LOD scores,
will be called the LOD score profile of gene j.
|
| (1) |
|
|
Let 1t
Rj = 1 if t
Rj, 0 otherwise. For trait j, j
{j1, j2},
is the proportion of QTL that are common to both traits. We measure the genetic coregulation by the geometric mean, C(j1, j2), of the two proportions.
|
| (2) |
2.2 Estimating C(j1, j2)
Define the LOD score correlation coefficient,
, for traits j1 and j2 by
|
| (3) |
|
| (4) |
A second approach for estimating C(j1, j2) would be to first estimate 1t
Rj1 and 1t
Rj2 for each potential QTL t. Let
if Xjt is a local maximum and exceeds some threshold >0.22, 0 otherwise. Then, calculating LOD scores by composite interval mapping (CIM) promises
|
| (5) |
{j1, j2}. We avoid this approximation because it fails horribly for real sample sizes. Let QTL t' affect traits j1 and j2. Even when n is large, the estimated locations for QTL t' will rarely coincide perfectly resulting in
Rj) or evidence of linkage instead of
Returning to our original focus, we calculate
for all possible pairs of traits and input those values in a proximity matrix, D. Let Dj1, j2 =
(j1, j2), where Dj1, j2 is the entry in the j1th row and j2th column of D. Given a proximity matrix, there are numerous methods available for finding groups, T1, T2, ..., Tg of traits such that
is large. If our estimates of
(j1, j2) are accurate, then the identified clusters will result in a large value of
.
2.3 Similarity measures
Finding clusters, T1, T2, ..., Tg, of traits resulting in a large value of
does not require estimating C(j1, j2). We can circumvent this step by identifying a statistic, or measure, D, that is highly correlated with C. Given such a measure, we can construct a proximity matrix, D, where Dj1, j2 = D(j1, j2). Applying an appropriate clustering method to D would result in groups of traits with a large value of
. Three candidate measures are the correlation between expression levels, the correlation between vectors
and
, and the correlation between LOD score profiles. We must avoid methods based on variance components because the genetic component is not identifiable for Yvert et al.'s yeast experiment or any experiment where the population is the progeny of a single cross.
The most prevalent method for finding genes linked to common eQTL is clustering by expression profile (Brem and Kruglyak, 2005; Eisen et al., 1998; Yvert et al., 2003). This method is equivalent to defining
, where
estimates the Pearson correlation coefficient,
, between
and
.
|
| (6) |
Tk, then
P, and therefore |
| (7) |
j1i and
j2i are independent and normally distributed with mean = 0 and variances = |
| (8) |
When there is only one QTL, the following, desirable, statement is true: |
P| > 0 if and only if the two traits are genetically coregulated. A discrete measure, C(j1, j2)
{0, 1} will never perfectly correlate with a continuous measure, D(j1, j2). However, if
is constant for all genes,
P, is an increasing function of the genetic effect sizes, βj1 and βj2. Therefore, as the influence of the shared QTL increases, the
will also increase, another highly desirable characteristic. Unfortunately, problems can arise when multiple genes influence each trait. Then, even the simple statement from above fails, as
P = 0 no longer implies the absence of genetic correlation. We give an example later where
P(j1, j2) = 0 and C(j1, j2) = 1.
A second possible measure is
, the Pearson correlation coefficient between
and
, where
is the least squares estimate of βj· in Equation (10). If we believe that two genes share a common function only when they share the same QTL and when those QTL have identical influences,
would be the preferred statistic. However, this second condition is superfluous if our ultimate aim is to group genes for QTL mapping, so we still prefer a measure correlated with C(j1, j2). Therefore,
has two drawbacks. The signs of
and
affect
and the size of
, not the evidence for linkage, affect
. The logical replacement for
, which addresses those flaws, is the F-statistic
. However, F = (n – 2)(10–2Xjt – 1) so we are lead back to a function based on the LOD scores for our correlate to C(j1, j2).
The third possible measure, and our focus for the remainder of the article is
. Unlike
,
incorporates both expression and genetic data. Furthermore,
can compare traits measured on distinct populations. Both traits could be expression levels or one trait could be a more general characteristic, such as size or life expectancy. Finally, there is a robust correlation between
and C even in small samples, which will be illustrated by later examples.
2.4 Asymptotic properties of ![]()
Let traits j1 and j2 have the distribution described by equation 10. To simplify calculations, we make the following three assumptions: (1) The entire genome is on a single chromosome. (2) The markers are evenly spaced at intervals of length d cM. (3) Haldane's mapping rule describes the recombination rate between loci. Under the absolute null hypothesis,
, we proved that
converges to a normal distribution as N, n
(Supplementary Material).
|
| (9) |
2.5 Clustering method
We propose a three-step method for identifying clusters of traits that share common QTL. (1) Calculate the truncated LOD score profile,
for each of the n traits, where Xtruncjt = min(Xjt,6). Without truncation, traits with an extreme LOD score at position t will be correlated to any trait j where Xjt is even modestly larger than E[Xjt]. Truncation also ensures that
is only large when traits share multiple QTL. Simulations suggested the threshold of six greatly reduced the type I error rate without noticably lowering power. (2) Form an nxn similarity matrix where the j1, j2th entry is
, calculated using the truncated LOD scores at all markers. (3) Use a heierarchical clustering method, (such as hclust, method=complete in R), to order the traits. Then, select groups of traits where
for all included trait pairs, j1 and j2, where c is a predefined threshold. These groups can be subsequently ranked by their size and/or mean value of
.
2.6 Simulations
We could simulate small groups of coregulated traits, and then examine whether the above method can correctly cluster those subgroups when applied to the union of all traits. However, these simulations would introduce multiple variables simultanously. Instead, we focus on groups of two coregulated traits, j1 and j2, and calculate the probability of correctly clustering those two traits together, or equivalently, calculating the
. We refer to
as the power of our clustering method, because in this two trait example, our clustering method is equivalent to a test that rejects the null hypothesis H0 : {R1}
{R2} = Ø when
. We define c
so the probability of clustering two unrelated traits together, the type I error rate, is
. In each of the scenarios described below, we generate 10 000 values of
under the null set-up and define the 95th percentile as c0.05. We then generate values under the alternative, and define the proportion exceeding c0.05 as the power. Full, as opposed to truncated, LOD scores could be used to calculate
because we avoided genes with extreme effects. Using identical methodology, we also calculate the power for the test rejecting H0 when
is large.
For each individual in a group of offspring, we simulated two phenotypes and a set of SNPs, spaced every 10 cM, along a single chromosome. The first marker was randomly assigned to be –1 or 1, indicating parental origin, and the remaining markers were generated according to Haldane's mapping function. In the first set of simulations, the phenotypes, Yj1 and Yj2 were generated according to equation 10 with Nj1 = Nj2 = 1. Data was simulated for 100 subjects when the trait heritability, H, was 0.05, 0.10 and 0.20 and for 1000 subjects when H was 0.010, 0.015 and 0.020. In this simple model,
. Simulations were repeated for genome lengths of 1000, 5000, and 10 000 cM. Under the null hypothesis, the genes influencing traits j1 and j2 were located at the 0.3Nth and 0.7Nth marker, respectively, and under the alternative, both genes were located at the 0.5Nth marker.
For the second set of simulations, we fixed the number of subjects (100), heritability of each QTL (i.e
, and genome length (5000 cM). The two phenotypes Yj1 and Yj2 were generated according to the following equation.
|
| (10) |
{j1, j2} and t
{1, ..., N}. Also, let var(
j1i) = var(
j2i) = 
. Each phenotype had two, four or six influential QTL
2.7 Experiment
Yvert and colleagues (2003) measured the expression of 6818 genes in laboratory (BY) and wild (RM) strains of S.cerevisiae and in 112 segregants from a cross between them. Including multiple replicates for each parent, expression was measured on 130 samples. In addition to this genomics data, their lab genotyped each member of the two generations at 3114 genetic markers. Because genotypes at adjacent markers were often nearly identical, only a subset of 1063 marker locations were chosen for calculating the LOD score correlation coefficient. A complete description of the experiment has been previously published (Yvert et al., 2003).
2.8 Composite interval mapping
In the analysis of both simulations and experimental results, we calculated the LOD score profile using CIM. In addition to the locus of interest, the nearest markers, on each side, at least 15 cM away were also included in the Haley–Knott regression. For the experimental results, the at least 15 cM requirement was replaced by differing genotypes for at least 15 subjects. All significant loci outside the surrounding interval were also included in the regression. The group of significant loci,
, was initially defined as t', where Xjt' = maxt(Xjt). Given a set
, the LOD score,
, for each position was then recalculated with all significant loci included in the regression. Until
or
contained eight loci, t' was added to
, where
. If
, and
, t'' was dropped from the set
. This abbreviated version of CIM was chosen to accommodate the large number of traits.
| 3 RESULTS |
|---|
|
|
|---|
3.1 Simulations
Fundamental characteristics of
|
In all scenarios, the power for rejecting H0 was far greater when using tests based on
Figure 1 shows that
increases with C(j1, j2) and HT. With only a 100 subjects, the population size of Yvert's; experiment, HT still affects
and
is a poor approximation of C(j1, j2). However, we see that even for sample sizes where
is not true, tests based on
still have power to identify correlated traits. Table 2 shows that these tests can be far more powerful than those based on
. Table 2 also shows the power tends to be slightly smaller when the signs of the elements of
alternate, even though the loci are, for practical purposes, independent of each other. In our simulations, whenever C(j1, j2) > 0,
implying that contrary to the marginal distributions, the joint distribution of
and
depends on the signs of
and
. To understand this phenomena, we focus on the 2 QTL example. Let t1 and t2 be the locations of the two influential genes. Although the
, the
. When this event occurs and βj1t2 = βj2t2, both Xj1t1 and Xj2t1 tend to increase. When
, both Xj1t1 and Xj2t1 tend to decrease. The values of Xj1t1 and Xj2t1 change together, or in unison. The same is true for the LOD scores at t2. In contrast, when βj1t2 = – βj2t2, Xj1t1 tends to be higher than E[Xj1t1] when Xj2t1 is lower than E[Xj2t1]. Here, the extra error is negatively correlated. In the Supplementary Materials, we provide the mathematical details and show that
–E[(Xj1t1 – E[Xj1t1])(
])] will be proportional to
. The superscript indicates whether S = 0 (–) or S = 1 (+) when comparing
and
. As with the single QTL examples, we again found that power for rejecting H0 was greater when using tests based on
, compared to tests based on
. Now, we see that when S = 0 or the signs of
alternate, the advantage of the former is even greater. As promised in section 2.3, in the examples with S = 0, the power to detect traits sharing common QTL using
is only equal to the alpha level, 0.05, even when C(j1, j2) = 1.
|
|
3.2 Experiment
We calculated the LOD score profiles for the 6818 genes measured by Yvert and colleagues (2003). Following, the steps outlined in section 2.5, we formed clusters of genes where the values for
From the actual, experimental values, over 34 854 pairs had a
. We then ranked all clusters by their average value of
and focused on the top 20 clusters with at least 10 genes (see Supplementary Material for the genes within each cluster). Genes within these clusters tended to have the same molecular functions and biological processes, as determined by Gene Ontology (GO) annotations (see Supplementary Material). The GO project creates a common vocabulary to describe genes and their products (www.geneontology.org). For example, the biological process of all annotated genes in the highest ranking cluster included DNA metabolic process and Organelle organization and biogenesis. The molecular function of all of these genes included Helicase activity. Therefore, we now have potential functions for the six genes in this group that previously had no annotation. Figure 2 illustrates that the 16 genes in that cluster share multiple QTL. At least 67% of all annotated genes in eight additional clusters shared a common molecular function or biological process. Additionally, 77% of the genes in cluster 16 participate in a metabolic process, but the annotations discrimate between RNA, DNA, and amino acid metabolic processes. The mean
was 0.6 for the four genes labelled as part of the amino acid metabolic process, suggesting that those share nearly identical QTL with each other, but only a portion of their QTL with genes involved in other metabolic processes.
|
The overall correlation between
10 genes) which ranked highest by | 4 DISCUSSION |
|---|
|
|
|---|
The LOD score correlation coefficient,
As more laboratories are focusing on genetical genomics, we need new methods to synthesize results. Investigators often focus on a specific subset of genes. The subset may be determined by their expression platform or by their experimental goals. Each manufacturer includes a different set of probes (i.e. different genes) on their microarrays (Verdugo and Medrano, 2006) and labs often limit their measurements to a specific type of tissue or a specific set of genes thought to be associated with a disease (MacLaren and Sikela, 2005; Yamashita et al., 2005). As an example of how
will immediately impact the field of genetical genomics, we offer WebQTL (http://www.genenetwork.org), a database that includes LOD score profiles for an expansive list of traits in mouse, rat and barley. This website was designed to find coregulated genes. Currently it assesses coregulation by
and is limited to searching traits measured on the same cross. Our introduction of
will now allow for previously impossible comparisons.
Clearly,
has limitations because it is a function of LOD scores. The quality of
is limited by the quality of the LOD score profiles, which are often very noisy. False positives will occur when two traits are influenced by linked, but not identical, genes. Moreover, there are other statistics which may better compare two LOD score profiles. In the future, we should explore statistics that account for the number of overlapping QTL and preprocess the LOD scores by smoothing their profiles before comparison. With enough smoothing, correlation will be based only on the largest peaks. Also, we might search for a method to better estimate P-values and FDR. Currently, our null distributions, from theory or permutation, both assume the absence of any genetic influence, and therefore are only close approximations to the desired null distributions.
At this stage, we propose the obvious two step procedure for improved QTL mapping. First, search for genetically coregulated traits. Then, perform joint linkage mapping on those traits. The P-values from standard joint linkage methods are no longer valid, as we have purposely grouped traits that appear to have similar LOD score profiles. In future research, we hope to combine clustering and linkage so we can assign a single, meaningful, significance level for each purported gene-trait pair. As we improve our methods and genetical genomics continues to gain popularity,
, will become increasingly important in identifying genetic coregulation.
| APPENDIX A |
|---|
|
|
|---|
When the following three assumptions hold and the number of subjects and markers are large,
ASSUMPTION 1. k1, k2 < <1
ASSUMPTION 2. The genes are independent of each other. For any two positions, t1 and t2, P(Gt1 = 1 | Gt2) = 0.5.
ASSUMPTION 3. Genetic effects are described by the linear model in Equation (10). with its accompanying restrictions of
and 
.
We show that violating these assumptions will have minimal effect in the Supplementary Material. Without loss of generality, let genes 1, ..., N1 be the influential QTL for trait j1. As Lander and Botstein (1989) point out, the expected score per progeny, or ELOD, can be described by
|
| (11) |
0.22 when
ELOD(j1, t') when |
| (12) |
|
| (13) |
|
| (14) |
|
| (15) |
|
| (16) |
G if F/G
n 1. Using these results and their counterparts for j2,
|
| (17) |
|
| (18) |
|
| (19) |
| ACKNOWLEDGEMENTS |
|---|
|
|
|---|
This work was funded by National Institute of Dental and Craniofacial Research (T32DE07132). We wish to thank John Storey and Elizabeth Thompson for their valuable comments, and also thank the anonymous reviewers for their extremely insightful suggestions.
Conflict of Interest: none declared.
| FOOTNOTES |
|---|
Associate Editor: John Quackenbush
Received on November 3, 2007; revised on January 11, 2008; accepted on February 17, 2008
| REFERENCES |
|---|
|
|
|---|
Brem R, Kruglyak L. The landscape of genetic complexity across, 5700 gene expression traits in yeast. PNAS (2005) 102:1572–1577.
Brem RB, et al. Genetic dissection of transcriptional regulation in budding yeast. Science (2002) 296:752–755.
Chen Z. The full em algorithm for the mles of qtl effects and positions and their estimated variances in multiple-interval mapping. Biometrics (2005) 61:474–480.[CrossRef][Web of Science][Medline]
Eisen MB, et al. Cluster analysis and display of genome-wide expression patterns. Proc. Natl Acad. Sci (1998) 95:14863–14868.
Haley CS, et al. Mapping quantitative trait loci in crosses between outbred lines using least squares. Genetics (1994) 136:1195–1207.[Abstract]
Hastie T, et al. The Elements of Statistical Learning (2001) Canada: 1st edn. Springer.
Jiang C, Zeng ZB. Multiple trait analysis of genetic mapping for quantitative trait loci. Genetics (1995) 140:1111–1127.[Abstract]
Knott SA, Haley CS. Multitrait least squares for quantitative trait loci detection. Genetics (2000) 156:899–911.
Korol AB, et al. Linkage between quantitative trait loci and marker loci: resolution power of three statistical approaches in single marker analysis. Biometrics (1996) 52:426–441.[CrossRef][Web of Science][Medline]
Kraft P, et al. Multivariate variance components analysis of longitudinal blood pressure measurements from the Framingham Heart Study. BMC Genetics (2003) 4(suppl. 1):S55–S61.[CrossRef]
Lander ES, Botstein D. Mapping Mendelian factors underlying quantitative traits using RFLP linkage maps. Genetics (1989) 121:185–199.
Lehman EL, Casella G. Theory of Point Estimation (1998) 2nd. New York: Springer. Chapter 6. pp. 429–475.
Li J, Burmeister M. Genetical genomics: combining genetics with gene expression analysis. Hum. Mol. Genet (2005) 14(suppl. 2):R163–169.
MacLaren EJ, Sikela JM. Cerebellar gene expression profiling and eQTL analysis in inbred mouse strains selected for ethanol sensitivity. Alcohol Clin. Exp. Res (2005) 29:1568–1579.[CrossRef][Medline]
Mangin B, et al. Pleiotropic QTL analysis. Biometrics (1998) 54:88–99.[CrossRef][Web of Science]
Martin LJ, et al. Phenotypic, genetic, and genome-wide structure in the metabolic syndrome. BMC Genetics (2003) 4(suppl. 1):S95–S99.[CrossRef]
Schadt E, et al. An integrative genomics approach to infer causal associations between gene expression and disease. Nat. Genet (2005) 37:710–717.[CrossRef][Web of Science][Medline]
Segal E, et al. Genome-wide discovery of transcriptional modules from DNA sequence and gene expression. Bioinformatics (2003) 19(suppl. 1):i273–282.[Abstract]
Verdugo RA, Medrano JF. Comparison of gene coverage of mouse oligonucleotide microarray platforms. BMC Genomics (2006) 7:58.[CrossRef][Medline]
Weller J, et al. Application of a canonical transformation to detection of quantitative trait loci with the aid of genetic markers in multitrait experiments. Theor. App. Genet (1996) 92:998–1002.[CrossRef]
Yamashita S, et al. Expression quantitative trait loci analysis of 13 genes in the rat prostate. Genetics (2005) 171:1231–1238.
Yvert G, et al. Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factors. Nat. Genet (2003) 35:57–63.[Web of Science][Medline]
This article has been cited by other articles:
![]() |
T. M. Przytycka, M. Singh, and D. K. Slonim Toward the dynamic interactome: it's about time Brief Bioinform, January 8, 2010; (2010) bbp057v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Al-Ghazi, S. Bourot, T. Arioli, E. S. Dennis, and D. J. Llewellyn Transcript Profiling During Fiber Development Identifies Pathways in Secondary Metabolism and Cell Wall Structure That May Contribute to Cotton Fiber Quality Plant Cell Physiol., July 1, 2009; 50(7): 1364 - 1381. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||








