|
|
||||||||
Laboratory of Respiratory Biology, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina
Correspondence and requests for reprints should be addressed to David M. Brass, Ph.D., National Institute of Environmental Health Sciences, Rall Building, Room C224, P.O. Box 12233 MD C2-15, 111 Alexander Drive, Research Triangle Park, NC 27709. E-mail: brassd{at}niehs.nih.gov
ABSTRACT
Idiopathic interstitial pneumonia represents a broad category of lung disorders characterized by scarring or fibrosis of the lung accompanied by varying degrees of inflammation. A number of important hypotheses based on clinical observations have substantially contributed to our understanding of the pathogenesis of the most insidious and devastating of the idiopathic interstitial pneumonias, idiopathic interstitial fibrosis (IIF). Patients with IIF usually present late in the course of their illness; thus, animal models of the early, preclinical stage of these diseases are needed. Although no model faithfully recapitulates the clinical course of disease or the histopathology observed in humans, all result in scarring of the lung and may therefore be used to understand the biological processes that contribute to this scarring. The purpose of this article is to summarize the application of mouse genetic and genomic tools to these models to advance our understanding of IIF and to describe emerging agnostic approaches to identifying genes important to the fibroproliferative component of IIF.
Key Words: fibrosis QTL genomics microarray
Idiopathic interstitial fibrosis (IIF) is an insidious and devastating lung disorder characterized by scarring or fibrosis of the lung accompanied by varying degrees of inflammation (1, 2). In a recent historical perspective on the clinical aspects of human pulmonary fibrosis, Paul Noble describes how thinking about IIF has evolved from the "inflammation hypothesis," which focused on the observed inflammation as being critical to the end result of fibrosis, to the current view that there is an interaction between epithelial and mesenchymal cells that takes place that predisposes toward pulmonary fibrosis (3, 4). Growth factors were and still are considered to be pathogenically important soluble factors released by macrophages that play key roles in the evolution of pulmonary fibrosis (3, 4). These hypotheses (inflammation and growth factors) and many other more focused areas of investigation (e.g., injury/repair of the alveolar epithelia, alterations in fibrinolysis, and matrix homeostasis) have substantially contributed to our understanding of the pathogenesis of IIF. However, patients with IIF usually present late in the course of their illness, raising the possibility that the biological processes that are thought to be involved in disease pathogenesis may represent responses to the fibroproliferative process rather than primary or pathogenic events that cause IIF. To understand the primary events in IIF, animal models and strategies to identify patients at early, preclinical stages of their disease are needed.
Although there are a number of animal models of IIF, all are limited in being at least one step removed from the human condition. Moreover, no model faithfully recapitulates the clinical course of the disease or the histopathology observed in humans. In specific diseases (e.g., sarcoidosis) (5), there are no animal models, and even in forms of interstitial lung disease where the cause of the disease is known (e.g., hypersensitivity pneumonitis, silicosis, and asbestosis), the animal models of these disorders are suboptimal (6). These limitations include the required large concentrations of organic or inorganic material, the method of administration (intratracheal bolus or extensive inhalation of high concentrations of dust), and the relatively rapid onset and progression of disease (7). However, despite having little in common with the clinical course of IIF, these different exposures (silica, asbestos, thoracic radiation, and bleomycin) produce scar tissue in the lung. Thus, even in the absence of a perfect animal model, these approaches represent important tools that may be used to map and clone genes and to understand the biology that contributes to lung scarring and fibroproliferation.
Many methods can be used to map and clone genes. The most powerful of these rely heavily on identifying phenotypic differences between inbred mouse strains. These comparisons between inbred strains combined with subsequent mapping of regions of DNA that are closely linked with a phenotype by using recombinant inbred mouse strains can identify a quantitative trait locus (QTL), or a region of the genome that is associated with a particular phenotype (8). Theoretically, each QTL should contain at least one gene or regulatory element that is pathogenically associated with the disease being studied (9). However, the record for proceeding from identification of a QTL to the identification of single genes that are unambiguously important in contributing to a particular disease phenotype is staggeringly poor (9), and such analyses have only identified some dozen or so genes (9). This highlights the need to improve our genetic tools and to use other strategies to identify disease genes. As a strategy to improve our genetic tools, efforts are underway to fully genotype 16 strains of inbred mice (10) and selectively target or knock out every gene in the mouse genome and to make this resource publicly available (11).
In silico mapping is a genomic approach that uses single nucleotide polymorphisms (SNPs) between inbred strains of mice to identify novel loci associated with a particular phenotype (12, 13). In silico mapping uses patterns of SNPs that define regions of the genome (haplotypes) that are inherited as a block of DNA and serve to define the relationship between different strains of mice. Thus, in silico mapping uses a computer algorithm to search for association between these SNP haplotypes and a particular phenotype among multiple strains of mice. Given the large amount of genetic information contained in the mouse genome, one needs to phenotype between 10 and 20 strains of mice or more to identify a specific locus with a reasonable degree of certainty.
Global transcriptional (microarray) analysis is an increasingly common methodology that is facilitated by hybridizing total RNA extracted from a biological sample onto a collection of microscopic DNA spots attached to a solid surface, such as glass, plastic, or silicon, forming an array for the purpose of examining simultaneous expression of tens of thousands of genes. Bioinformatic tools are then used to mine the extensive data and relate the gene expression changes to various pathophysiologic responses. Studies involving global analysis of gene expression are most often focused on genes that are significantly elevated in response to the disease-causing agent. However, we and others have identified long lists of genes that have significantly reduced gene expression compared with control subjects, suggesting that there are negative regulators that remain to be studied. All the tools being brought to bear on studies of genes that are elevated can just as easily be used to analyze genes that show decreased expression. Thus, characterizing global transcriptional responses to fibrogenic agents by microarray in mouse models of IIF have identified and will continue to identify broad categories of genes that are or may be involved in the development and progression of lung fibrosis in samples obtained from mice or humans.
In combination, these agnostic approaches (QTL mapping, in silico mapping, and global transcriptional analysis) have the potential to identify regions of DNA that contain genes that are differentially expressed and may provide clues to the pathogenesis of IIF. The purpose of this article is to discuss genetic and genomic approaches using mouse models and to improve our understanding of the mechanisms associated with the initiation and progression of idiopathic interstitial fibrosis and to highlight some emerging and novel approaches to genomic analyses that will meet the current challenge in the use of this technology.
GENETIC APPROACHES TO UNDERSTANDING LUNG FIBROSIS
Most of the advances in our understanding of genetic diseases over the last century have come from the identification of structural variation in single, so-called major genes. However, many diseases, such as pulmonary fibrosis, are complex and are not attributable to a single gene defect but are likely due to many genegene and geneenvironment interactions. Heroic work has been done in the identification of QTL that contribute to the development and progression of pulmonary fibrosis using the radiation and bleomycin mouse models (1419), and a solid QTL has been mapped on mouse chromosome 17 closely linked to the major histocompatibility complex (14), a region of the genome intimately involved in the immune response to insult and injury. In two subsequent studies using recombinant congenic mice (20), these same authors confirmed the QTL on chromosome 17 and further identified a QTL on chromosome 1 (20) specifically linked to radiation-induced fibrosis. In a further study (21), a QTL on chromosome 11 was shown to contain bleomycin hydrolase, demonstrating the sensitivity of this type of analysis. Because it is unlikely that bleomycin hydrolase evolved in response to evolutionary pressure resulting from bleomycin exposure, the authors speculate that this protein may function as a major histocompatibility complex, epitope-processing protease (21). The QTL on chromosome 17 has been confirmed in an independent study (14), and this locus has appeared in other QTL analyses for other lung injury phenotypes, including ozone-induced inflammation (22), particle exposure (23), and asthma (24, 25), suggesting that this might be a genomic region that contains a common lung injury-response gene. Therefore, the investigation of other models of fibrosis and the comparison of these results to current findings should provide a useful tool in attempting to identify other candidate genes that might be essential mediators of the fibroproliferative process.
MICROARRAY STRATEGIES USED TO CHARACTERIZE MOUSE MODELS OF IIF
The initial application of microarray technology to characterize a mouse model of interstitial lung disease (26) compared the response in over 6,000 genes and expressed sequence tags from two inbred mouse strains that are susceptible to bleomycin-induced pulmonary fibrosis and compared these results to a dataset obtained from one of the strains (129) harboring a single gene deletion (ß6 integrin) that they had previously shown protected these mice from the fibroproliferative response but not the inflammatory response to bleomycin (26). These investigators used a straightforward cluster analysis (27) to identify subsets of genes involved in the inflammatory and fibrotic components of the response to bleomycin in the wild-type mice (26). To evaluate the genes that distinguished responsive from unresponsive mice, these investigators compared wild-type mice with the ß6-integrindeficient mice as a means of further delineating the differences between the inflammatory and the fibrotic gene expression profiles. In this analysis, the gene historically most strongly associated with the development of pulmonary fibrosis, transforming growth factor (TGF)-ß, was not significantly differentially expressed (26). However, the analysis was able to identify a pattern of gene expression attributable to TGF-ß activation, confirming the importance of ß6-integrin activation of TGF-ß (28). The hope expressed by these authors was that comprehensive data on the development and progression of fibrosis in mice would lead to more effective strategies for intervention in human patients (26).
Another early study that used an array of just over 4,000 genes (29) confirmed the findings of Kaminski and colleagues (26) that there are distinct patterns of gene expression in the bleomycin model that are associated with an inflammatory phase or a fibrotic phase (29). This analysis compared exposed mice with control mice (29), whereas Kaminski compared susceptible mice with mice with a known single gene defect that confers protection from fibrosis even in the presence of an inflammatory response typically associated with bleomycin treatment (26). Nonetheless, this study confirmed the fundamental finding of Kaminski and colleagues, and these studies taken together validate the use of microarray as a means to understand the etiology of the fibroproliferative response. In addition, the findings of Kaminski directly addressed the inflammation hypothesis as put forward by Crystal and colleagues (30) and provided support to the growth factor hypothesis suggested by Bitterman and colleagues (31) inasmuch as the ß6 integrin proved to be essential to TGF-ß activation in this model (26).
The finding that TGF-ß was not differentially expressed in the Kaminski study (26) highlights an important feature of microarray analyses, namely that if a gene is not regulated at the transcriptional level, the contribution of that gene can be overlooked unless deeper analyses of the data are undertaken. Kaminski and colleagues (26) knew what they were looking for and had reagents (ß6-integrindeficient mice) with which to address the expression profile of TGF-ßinducible genes.
There are some emerging methodologies for examining the effects of genes that are not regulated at the transcriptional level, such as genes regulated at the protein level (e.g., TGF-ß and other proteins, such as transcription factors, whose activity is regulated by phosphorylation and dephosphorylation). To study a gene or genes whose general function is known, such as TGF-ß, one can use a method such as that described by Sadlier and colleagues (32) in an analysis of tubulointerstitial fibrosis (TIF) in kidney. The approach as used by Sadlier and colleagues (32) identified novel genes involved in TIF by examining genes with expression profiles similar to those known to be involved in the disease process. These researchers used hierarchical clustering (27) of their dataset focused on mRNAs encoding matrix proteins followed by a secondary "baited"-global cluster analysis of gene expression (32). This two-step cluster analysis used the first-step cluster to identify patterns of extracellular matrix (ECM) gene expression (genes known to be involved) over time during the experiment, and the second round of clustering identified genes that exhibited the same pattern of expression during the experiment (novel genes). In this way, these investigators identified molecules and pathways already implicated in the pathogenesis of TIF (e.g., TGF-ß1connective tissue growth factor [CTGF]fibronectin-1 pathway) and novel TIF-associated genes (32). This methodology has not been applied to pulmonary fibrosis, but, given the similarities between the disease models, this would be a viable approach to studying the late fibroproliferative phase and the early inflammatory phase in the bleomycin model or in another mouse model of pulmonary fibrosis.
Another perspective can be gained by recognizing that genes with similar expression profiles may be regulated by the same transcription factors (33). PRIMA (Promoter Integration of Microarray Analysis) and other algorithms of this type examine transcription factor binding site motifs in promoter regions and identify motifs that appear more frequently within a given group of genes (e.g., a cluster) than would be expected by chance. Burch and colleagues (34) recently used this approach to identify the ISRE (Interferon Stimulated Response Element) as an important transcriptional element in the response to inhaled LPS showing for the first time that IFN-
plays a previously undefined role in neutrophil recruitment in this model system. Figure 1 illustrates our analysis of a publicly available dataset, obtained from the GEO (Gene Expression Omnibus) website (series accession number, GSE485), in which we demonstrate that there are subsets of genes associated with an inflammatory and a fibrotic phase. Using PRIMA analysis on the cluster of genes associated with the inflammatory response to bleomycin (Figure 1), we find the AP-1 and c-Rel transcription factor binding site motifs overrepresented (p < 0.05). It has long been known that AP-1 is a downstream target of mitogen-activated protein (MAP) kinase signaling (35); thus, it is not surprising that AP-1 appears in this analysis. Likewise, c-Rel, as a member of the nuclear factor
B family of transcription factors, is likely important in the inflammatory phase of the response to bleomycin. There were no transcription factor binding motifs overrepresented in an analysis of the genes associated with the late fibroproliferative phase from this dataset. This suggests that, although the initial inflammatory phase might be more broadly regulated, the fibroproliferative phase might be regulated by a few key genes with few regulatory components in common.
|
I and V
2, IGF1 (40), tenascin C in human (41) and rat (42), Spp1 (osteopontin) (43), integrin
V (associated with ß6) (28), TIMP-1 in human (44) and mouse (45), tissue plasminogen activator (46), and surfactant protein C in human (47) and mouse (48). Genes on the short list in Table 2 that have recently been characterized with respect to their role in a fibroproliferative phenotype include CXCL12 (49) and thrombospondin 2 (50). CXCL12 has recently been shown to play a role in trafficking circulating fibrocytes to the lung in response to bleomycin (49), and the role of CXCR4, the receptor for CXCL12, has recently been demonstrated to be important in the development of fibroproliferative lesions in response to intratracheal bleomycin (I. V. Yang, unpublished data). This short list of genes derived from mouse homologs to genes identified by microarray of human IPF samples demonstrates that there remains much to be learned about the fibroproliferative process and has identified some genes that are worthy of further investigation in mouse model systems.
|
|
|
Although there is a substantial amount of information to be derived from directly modeling human IIF in mice, a hypothesis in our laboratory is that there are a few critical mediators of a fibroproliferative response regardless of the location in the lung and independent of the etiologic agent. Thus, an approach to understanding this generalized fibroproliferative response is to consider genes that are involved in other model systems. For example, there is increasing evidence that there is a fibroproliferative component to airway remodeling seen in reactive airway diseases such as asthma and chronic obstructive pulmonary disease (51, 52). Our laboratory has focused attention on environmental exposures known to produce asthma-like symptoms in agricultural workers. In mice, we have shown that repeated long-term exposure to inhaled LPS causes all of the classical features of asthma, including reversible airway obstruction, repeated episodes of inflammation, and airway remodeling, with a strong fibroproliferative component that persists over time and that is similar in many respects to human asthma (5356). In C57BL/6 mice, we identified by microarray analysis over 600 genes that are significantly differentially expressed when compared with age-matched unexposed control mice. Because we have demonstrated that chronic LPS-induced airway remodeling is a fibroproliferative disorder (55, 56, 57), we have interrogated our gene list with a publicly available dataset in which there are 186 significantly differentially expressed genes at 14 d after bleomycin instillation in C57BL/6 mice. By interrogating our LPS-induced gene list with the bleomycin list, we have identified 49 genes that are significantly differentially expressed in common between these disparate model systems (Figure 3 and Table 3). Because there is a profound inflammatory component associated with LPS-induced airway remodeling and bleomycin-induced fibrosis, it is not surprising to see such genes as Saa3 and many chemokines at the top of the list. Many of these genes have previously been reported to be involved in more traditional fibrosis models, and two, Tnc and Col3
1, appear on the list of mouse homologs of genes identified in human IPF discussed previously. Many of these genes have not been investigated in this context, however, providing us with a new starting point for further investigation of the relationship between inflammation and fibrosis that has yet to be resolved.
|
|
COMBINING QTL AND GENE EXPRESSION
QTL-specific microarray, such as that planned by Walters and colleagues (60), is a powerful approach that has previously been applied by this laboratory in an investigation of genes involved in the response to inhaled LPS (61). Combining these approaches serves as a powerful method to focus on genes within a QTL that might be pathogenically involved in the development of lung scarring. In a recent investigation of bleomycin-induced fibrosis in mice, QTL-specific microarray analysis (62) identified genes that are differentially expressed between susceptible and resistant mouse strains within previously identified QTL (21). This analysis identified a short list of genes that were under two identified QTLs and that showed unambiguous differential gene expression (62). Furthermore, using an NCBI database query for SNPs, the authors were able to identify a manageable number of nonsynonymous sequence variations that can be tested for their contribution to the development and progression of pulmonary fibrosis in this model system (62). This approach identified groups of genes associated with DNA damage and repair, the oxidative stress response, apoptosis, immune and proinflammatory pathways, and extracellular matrix deposition (62, 63). These broad categories of genes have been implicated in the development and progression of bleomycin-induced fibrosis in mice, and these studies have generated short lists of novel candidate genes that can be investigated.
An approach that has not been applied to studies of fibroproliferation in the lung was recently described in three recent reports (6466). This approach uses gene expression as determined by microarray analysis as the phenotype on which QTL analysis is performed (67). This approach, referred to as "genetical genomics" or "expression genetics," allows the mapping of QTL that directly affect differential gene expression. Such an approach allows the identification of QTL that affect the abundance of transcripts from genes within the QTL (cis-acting QTL) and those that act at a distance (trans-acting QTL) (67). Applying such an approach to mouse models of fibroproliferative lung disorders would yield tremendous benefits in coming to understand the etiology of this disease.
CONCLUSIONS AND FUTURE DIRECTIONS
Although these genetic and genomic studies in mice are important, we constantly need to remind ourselves that gene lists are not an answerrather, they pose a new set of questions that need to be askedand that validation is needed in focused, hypothesis-driven studies. Moreover, we need to develop innovative approaches to understanding the importance of these genes in the development and progression of IIF in humans. In this way, we will identify novel pathways and critical regulatory genes that point toward novel therapeutic interventions for these diseases that are extraordinarily difficult to treat.
FOOTNOTES
Supported by the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences, and NHLBI grant HL67467 (D.A.S.).
Conflict of Interest Statement: None of the authors has a financial relationship with a commercial entity that has an interest in the subject of this manuscript.
(Received in original form July 18, 2006; accepted in final form August 13, 2006)
REFERENCES
in lung neutrophil recruitment. Am J Physiol Lung Cell Mol Physiol 2006;9:9.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |