|
|
||||||||
Dorothy P. and Richard P. Simmons Center for Interstitial Lung Disease, Pulmonary, Allergy, and Critical Care Medicine, University of Pittsburgh Medical Center, Pittsburgh, Pennsylvania
Correspondence and requests for reprints should be addressed to Naftali Kaminski, M.D., Dorothy P. and Richard P. Simmons Center for Interstitial Lung Disease, Pulmonary, Allergy, and Critical Care Medicine, University of Pittsburgh Medical Center, NW 628 MUH, 3459 5th Avenue, Pittsburgh, PA 15261. E-mail: kaminskin{at}upmc.edu
ABSTRACT
Expression microarrays that provide genome-level, transcriptional, high-resolution profiles have been applied successfully to multiple diseases. Although microarrays provide information regarding thousands of genes, many investigators prefer to focus on a single gene and validate its role, an approach often supported by grant and journal reviewers. Only a minority of investigators focus on global changes in gene expression. Here, we describe and contrast two general approaches to the use of microarray data: the reductionist "cherry picking" approach and the more global, quantitative "systems" approach. We describe microarray analysis experiments relevant to idiopathic pulmonary fibrosis (IPF) in the context of these two approaches. Although it seems that the cherry-picking approaches have been successful in identifying new relevant genes in IPF, we suggest that to fulfill the discovery potential of microarrays in IPF and to create a working model of IPF, unbiased integrative systems approaches are required.
Key Words: FIZZ1 matrix metalloprotease microarrays osteopontin systems biology
Since their introduction approximately 10 years ago (1, 2), high-throughput gene expression profiling technologies, such as microarrays, have become a mainstay of biomedical research. Combined with the abundant genomic information available and with the rapid introduction of computational approaches, these methods have greatly improved the ability to create high-resolution expression profiles of distinct disease states. Such detailed profiles improved the ability of investigators to identify key regulatory molecules and to dissect the gene and protein networks that underlie disease. In a classic example, Alizadeh and colleagues identified two classes of diffuse large B-cell lymphoma based on gene expression profiles (3) and zoomed in on a small number of genes that could be used for classification (4); and recently, they identified potential signaling pathways that underlie some of these differences (5). Numerous other examples of the usefulness of these technologies include classification, prediction of and understanding the mechanisms of metastasis in breast cancer (610), prognosis determination and mutation detection in lung cancer (11), and prognosis prediction in pediatric leukemias and lymphomas (1214).
The study of idiopathic pulmonary fibrosis (IPF), a relatively poorly understood progressive and mostly lethal interstitial lung disease (15), seems to be a field that would greatly benefit from application of gene expression profiling. Much of the classical hypotheses of the mechanisms of pulmonary fibrosis were generated in the bleomycin-induced lung injury and fibrosis model in animals. The need to generate and verify new hypotheses seems to be critical to the development of new diagnostic and therapeutic options. Despite this need, the number of articles using expression-profiling technologies to understand lung fibrosis is low. In this review, we introduce the two contrasting approaches to microarray experiments: the "cherry picking" approach and the "systems" (as in "systems biology") approach. The aim of this review is to describe the few works that applied microarrays to the study of pulmonary fibrosis and highlight the discovery potential in the datasets generated by these works within a general framework of approach to microarray experiments.
A CONCEPTUAL FRAMEWORK FOR ANALYSIS OF MICROARRAY EXPERIMENTS
In recent years, there has been a major improvement in our ability to analyze microarray data. Better approaches to normalization, statistical analysis, and pathway analysis greatly simplified the steps from experiment to hypothesis generation. Generally, analytic approaches and their results can be divided into reductionist approaches, where the investigator focuses on a single or a few genes, and global, holistic approaches that try to derive meaningful biological insights from the global changes in gene expression. Historically, the first microarray papers tried to derive global insights; however, very quickly, the reductionist cherry-picking approaches (Figure 1) took precedence, partly because single-gene stories are easier to package and mainly because such papers were better accepted by reviewers, and therefore they were easier to publish. In the cherry-picking approach, a single or a few genes are picked out of the gene expression data, and their relevance is usually validated. The selection is usually based on some statistical criterion and prior biological knowledge. The pathway to validation is outlined in Figure 1. The least important validation is the RNA level validation, which is usually a technical validation. The protein verification suggests that the change in the gene mRNA is meaningful. This can be done in a quantitative manner, verifying the direction of the protein change, and also in a more meaningful way, by demonstrating in which cell and region in the lung the protein is expressed. The biological validation is most important and aimed at answering whether the function of the protein encoded by the gene has a fibrosis relevant function in vivo or in vitro. A final validation is to demonstrate that genetic perturbation in the gene is associated with the disease. Many times it is beyond the ability of a single group or a single article to provide all the details in IPF research and in other fields. The majority of microarray articles published in the general biological literature are cherry-picking articles.
|
|
Comparing Normal Control with Diseased Lung
In 2002, two comparisons of gene expression patterns in IPF and normal lungs were published by two different groups, one a conference abstract (21) and the other a full article (17). In their titles, they highlight matrix metalloprotease (MMP) 7 as a potential target for IPF. These findings have previously been reviewed by us (22). Both of these experiments fall into the cherry-picking category. An interesting, statistically significant gene is identified and verified, and its relevance to fibrosis is determined. Our article (17) also contained our first naive experimentation of global analysis. Among the most informative increased genes, we identified genes encoding for proteins expressed in smooth muscle differentiation and muscle contractile machinery, genes that encoded extracellular matrix proteins, and a coordinated increase in the levels of several matrix metalloproteases (MMP1, MMP2, MMP7, and MMP9); however, this analysis was intuitive and not based on an unbiased assessment of enrichment of statically significant annotations in our datasets. For this review and for demonstration, we downloaded the list of genes increased in IPF lungs and analyzed them using NIH DAVID and Ease online tools for functional analysis (www.apps1.niaid.nih.gov/david) (23, 24). We looked for enrichment of GO annotations in the genes up-regulated in IPF (Figure 3). The biological processes profile of IPF (Figure 3A) provides an active picture: IPF lungs are characterized by an increased catabolic state that is also associated with an increase in glucose metabolism, potentially reflecting the active remodeling that occurs in IPF lungs. This is different from the passive view of IPF lungs as end-staged fibrosed organs. Analysis of cellular localization reveals enrichment for genes associated with protein biosynthesis and extracellular matrix as expected (Figure 3B). The most interesting profile is derived from the enrichment in molecular functions (Figure 3C). Although the expected active remodeling is reflected by enrichment for structural constituents and genes with endopeptidase activity, the functional themes contain some unexpected groups, such as antigen binding and hydrogen ion transporter activity. This simple analysis of data generated from five IPF lungs analyzed by the first microarray generation demonstrates the power of the global approach and its discovery potential. Although this dataset has been publicly available for 4 years, we and others have only explored several targets and not necessarily the most interesting ones. Cosgrove and colleagues compared gene expression in five IPF and five normal lungs and focused on angiogenic and angiostatic cytokines in IPF and especially on pigment epitheliumderived factor, a protein with angiostatic properties (25). Their work demonstrates the discovery potential of the cherry-picking approach. Pigment epitheliumderived factor is an excellent example of an unexpected target identified by microarrays. Recently, using an additional dataset of 11 control lungs and 13 IPF lungs, we have again engaged in cherry picking and focused on osteopontin, a phosphoprotein previously shown to be required for development of bleomycin-induced fibrosis in murine lungs (26, 27). Osteopontin was highly expressed in IPF lungs and was expressed mainly in alveolar epithelial cells, and it colocalized with MMP7 in alveolar epithelial cells (16).
|
Comparing Different Interstitial Lung Diseases
Interstitial lung diseases differ in their clinical course, histologic presentation, radiologic imaging, and prognosis and response to therapy. Therefore, it seems plausible that different interstitial lung diseases express distinct gene expression patterns. Selman and colleagues compared gene expression patterns between IPF and hypersensitivity pneumonitis (HP) (24). Their article is probably the most advanced example of an IPF systems article. Although the authors seem to engage in some cherry picking, this may have been a response to a request by a reviewer. The authors identify multiple genes that distinguish IPF from HP. One interesting aspect of this article is the feature selection. Instead of setting an arbitrary threshold for selecting genes, the authors selected a signature by its use of the highest number of genes that provides the lowest number of classification errorsin other words, by its classificatory efficiency. Using NIH David (http://apps1.niaid.nih.gov/david/), they provide a functional profile of IPF (genes overexpressed in IPF) and of HP (genes overexpressed in HP). One look at these profiles reveals the gene expression parallel to a highly known clinical truth: patients with IPF rarely respond to antiinflammatory treatment (30). The genes that characterize HP are enriched with multiple inflammatory annotations, including T-cell activation, immune response, and defense response, whereas the genes that characterize IPF are enriched with ectoderm development, metalloendopeptidase activity, and extracellular matrix. This view provides an immediate global and unbiased view of the mechanism that underlies the fibrotic lung phenotype in IPF. When these signatures were used to classify a nonspecific interstitial pneumonia, a histologic pattern that is often difficult to differentiate consistently from HP and IPF, we identified three cases that exhibited an IPF-like gene expression and one case that was classified as HP, suggesting that after these gene expression patterns are confirmed on larger cohorts, they could be used to classify disease and potentially to predict response to antiinflammatory therapies.
MICROARRAY ANALYSIS OF ANIMAL MODELS OF DISEASE
Bleomycin-induced Fibrosis
Bleomycin is a common model for induction of fibrosis. We have previously described the two articles that deal with gene expression in bleomycin-induced fibrosis (22). There are also at least two more datasets publicly available at the Gene Expression Omnibus, which is a public microarray database (Table 1). In our first microarray article (31), we applied an intuitive clustering-based approach to obtain global insights regarding the general mechanisms of bleomycin-induced fibrosis using 129 mice homozygous for a null mutation of the integrin ß6 subunit gene that develop inflammation, but not fibrosis, in response to bleomycin and wild-type 129 mice (32). Our analysis allowed us to dissect the transcriptional programs associated with inflammation or fibrosis in bleomycin-induced lung injury. Recently, Liu and colleagues analyzed lung gene expression profiles after bleomycin in the rat (33). They identified 628 genes that were changed by bleomycin. The most dramatic increase that they observed was in FIZZ1. They localized FIZZ1 expression to alveolar and airway epithelium. They also identified FIZZ1 in isolated type II alveolar epithelial cells but not in isolated lung fibroblasts. When they transfected FIZZ1 into fibroblasts or cocultured of FIZZ1-expressing type II cells with fibroblasts, the fibroblasts expressed
-smooth muscle actin and type I collagen expression independent of TGF-ß stimulation (33).
|
Drug Targets
The studies we described implemented microarrays in the study of human disease and animal models of disease. In most cases, authors have applied cherry-picking approaches and focused on the function of one or two genes and most often determined that these genes (e.g., MMP7, osteopontin, FIZZ1, and CCL7) were important in IPF (16, 17, 25, 28, 33). Although some of the articles made attempts at global observations (17, 32), these observations were mostly qualitative. Only recently have we applied more quantitative methods to better understand the gene expression patterns that characterize IPF (24). Although the relative success of cherry-picking approaches is encouraging, it is also concerning: If so many molecules are involved, how are we ever going to treat IPF? Is blocking any one of these molecules going to be sufficient to halt or reverse lung fibrosis in IPF? Although the results of bleomycin experiments in knockout animals suggest that this is the case, the biological logic does not support this notion because all of these and many other molecules are overexpressed in IPF lungs. An alternative explanation suggests that although each one of these molecules may have isolated profibrotic or antifibrotic effects in healthy animals subjected to a defined injury, in IPF lungs they serve as components of a profibrotic network that determined the lung phenotype in IPF. Therefore, we believe that global approaches that apply system approaches and incorporate multiple levels of information (including the results of cherry-picking experiments) into a robust quantitative model of IPF will allow us to understand and predict the molecular events that lead to the lung phenotype in IPF. Once this model of IPF is created, evaluating the relative role of a specific molecule within this model will allow more accurate predictions about its role as a potential target for therapeutic intervention.
Biomarkers
The mechanistic role of molecules elevated in IPF needs to be validated in complex experiments. The potential role of molecules expressed in IPF lungs as biomarkers seems to be intuitive. This has not been assessed systematically; however, our recent article (24) and the finding of osteopontin, a protein highly expressed in IPF lungs (16) in the peripheral blood of patients with interstitial lung disease (34), suggest that transcriptional profiles may be readily translated into clinical practice by planning studies that will assess their value as markers for disease. The availability of multiple human and mouse datasets (Table 1) should make such analyses possible.
CONCLUSIONS
This article describes and contrasts two general approaches to the use of microarray data: the reductionist cherry-picking approach and the more global, systems approach. We describe microarray experiments relevant to IPF in the context of these approaches and highlight their potential value in understanding the disease. We believe that the increased availability of multiple datasets and the maturation of analytic methods favor the application of systems approaches that will enable us to model and understand IPF in a fashion that will allow better development of new therapeutic and diagnostic modalities.
FOOTNOTES
Supported by NIH grant 1R01 HL073745-01 and by a generous donation from the Simmons family.
Conflict of Interest Statement: N.K. received $5,000 for serving on a Biogen IDEC advisory board in November 2005. I.O.R. does not have a financial relationship with a commercial entity that has an interest in the subject of this manuscript.
(Received in original form January 20, 2006; accepted in final form February 9, 2006)
REFERENCES
This article has been cited by other articles:
![]() |
A. U. Wells and C. M. Hogaboam Update in Diffuse Parenchymal Lung Disease 2007 Am. J. Respir. Crit. Care Med., March 15, 2008; 177(6): 580 - 584. [Full Text] [PDF] |
||||
![]() |
A. Tzouvelekis, V. Harokopos, T. Paparountas, N. Oikonomou, A. Chatziioannou, G. Vilaras, E. Tsiambas, A. Karameris, D. Bouros, and V. Aidinis Comparative Expression Profiling in Pulmonary Fibrosis Suggests a Role of Hypoxia-inducible Factor-1{alpha} in Disease Pathogenesis Am. J. Respir. Crit. Care Med., December 1, 2007; 176(11): 1108 - 1119. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Scotton and R. C. Chambers Molecular Targets in Pulmonary Fibrosis: The Myofibroblast in Focus Chest, October 1, 2007; 132(4): 1311 - 1321. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Ren, I. O. Rosas, S. D. MacDonald, H.-P. Wu, E. M. Billings, and B. R. Gochuico Impairment of Alveolar Macrophage Transcription in Idiopathic Pulmonary Fibrosis Am. J. Respir. Crit. Care Med., June 1, 2007; 175(11): 1151 - 1157. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. O. Rosas and N. Kaminski When It Comes to Genes--IPF or NSIP, Familial or Sporadic--They're All the Same Am. J. Respir. Crit. Care Med., January 1, 2007; 175(1): 5 - 6. [Full Text] [PDF] |
||||
![]() |
S. M. Studer and N. Kaminski Towards Systems Biology of Human Pulmonary Fibrosis Proceedings of the ATS, January 1, 2007; 4(1): 85 - 91. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |