|
|
||||||||
1 Department of Health Services, University of California, Los Angeles, Los Angeles, California; and 2 Departments of Medicine and Family and Preventive Medicine University of California, San Diego, San Diego, California
Correspondence and requests for reprints should be addressed to Robert M. Kaplan, Ph.D., Wasserman Professor, and Chair, Department of Health Services, University of California, 650 Charles E. Young Drive, Los Angeles, Los Angeles, CA 90095-1772. E-mail rmkaplan{at}ucla.edu
ABSTRACT
Patients with emphysema may experience reduced health-related quality of life (HRQOL). HRQOL measures have evolved from two different measurement traditions: psychometric theory and decision theory. Psychometric methods typically create a profile of outcomes, whereas decision theory methods offer a summary score on a continuum ranging from 0.0 (for death or worst possible health) to 1.0 (for best possible health). Decision theory methods are better suited for cost-effectiveness studies. Generic HRQOL measures can be applied to any disease population, whereas disease-targeted measures are tailored to a specific clinical condition. Disease-targeted measures are typically more sensitive to clinical change, but cannot offer a comparison basis for different clinical conditions. This article reviews the measurement of HRQOL in patients with emphysema. The National Emphysema Treatment Trial (NETT) offers an example of the application of both generic and disease-targeted, as well as profile and decision theory, methods. The NETT illustrates how HRQOL measures can be used to assess outcomes and estimate cost-effectiveness in a major clinical trial.
Key Words: health-related quality of life emphysema outcomes assessment quality-of-life measurement
Patients with emphysema experience significant limitations in daily activities and reduced quality of life. Evidence supports the value of medical management and pulmonary rehabilitation for emphysema. Despite important advances in medical management, the promise of medical therapy is limited. Lung volume reduction surgery (LVRS) offers the potential to enhance quality of life for selected patients who have completed pulmonary rehabilitation.
The goals of surgical and medical treatments for emphysema are to extend life expectancy and to improve quality of life. Medicines have side effects and surgery is associated with operative and perioperative risks. Recovery from surgery necessarily reduces quality of life and functioning in the short term. These effects are important, but often are difficult to quantify. Health-related quality of life (HRQOL) measures attempt to characterize the negative and positive effects of a disease as well as its effect on quality of life.
HRQOL measures have been developed over the past 30 years, and have been commonly used to study emphysema. A PubMed search cross-referencing chronic obstructive pulmonary disease (COPD) with quality of life (in June 2007) produced 1,704 references and the intersection of emphysema and quality of life yielded 313 references. The American Thoracic Society maintains a website that summarizes quality-of-life measures that can be used in outcomes research for lung disease (see www.atsqol.org). The site lists measures by disease and offers references on their use.
The purpose of this article is to review HRQOL measurement in emphysema studies with emphasis on measures chosen for the National Emphysema Treatment Trial (NETT). More comprehensive reviews of HRQOL in chronic lung disease are available elsewhere (1).
DISTINCTIONS BETWEEN MEASURES OF HRQOL
Measures of HRQOL are classified as either disease targeted or generic. Disease-targeted measures are used for patients with a particular diagnosis, whereas generic measures can be used in any population. There are two major approaches to assessment of HRQOL: psychometric and decision theory.
The psychometric approach provides a profile of measures, each summarizing a different dimension of HRQOL. The best-known example of the psychometric tradition is the Medical Outcomes Study 36-item Short Form (SF-36) (2), a generic measure of HRQOL.
The decision theory approach weights the different dimensions of HRQOL to provide a single expression of quality of life. Supporters of this approach argue that psychometric methods fail to consider that different health problems cause different levels of concern. A minor itch and coughing blood are both symptoms, but they are not equal in importance. Furthermore, a treatment may improve some aspects of quality of life while causing others to deteriorate. For example, a medication might reduce coughing, but increase skin problems or reduce energy. When components of outcome change in different directions, an overall subjective evaluation is required to integrate the components and offer a summary of whether the patient is better or worse off. The decision theory approach attempts to provide an overall measure of HRQOL that integrates subjective function states, preferences for these states, morbidity, and mortality.
Measures can also be distinguished by their possible uses. Most measures can be used to characterize populations and to study clinical changes. However, only generic decision theory–based measures can be used to evaluate cost-effectiveness.
DISEASE-TARGETED METHODS OF HRQOL
Commonly used measures of HRQOL in patients with lung diseases are described briefly below and summarized in Table 1.
|
St. George's Respiratory Questionnaire
The St. George's Respiratory Questionnaire (SGRQ) is a self-administered 50-item instrument with three separate scales (symptoms, activity, and impacts on daily life). A total score can also be calculated. The individual SGRQ questions are weighted differentially to produce an overall score. The questionnaire has been evaluated for reliability and validity in several chronic lung diseases of varying severity, particularly COPD and asthma (8). We offer more on the sensitivity of the SGRQ in a later section.
DYSPNEA
Despite the advantages of disease-specific and generic quality-of-life measures, pulmonologists have been concerned that these approaches may miss some of the most important patient outcomes. For example, several studies have observed low correlations between measures of generic quality of life and lung function. Dyspnea is the one common self-reported patient outcome that may have a profound effect on HRQOL (9). Several studies show that the correlation between generic quality of life and dyspnea is substantial. Schrier and colleagues found no correlation between lung function tests and scores on a generic profile measure known as the Sickness Impact Profile (SIP) (10). However, they did observe substantial correlations between symptoms of wheezing and dyspnea and SIP scores. Yet, the general measures miss many of the subtle characteristics or subtle aspects of the clinically important symptoms. These findings suggest that measures of shortness of breath or dyspnea may be of central importance for evaluating outcomes in emphysema.
We believe that HRQOL assessment in studies of patients with emphysema should be complemented by measures of shortness of breath. However, measurement of dyspnea presents several methodologic and technical challenges. A number of instruments are available to assess dyspnea, including structured interviews and self-report questionnaires that evaluate a patient's historical recall of breathlessness. These measures compare shortness of breath associated with daily activities or exercise to symptoms occurring at that specific time when the assessment is made (Table 2). A review of dyspnea measures by Eakin and colleagues offers more details about the measures and their properties (11).
|
GENERIC METHODS OF HRQOL
Generic HRQOL measures can evaluate outcomes for any population (as compared with disease-targeted measures that measure outcomes for people with a particular health condition). Some of the common generic approaches are summarized in Table 3. Readers interested in more detail should consult Shumaker and Berzon (13) or Walker and Rosser (14) and McDowell and Newell (15).
|
Some approaches to the measurement of HRQOL combine measures of morbidity and mortality to express health outcomes in units analogous to years of life. The years-of-life measure can be adjusted for diminished quality of life associated with diseases or disability (20). We consider three utility-based approaches.
Quality of Well-Being Scale
The Quality of Well-Being scale (QWB) combines preference-weighted values for symptoms and functioning. The preference weights were obtained by ratings of 856 people from the general population. These judges rated the desirability of health conditions to place each on the continuum between death (0.00) and optimum health (1.00). Symptoms are assessed by questions that ask about the presence or absence of different symptom complexes. Functioning is assessed by a series of questions designed to record functional limitations over the previous 3 days, within three separate domains (mobility, physical activity, and social activity). The four domain scores are combined into a total score that provides a numerical point-in-time expression of well-being that ranges from zero (0) for death to one (1.0) for asymptomatic optimum functioning.
The QWB has been used in numerous clinical trials and studies to evaluate medical and surgical therapies in conditions such as COPD (21) and several other conditions, (35–39). Furthermore, the method has been used for health resource allocation modeling and has served as the basis for an innovative experiment on rationing of health care by the state of Oregon (36, 37). A self-administered form of the QWB (QWB-SA) is available and it now the most commonly used form of the QWB.
EuroQol Group 5 Dimension
The approach most commonly used in the European community is the EuroQol Group 5 Dimension (EQ-5D). This method has been advanced by a collaborative group from Western Europe known as the EuroQol group (44). The original version of the EuroQol had 14 health states in six different domains. In addition, respondents placed their health on a continuum ranging from death (0.0) to perfect health (1.0). The method was validated in postal surveys in England, Sweden, and the Netherlands. More recent versions of the EuroQol, known as the EQ-5D, are now in use in a substantial number of clinical and population studies (40, 41). Although the EQ-5D is easy to use and comprehensive, there have been some concerns about ceiling effects. Substantial numbers of people obtain the highest possible score.
Health Utilities Index
The Health Utilities Index (HUI) is a family of health status and preference-based HRQOL measures suitable for use in clinical and population studies (42, 43). Each member of the family includes a health status classification system, a preference-based multiattribute utility function, data collection questionnaires, and algorithms for deriving HUI variables from questionnaire responses. The most commonly used versions are the HUI Mark 2 (HUI2) and the HUI Mark 3 (HUI3). HUI3 consists of eight attributes: vision, hearing, speech, ambulation, dexterity, emotion, cognition, and pain (43). There are five to six levels per attribute. HUI3 focuses on capacity rather than performance. Multiplicative multiattribute utility functions based on community preferences have been estimated for HUI2 (45) and HUI3 (42). Evidence on responsiveness and construct validity is provided by Torrance and coworkers (45). Evidence of construct validity in the 1990 Ontario Health Survey is provided in Grootendorst (46). HUI3 described the burden of morbidity for both stroke and arthritis. Problems were found in the attributes that had been expected to be affected.
QUALITY-ADJUSTED YEARS OF LIFE
The QWB, EQ-5D, and the HUI are important because they can be used to obtain quality-adjusted life-years (QALYs). QALYs combine measures of quality of life and years of life to adjust the years-of-life measure for the diminished quality of life associated with diseases or disabilities (20). Modern measures of health outcome consider future as well as current health status. Lung cancer, for example, may have very little impact on current functioning but may have a substantial impact on life expectancy and functioning in the future. Today, a person with a malignant tumor in a lung may be functioning very much like a person with a chest cold. However, the patients with cancer is more likely to remain dysfunctional or to die in the near future. Comprehensive expressions of health status need to incorporate estimates of future outcomes as well as measure current status (47). For example, surgical intervention for emphysema may have short-term consequences that will reduce HRQOL. However, both HRQOL and life expectancy gains may result from the surgery. QALY methods can express this comprehensive effect of the treatment.
Most of the methods for obtaining QALYs are similar (35). The most commonly used methods are the EQ-5D, the HUI, and the QWB. Methods for translating SF-36 scores into the QWB scores (so that the SF-36 scores can be used to calculate QALYs) have emerged (48). After considerable discussion, the NETT selected the QWB.
INTEGRATING COST WITH OUTCOME DATA
Although treatment programs provide health benefits, they also have costs. Not all health care interventions equally return benefit for the expended dollar. Resources are limited, and good policy requires that they be used wisely. Cost-effectiveness studies have gained in popularity because health care costs have grown so rapidly in recent years. Objective cost studies might guide policymakers toward an optimal and equitable distribution of scarce resources. Methodologies for estimating costs have now become standardized (49). From an administrative perspective, cost estimates include all treatment costs and any costs associated with treating side effects. Typically, economic discounting is applied to adjust for using current assets to achieve a future benefit. From a social perspective, costs are broader and may include indirect costs of family members missing work to provide care. Comparing treatment programs for populations with a given medical condition, cost-effectiveness is measured as the change in costs of care for the new intervention compared with the existing therapy or program, relative to the change in health measured in a standardized unit such as the QALY. The difference in costs over the difference in effectiveness is the incremental cost-effectiveness and is usually expressed as the cost/QALY. Because the objective of all programs is to produce QALYs, the cost–QALY ratio can be used to show the relative health benefits from investing in different programs (35). Several different governments have proposed allocating resources based on systematic data (37). For example, the Australian government now requires evidence of effectiveness, as do a variety of European governments. In Ontario, Canada, QALYs have been considered as a basis for formulary decisions (50). Similar proposals have been considered in the United Kingdom (51).
HRQOL ASSESSMENT IN THE NETT
The NETT was one of the first randomized clinical trials of emphysema treatment to prospectively integrate quality-of-life assessment. The NETT compared LVRS plus maximal medical therapy with medical therapy alone. Four quality-of-life measures were administered at baseline and again at 6, 12, 24, 36, 48, and 60 months after treatment assignment. The trial used two generic HRQOL measures (the QWB-SA and the SF-36) and two disease-targeted measures (the SGRQ and the UCSD-SOBQ).
HRQOL at Baseline
Pairwise correlations between HRQOL measures obtained at baseline (SOBQ total score, QWB-SA total score, SGRQ total score, SF-36 physical component score [PCS] and mental component score [MCS]) were all statistically significant and ranged from 0.31 to 0.70 except for the PCS with the MCS; the PCS and MCS scores were originally derived from factor analysis and are expected to be uncorrelated (52). On average, the NETT participants had lower scores on the QWB-SA compared with a comparable normative population. With the exception of the SF-36 scores for bodily pain and MCS, all other SF-36 measures show NETT participants to be well below the population norms. For the QWB-SA and the SF-36 subscales (other than the PCS and MCS), the normative sample was older adults selected from the general population in Beaver Dam, Wisconsin (mean age, 64.1 yr) (53). Because the Beaver Dam sample did not include PCS and MCS scores, SF-36 general population norms for the U.S. population in the age category of 55–64 years (54) were used to assess the PCS and MCS scores of the NETT participants. These summary measures suggest that the NETT participants were quite ill. All of the HRQOL measures were significantly associated with the number of hospital days in the 3 months before NETT enrollment (all P values < 0.001) (52).
HRQOL Changes after Rehabilitation
The NETT evaluated HRQOL before and after comprehensive pulmonary rehabilitation. There were significant improvements on all HRQOL measures after rehabilitation. The correlations between the changes in HRQOL measures and the change in the six-minute-walk distance between baseline and the completion of rehabilitation were all modest, but statistically significant. This suggests that those who improved in exercise also improved on HRQOL measures (52). The pre- and postrehabilitation means and the effect sizes for change are shown in Table 4. Effect size is used to describe the magnitude of a treatment effect. The effect size is an important index because it is independent of the sample size and not entirely dependent on statistical significance. There are a variety of indexes of effect size, but most consider the difference between treatment and control means divided by the standard deviation of the pooled groups.
|
HRQOL after Surgery
In May 2001, recruitment into the NETT trial was halted for patients with an FEV1
20% predicted and either homogeneous emphysema on computed tomography scan or DLCO
20% predicted. A total of 1,078 patients who did not have high-risk characteristics completed the trial (56). Patients randomly assigned to LVRS obtained lower (more favorable) scores on the UCSD-SOBQ and the total score on the SGRQ. For these measures, groups began to separate by the 6-month follow-up and the differences remained significant through 60 months. Similar trends were observed for the generic QWB-SA and the physical component of the SF-36.
Overall, the NETT trial suggests that LVRS leads to improvements in HRQOL for non–high-risk patients with emphysema. The benefits may be stronger for certain subgroups of patients. Patients randomly assigned to LVRS achieved better outcomes on the SGRQ, the UCSD-SOBQ, and the QWB-SA. These findings suggest that LVRS may result in enhancements in everyday functioning, particularly in activities associated with shortness of breath.
Cost-effectiveness Analysis
The outcomes on the QWB-SA were combined with mortality outcomes to express the benefits of LVRS in terms of QALYs. These measures are the central metric for cost-effectiveness analysis (see Ramsey and coworkers, pages 406–411, in this symposium [62]). The NETT suggested that, in comparison to maximal medical therapy, LVRS produces a QALY for about $190,000, considering a 3-year time horizon. If the time horizon is expanded to 10 years, the cost–QALY was estimated to be $53,000. The cost–QALY was even more attractive for some subgroups.
CONCLUSIONS
Quality-of-life measures evolve from two different measurement traditions: psychometric theory and decision theory. Psychometric methods, such as the SF-36, typically create a profile of outcomes, whereas decision theory methods attempt to portray an integrative summary judgment of health. Decision theory methods are better suited for cost-effectiveness studies.
A review of current outcomes research for emphysema shows that quality-of-life measures are now commonly used. There is substantial evidence for the validity of these measures for patients with chronic lung diseases. The NETT showed that, in comparison to maximal medical therapy alone, non–high-risk patients undergoing maximal medical therapy plus LVRS experience improved HRQOL. Data from the prerandomization phase of the NETT indicate that patients with emphysema who complete rehabilitation experience significant improvements on several indexes of life quality. The NETT also illustrated how quality-of-life data can be used to assess outcomes in major clinical trials and how they can be applied in cost-effectiveness analysis (see Ramsey and coworkers, pages 406–411, in this symposium [62]). Analyses from NETT suggest that the cost-effectiveness of LVRS is justifiable in comparison to other medical and surgical treatments and can be as low as $21,000 per QALY for patients with upper-lobe emphysema and low exercise capacity if a 10-year time horizon is used. We urge that quality-of-life measurement become a standard part of outcomes assessment in emphysema clinical research.
FOOTNOTES
The National Emphysema Treatment Trial (NETT) is supported by contracts with the National Heart, Lung, and Blood Institute (N01HR76101, N01HR76102, N01HR76103, N01HR76104, N01HR76105, N01HR76106, N01HR76107, N01HR76108, N01HR76109, N01HR76110, N01HR76111, N01HR76112, N01HR76113, N01HR76114, N01HR76115, N01HR76116, N01HR76118, and N01HR76119), the Centers for Medicare and Medicaid Services (CMS), and the Agency for Healthcare Research and Quality (AHRQ).
Conflict of Interest Statement: R.M.K. does not have a financial relationship with a commercial entity that has an interest in the subject of this manuscript. A.L.R. received $50,000 from Boehringer Ingelheim as research grants for participating in a multicenter clinical trial.
(Received in original form June 27, 2007; accepted in final form September 12, 2007)
REFERENCES
Related articles in Proceedings of the American Thoracic Society:
This article has been cited by other articles:
![]() |
E. Kozora, C. Emery, R. M. Kaplan, F. S. Wamboldt, L. Zhang, and B. J. Make Cognitive and Psychological Issues in Emphysema Proceedings of the ATS, May 1, 2008; 5(4): 556 - 560. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |