| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|
| ||||||||||||||||||||||||
RESEARCH ARTICLE |
University of Maryland, Baltimore County.
Address correspondence to Jessica A. Kelley-Moore, Sociology and Anthropology, University of Maryland Baltimore County, 1000 Hilltop Circle, Baltimore, MD 21250. E-Mail: jkm{at}umbc.edu
| Abstract |
|---|
|
|
|---|
Methods. By using data from a mixed-mode panel study, I compared estimates of changing health inequality for Black and White older adults collected from face-to-face and telephone interviews. I calculated trajectories of physical disability by using latent growth models across seven waves of data.
Results. Face-to-face interviews yielded consistently higher reports of disability relative to telephone interviews of the same persons. Black adults had significantly greater and increasing disability than did White adults for both interview modes. After adjusting for covariates, I found that Black and White older adults had parallel disability trajectories in face-to-face interviews but a widening gap in disability over time in telephone interviews.
Discussion. Researchers should judiciously consider whether estimates of racial health inequalityand change in disability more broadlymay be misleading because of interview-mode effects.
SOCIAL surveys have long been used in order to assess the existence and extent of inequality in society across a number of domains, including income, status attainment, and health. Researchers typically make use of respondents' answers to survey questions in order to document inequality by gender, race or ethnicity, or other ascriptive social categories. The assessment of racial health inequality on a wide range of indicators often relies on respondent reports in surveys. There is inconsistency, however, across studies regarding the extent and nature of this inequality, particularly over time where it is unclear whether the racial health gap is shrinking or remaining constant. One potential source of variation that may be overlooked is group differences in response behavior across interview modes. Given the growing use of telephone surveys, could the extent of the racial gap in various quality-of-life outcomes have been underestimated in recent years? If yes, then the underestimation would appear as social changefalsely suggesting the amelioration of racial inequality.
Some research has shown that where a discrepancy exists in responses between face-to-face and telephone interviews, the difference is greater in potentially vulnerable populations, such as older adults and racial minorities (Aquilino, 1994
; Pruchno & Hayden, 2000
). If this is the case, then what researchers know about the extent of racial health inequality, particularly in older adulthood, may be biased. Despite the importance of the issue, relatively few studies have addressed it.
The present research systematically compares responses of Black and White older adults who were interviewed in telephone and face-to-face interviews as part of a seven-wave panel study in order to determine if the estimates of racial inequality vary by interview mode. The analysis permits an examination of both initial level of disability and change in disability over time. In this article, I first consider recent estimates of racial health inequality among older adults and then turn to the potential consequences of interview mode for the accurate assessment of inequality.
Measuring Inequality Accurately: The Case of Racial Health Disparity
Research on health disparities is a crucial step in guiding efforts that will reduce and ultimately eliminate racial health inequality. By most measures, older Black Americans have poorer health (including higher mortality risk), more chronic and acute health conditions, and greater physical disability than do Whites (Martin & Soldo, 1997
; National Center for Health Statistics, 2003
). Studies have also shown that the Black/White discrepancy in life expectancy at age 65 has widened in the past half century (e.g., National Center for Health Statistics). Estimates of this disparity have been relatively stable across studies because information is often based on death records and census counts.
The measurement of the racial gap for other indicators of health status is more difficult, though, owing to differences in measurement, sampling, and analytic techniques. Widely varying estimates of the same health domain across studies make it difficult to test competing conceptual or theoretical explanations of health inequality. For instance, some longitudinal studies on physical disability have shown that the racial disability gap in later life continues to increase over time (Clark, 1997
; Liao, McGee, Cao, & Cooper, 1999
), consistent with the cumulative disadvantage perspective (Dannefer, 1987
). Other studies have argued that this inequality is holding steady or even diminishing (Kelley-Moore & Ferraro, 2004
; Mendes de Leon et al., 1997
), supporting persistent inequality or selective survival explanations of health disparities. These inconsistent results make it impossible to determine definitively the extent of racial disparities in older adulthood.
Careful inspection of many of the studies of the racial gap in disability reveals that some were conducted with a mixed interview-mode design (Kelley-Moore & Ferraro, 2004
; Mendes de Leon et al., 1997
), whereas others used data that were collected in face-to-face interviews only (Ferraro, Farmer, & Wybraniec, 1997
) or in telephone interviews only (Clark, 1997
). Is it possible that the interview mode itself might shape estimates of the extent of racial health inequality? This question is important because claims of growing convergence in Black/White health inequality could actually be an artifact of interview mode. I will draw from the literature on surveying vulnerable populations in order to address this question and gauge its import on other groups of respondents.
Response Behavior of Vulnerable Populations
Assessing racial health inequality requires having adequate numbers of disadvantaged persons in the sample in order to ensure statistical power in comparisons (e.g., BlackWhite comparisons). Oversampling and procedures that heighten response rates among disadvantaged groups (e.g., use of bilingual interviewers) have greatly helped social science research on inequality. Once sampling is complete, however, there remain considerations regarding the validity of the responses across interview modes.
Previous research has revealed that discrepancies in reports from survey research may be greatest across interview modes for vulnerable populations such as older adults (Pruchno & Hayden, 2000
; Weinberger et al., 1994
), poor persons (Sullivan et al., 1995
), and racial or ethnic minorities (Aquilino, 1994
). Most research has shown that face-to-face interviews yield greater estimates of negative health behaviors, functional limitations, and depression relative to telephone interviews of these same populations. Additionally, belonging to more than one socially disadvantaged group simultaneously may increase the discrepancy in responses (Nebot et al., 1994
).
When examining differences in responses across interview modes, particularly for vulnerable populations, one finds two distinct sources of variation. First, persons who are interviewed face-to-face may be different from persons who are interviewed over the telephone. A number of the studies that have compared responses across interview modes used samples that were not designed to investigate interview-mode effects. Although some surveys had planned mixed-mode designs (Fowler, Roman, & Di, 1998
; Taylor, Wilson, & Wakefield, 1998
), others resulted from samples with nonrandom interview-mode assignment. Such modifications were typically enacted to keep reluctant or hard-to-reach participants in the sample by engaging in face-to-face interviews in lieu of the assigned telephone mode (Nebot et al., 1994
; Pruchno & Hayden, 2000
; Sullivan et al., 1995
). The endogeneity of interview mode, however, may lead to comparisons that confound data collection mode with vulnerabilitypersons with lower incomes, less education, and poorer health might be more likely to be retained in the sample with an alternative mode of interview. Whatever the case, nonrandom selection into an interview mode may obscure differences across modes.
The second source of variation is that some respondents may provide incongruent self-reports of health across interview modes. In telephone interviews, older adults are more likely to report "don't know" or to require additional interviewer assistance, perhaps reflecting difficulty hearing or other vulnerabilities (Herzog & Rodgers, 1988
). When the data are collected in a face-to-face encounter, the interviewer is able to employ visual aids and respond to non-verbal cues. Additionally, in the age of telemarketing and scams, telephone surveyseven from legitimate sourcesmay not engender trust among respondents. Indeed, minority group members and older adults tend to be suspicious of telephone callers who ask sensitive or personal questions and thus may be more likely to censor or temper their responses over the telephone (Bell, 1984
). The physical presence of the interviewer may lend legitimacy to the study and create an atmosphere of trust that is not as easily achieved over the telephone.
There is also increasing concern that many measures that were once considered universal or global may not actually elicit comparable responses from different racial or ethnic groups (Ford, Havstad, & Hill, 2000
). Respondent answers to questions about health status may reflect cultural norms about health disclosures, historic mistrust of the medical system, or cohort differences in sharing private information (Gamble, 1997
). Knowing that there are Black/White differences in rates of participation and information disclosure, it is plausible that there may also be racial differences in response behavior across interview modes, potentially yielding inaccurate estimates of the amount of inequality in a population.
In light of these differences in response behavior, the aim of this study was to systematically examine whether interview mode influences estimates of physical disability for Black and White older adults. With data from a seven-wave panel study, the present research estimates the initial level and trajectory of disability over time. I have organized the analysis around three major questions. First, are there differences between Black and White older adults in the disability trajectory by interview mode? Second, if there are differences by interview mode, which mode reflects greater disability? Third, do these differences hold after adjusting the disability trajectories for key covariates? These three questions are important steps in answering the broader question of whether current estimates of racial health inequality may be misleading because of interview-mode effects.
| METHODS |
|---|
|
|
|---|
Measurement
Table 1 lists all of the variables used in these analyses along with descriptive statistics. The primary outcome of interest is disability. At each of the seven waves, researchers asked respondents to report their ability to perform seven activities of daily living (ADLs; Katz, Ford, Moskowitz, Jackson, & Jaffe, 1963
): walking, bathing, eating, dressing, grooming, using the toilet, and transferring from a bed to a chair. Respondents reported whether they needed no help (0), some help (1), or were unable to perform the task (2) for each domain. I summed all seven ADLs to create a continuous measure of physical disability ranging from needing no help with any task (0) to being unable to perform all seven tasks (14). I eliminated from the present analyses 86 respondents who were alive and interviewed but who did not answer the ADL questions at one or more waves. Reliability of the summary score is high (
=.89).
|
I measured socioeconomic status with education and income. Education ranged from completing fewer than eight years (1) to post college (6). Current total household income had 11 categories, from less than $1,000 per year (1) to greater than $40,000 per year (11). Of participants who were eligible for the final sample, 406 did not report their income at baseline. I estimated an algorithm that predicted values for these cases and imputed whole-number values. Demographic variables included female gender, Black, rural, currently married, currently widowed, and lives alone. All of these were binary variables where 1 = the name of the variable and 0 = all others. Age was measured in years from 64 to 105. Religious service attendance was an ordinal variable ranging from never attend (1) to attend more than once per week (6).
Analyses
Analyses for the present study proceeded in two main stages. First, I calculated unadjusted intercepts and slopes for disability by regressing disability on time. I estimated models for three groups: all seven interviews, face-to-face interviews only, and telephone interviews only. I plotted these slopes by survey wave for the total sample and then separately for Black and White older adults. In the second stage of analysis I adjusted the intercepts and slopes of disability for covariates in order to determine whether any racial disparities observed in the unadjusted disability trajectories persisted. By using latent growth models in structural equations, I calculated intercepts and slopes in three models: first for all seven waves together, and then separately for face-to-face interviews and telephone interviews (McArdle & Epstein, 1987
). This stage of analysis was important because failing to adjust estimates for known covariates and predictors of disability may have led to overestimates of racial disparities in physical function over time. The models predicting the intercept and slope of disability each had one instrumental variable in order to avoid over-identification. Given the skewed distribution of the disability measures, I estimated all structural equation models on an asymptotic matrix of polyserial and polychoric correlations.
Attrition (primarily due to death) was significant in this sample of older adults. I adjusted the models with covariates for potential selection bias due to nonrandom attrition. This two-stage process uses a hazard rate instrument based on the inverse Mills ratio expressing the likelihood of not remaining in the study for all seven waves (Heckman, 1979
). First, by using a probit equation, I estimated the likelihood of completing all seven waves of the study. Second, based on that likelihood, I calculated an inverse Mills ratio for each case so that high values indicated a strong likelihood of not completing the study. I entered this variable into the substantive model as a covariate (Berk, 1983
).
| RESULTS |
|---|
|
|
|---|
|
|
These differences in reports of disability varied by race, as well. Figure 2 illustrates three regression lines calculated separately for Black and White older adults: all seven waves, face-to-face interviews only, and telephone interviews only. Again, I extended the trajectory using telephone interviews to Waves 1 and 7 in order to compare across interview modes. The unadjusted intercepts were grouped by interview mode. Face-to-face interview intercepts for both groups were the highest of the six points, with Black older adults at.59 and White older adults at.53. These were followed by both of the intercepts for the lines using all interviews. The bottom two intercepts were the telephone interview intercepts: Black older adults were at.36 and White older adults were at.37. There was little difference across racial groups in the level initially reported by interview mode. In fact, there was no significant difference between Black and White older adults on the intercepts for the face-to-face interviews. Likewise, there was no significant difference by race for the intercepts on telephone interviews.
|
|
The next step in the analysis was to determine whether the intra-individual differences in the disability trajectory by interview mode remained after controlling for covariates. Table 4 presents the model fit statistics and parameter estimates for three latent growth curve models of disability over time: one for all interviews, one for face-to-face interviews only, and one for telephone interviews only. Measures of model fit indicated that all three models fit the data well. The goodness-of-fit index and incremental fit index were both 1.00 for nearly all models. The root mean square error of approximation was well below the recommended threshold of.05 for all three models, with the lowest at.004 for the face-to-face interviews model (Kelloway, 1998
).
|
Possible reasons for these differences between the unadjusted and adjusted slopes, as well as the consistency of the intercepts across models, can be seen in the estimates of the covariates displayed in Table 4. In the model predicting disability using all seven interviews, living alone and attending religious services more frequently predicted lower intercepts of disability. Persons with greater incontinence and those with more total prevalent morbidity had higher intercepts of disability. Individuals who were at greatest risk of attrition also had significantly greater disability intercepts. Older persons and those at highest risk of attrition had greater increases in disability over time.
In the model using only the face-to-face interviews, persons who attended religious services more frequently had lower overall intercepts of disability. Consistent with the model using all seven interviews, persons who had greater problems with continence and persons with greater morbidity had higher overall disability intercepts. Slopes of disability over time were steeper for older adults, individuals with more incontinence, and persons at highest risk of attrition. More frequent attendance at religious services predicted less steep disability trajectories. In the face-to-face interviews, two new covariates emerged as strong predictors of slower trajectories of disability over time: having higher income and being married. Neither of these was significant in the first model, but, after eliminating the telephone interviews, both covariates became significant. Finally, persons at highest risk of attrition had steeper disability slopes.
The third model estimated the intercept and slope of disability using only the telephone interviews. Consistent with the model using only face-to-face interviews, persons who attend religious services less often, individuals with greater incontinence, and persons with greater morbidity had higher disability intercepts. As for the slope, Black respondents and individuals who were older experienced the greatest increases in disability over time. Some of the same significant predictors appeared in this model as in the earlier models: religious service attendance, incontinence, and morbidity. For the model that estimates only telephone interviews, persons at highest risk of attrition had significantly higher disability intercepts and higher slopes over time.
In order to test for significantly different predictors across interview modes, I estimated a multi-group latent growth model by treating the face-to-face interviews as Group 1 and the telephone interviews as Group 2. I then reestimated the model with covariates. Each covariate that had been a significant predictor of the intercept or slope for at least one interview mode was individually constrained to be equal across models. Several of the constrained relationships produced a significantly higher chi-square value for one degree of freedom, indicating a worse fit and significantly different slope estimates across groups. Having lower incomes and not being married predicted significantly steeper disability trajectories over time in face-to-face interviews relative to telephone interviews. Religious service attendance was associated with lower disability over time in telephone interviews but not in face-to-face interviews. The influence of morbidity and incontinence on disability trajectories did not differ significantly by interview mode.
| DISCUSSION |
|---|
|
|
|---|
One goal of research on racial health disparities in older adulthood is to accumulate evidence across health domains that will allow tests of competing conceptual frameworks of health inequality over the life course. There is disagreement across these frameworks regarding the degree of BlackWhite health disparities in older adulthood, with some researchers hypothesizing a growing health gap (e.g., Dannefer, 1987
) and others arguing that the gap is remaining constant or even shrinking (Kelley-Moore & Ferraro, 2004
). The results of this study indicate that interview mode presents an often unrecognized source of variability in response to health questions, leading to quite different conclusions concerning racial health disparities. The conclusion from just the telephone surveys is that health disparities continue to increase among Black and White older adults, consistent with the cumulative disadvantage perspective. However, there was no such widening among just the face-to-face interviews. Conclusions from the latter interviews would support a shrinking disability gap or, at the very least, persistent health inequality in older adulthood.
A mixed-mode panel design further complicates the study of racial health disparities. With a face-to-face baseline interview followed by a telephone interview, it would be difficult to show an increase in inequality because the follow-up estimates would likely be lower than those projected from the face-to-face interview(s). Researchers should cautiously consider claims of shrinking disparities from mixed-mode interview studies. The Established Populations for Epidemiologic Studies of the Elderly provides consistent evidence within mode that disability is increasing. The BlackWhite disability gap increases in these data, but this generalization would be hard to support without face-to-face follow-ups integrated into the study design.
Although in the present study the estimates of disability were lower by telephone, the difference between face-to-face and telephone disability estimates narrowed somewhat over time, especially among the Black older adults. For White respondents, the two telephone interviews following the face-to-face interviews always led to the appearance of less disability. Not so for Black respondents. For Waves 13, the pattern was similar to that for White participants, but by Waves 5 and 6, Black respondents' disability reports were equal to or higher than the Wave 4 estimates gathered in the face-to-face interview. This intriguing finding shows that the responses to disability questions across interview modes became more concordant for Black older adults, perhaps due to the growing rapport that accompanied repeated interviewing. Generally, one of the keys to the inclusion and retention of minority participants in research, particularly Black older adults, is the development of trust. This, coupled with a familiarity with data collection procedures, may diminish the discrepancy in responses across interview modes (Nápolis-Springer et al., 2000
). Black participants may begin a study with greater concern about the use of information obtained, but once rapport is established in a panel study, telephone interviews may work as well as face-to-face interviews for assessing health problems among Black older adults. For White persons, however, there appears to be no substantial gain in rapport by repeated interviewing.
Survey interview mode has further implications for the measurement of physical function among older adults, independent of race. Considering just the first three interviews would give the appearance of a decline in disability. Separating the data by interview mode, however, shows that this apparent decline is actually due to the use of telephone interviews following a face-to-face interview. Nevertheless, it is difficult to discern real change from interview-mode effects in the absence of repeated measures from each mode. Although disability tends to be progressive over time, recovery of ADL functioning is not uncommon among older adults as a result of the healing of broken hips, physical therapy after a stroke, or long-term adjustment to an amputation. Sharp spikes in disability can also occur with the onset of new health conditions, a fall, or terminal drop. The question is not whether change occurs, but how much variation in these scores can be attributed to actual change in function versus an artifact of a mixed-mode study design.
Indeed, several covariates differed significantly across modes, indicating that the predictors of disability were not identical for face-to-face and telephone interviews. Having lower incomes and not being married led to greater disability over time, but just for face-to-face interviews. In telephone interviews, religious service attendance predicted significantly lower disability levels and trajectories over time relative to the in-home interviews. Thus, the greatest discrepancies in reports of disability were among persons who were most socially vulnerable (low income, not married, infrequent religious service attendance), which is consistent with other studies (Pruchno & Hayden, 2000
; Weinberger et al., 1994
). Individuals with a greater number of health problems or with incontinence reported similar disability levels across interview modes. In fact, older age no longer predicted significantly higher initial levels of disability in either interview mode once morbidity was included in the model. This may indicate that declining health, including physical function, is a salient concern that outweighs other potential sources of variation.
The differences between face-to-face and telephone interviews observed in this study may be more far reaching than appears on the surface due to planned mixed-mode designs of some studies. Consider two national, federally funded panel surveys, the Longitudinal Study on Aging and the Longitudinal Study on Aging II. In both instances, the baseline survey was a face-to-face interview, conducted in tandem with the National Health Interview Survey, but researchers administered all of the follow-ups by telephone. Based on the present findings, the only way one can support an assertion of disability decline is by mixingthen ignoringmodes of interviewing. To give another example, the panel study of Asset and Health Dynamics Among the Oldest Old switches from telephone to face-to-face interviews when the participant turns 80 years old. It may be judicious in that study to interview persons of advanced age in a face-to-face interview, but based on the findings of the present study, one should not be surprised to see an apparent increase in disability when switching from a telephone to a face-to-face interview.
Nonrandom missing data can affect the measurement of racial health disparities in any panel study, but it is particularly important for mixed-mode panel studies. Analyses that include only those who are healthy enough and willing to complete the panel study in the assigned interview mode may underestimate the extent of disparity over time. In order to counter this, I adjusted the estimates for two potential sources of selection bias: nonrandom attrition from the study and the requirement of an alternate interview mode (e.g., the need for a face-to-face interview instead of the assigned telephone interview). I found that respondents who participated in the assigned interview mode for all seven waves had lower initial levels and less steep trajectories of disability over time. Black respondents had a higher risk of both mortality and requiring an alternate interview mode, suggesting that the surviving and participating Black older adults were a more selectively healthy subsample. However, even after I had adjusted for selection bias, Black older adults continued to have significantly steeper trajectories of disability over time than did White older adults when examining just the telephone interviews. A more detailed examination of the impact of nonrandom attrition on disability appears in Kelley-Moore and Ferraro (2004)
.
Readers should consider two major limitations to this study when interpreting these results. First, the face-to-face and telephone interviews were not of equal length. Interviews for Waves 1, 4, and 7, which researchers conducted in the home, were longer and more in-depth than those for Waves 2, 3, 5, and 6, which investigators conducted by telephone. The differences in responses across waves could have been partially due to the respondents feeling more comfortable talking about physical function because there had been more questions leading up to that section of the interview. Future studies should confirm the present results using questionnaires that are of identical length and format across modes and waves. Second, the time lag between each wave was one year. Among older adults, especially the oldest participants, health can change drastically in one year. Two types of studies would be helpful in further examining the conclusions of this study derived from 1-year follow-ups. First, studies with shorter follow-up periods would be helpful because they would reduce the amount of change that would likely occur. Second, studies that ask about intervening events (e.g., illness or treatment episodes) could help identify changes that are plausibly related to the outcome rather than measurement procedures.
When measuring racial health inequality, researchers have often taken for granted the comparability of telephone and face-to-face interview modes. The present results show that mode of data collection can lead to variations not only in person-specific responses across modes of data collection but also in the calculation of entire trajectories over time, resulting in disparate conclusions about the persistence of inequality over time. Although racial health disparities grew over time within each interview mode for these data, analyses that ignore the effect of interview mode on subject responses would lead to an underestimate of racial inequality.
| Acknowledgments |
|---|
| Footnotes |
|---|
Received for publication June 27, 2005. Accepted for publication January 3, 2006.
| References |
|---|
|
|
|---|
| ||||||||||||||||||||||||
| HOME | ARCHIVE | SEARCH | TABLE OF CONTENTS |
|---|