# Community-Level Factors Associated With Racial And Ethnic Disparities In COVID-19 Rates In Massachusetts

The United States is now the epicenter of the global coronavirus disease 2019 (COVID-19) pandemic. Massachusetts had one of the highest case and death rates in the country for much of the spring,1 and early reports suggest that Black and Latino persons are being disproportionately affected, similar to patterns observed in other states.24 In response to the unequal burden of the disease, Massachusetts lawmakers recently passed Bill H.4672,5 which aims to understand the racial and ethnic health inequities that have emerged amid the pandemic.

Although Black and Latino communities in Massachusetts have shouldered a larger burden of COVID-19,2,4 evidence regarding the community-level factors that may be contributing to these disparities remains sparse. Furthermore, because most data collected by Departments of Health focus on only a few demographic factors such as age, sex, and race/ethnicity, there has been a paucity of objective data on whether COVID-19 cases are characterized by disparities along other important dimensions. There is growing concern, for example, that lower-income and Black and Latino persons may be at greater risk of exposure to COVID-19 because they are more likely to be essential workers, and also tend to live in densely-populated areas and multigenerational households.68 Among immigrant communities, there are reports of foreign born non-citizens avoiding care (including testing and advice regarding COVID-19-like symptoms) for fear of deportation or risking their future legal resident status based on new federal “public charge” regulations.9,10

Finally, long-standing historical inequities and structural racism, which have led to adverse outcomes including residential segregation and differences in access to health care, may also contribute to these disparities.1113 Therefore, an understanding of the factors leading to increased case rates among minority communities is urgently needed to inform strategies to mitigate the ongoing spread of the disease.

This study has two key objectives. First, we sought to characterize the association between the proportion of Black or Latino persons and COVID-19 cases in cities and towns across Massachusetts. Second, we evaluated what other demographic, occupational, and economic factors are associated with elevated community risk for COVID-19, and which of these factors—if any—are contributing to potential racial and ethnic disparities.

## Study Data And Methods

### Data Sources

The Massachusetts Department of Public Health database, which is publicly available on the Mass.gov website, was used to obtain the number of confirmed COVID-19 cases per 100,000 residents by city or town between January 1, 2020 to May 6, 2020.14 The state has released exact counts of confirmed cases for most towns, including those with 0 cases, but it censors values for towns with between 1 and 4 total cases and simply reports “<5 cases”; for these towns (representing 0.8% of our weighted sample), we imputed the absolute rate at the median of this range at 2.5, and divided by the population total to convert to a rate per 100,000.

The 2013–2018 American Community Survey was used to obtain demographic, economic, and occupational variables at the city and town level.15 The primary variables of interest were the proportions of people in one of four mutually-exclusive racial/ethnic groups (based on self-report): Hispanic or Latino ethnicity (referred to as Latino for brevity); Black non-Latino; White non-Latino; and Other non-Latino (approximately 70% of whom are Asian-American or Pacific Islander). Other demographic, economic, and occupational variables at the community-level were total population; average age; average household size; median household income; and the proportions of people who were age 60 years or older, male, were employed as essential workers, were foreign-born non-citizens, and completed less than a high school degree. Most foreign-born non-citizens in Massachusetts are from Latin America (44.9%), Asia (30.7%), Europe (13.9%), and Africa (7.5%).15

We classified employed residents into essential workers and non-essential workers based on the state’s classification in the governor’s March 2020 emergency order.16 The following occupations were defined as essential: health care practitioners, technical occupations, and support services; construction and extraction; installation, maintenance, and repair; material moving occupation; production services; transportation; building and grounds cleaning and maintenance; food preparation and serving-related occupation; personal care and service occupation; and protective services.

### Statistical Analysis

We categorized cities and towns in our sample into quartiles based on the proportion of Black residents, and then analyzed population-weighted descriptive statistics. We repeated this based on the proportion of Latino residents.

Next, we estimated linear regression models to assess the association between the proportions of each racial/ethnic group with the number of COVID-19 cases per 100,000 people in each city or town. Models were weighted by the population size of each town. We first performed a series of univariate analyses using each demographic, economic, and occupational variable as a predictor with COVID-19 case rates as the dependent variable. Analyses that focused on a specific race/ethnicity (e.g. proportion Black residents) as a predictor adjusted for the proportion of other racial/ethnic minority residents in a town (e.g. proportion Latino, proportion other non-Latino). Then, we performed multivariable linear regression to assess what factors were independently associated with differences in COVID19 case rates, and which of those factors—if any—attenuated the observed racial/ethnic disparities after adjustment. We present two versions of the multivariable model—first, using natural units for each covariate (e.g. proportion of people by race/ethnicity, foreign-born non-citizens, essential workers, over age 60 years old, and those with less than a high school education; number of people for household size; dollars for median income; total number of people in city or town); and second using normalized z-scores, which enable an apples-to-apples comparison of the strength of the associations for a one-standard deviation increase in each covariate.

Several sensitivity analyses were performed. Since the “essential occupations” designation aggregates a heterogeneous set of jobs that likely have different levels of risk, we tested a model that separately identified the three most common categories of essential work in our sample—healthcare-related, food services, and construction/extraction jobs.

To examine whether our results were being driven by a small number of large population areas (particularly Boston), we performed sensitivity analyses with multivariable regression models limited to the top 50 largest towns and cities, and separately, limited to the smaller towns (all other towns not in the top 50). All analyses were repeated using the logarithm of the COVID-19 rate as the outcome, and we also tested using the log of average household income as a covariate. We also tested whether excluding towns with censored COVID rates between 1 and 4 affected our results.

In addition, because data on deaths at the city/town level are not publicly available, we performed county-level analyses to assess the association between proportion of Black or Latino population and the number of confirmed COVID-19 cases and deaths per 100,000 population within a county. We also assessed the correlation between COVID-19 case rates and COVID-19 death rates within counties.14

The Harvard T.H. Chan School of Public Health Institutional Review Board deemed this study non-human-subjects research given the use of aggregated publicly available data. Analysis was performed using STATA Version 14.0.

### Limitations

There are important limitations to this study. First, in Massachusetts—as in most states and localities to our knowledge—individual-level data by race/ethnicity and other demographic, economic, and occupational variables are not currently publicly available. Since our analysis focused on community-level variables, these population-level findings may be subject to ecological fallacy. However, the broad pattern of our findings on race/ethnicity match closely the individual-level disparities evident in other patient-level data sources.17,18 Moreover, in the absence of individual-level demographic, economic, and occupational data, our evaluation of these factors at the level of the city of town provides the most granular level of analysis currently possible, and improves considerably over county or state-based analyses.

Second, although Massachusetts is a COVID-19 “hot-spot,” these findings may not be generalizable to other U.S. states. While our single-state study may limit its generalizability, there is wide variation across states in testing availability and guidelines, which may bias rates of confirmed COVID-19.19 Therefore, this analysis’s focus on a single state likely improved its internal validity to assess the predictors of COVID-19 case rates. In addition, Massachusetts has the fourth highest median income of any state and the highest level of health coverage in the nation,20,21 though it is unclear whether insurance coverage has any mitigating effect on rates of COVID-19 transmission and disparities.

Third, we focused on confirmed COVID-19 cases, since town and city COVID-19 death rates are not publicly available. Our county level analysis indicates that COVID-19 case rates and deaths rates were highly correlated, suggesting the pattern seen here for cases is likely similar for death rates. Unlike death rates, however, variation in cases could potentially be affected by differences in testing patterns across localities or other sociodemographic factors. Given known disparities in health care access and COVID-19 testing availability, the communities that we identified as being at higher risk for COVID-19 may—if anything—be less likely to receive adequate testing,19,22 which suggests that our results likely underestimate disparities in case rates. Finally, underlying differences in comorbidities across minority groups were not captured in our study; however, this would be more likely to affect disparities in death rates rather than case rates.

## Study Results

### Characteristics Of Towns And Cities

All 351 towns and cities in Massachusetts were included in the analysis. Towns with a higher proportion of Black residents and Latino residents had larger populations on average, fewer high school graduates, lower median incomes, younger residents, and more foreign-born non-citizens than towns with fewer minorities (online supplemental exhibit 1).23 Average household size was slightly smaller among high-Latino and high-Black communities compared to other communities (2.5 vs. 2.6). The proportion of people working in essential service occupations was higher among high-Black and high-Latino communities, including food and service preparation and building, grounds cleaning and maintenance services.

### Race/Ethnicity

Towns with a higher proportion of Black residents or Latino residents generally had a higher number of COVID-19 cases per 100,000 population (supplemental exhibits 2 and 3).23 Notably, the state’s 5 towns with the highest COVID-19 rates are all majority-minority, including Chelsea, which has a population that is two-thirds Latino and has the highest confirmed infection rate in the state—nearly 6 times higher than the state average.24

In the unadjusted analyses of race/ethnicity (with White non-Latino the omitted group), a 10 percentage point increase in the Black population was associated with an increase of 312.3 [95% CI 241.9 to 383.0,

$p<0.001$

] cases per 100,000 (supplemental exhibit 4).23 A 10 percentage point increase in the Latino population was associated with an increase of 258.2 [95% CI 217.0 to 300.3,

$p<0.001$

] cases per 100,000, while a 10 percentage point increase in the Other non-Latino population was associated with an increase of 86.5 [95% CI 7.2 to 169.9,

$p=0.03$

] cases per 100,000.

After multivariable adjustment, the association between the Black population and case rates persisted with a 10 percentage point higher population associated with an additional 307.2 cases [95% CI 219.6 to 394.7,

$p<0.001$

] per 100,000. In contrast, the association between the Latino population and case rates was substantially attenuated and no longer statistically significant (50.4, [95% CI -19.0 to 119.8],

$p=0.15$

). Meanwhile, after adjustment, the association between the proportion of Other non-Latino race and case rates became significantly negative (-216.4, [95% CI -321.4 to -111.3],

$p<0.001$

).

### Demographic, Economic, And Other Factors

Several factors were significantly associated with higher COVID-19 case rates across towns and cities in unadjusted models: higher average household size; and larger shares of essential workers, foreign-born non-citizens, and non-high school graduates. After multivariable adjustment, mean household size and proportion of foreign-born non-citizens were still independently associated with higher COVID-19 rates. The model using normalized values for these covariates showed that the largest absolute risk factor was the share of foreign-born non-citizens (310.4 per standard deviation, 95% CI 253.5 to 367.2) and then mean household size (236.4 per SD, 95% CI 131.9 to 340.9). Older age was also associated with additional cases while larger population size was associated with slightly fewer cases but both were weaker predictors.

Although the overall proportion of essential service workers was no longer associated with case rates in our multivariable analysis, an exploratory analysis of the three most common occupations found that employment in food service was significantly associated with higher case rates (71.4 per standard deviation, 95% CI 7.2 to 135.7) (appendix exhibit 1).23

### Sensitivity Analyses

We observed a similar association between race/ethnicity and COVID-19 case rates in the multivariable analysis when we stratified the sample into the largest 50 towns and cities and all other (smaller) towns (appendix exhibit 2).23 The proportion of foreign-born non-citizens remained the strongest predictor of case rates in the largest towns and cities, while in smaller towns, a larger mean household size was significantly associated with higher case rates. Results were generally similar using the logarithm of the COVID-19 case rate as an outcome, when excluding towns with censored values for cases between 1 and 4 individuals, when excluding Boston from the sample, and when measuring income using the logarithm of average household income (data not reported).

We also performed additional county-level analyses and found that a higher proportion of Black or Latino residents within a county was associated with a significant increase in county-level cases and deaths (appendix exhibits 3 and 4).23 There was also a strong positive correlation between the number of county-level cases and county-level deaths (Pearson’s coefficient, 0.82,

$p<0.001$

) (appendix exhibit 5).23

## Discussion

Across Massachusetts’ cities and towns, Latino and Black communities are experiencing much higher rates of COVID-19 cases. Several factors measured in our data (foreign-born non-citizen status, household size, and job type) appear to explain the higher COVID-19 case rates among Latino communities in Massachusetts. It appears that these factors may not be the primary reason for higher case rates in Black communities.

While the extent of racial and ethnic disparities has already been documented,2527 our study identifies important factors that are independently associated with higher COVID-19 case rates in the state. The proportion of foreign-born non-citizens was the strongest predictor of the burden of COVID-19 cases within a community, and in Massachusetts, this population includes sizable numbers of both Latin American (44.9%) and Asian individuals (30.7%).15 Furthermore, under the Trump Administration’s revised “Public Charge” Rule, which took effect in early 2020, lawfully present immigrants who use public benefits from local, state, or federal governments may be at risk of being denied permanent residency status. Although the U.S. Citizenship and Immigration Services website now encourages immigrants to seek care for COVID-19-like symptoms, enrollment in Medicaid at the time of COVID-19-related care may still be used in the Public Charge analysis.28 Recent studies suggest that immigrant families have strong incentives not to enroll in public health insurance like Medicaid and may avoid seeking medical care if they develop COVID-19-like symptoms and require testing.10 In the absence of a positive test, these individuals are less likely to isolate and quarantine, which may impede public health efforts to control the spread of COVID-19.29,30

These issues are likely only magnified by the fact that immigrants tend to live in larger households,31 which we also found to be an independent predictor of COVID-19 case rates. Policy approaches that reduce barriers to accessing medical care for immigrant populations and that address crowded housing—particularly when individuals have tested positive and need to be isolated—could be important avenues for reducing disparities and slowing the spread of infection.

Our work also sheds important insights into the factors that may be contributing to the higher rates of COVID-19 cases among Latino communities. As noted above, many immigrants in the Latino community may be deferring necessary care for fear of risking citizenship under the Public Charge rule or possible deportation. In addition, Latino persons tend to be disproportionately employed in essential services that are public facing, particularly in the food service industry, which we found to be associated with higher COVID-19 rates. As the state moves to cautiously re-open more businesses, greater worker protections to reduce potential exposures may be needed in these jobs. Many low-income minority workers also do not have the luxury of working from home,25 and less than half of Latino workers in the U.S. have jobs with paid sick leave.32 While Congress recently expanded the availability of paid sick leave under the CARES Act in March 2020, the law exempted workers in large firms and health care organizations, which may leave many essential workers unprotected if they contract COVID-19.33

Meanwhile, our results show large disparities in infection rates associated with Black communities, but these disparities do not appear to be primarily explained by the factors that we examined. Other factors not examined in this study may explain the disparate impact of COVID-19 in Black communities.12,13 Structural inequities, such as disproportionately high incarceration rates,34 residence in areas with a higher concentration of multiunit residential buildings,35 and de facto neighborhood segregation, which may lead to disparities in health care access and greater exposure to environmental hazards, may contribute to the spread of COVID-19 in these communities. Transportation use may also increase the risk of exposure to COVID-19, as Black workers are more likely to use public transit to commute to work.36

Evidence emerging from other U.S. cities and states has similarly described that Black and Latino populations are being disproportionately affected by COVID-19.3,25,26 In addition, preliminary work from Massachusetts suggests that these populations also have substantially higher death rates.37 Our study expands upon these findings in several ways. We identify important factors (proportion of foreign-born non-citizens in a community, household size, food service occupation) that are strongly associated with the risk of developing COVID-19. In addition, we characterize the extent to which these factors, among others, may be contributing to the higher number of COVID-19 cases in Black and Latino communities. Because we evaluate each of these unique populations separately, we find that some factors, such as occupation in an essential service field, may not affect Black and Latino communities in a similar manner. Our findings provide important insights that may inform and help tailor public health and policy strategies to address the ongoing COVID-19 pandemic.

## Conclusion

Across Massachusetts cities and towns, significant COVID-19 disparities are evident along multiple dimensions—particularly race/ethnicity, foreign-born non-citizen status, household size, and job type. Higher proportions of Black or Latino residents within a community was significantly associated with higher rates of COVID-19 cases. The factors examined in our study explained this relationship for Latino communities but did not appear to explain the higher rates among black communities. Further research into the social and economic factors underlying COVID-19-related disparities and new policies to address risk factors and institutional racism will be critical to controlling the epidemic and improving health equity.

## ACKNOWLEDGMENTS

This work was partially funded by the Robert Wood Johnson Foundation Policies for Action. Jose Figueroa reports receiving grants from the Commonwealth Fund and Robert Wood Johnson Foundation for other work unrelated to this project. Rishi Wadhera reported receiving research support from the National Heart, Lung, and Blood Institute (grant K23HL148525-1) and previously serving as a consultant for Regeneron. Robert Yeh reported receiving research support from the National Heart, Lung, and Blood Institute (grant R01HL136708); serving as a consultant to Biosense Webster; and serving as a consultant to and receiving grants from Abbott Vascular, AstraZeneca, Boston Scientific, and Medtronic. Benjamin Sommers reported receiving funding from the Commonwealth Fund, the Robert Wood Johnson Foundation, the National Institutes of Health (R01MD014970). [Published online August 27, 2020.]