Introduction

Distribute is a community-based, population-level public health information system for syndromic influenza-like illness (ILI) surveillance that displays aggregated, de-identified, public health surveillance data collected from emergency departments (EDs) by state and local health jurisdictions [1]. Distribute was first organized by the International Society for Disease Surveillance (ISDS) in 2006 as a proof of concept. With support from the Markle Foundation and the United States Centers for Disease Control and Prevention (CDC), Distribute grew from participation by 8 state and large metropolitan health jurisdictions, representing summarized data on 10% of all US emergency department (ED) visits, to a nation-wide system that currently receives data from 43 health jurisdictions that represents over 50% of the US population and summarizes more than 35% of all ED visits nationwide. Distribute currently has participation from all ten Health and Human Services (HHS) surveillance regions, includes data from over one million ED visits each week, and displays updated visualizations of ILI trends in the US on Public and Restricted access web sites.

Distribute serves as an example of a new paradigm in the collection and sharing of public health surveillance data [2]. The roots of automated syndromic surveillance systems began just prior to the 2001 Anthrax attacks [3] with systems which automatically classify clinic visits and other data according to loose "syndromic" criteria and present graphic and statistical views of summarized data based on counts of those visits, and their numbers in proportion to population and denominators derived from utilization of health care services. The literature describes both the early experience and growth of these systems [4] and their evolving design and implementation [5]. With the development of health information exchanges (HIEs), and of methods for the structuring of, and access to, regional data across multiple health care systems, public health gained access to larger sources of both visit level and summarized "syndromic" data [6]. In part, Distribute developed as a way for health departments to share and compare these summarized, syndromic data, regardless of whether those data were obtained from integration of data from individual providers or hospitals, or from a single large hospital network or HIE.

In the early development of the Distribute project, the apparent benefits of mandated standards for syndrome definitions were weighed against two often overlooked issues: barriers to entry and ability to compare data across jurisdictions. The need to adopt a mandated standard prior to joining the network created a potential technical barrier that could delay or prevent interested jurisdictions from participating. In addition, although mandated standard syndrome definitions could improve data comparison on average across the whole network, there was concern on the part of project participants that this practice might decrease accuracy and utility locally. That is, a local definition of a syndrome might best reflect local variations in coding or clinical practice that were reflected in the data, and might most accurately reflect the underlying disease being tracked. To address these issues, the Distribute project adopted the use of two separate syndromes: 1) a more narrow and specific definition, following a traditional clinical definition of ILI, and 2) another more sensitive definition, as a broad febrile, respiratory and influenza-like syndrome [7]. In a preliminary comparison, two Distribute participating sites shared local coding of their narrow and broad ILI syndrome definitions and applied each other's definitions to their own local data. The pilot findings suggested that data using locally applied syndromes were better correlated with population-level viral surveillance data [8, 9].

Utility and Usability

Utility and usability issues are related and often difficult to separate in the evaluation of information systems [10, 11]. Utility, or perceived usefulness, refers to the extent to which an information system or its output provides benefit or value [1012]. Usability, or perceived ease of use, refers to the degree of effort required to use an information system or its output [1012]. Many international standards for system design conflate usability and utility, incorporating aspects of utility and usability in a single definition [13]. Because this project is not an interaction study, we focus on the utility, or perceived usefulness, of the Distribute system and its data outputs while acknowledging that usability contributes to utility.

Current initiatives of the Distribute project place a high priority on improving the utility and usability of the information system and extending functionality to support public health decision-making and practice. Qualitative methods are important in the evaluation of health information systems [1416]. It is important to engage practitioners in a discussion of their needs and proposed system features to mitigate common informatics risk factors for failed system adoption [1719]. Following the idea that "the simplest way to assess usefulness is to ask those involved in public health practice"[20], we engaged epidemiologists and other public health practitioners in a pilot study to collect quality improvement feedback for the Distribute system. This pilot study was undertaken to inform the design of a larger quality improvement investigation by including participants who were current members of the Distribute community of practice and those interested in learning more about the Distribute system.

System Description

The objective of Distribute is to collect, analyze, and display ILI surveillance data from across the United States. Another objective is to provide ways to compare the progression of outbreaks of infectious disease between regions, and to enhance communication between health jurisdictions. Distribute displays summary level data from state and local health department ED surveillance systems in two views: Public and Restricted. The Public site provides public access graphs of weekly trends in ratios of ILI syndromes to all ED visits. The restricted site requires secure authentication for access and provides greater granularity in the time series data, primary counts as well as ratios, multiple syndromes, information about data upload history and transmission details, data timeliness, detailed visualizations based on user-specified queries and metadata that includes background information about each data provider and details of syndrome definitions. This secure view of information about all data-providing jurisdictions is available to each participating data provider on the restricted site. Distribute data providers upload data from existing surveillance systems, such as ESSENCE [21], EARS [22], RODS [23]and BioSense [24], at local and state health jurisdictions along with provider-specified syndrome definitions. Table 1 shows selected metadata elements available to data providers with restricted site access.

Table 1 Selected metadata elements from the Restricted site of the Distribute system

Methods

Setting

This study was conducted at the Washington State Public Health Association (WSPHA) Joint Conference on Health on October 11-12, 2010 in Yakima, WA USA. The conference is an annual meeting of public health practitioners that includes participants from the Washington State Department of Health and local health jurisdictions in the State of Washington. The study protocol received approval from the University of Washington Institutional Review Board.

Participants

A total of nine public health practitioners participated, representing state- (n = 5) and county- (n = 4) level public health organizations. Six participants attended a focus group and three participants engaged in brief interviews. Seven participants were current epidemiologists, health officers or other public health practitioners while two participants were former epidemiologists or health officers. Regarding prior use and/or familiarity with Distribute, four participants had never seen or used Distribute and of the five who were familiar with Distribute, only two participants had access to the Restricted site. All participants were familiar with how surveillance data are used for public health purposes. Table 2 shows a breakdown of study participants by role and data collection method.

Table 2 Study participants by role, agency level, familiarity with Distribute and method of data collection

Data Collection

We employed qualitative methods [2527] to capture participants' perceived usefulness of data, data visualizations and features of the Distribute system and solicit quality improvement feedback from participants: 1) a focus group discussion of the Distribute system during a scheduled presentation and 2) three brief, semi-structured interviews with conference attendees who fit the profile of the target user group. Following a brief project background, participants were shown different views of graphed data for US Department of Human and Health Services (HHS) Regions from the Public site and Restricted site comparisons of graphed data from data providers in the Pacific Northwest for periods of time covering the 2009-2010 H1N1 and influenza seasonal time periods back to March 2009. Participants were asked to respond about perceived usefulness of views of graphed data and to discuss how they use surveillance data in their work. Focus group and interview participants were asked the same types of questions. Those familiar with Distribute were asked questions about their perceptions of the system. Table 3 shows the types of questions asked of participants.

Table 3 Types of questions asked of focus group and interview participants

Figure 1 shows a comparison graph used during data collection. The data-providing jurisdictions in the graph are de-identified for publication. The graph shows ILI visit time-series ratios of syndromic ED surveillance data during March 2009 through April 2010 from two Northwest US jurisdictions (Sites A and B) participating in Distribute. The ratios presented are ILI syndrome visits over total visits using syndrome definitions that are jurisdiction-specific and not standardized. While the absolute ILI levels are not directly comparable, the timing and relative magnitude show ILI trends representing the emergence and early spring wave of the influenza A/H1N1 pandemic and its autumn 2009 return.

Figure 1
figure 1

De-identified jurisdiction comparison showing the emergence and return of pandemic A/H1N1 influenza, March 2009-April 2010.

Data Analysis

Notes taken by Distribute team members during the focus group and interviews were summarized and analyzed to identify patterns and themes [28, 29]. Study notes were stripped of identifying information before analysis. Conference attendees were referred to by role and an assigned study code. Specific data extracted from study notes pertained to opinions of conference attendees about usefulness, suggested features and other recommendations for improvement of the Distribute system. Themes were identified from focus group and interview notes by grouping similar responses and creating names and descriptions of the groupings [26, 28].

Results

Overall, participants were engaged and positive about Distribute as a community-based network and syndromic surveillance system. Data analysis resulted in the emergent themes, which are displayed in Table 4 along with brief descriptions of each theme.

Table 4 Themes that contribute to information system and data usefulness

With regard to standardization, participants recognized a need for a common influenza-like illness (ILI) syndrome definition in order to make comparisons between data sets more meaningful and relevant. For regional comparability, participants wanted to see views of different regions to aid in comparisons. In particular, they expressed a desire for views that support county-by-county comparisons of syndromic surveillance data, separate regional views of Western Washington and Idaho and views by preparedness regions as an alternative if representative views of HHS Regions were unavailable.

The completeness theme is described by participant desire for completeness in the data sets submitted from each data provider. Coverage refers to participant desire to know that data are representative of a given population in order to generalize findings across the population. In particular, participants expressed a need for data coverage of all Washington State. Participants noted that during the second wave of H1N1, the eastern side of the state, which includes one-third of the population, initially saw two-thirds of all cases (consistent with the graph of data for Eastern Washington in Distribute). One participant noted minimal use of Distribute due to lack of close neighbors for comparison. Context refers to the expressed need for metadata incorporated into views to facilitate understanding of graphed data. Participants suggested inclusion of the number of hospitals, emergency departments, patients, data providers and denominators for the total number of ED visits as contextual information in the graphed data views.

Participants expressed a need for trusted data, that is, confidence that data are valid and timely. They wanted to know that data were verified through defined quality assurance processes that are conducted on a regular basis. Participants reported that these data are useful for consistent events such as seasonal influenza and local data could be used to declare a local epidemic. Consistent, reliable data were cited as more useful to stand-down from an emergency than to issue an initial alert; participants hypothesized that during H1N1 the data in Distribute might have been more useful after confirmation of an actual event occurrence. Customization of available data and data processing capabilities to meet the surveillance needs of each local and state health jurisdiction was requested by participants. The ability to overlay graphs with other graphs and create labels on request was envisioned as a useful feature.

Participants acknowledged the value of the Public site as a tool to view national trends. One participant cited the need for a surveillance system with a low-impact training cost that anyone can use and that is largely automated to minimize maintenance. Requests for additional data viewed as overlays to graphs included: metadata already available elsewhere in the system, county school absenteeism rates and a state view that includes data from all clinics in the Group Health Cooperative health care system. An additional result of the focus group was the expressed desire to participate as a data contributor by one participant from a county-level health jurisdiction.

Limitations

The limitations of this pilot study include its restricted time frame for data collection and the regional population from which the sample is drawn.

Conclusion

Our results suggest themes that can be used to guide future evaluation and design iterations to improve support for public health surveillance. These results are important for improvements to syndromic surveillance of influenza-like-illness in the Distribute system but can also help improve syndromic surveillance efforts overall, regardless of the disease or surveillance system. For example, gastrointestinal (GI) indicators are currently being piloted in the Distribute system as a demonstration of system extensibility for surveillance of other diseases. Themes from this qualitative evaluation study can inform GI syndromic surveillance efforts as they are expanded within Distribute or any other surveillance system. These themes should be further explored by including public health practitioners in information design efforts. In addition, this study demonstrates that the application of qualitative methods in an "evaluation of opportunity" at a public health practice gathering can be a simple way to solicit feedback for the improvement of a working public health information system. Lastly, we found that efforts of this type can be useful in recruiting new users to participate in the system and expand the community of practice.

The community-based approach employed in the Distribute project focuses on data use and has resulted in convergence toward a recognized need for a common influenza-like illness (ILI) syndrome definition to compare data sets across jurisdictions among the Distribute community of practice. The findings of this pilot study are consistent with this trend. However, to maintain local utility of data, existing data providers need not, and should not, abandon prior syndrome definitions, but rather should submit an additional common definition while continuing to send data aggregated by existing syndrome definitions that have local meaning.

Syndromic surveillance data, if available, are used by public health practitioners as early indicators of influenza outbreaks within their own jurisdictions and adjacent health jurisdictions. These data are used in conjunction with other data sources, such as laboratory results, to triangulate disease prevalence. To aid decision-making for interventions that help contain outbreaks, improved data access and visualizations for syndromic surveillance data are needed. The context of how data are used for individual tasks is important [30, 31] and data quality cannot be assessed independent of the people who use them [32]. Information systems are part of the contexts of use for data; the utility and usability of these systems are factors in the utility and usability of data [13, 33, 34]. Three contexts of use for syndromic surveillance information systems - routine, anticipated threat and present threat - have been recognized as key inputs to tasks for analysis and characterization of syndromic surveillance data for decision-making [35]. In addition, our pilot results indicate contextual information - metadata related to hospitals, patients, data providers, ED visit counts, etc. - provides meaning of syndromic surveillance data to epidemiologists.

To better understand contexts of information system and data use, future efforts should identify the specific ways in which epidemiologists and others use metadata to discern meaning from data, the best ways to include annotations in data visualizations and different ways to display information for population health surveillance. Interviews with a larger number of participants will help refine the specific meanings of our themes, gauge reactions to anticipated results from common ILI syndrome definition efforts and explore specific needs around regionalization and other identified themes. Future work to engage a more geographically diverse population of participants will help validate these results outside the pilot region of Washington State. This future work will be informed by pilot results in two related areas: 1) usability studies to improve the design of Distribute as an information resource that an epidemiologist might check before making a phone call to a colleague in a different jurisdiction or region and 2) utility studies to assess the value of Distribute to participants, their organizations and community population health outcomes.