ROC Study of Four LCD Displays Under Typical Medical Center Lighting Conditions
- First Online:
- Cite this article as:
- Langer, S., Fetterly, K., Mandrekar, J. et al. J Digit Imaging (2006) 19: 30. doi:10.1007/s10278-005-8149-y
- 82 Downloads
Nine observers reviewed a previously assembled library of 320 chest computed radiography (CR) images. Observers participated in four sessions, reading a different 1/4 of the sample on each of four liquid crystal displays: a 2-megapixel (MP) consumer color display, a 2-MP business color display, a 2-MP medical-grade gray display, and a 3-MP gray display. Each display was calibrated according to the DICOM Part 14 standard. The viewing application required observer login, then randomized the order of the subsample seen on the display, and timed the responses of the observer to render a 1–5 judgment on the absence or presence of ILD on chest CRs. Selections of 1–2 were considered negative, 3 was indeterminate, and 4–5 were positive. The order of viewing sessions was also randomized for each observer. The experiment was conducted under controlled lighting, temperature, and sound conditions to mimic conditions typically found in a patient examination room. Lighting was indirect, and illuminance at the display face was 195 ± 8% lux and was monitored over the course of the experiment. The average observer sensitivity for the 2 MP color consumer, 2 MP business color, 2 MP gray, and 3 MP gray displays were 83.7%, 84.1%, 85.5%, and 86.7%, respectively. The only pairwise significant difference was between the 2-MP consumer color and the 2-MP gray (P = 0.05). Effect of order within a session was not signitfficant (P = 0.21): period 1 (84.3%), period 2 (86.2%), period 3 (85.4%), period 4 (84.1%). Observer specificity for the various displays was not statistically significant (P = 0.21). Finally, a timing analysis showed no significant difference between the displays for the user group (P = 0.13), ranging from 5.3 s (2 MP color business) to 5.9 s (3 MP Gray). There was, however, a reduction in time over the study that was significant (P <<< 0.001) for all users; the group average decreased from 6.5 to 4.7 s per image. Physical measurements of the resolution, contrast, and noise properties of the displays were acquired. Most notably, the noise of the displays varied by 3.5× between the lowest and highest noise displays. Differences in display noise were indicative of observer performance. However, the large difference in the magnitude of the noise was not predictive of the small difference (3%) in the observer sensitivity for various displays. This is likely because detection of interstitial lung disease is limited by “““““anatomical noise””” rather than display or x-ray image noise.