Establishing the pathological diagnosis of osteonecrosis of the femoral head (ONFH) has historically been given little importance. In fact, even the need to histologically examine femoral heads (that have been excised during routine hip arthroplasty) has been debated [1, 2]. Perhaps this lack of interest is because ONFH becomes significant only when it leads to secondary degenerative joint disease (DJD), and because the treatment of hip ONFH and DJD is the same. This leads to a lack of interest by the surgeon in the pathological diagnosis of femoral heads, because the diagnosis does not alter the subsequent management or outcome.

However, a few years ago, one of the authors was part of an advisory committee meeting at the FDA where a group of drugs was being investigated for a possible side effect of ONFH [3]. Although the final adjudication was based upon multiple factors, one problem that came to light during the discussion was the relative uncertainty in the baseline prevalence rates of ONFH. It became obvious that although the diagnosis of osteonecrosis may not matter to the individual patient or surgeon, it becomes important in developing prevalence data upon which future drug trials may perhaps depend.

Literature on the prevalence of ONFH is surprisingly sparse. A literature review (Medline, 1946 to present), using the key words prevalence and osteonecrosis or avascular necrosis yielded 272 articles; however, it turned out that most of these did not deal with ONFH in general patient populations. Instead, the vast majority of articles deal with specific patients such as those with kidney disease, collagen vascular diseases, immunosuppression, genetic conditions, transplantation or hematological problems amongst whom the prevalence of ONFH is expected to be higher than in the general public. It is clear that the prevalence of osteonecrosis depends on the patient mix [4]. After excluding articles that were limited to certain patient groups or diseases, there were only a handful of articles that dealt with general patient populations. These articles generally suggested that in the United States, ONFH affects about 10 % of all hips that are replaced [58]; however, some of these articles were not very clear about how the diagnoses were made (radiologically or pathologically).

In some countries, for example, Japan and South Korea, there has been an attempt to study the prevalence of ONFH at a national level [911]. In Japan, a questionnaire was sent to orthopedic departments, and in South Korea, insurance data were used. Both methods have the benefit of including patients who may not present for surgery, but the disadvantage that the use of a code for the diagnosis does not reveal whether the diagnosis was only clinically suspected or definitively made, or how it was made: radiologically versus pathologically. Studies from Asia also reveal a higher prevalence—as many as 50–60 % of arthroplasties are performed to treat ONFH [10, 12]. However, this may be the result of a lower prevalence of osteoarthritis rather than an increased prevalence of ON.

There are several problems involved in studying the prevalence of ONFH. First, the term prevalence is used by biostatisticians as a measure of the presence of a condition in a particular population. For ONFH this is not known because we only know of the presence of ONFH in the femoral heads of patients who present for surgery and not in the general population. It is possible that some patients with ONFH do not have surgery, perhaps because their symptoms are insufficiently severe or because of other comorbidities that might preclude surgery.

Another problem is encountered, because the natural history of osteonecrosis is to heal. We know from longitudinal radiological and histological studies of patients with various osteochondritidies and other forms of osteonecrosis that necrotic bone has the capacity to heal, and that this may occur with or without secondary osteoarthritis. The cases that come to surgery, therefore, are those that collapse and secondary DJD occurs before the healing. We do not know the numbers of people in whom the reverse occurs (i.e., where healing is completed before collapse). It appears plausible that the smaller the region of osteonecrosis, the more likely it is to heal. Therefore, it is quite likely that less severely affected individuals may not present for surgery [13, 14].

Ideally, therefore, the diagnosis of ONFH should be made by some non-invasive means and ideally this would have a perfect correlation with the gold standard, the pathological examination. This would also ideally be combined with a long-term follow-up of the population; however, we clearly have no such ideal modalities and it is not possible to subject the entire population to radiological studies at frequent intervals.

A third problem is that the available imaging modalities (X-ray, nuclear scans, MRI) have differing sensitivities and specificities and their interpretation may be subject to some inter- and intra-observer differences [1520]. Interestingly, in a recent study of radiological practice published in this journal, the authors also concluded that current radiological practice in diagnosing ONFH was non-uniform and unsatisfactory [21]. With new modalities being brought into practice, it becomes difficult to compare data across different time periods.

The problems with the pathological diagnosis also bear some scrutiny. This is supposed to be the “gold standard” by which other modalities are judged; however, problems in the pathological diagnosis have not been well explored. For example, unlike the radiology literature, where there are a number of studies on the reliability and precision of femoral head osteonecrosis [22, 23], we are not aware of any studies that have looked at intra- and inter-observer variability in the pathological diagnosis of osteonecrosis. Although osteonecrosis seems to be a fairly straightforward diagnosis, in the location of the femoral head there are no established quantitative criteria. This is important, because a few necrotic trabeculae often accompany osteoarthritis, but these should not be diagnosed as ONFH, as the size of the focus is not clinically significant [24, 25]. Usually, clinically significant ONFH is accompanied by bone marrow necrosis and signs of repair such as creeping substitution. Interestingly, it has also been postulated that in some cases, rapidly progressive osteoarthritis may secondarily cause osteonecrosis [26]. If this is true, then in these cases bone necrosis is an effect, not a cause, of the arthritis. It is therefore possible that the pathological diagnosis of ONFH suffers from a lack of standardization.

An additional problem that is inherent in pathology is that the correctness and frequency of a diagnosis often depends upon the intensity with which it is sought. In other words, if you don’t think of it, you won’t look for it and you won’t find it. A careful search for a focus of osteonecrosis ideally requires the femoral head to be serially sectioned and multiple suspicious areas to be examined. However, for femoral heads (as in other cases that are sent to the laboratory for documentation alone), there could be a tendency toward less careful sectioning (with the feeling that the diagnosis may not really matter). This could perhaps be compounded by the lack of interest in the surgical team. Pathologists often receive specimens for documentation alone and once documentation is over, the job is considered done. For example, an appendix that is removed incidentally might undergo one histological section, whereas if the appendix was removed for appendicitis it might undergo more (or even be totally sampled) to document the presence or absence of the clinical diagnosis.

This behavioral pattern is not limited to pathologists alone. All clinical specialties demonstrate the alteration in the intensity of looking for a diagnosis depending upon the clinical suspicion. For example, if a teenager comes to a doctor’s office stating that he or she is there because their parents told them to come for a check-up, they might undergo a physical examination, perhaps a blood test, but not much more. On the other hand, if the history provided is that of ankle swelling, fever, and pain, they might undergo a more involved work-up including X-rays and a more detailed blood test. Thus, the history provided matters in the intensity and direction of our work-up.

If this hypothesis is true, then femoral heads removed for degenerative joint disease might undergo less sectioning (perhaps one section from the articular cartilage to document osteoarthritis), but if removed for osteonecrosis they might undergo more sectioning to look for the lesion in question (and perhaps many more sections if the pathology does not correlate with the clinical impression). More importantly, if the history provided is incorrect or misleading, this might lead to missing the focus of osteonecrosis and perhaps an alternative diagnosis of degenerative joint disease (which invariably accompanies ONFH, especially in hips that present for arthroplasty). Again, perhaps this is not important in the individual case, but may significantly alter the statistics of the prevalence of ONFH.

In order to test this hypothesis we retrospectively reviewed (after IRB permission) the pathological diagnoses (pathology reports), pathology requisition forms (pathology request forms), radiological reports, and medical records of all the patients at our institution who underwent excision of at least one femoral head between January 2007 and December 2011. We found 389 femoral heads from arthroplasties performed on 357 patients. We found that the cases were fairly evenly distributed amongst our pathologists, with no discernible clustering around any specific pathologist. We were interested in the pathological diagnosis and specifically in whether previous knowledge of the clinical and/or radiological diagnosis influenced the extent of tissue sampling. We therefore reviewed the pathology requisition/request forms for information that had been provided to the assigned pathologist at the time of submission. We considered the possibility that the diagnosis that is entered on the pathology requisition/request forms might differ from the real “clinical diagnosis” made by the surgeon, as the requisition form is often filled out by the operating room staff, who may sometimes have an incomplete understanding of the case. For example, let us think of an alcoholic deep-sea diver who is on steroids for asthma and who presents with bilateral hip pain to a surgeon. The surgeon requests imaging studies, which confirm a clinical suspicion of bilateral hip osteonecrosis. This then is the surgeon’s clinical diagnosis and is documented as such in the clinical charts and medical records. Now the patient is taken to the operating room for an arthroplasty. The operating room nurse, who is filling out the pathology requisition form, writes “femoral head with DJD” because, in that hospital, most femoral heads are removed because of DJD, or because he or she fails to check the history with the surgeon. This becomes the diagnosis available to the pathologist (who is unaware of the actual surgeon’s diagnosis in the medical record).

For the study, we counted the number of tissue blocks that were sampled in each case of femoral head arthroplasty and matched this number against whether or not a diagnosis of ONFH had been provided on the pathology requisition forms. We recorded the pathologist’s diagnosis, but did not revise the pathological diagnoses (even if we felt they were incorrect) because of our purpose was to find behavioral differences related to the history and clinical information that was provided and not to determine the accuracy of the diagnosis. We found that if the requisition sheet gave a diagnosis of ONFH, then the mean number of paraffin blocks examined in these cases was 6.1 per case. Whereas, if the requisition sheet did not give any diagnosis at all, or gave a diagnosis other than ONFH, then the pathologist sampled 2.2 blocks per case. If a diagnosis of osteonecrosis was communicated to the pathologist (via the requisition sheet), but no osteonecrosis was present histologically, the mean number of blocks increased further to 11 blocks per femoral head (Tables 1 and 2). As our study spanned a period of 5 years and many different pathologists, the data are unlikely to be biased by an outlier pathologist. We therefore believe that that these data show that pathologists try to confirm a clinical diagnosis of ONFH if it is suggested.

Table 1 Blocks submitted for evaluation
Table 2 Blocks submitted for evaluation

These data support the hypothesis that sampling by the pathologist might be influenced by the clinical question that is asked. If other institutions follow similar practices, then current published baseline estimates of the prevalence of ONFH could be seriously flawed.

Interestingly, a review of the pathology and radiology reports together with the clinical charts revealed some differences in the prevalence of ONFH. Thus, despite using exactly the same patients, the prevalence differed according to the specialty that made the interpretation (Table 3).

Table 3 Diagnosis of ONFH by subspecialty

In summary, the true prevalence of ONFH is important because of the emergence of new therapeutic drugs and antibodies that might have ONFH as a side effect. However, currently available data on the prevalence of ONFH could be seriously flawed because of several reasons highlighted above. The term prevalence is incorrectly used, asymptomatic ONFH confounds the numbers, the size and other pathological criteria of ONFH are not standardized, and there are no good studies on inter- and intra-observer diagnoses amongst pathologists. The diagnosis of ONFH varies among specialties and the literature fails to mention, which methods were used for the diagnosis. We therefore advocate caution before applying historical data on ONFH to the evaluation of drugs that may either cause or prevent osteonecrosis. There is a need for a consensus definition of the histological features of ONFH and a careful study to improve our understanding of its prevalence.