Background

Retrieving health literature is the cornerstone to evidence-based practice. However, the sheer volume of available information presents a challenge to even the most skilled physicians and researchers. Many users lack knowledge of information sources, have difficulty formulating an optimal search strategy, and are short on time [13]. These obstacles may be even greater when dealing with an area of nephrology such as glomerular disease, which is particularly broad, multidisciplinary, and difficult to define. Indexing of articles is often inconsistent, with variable terminology used for similar clinical entities or histologic diagnoses [4]. In this case, searches need to be highly sensitive to ensure important evidence is not overlooked, while minimizing the retrieval of non-relevant articles to ensure efficiency.

Search filters are a logical way to deal with these barriers. Filters are pre-tested searches created by strategically combining individual and combinations of search terms to achieve optimal article retrieval for a given purpose. Many filters already exist including those optimized to retrieve studies and systematic reviews of diagnosis, etiology, treatment, outcomes, adverse events, prognosis, and clinical prediction guides [514]. More recently, topic-based search filters have started to emerge [1517]. Within the area of nephrology, search filters already exist to retrieve renal information and articles relevant to kidney transplantation [15, 18]. However, none of these filters were designed to enhance retrieval of articles relevant only to glomerular disease.

A search filter for glomerular disease would allow physicians to perform searches within a subset of articles in an online database that were preselected as relevant to this content area. For example, if a user wanted to determine the most effective immunosuppressive therapy for a case of membranous nephropathy, they could combine the terms ‘treatment membranous’ with the glomerular disease search filter to improve the precision of article retrieval. The search filter acts as an optimized substitute for topic-specific terms required to increase the sensitivity and specificity of the search and eliminates the need to enter these glomerular disease terms and synonyms in the search query (e.g., nephropathy, glomerulopathy, glomerulonephritis). This strategy, in theory, should maximize the retrieval of articles relevant to glomerular disease and minimize non-relevant articles, increasing the overall precision of each search.

We conducted this study to develop and test glomerular disease search filters for PubMed, Ovid Medline, and Embase. Afterwards, we did some proof of concept searches to illustrate the potential effectiveness of these new filters with real physician searches in the PubMed database at large.

Methods

We used a diagnostic test assessment framework to develop and validate search filters for glomerular disease. For the purpose of this study, glomerular disease was defined as any disease in which the glomerulus of the kidney is affected, resulting in hyperplasia, atrophy, necrosis, scarring, or deposits in the glomeruli.

Sample of articles

We first established the reference standard by manual review of all full text articles published in 39 journals from 2004 to 2008. To develop this collection of journals, we adopted a similar strategy for article sampling as published in prior search filter studies. This approach has resulted in filters that generalize well over publication years and journal types [15, 19, 20]. We compiled a list of 466 journals from a list of journals that had published at least one article relevant to renal care from 1961 to 2005 [21]. We then ranked these journals according to the number of articles with relevant information and selected the top 20 journals. In addition to this, we selected 19 more journals at random from the remaining 446 journals. We then randomly divided these 39 journals into development and validation sets at a ratio of two to one respectively.

Article review

We manually reviewed all full text articles indexed in PubMed, Ovid Medline, and Embase from 2004 to 2008 for each journal in the development and validation set for relevance to glomerular disease ( Additional file 1: Appendix A). These 22,992 articles included original investigations, reviews, letters, and editorials. We derived a standardized checklist of qualifications and terms to classify articles as relevant to glomerular disease from a review of nephrology textbooks and the MeSH thesaurus ( Additional file 1: Appendix B). Three readers (AI, CL, AG) used this checklist to determine whether the full text of each article was relevant to nephrology. All reviewers were calibrated against a nephrologist (AG) in their application of checklist criteria using two test sets of 100 articles (agreement beyond chance, κ = 0.91).

Filters

We developed unique filters for PubMed, Ovid Medline, and Embase. We obtained the search terms used for filter development from the following sources: US National Library of Medicine (NLM) medical subject heading (MeSH) thesaurus using Medline MeSH browser [22], Medline permuted index [23], Emtree thesaurus [24], SNOMED clinical terms, nephrology textbooks [25], clinical practice guidelines [26, 27], systematic reviews [2833], website glossaries, and clinician and librarian opinion. All terms considered potentially useful by any member of our team were included. Examples of terms used in the filters include ‘glomerulonephritis’, ‘proteinuria’, ‘nephrotic’, and ‘biopsy’. We used MeSH terms with or without major focus and with or without additional subheadings or explosion capability. Major focus refers to records in which an index term has been tagged as the major topic of the article. Entering the exploded MeSH term ‘glomerulonephritis’ means the following terms are also automatically included in the search: anti-glomerular basement membrane disease, IgA, membranoproliferative, membranous, focal segmental glomerulosclerosis, and lupus nephritis. We considered free text words as full and truncated terms and accounted for both American and British English spelling. The inclusion of multiple endings was achieved through the use of the $ symbol (for example, glomerulo$). Terms could appear anywhere in a citation, but not solely in the journal name. We repeated the same process for Embase using EMTREE index terms to replace the MeSH terms in PubMed and Ovid Medline.

We automated the process of combining and testing the filters by using a computer-implemented algorithm. We combined single term filters into multiple term filters by selectively using the Boolean operators “OR,” “AND,” and “NOT” to maximize sensitivity and specificity. We then compared the retrieval performance of various filters (made up of individual and combinations of search terms) with the reference standard from manual review in the development set.

Statistical analysis

For each filter, we constructed a two by two contingency table and assessed filter performance by calculating sensitivity, specificity, precision, and accuracy, similar to evaluation of a diagnostic test (Table 1). We then selected filters from the development phase that demonstrated high performance in either sensitivity or specificity without compromising precision and retested them in the validation set of articles.

Table 1 Two by two contingency table comparing filter to ‘reference standard’

Proof of concept searches

To illustrate the potential effectiveness of validated filters in PubMed, we selected six independent nephrologists from a directory of Canadian nephrologists provided by the Royal College of Physicians and Surgeons of Canada to execute a search for a unique predetermined clinical question. We formulated six clinical questions, each which could be answered by a recent corresponding systematic review [2833]. These systematic reviews were then used as a reference source for relevant articles on the given topic. For example, the question ‘What are the benefits and harms of different interventions for the treatment of renal vasculitis in adults?’ was framed to match a systematic review of thirteen articles on interventions for renal vasculitis in adults by Walters et al. [32]. We asked each nephrologist to formulate a search strategy for the given clinical question without knowledge of the search filter or database in use. We then applied these searches to the PubMed database with and without the validated filters developed as part of this study. Search dates were restricted to the date on which the review was updated. In each case, we noted the number of relevant articles identified in searches with and without the validated filters, compared with the reference standard, which in this case was the set of relevant articles as determined by each systematic review.

Results

Sample of articles

We used 22,992 full text articles from 39 journals ( Additional file 1: Appendix A). In total, 21,300 articles contributed to the PubMed set, while 21,280 and 22,158 articles contributed to the Ovid Medline and Embase set, respectively. We assigned 14,619 articles to the development set and 8,373 articles to the validation set.

Single term filters

We tested 261,255 single term filters. The single term filters with optimal balance of sensitivity and specificity in the development set were ‘Kidney Diseases[mh]’ for PubMed (90.2% sensitivity, 87.0% specificity), ‘exp Kidney Diseases/’ for Ovid Medline (90.2% sensitivity, 87.0% specificity), and ‘exp kidney disease/’ for Embase (95.3% sensitivity, 80.6% specificity).

Multiple term filters

We tested 736,043 multiple term filters. Our best performing filters for PubMed, Ovid Medline, and Embase are shown in Table 2, categorized by high-sensitivity and high-specificity. These filters used over 50 terms. All filters in the development set achieved 93.8-99.0% sensitivity, 95.2-98.6% specificity, 43.4-71.1% precision, and 95.3-98.5% accuracy. Filters optimized for sensitivity achieved 96.7-99.0% sensitivity in the development set and filters optimized for specificity achieved 98.4-98.6% specificity (Table 2).

Table 2 Glomerular Disease Search Filters for PubMed, Ovid Medline, and Embase Optimized for High Sensitivity and High Specificity

The performance of these filters was consistent in the validation set. All filters in the validation set achieved 91.1-96.4% sensitivity, 96.0-98.6% specificity, and 95.9-98.5% accuracy, however the precision dropped to 28.5-52.9%. Filters optimized for sensitivity achieved 94.8-96.4% sensitivity in the validation set and filters optimized for specificity achieved 98.5-98.6% specificity (Table 2).

Proof of concept searches

Selected systematic reviews included a range of 3 to 15 relevant articles. Search phrases determined and entered by the physicians included ‘(mycophenolate OR cyclophosphamide) lupus’, ‘treatment membranous’, ‘steroid HSP’, ‘renal vasculitis treatment limit to English, core journals, adults’, ‘low protein diet diabetes’, and ‘minimal change treatment’. In all proof of concept searches using the validated high-sensitivity filter, the number of non-relevant articles was minimized without compromising the retrieval of relevant articles (Table 3). In proof of concept searches using the validated high-specificity filter, there was a more dramatic reduction in non-relevant articles, however in one case one relevant article was not retrieved using the search ‘low protein diet diabetes’ (Table 3).

Table 3 Proof-of-concept searches showing the number of relevant articles retrieved with and without glomerular disease filter

Discussion

Building on the same concepts our group has used to create novel high performance search filters for general nephrology and renal transplantation [15, 18], we have succeeded in developing and validating search filters for glomerular disease that are highly sensitive and specific. All filters achieved a balance of at least 93.8% sensitivity and specificity. Our best performing high-sensitivity filter was in Embase, achieving 99.0% sensitivity and 95.3% specificity. The best performing high-specificity filter was also in Embase, which reached 95.7% sensitivity and 98.6% specificity. Without changing their PubMed search terms, in an illustrative example physicians were able to retrieve articles with a higher degree of precision (less non-relevant articles) with use of these filters.

These filters are complex, often combining in excess of 50 terms with Boolean operators. Coding these filters into the PubMed and Ovid search engine interfaces will permit their easy use by anyone doing a search. In the meantime, we provide these filters at the following link: http://hiru.mcmaster.ca/hiru/hiru_hedges_nephrology_filters.aspx. As of September 2011, use of the high-sensitivity glomerular disease filter reduced the PubMed database from 21 million to 195,374 articles, and the high specificity filter reduced this to 107,658 articles.

Depending on the search terms entered by the user, these filters may serve many purposes, which are best understood in the context of our illustrative proof of concept searches (Table 3). First, without changing the original search term(s), selecting a filter applies the search only to a subset of articles that are richer in glomerular disease content. The result is an increase in precision of the search, similar to the increase in positive predictive value of a screening test when applied to a high-risk population. This was demonstrated by the use of search terms ‘minimal change treatment’ in Table 3. Fewer non-relevant articles were retrieved with use of the filter (1236 versus 4662 articles), without impacting relevant article retrieval. Second, the filter acts as an optimized substitute for glomerular disease specific terms and synonyms allowing users to simplify the search query. This avoids unnecessarily limiting the search due to indexing inconsistencies inherent with the terminology used to define glomerular disease. For example, if a user was searching for dietary recommendations in diabetic nephropathy, the search terms may be simplified to ‘low protein diet diabetes’, instead of searching for ‘low protein diet diabetes’ with selected terms such as ‘nephropathy’, ‘kidney disease’, or ‘glomerulosclerosis’ that may negatively impact relevant article retrieval. In this case, even without use of search terms pertaining to glomerular disease, precision of the results was enhanced (Table 3). Third, users may opt to exclude disease specific terms entirely and use the filter to address questions that potentially relate to all glomerular disease equally. An example of this may include entering ‘immunization’ when addressing the impact of vaccinations in patients with glomerular disease.

Our results also highlight that even with high performance validated search filters, a single search will rarely retrieve everything of relevance on a particular topic. There is simply too much variation in the quality of accompanying search terms entered by the user, completeness of the database, and quality and consistency of indexing. This explains why in some proof of concept searches, retrieval of relevant articles was incomplete both with and without use of the filter (Table 3). Also, the extent to which the search filter is generalizable depends upon the sample of journals selected for study and the method by which articles were defined as relevant. Our selection of journals was deliberately enriched with leading clinical nephrology journals. Although it also included a random sampling of other journals, this set of journals may not adequately represent the complete set of multi-disciplinary journals that feature glomerular disease content in PubMed. This may explain the significant drop in precision when the filter was applied to the validation set, which was a smaller database by design with a lower proportion of relevant articles. Our choice to divide articles into the development and validation sets at the journal level may have also contributed to the lower proportion of articles with glomerular disease content in the validation set. However, this approach provided insight as to what would occur if the search database were expanded to include the over 5000 journals indexed in PubMed.

Proof of concept searches were used to illustrate the functionality of our best performing filters with real physician searches. In each case, the clinical questions formulated from recent systematic reviews were relevant to glomerular disease and physician searches appear typical for the average user. These examples show a gain in search strategy precision with use of the high-sensitivity and high-specificity filter through a dramatic reduction in non-relevant articles. This occurs without sacrificing retrieval of relevant articles in most cases. However, the methods for defining the reference standard based on articles used in systematic reviews of variable quality is indirect and has not been compared with one derived from hand searching [34]. For this reason the proof of concept searches should be viewed as illustrative examples, not as evidence of further filter validation.

These search filters for glomerular disease were designed to offer physicians and researchers a strategy to optimize results by sensitivity or specificity, depending on the level of article retrieval they deem manageable on a practical level. Filters that maximize sensitivity involve a compromise on the level of precision achieved, though this still may appeal to a researcher conducting a systematic review. For busy physicians at the point of care, we recommend starting with the high-specificity filter. To narrow results even further, physicians may prefer use these search filters in conjunction with previously developed methods filters, such as the therapy filter for randomized controlled trials available via PubMed’s Clinical Queries section [514]. This approach has not been formally tested with the glomerular disease filters, but in a recent study has been shown to increase the efficiency of retrieval of articles relevant to renal care [35].

Conclusions

In conclusion, we have succeeded in developing and validating high performance search filters for glomerular disease that can be easily applied by the busy physician. We expect this will contribute to more efficient and effective evidence-based decision-making, education, and patient care. Future research is required to measure this impact, and to better understand the usefulness of these filters when used in combination with previously developed methods filters for physician searches.