Investigating quantitative histological characteristics in renal pathology using HistoLens

Border, Samuel P.; Tomaszewski, John E.; Yoshida, Teruhiko; Kopp, Jeffrey B.; Hodgin, Jeffrey B.; Clapp, William L.; Rosenberg, Avi Z.; Buyon, Jill P.; Sarder, Pinaki

doi:10.1038/s41598-024-68406-7

Investigating quantitative histological characteristics in renal pathology using HistoLens

Article
Open access
Published: 30 July 2024

Volume 14, article number 17528, (2024)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Investigating quantitative histological characteristics in renal pathology using HistoLens

Download PDF

Samuel P. Border¹,
John E. Tomaszewski²,
Teruhiko Yoshida³,
Jeffrey B. Kopp³,
Jeffrey B. Hodgin⁴,
William L. Clapp⁵,
Avi Z. Rosenberg⁶,
Jill P. Buyon⁷ &
…
Pinaki Sarder¹

307 Accesses
Explore all metrics

Abstract

HistoLens is an open-source graphical user interface developed using MATLAB AppDesigner for visual and quantitative analysis of histological datasets. HistoLens enables users to interrogate sets of digitally annotated whole slide images to efficiently characterize histological differences between disease and experimental groups. Users can dynamically visualize the distribution of 448 hand-engineered features quantifying color, texture, morphology, and distribution across microanatomic sub-compartments. Additionally, users can map differentially detected image features within the images by highlighting affected regions. We demonstrate the utility of HistoLens to identify hand-engineered features that correlate with pathognomonic renal glomerular characteristics distinguishing diabetic nephropathy and amyloid nephropathy from the histologically unremarkable glomeruli in minimal change disease. Additionally, we examine the use of HistoLens for glomerular feature discovery in the Tg26 mouse model of HIV-associated nephropathy. We identify numerous quantitative glomerular features distinguishing Tg26 transgenic mice from wild-type mice, corresponding to a progressive renal disease phenotype. Thus, we demonstrate an off-the-shelf and ready-to-use toolkit for quantitative renal pathology applications.

Semiautomated pipeline for quantitative analysis of heart histopathology

Article Open access 26 September 2023

Next-Generation Morphometry for pathomics-data mining in histopathology

Article Open access 28 January 2023

PathoSpotter-K: A computational tool for the automatic identification of glomerular lesions in histological images of kidneys

Article Open access 24 April 2017

Introduction

During training, pathologists are taught to recognize microscopic and macroscopic patterns that are established diagnostic features of disease indicative of disease progression. Advancements in biomedical research have led to the discovery of cell biological and genetic drivers underlying pathology, the diagnostic features of which may not be quantifiable by routine visual inspection. In recent years, artificial intelligence and machine learning approaches have demonstrated impressive ability to quantify and classify data derived from histologically stained tissue images, with improvements in speed, objectivity, and stratification, compared with traditional diagnostic routines. Such tools include automated segmentation pipelines and diagnostic workflows designed by digital pathology companies such as PathAI, Visiopharm, Aiforia, and others^1,2,3. These approaches may offer a bridge between laboratory findings and clinical practice^4,5. To make this translation of computer-based approaches into clinical pathology settings a success, a platform that allows interpretation of the results of computationally derived features within the context of a biological framework is needed^6,7.

Existing computational image analysis tools rely heavily on convolutional neural network (CNN)-based approaches, extracting features from acquired images and enabling continuous accretion of knowledge^8,9. Whether or how the resulting image features are relevant to disease diagnosis, stage or mechanism is often unclear when using a CNN. By contrast, the selection of hand-engineered image features to quantify the size, shape, color, and texture characteristics (the latter is defined as spatial variation of pixel intensity levels within histological sub-compartments) can be informative data in the evaluation of disease processes¹⁰. These features can be analyzed by advanced machine learning tools that have been borrowed from areas other than image analysis, such as text data mining using recurrent neural networks and transformer models^11,12. By selectively providing computational models with quantitative values describing histological sub-compartments of interest, the resulting models are more interpretable compared to CNNs that receive raw image data as input^{11,12,13,14,15}. Making computational models accessible to pathologists using a set of interpretable features in an easy-to-use, plug-and-play format will further engage pathologists in computational research, integrating the most robust characteristics of both classical and modern diagnostic approaches and delivering a translational tool for research and clinical applications.

Toward that aim, we have developed HistoLens, an open-source graphical user interface tool to enhance histological image investigation and biomarker discovery. Using an extensive set of 448 hand-engineered features and their associated visualizations, this application enables users to interpret combinations of quantitative image features and visualize the informative regions within images (Supplemental Table 1). As these informative regions are determined using image analysis methods that are closely tied to the calculation of specific hand-engineered features, we can be sure of their consistency and robustness. This visualization for hand-engineered features greatly increases the interoperability of features identified by HistoLens to be of value. Each of these features are designed to be summaries of a particular quantitative characteristic for a sub-compartment within a single structure. We also provide an example for future developers on how they can add their own features to HistoLens both for calculating different characteristics of Periodic Acid Schiff (PAS)-stained histology or for quantifying other types of common histological stains such as Silver (Supplemental Documents 4).

The ability of HistoLens to identify and visualize differences among human glomeruli is demonstrated in two distinct nodular glomerular diseases—diabetic nephropathy (DN) and amyloid nephropathy (AN)—from the histologically unremarkable glomeruli observed in minimal change disease (MCD). Additionally, we demonstrate the ability of HistoLens to characterize subtle glomerular differences in the Tg26 mouse model of HIV-associated nephropathy (HIVAN). In Supplemental Document 4, we provide an additional example of how HistoLens can be used to characterize Silver staining in subjects diagnosed with IgA nephropathy.

Methods

Human data collection followed a research protocol approved by the IRBs at the University of Florida, Gainesville, FL and Johns Hopkins Medical Institutions, Baltimore, MD and University of Michigan, Ann Arbor, MI. Animal studies were performed in accordance with protocols approved in advance by the Institutional Animal Care and Use Committee at the National Institutes of Health, Bethesda, MD.

Software overview

HistoLens is a stand-alone desktop application for hand-engineered image feature analysis, quantification, classification, and visualization for digital pathology images (Fig. 1). HistoLens accepts as input digital histology whole-slide images (WSIs) along with WSI micro-compartmental annotations at varying scales (cell nuclei to larger structures, such as glomeruli, in the context of kidney pathology) in Aperio ImageScope Extensible Markup Language (XML) or JavaScript Object Notation (GeoJSON/JSON) QuPath annotation format. Annotations can be generated manually or through the use of deep learning models implemented in cloud platforms such as Histo-Cloud or FUSION (Functional Unit State Identification for WSIs)^16,17. Software applications that support the recording of digital pathology annotations include both open-source applications, such as QuPath, Automated Slide Analysis Platform (ASAP), and commercial applications, including Pathomation (Antwerp Belgium), and Aperio ImageScope (Leica Biosystems, Wetzlar Germany) among others^18,19,20. If no annotations are found in the same folder as the WSIs, users can also use the annotation tool included in HistoLens which allows users to interactively annotate multiple different types of structures. The resulting annotations can then be saved in either Aperio ImageScope XML format, GeoJSON format, or Histomics format which is employed in Histo-Cloud.

Computational analysis in HistoLens includes interactive sub-compartment segmentation (extracting biologically relevant sub-compartments in each image; for example, nuclei, eosinophilic regions, and luminal space), color normalization, feature extraction, and feature ranking, all linked to patient level or tissue level labels²¹. Sub-compartment segmentation is performed using an interactive procedure that minimizes the amount of user parameters required and therefore simplifies the image analysis process. These parameters include colorspace, intensity threshold, minimum object size, and a splitting parameter applied only to overlapping nuclei. Users can also modify sub-compartment segmentation parameters for individual slides if there is variation in staining or imaging conditions within their dataset. If a user is confident that the staining characteristics are consistent across their slides, they may also bypass the remaining slides and project their current segmentation parameters onto subsequent slides prior to feature extraction. The resulting data (sub-compartmental segmentation parameters and their associated quantitative hand-engineered features) can be analyzed using HistoLens for structure classification and interactive visualization for biological discovery.

The main window of HistoLens consists of three major panels (Fig. 2). The first panel presents an annotated microanatomic structure pulled from the WSI, together with any hand-engineered feature visualizations that are displayed as an overlaid heatmap contained within a rectangular region of interest (ROI). The annotated structure in this display can be changed in three ways. Users can (1) select a different annotated image pulled from the WSI using the accompanied annotation file by selecting its label in the list below the displayed image, (2) click the “Next Image” or “Previous Image” buttons, or (3) tap the right or left arrow keys on the keyboard.

The second panel, located under the main visualization window, is a series of lists with titles, “Sub-compartment,” “Feature Type,” “Specific Feature,” and “Custom List.” By selecting an item in the “Sub-compartment” list, users can access the relevant sub-compartments of interest for quantifying associated hand-engineered features and feature visualizations. The “Feature Type” list contains the categories of features (e.g., size, morphology, texture, color) that are included in that sub-compartment. In the “Specific Feature” list, users can make specific image feature selections. Clicking the “Add to Custom” button, located beneath “Custom List,” adds the selected specific feature to the “Custom List” list. Adding features to this list allows users to select combinations of hand-engineered features that are not in the same sub-compartment or feature type. Located next to the four lists are four buttons labeled, “View Sub-Compartment Features,” “View Feature Type,” “View Specific Feature,” and “View Custom List.” Clicking on these buttons controls which features are currently under shown in the visualization panel as discussed next.

The third panel is a toolbox which allows users to generate figures, build classification models, and generate image annotations for downstream analysis. The major tools are separated on tabs labeled, “Feature Distribution Plot,” “Statistical Measures,” “Classification Models,” “Relative Feature Intensity,” “Add Annotations,” and “Feature Definitions.” Details of these tabs are provided below.

Feature distribution plot This tab provides plots of the distribution of the current/selected hand-engineered features across a dataset. Labels on the plot show users the location of the selected image in the main display relative to the rest of the dataset. When a single feature is selected using the “View Specific Feature” button as part of the second panel, the data distribution for that feature is presented as a violin plot in which the points correspond to the annotated structures in the dataset.

When selecting more than one feature using the “View Feature Type,” “View Sub-Compartment Features,” or “View Custom List” buttons, scatterplots are used to show either the differences between two features or between the first and second principal components of the dimensionally reduced feature set. Included in this tab are a second set of tabs, located below the feature distribution axes, for selecting the data that included in the plot. These tabs include the following five options: switching how each point is labeled, selecting regions in the feature space for making qualitative comparisons between sub-clusters displayed in the Feature Distribution Plot, labeling the data scatter according to slide metadata, adding additional labels from an external file or classification model, and removing outliers (with a table displaying the image name removed). Using these tools, trends in the current/selected features can more easily be identified and evaluated.

Statistical measures The second tab provides statistical comparisons, including a summary of descriptive statistics for the current feature (minimum, maximum, mean, median, etc.) identified by current labels in the feature distribution plot. When single features are viewed, a statistical test is implemented to provide users with a quantitative measure of statistical significance. The test implemented depends on the number of classes in the current labels. For two classes, a Student t-test is performed and for three or more classes, a one-way ANOVA is performed^22,23. For multiple features, the silhouette index for each class is displayed to quantify data clustering relative to other classes²⁴. Principal component analysis is also provided in the form of percentage variance explained in the first two principal components, together with the coefficients for each feature. To view the individual contributions of specific features on the principal components, users are able to select features for a biplot (a two-variable scatterplot) on the feature distribution plot²⁵. Arrows are used to indicate the direction and magnitude of coefficients for each feature.

Classification models Using the features displayed in the feature distribution plot, a user can implement several classification models, including decision trees, discriminant analysis, a naïve Bayes classifier, and a simple neural network^26,27,28,29. The performance of each of these models for a particular image set is displayed using a confusion matrix for categorical labels and a violin plot of error for continuous labels. A performance report is also generated that includes other performance metrics of the model, including accuracy, sensitivity, specificity, F1 score (defined as the harmonic mean between precision and recall, with values ranging from 0 to 1), and the number of structures in each slide that are predicted to be from a certain class³⁰. Saving the model, by clicking the “Save Model” button, generates a new directory in the slide directory containing the model object and the performance report (‘.xlsx’ file). This report also includes the features used to generate the model and predicted labels for each member of the test set. Additional information on the Classification Models workflow is provided in Supplemental Document 1.

Relative feature intensity This tab provides users with a quantitative description of the intensity, measured here as the area of the hand-engineered feature visualization present in the rectangular ROI in the first panel of each feature. For individual features, this plot displays the distribution of intensities contained within the feature visualization for that image. For multiple features, the quantification method instead shows the relative content for each feature in the enclosed region compared to the rest of the image. Additionally, the relative values displayed in the histogram are recorded along with feature name and feature rank in table format. Users can use this tab to locate which regions are influential towards highly ranked features as determined using a chi-square test (default) when new labels are imported through the “Add Labels” tab³¹.

Adding annotations In this tab, users can define custom classes and annotate more specific regions in the current image. These regions are saved locally as binary masks for each user-specified class where annotated pixels have a value of one and non-annotated, background pixels have a value of zero. The resulting folder structure separates out annotated images from different slides with subfolders corresponding to the name of the annotated class in that mask. These binary masks can be used in downstream deep-learning pipelines for segmenting different types of lesions or specific areas within annotated compartments. This tab also allows users to transfer the current image annotation to another annotation layer in the original annotation file associated with the WSI. This feature can be used to assign structures to a separate group within the study or to create a new structure in the WSIs.

Feature definitions The last tab contains written definitions for the current feature or features under observation as well as a description of how each feature’s visualization was generated (see Supplemental Table 1). A key for interpreting the columns of Supplemental Table 1a is provided in Supplemental Table 1b.

Experimental design

We analyzed the performance of HistoLens to recognize biologically significant characteristics of histological structures in two distinct experiments: one using a human dataset consisting of DN, MCD, and AN cases, and the other using a murine dataset consisting of HIVAN and wild-type (WT) control mice.

Tissue sectioning, staining, & imaging Tissues were sectioned at 2–3 µm thickness for staining and imaging. The tissues were stained primarily with PAS and hematoxylin counterstain. The slides were scanned using a brightfield microscopy WSI scanner, NanoZoomer 2.0-HT (Hamamatsu, Shizuoka, Japan). The pixel resolutions of the images used were 0.26, 0.25, and 0.51 µm/pixel.

Human DN, MCD, and AN In the first experiment, we examined whether and how HistoLens distinguishes features between nodular sclerotic glomerular pathologies (DN and AN) and histologically normal glomeruli (MCD). This particular comparison was selected as the visual appearances of glomeruli in DN and AN are readily distinguishable from MCD using PAS staining visualized by light microscopy (Fig. 3)^32,33. Additionally, there already exists a well-described general set of distinct diagnostic features that identifies the presence of DN and AN³³. The primary feature of significance is the extent and quality of the glomerular mesangium. For glomeruli manifesting nodular diabetic glomerulosclerosis, nodular mesangial matrix expansion is apparent, in extreme cases forming Kimmelstiel–Wilson nodules³³. In contrast, amyloid glomerulopathy, also a nodular pattern, is characterized by deposition of non-matrix material and along capillary walls³⁴. By light microscopy, amyloid is generally PAS-weak compared to strong PAS-positivity in nodular diabetic glomerulopathy.

In HistoLens, the distinguishing features among DN, AN, and MCD can be grouped into two categories, mesangial size features and mesangial color features. For each of these categories, several metrics can be quantified to identify more detailed and objective differences among the samples. These features include measures of PAS + area (measured using number of pixels contained within segmented PAS + regions) and PAS + region color statistics (mean and standard deviation of red, green, and blue [RGB] values).

Human lupus nephritis (LN) and IgA nephropathy To demonstrate the ability of HistoLens to extend to other diseases and stain types, we further examined two additional datasets and include that abridged analysis in Supplemental Documents 3 and 4. The first dataset contained PAS-stained slides from patients with Lupus Nephritis (LN) who exhibited different responses to treatment (Complete Response (CR) and No Response (NR)). The next dataset consisted of Silver-stained slides from patients with IgA Nephropathy (IgAN), where HistoLens was employed to quantify differences in silver deposition between patients.

Murine HIV-associated nephropathy (HIVAN) model Next we used HistoLens to characterize differences between diseased and WT groups in a mouse study set. The mouse cohort was an HIVAN model. In this model, Tg26 mice from the FVB/N strain contain a transgene with a gag-pol-deleted HIV-1 genome and manifest a sclerosing glomerulopathy (Fig. 4)³⁵. We sought to test whether quantitative assessment and visualization in HistoLens would be able to achieve a sufficiently granular analysis of the effects of experimental factors on tissue compartments. See Supplemental Table 2 for dataset composition details.

Sources for other quantitative methods applied in this work, including references and MATLAB implementation documentation, can be found in Supplemental Table 3.

Approval for animal experiments

All animal studies were performed in accordance with protocols approved by the Laboratory of Animal Science Section (LASS) at NIH NIDDK, and are consistent with federal guidelines and regulations and in accordance with recommendations of the American Veterinary Medical Association guidelines on euthanasia. This study is reported in accordance with ARRIVE guidelines.

Approval for human experiments

Human data collection followed a protocol approved by the Institutional Review Board at Johns Hopkins, University of Florida, and University of Michigan prior to commencement. All methods were performed in accordance with the relevant federal guidelines and regulations. Participants were required to be over 18 years of age, and with a diagnosis of chronic kidney disease that required renal biopsy. Special populations (vulnerable) such as minors, pregnant women, neonates, prisoners, children, and cognitively impaired patients were not included. All patients provided written informed consent.

Results

HistoLens performance was tested using human and murine model data; details of the experiments are discussed under Experimental Design in the Methods section. A description of the datasets used—including sources, disease pathology, tissue thickness, staining, and image acquisition—are summarized in Supplemental Table 2.

Differentiating human DN, MCD, and AN

We first examined variations in PAS-stained renal glomerular mesangial color intensity among three human diseases: MCD, DN, and AN. Hand-engineered features that measure the color properties of the mesangial area include the mean and standard deviation of RGB intensities assessed on PAS-stained sections. Across this dataset, we found the standard deviation of blue intensities to be the most significant PAS + color feature distinguishing AN and DN from MCD (Fig. 5) (ANOVA, p = 5.74*10⁻²⁷; MCD vs DN, p = 0.35; MCD vs AN, p = 3.47*10⁻²³; and DN vs AN, p = 1.43*10⁻¹⁵). In general, glomeruli from AN cases had a lower standard deviation in blue intensities than MCD. A similar trend was also observed for red and green values. AN and DN glomeruli generally had a more homogenous distribution of color intensities in mesangial pixels, a finding likely driven by increased mesangial area, consistent with the known acellular/pauci-cellular mesangial expansion in these entities³³.

We next examined features relevant to mesangial expansion, which HistoLens quantifies using the distance transform. The distance transform allows us to efficiently calculate the distance between pixels from adjacent sub-compartments (e.g., nuclei or luminal space). Instead of having a single value for total mesangial expansion present in a particular glomerulus, we then have a value for each pixel within the PAS + sub-compartment quantifying expansion at a particular point. These PAS + distance transform values are then binned into pixel counts containing pixel distance transforms within certain ranges. Using these subdivisions, we quantitatively mapped the cut-off point at which there is a large difference between MCD, DN, and AN glomeruli (Fig. 6). At smaller PAS + distance transform ranges, we anticipated that the glomerular area spanning those values would be similar among the three diseases, relative to the PAS + areas of each glomerulus. Of the slides in this dataset, only a few glomeruli contained greatly expanded mesangium. These glomeruli were found predominantly in the DN and AN groups. We observed that the statistical significance of PAS + distance transform features increased for larger values, indicating that these features are increasingly significant in the presence of higher degrees of mesangial expansion.

There were noticeable differences between DN and AN versus MCD (as expected) but also between DN and AN (two sclerosing diseases). DN samples generally had higher mesangial pixel counts compared to the other two diseases, with MCD having the lowest values. This relationship is illustrated in Fig. 6B, where the PAS + distance transform values greater than eight pixels are delineated using respective example images from each histologic class. The areas on each image that are highlighted in red correspond to the greatest values of the PAS + distance transform, overlapping with areas of nodular mesangial sclerosis. In the sub-classification of DN, these large, paucicellular/acellular sclerotic nodules are referred to as Kimmelstiel-Wilson nodules and have been used to delineate the transition from Tervaert stage IIb to III³³.

After confirming histopathologic features that are already known to be clinically relevant, we expanded our search to uncover digitally encoded image biomarkers that could assist in making classification decisions between one or multiple classes. These additional features include quantitative characteristics that may be difficult for a human to appreciate in an image but can be assessed with digital feature quantification. A prime example of such diagnostic features are texture properties.

With a computational aid like HistoLens, the measurement and tabulation of texture features can be accomplished with the click of a button. This way, users can concentrate on the task of interpretation if the newly derived digital image feature is found to be statistically significant. Texture features are especially relevant in the comparison between sclerotic glomerular pathologies like DN, AN, and histologically unremarkable MCD. To distinguish a region of sclerotic from healthy mesangial matrix, pathologists look for areas with an increased distance between cell nuclei, with the space occupied by uniformly PAS + material. Figure 7 shows an example quantification of PAS + energy between MCD, DN, and AN with p values < 0.05 (ANOVA, p = 6.15*10⁻²⁷; MCD vs DN, p = 5.80*10⁻²⁴; MCD vs AN p = 5.79*10⁻¹⁸; and DN vs AN, p = 1.20*10⁻³). In the context of texture quantification, energy refers to the sum of squared elements in the gray-level co-occurrence matrix and is also known as the angular second moment or uniformity. From a qualitative perspective, we observed that glomeruli that featured a higher textural energy had a greater number of capillary lumens present and generally lacked sclerotic nodules. Figure 7 highlights image regions where the textural energy is highest in each glomerulus. From this visualization, it can be determined that glomeruli from MCD cases have a generally higher, uniformly distributed textural energy compared to glomeruli from DN or AN cases.

Interactive glomerular sub-classification

We next demonstrate a pathologist’s use of HistoLens to aid in glomerular sub-classification for specific lesions using established quantitative definitions. For this analysis, we focused on nodular glomerulosclerosis in the DN, MCD, and AN study set. Nodular glomerulosclerosis can be identified by localized regions of mesangial expansion, and exists in both DN and AN, among other diseases^36,37,38. Without HistoLens, annotation of nodular glomerulosclerosis involves searching through all WSIs or narrowing down the search according to some other clinical metric. This process is error-prone and inefficient, especially when multiple annotators are involved, and requires significant coordination and consensus. HistoLens, in contrast, enables users to define specific quantitative criteria for annotation and facilitates comparisons between multiple users’ labels for agreement.

As a first step, we identified some key histologic features of glomeruli with nodular sclerosis. Because the primary identifying feature of nodularity is mesangial expansion, we initially selected hand-engineered features relating to PAS + region size. For example, the distribution of the number of pixels with a PAS + distance transform value between 13 and 15 pixels exhibited vast differentiation between MCD glomeruli compared with those from patients with DN and AN (Fig. 8A). After reviewing 50 glomerular images, we had annotations for nodular, non-nodular, and globally sclerotic glomeruli (see the red box in Fig. 8A). All glomeruli from the DN and AN study sets found inside of the red box were deemed to contain nodular sclerosis, with visual confirmation by our pathologist collaborator who pulled the images using HistoLens (see Fig. 8B and C). All glomeruli from the MCD cohort were deemed non-nodular. Next, HistoLens was used to increase the number of features used to represent each glomerulus using principal component analysis. The Feature Distribution Plot in HistoLens was used to identify probable nodular glomeruli near already-labeled nodular glomeruli (Fig. 8D), while actively confirming the labels of these new glomeruli, as determined by our pathologist. The proximity of points in this dimensionally reduced feature space indicated similarity in appearance.

With a larger set of labeled glomeruli, we designed and trained a simple multilayer perceptron in HistoLens using all hand-engineered features and achieved 0.94 and 0.93 F1-score and accuracy measure on labeled samples in a holdout set of glomeruli (Fig. 8E)²⁹s. The holdout set in this case comprised a random sampling of 20% of the original glomeruli. This classifier was then used to label the remaining unlabeled glomeruli. Finally, with the resulting glomerular sub-classification labels achieved with the aid of HistoLens, we analyzed the distinctions between glomeruli presenting with nodular glomerulosclerosis across disease types. One feature we found to be distinctive between nodular glomeruli in DN and AN was the number of nuclei. The difference in nuclei number is related to the pathological mechanism behind nodularity in both DN and AN, where in AN the nodular appearance is caused by amyloid protein deposits in the glomerular tuft and in DN the nodularity is caused by the increased cellularity of aggressively expanding mesangium^38,39. This feature distinction is shown in Fig. 8F, which depicts the histopathological differences between respective glomeruli.

HIV-associated nephropathy mouse model

In the previous use-case experiments, we primarily focused on capturing the pervasive, systemic, and diffuse characteristics of each disease type across all glomeruli from each slide. In the Tg26 HIVAN model, however, glomerular pathological changes are known to be focal, with only select glomeruli depicting changes corresponding to focal segmental glomerulosclerosis (FSGS) lesions in human HIVAN, on occasion forming collapsing lesions^40,41. Because Tg26 mice can have many normal-appearing glomeruli, the presence of outliers is sufficient to denote a disease phenotype. In our assessment, we focused on the PAS + sub-compartment features as these features appear to exhibit the greatest degree of spread between the Tg26 and WT genotypes (Fig. 9). Clustering of only the PAS + features from Tg26 and WT glomeruli was quantified using silhouette score and is reported in the “Statistical Measures” tab of HistoLens (WT = 0.522; Tg26 = − 0.451). Silhouette score, a metric used to quantify intra-cluster similarity and inter-cluster variability, ranges between − 1 and + 1 where a high positive value indicates that members of that specific cluster are both very similar to each other and very different from members of other clusters.²⁴ The large negative value calculated for Tg26 glomeruli is indicative of the greater intra-class variability compared with WT glomeruli.

In this use-case study, the quantitative influence of differences in staining color and image brightness was particularly pronounced. Looking at the principal components plot for all features related to PAS + color resulted in strong clustering behavior according to the slide from which each glomerulus originated. Without the level of interactivity that is available in HistoLens, such dependence on small deviations in stain coloration may be mistaken as being derived from differences in pathology. Color normalization, using the method described by Macenko et al.²¹ was used in primary experiments to circumvent significant staining differences. However, we noticed that after applying color normalization images took on an artificially muted appearance that suppressed other observations of mesangial texture. This observation was also noted in the Human MCD, DN, and AN datasets where a color space shift is biologically linked to the binding of proteoglycans to amyloid fibrils and is a primary distinguishing feature between DN and AN sclerotic nodules^42,43. In this particular work, we are utilizing highly specific quantitative measures of color from both diseased and control samples. Further exploration of how to handle color and stain variation in the presence of disease for the purpose of quantitative feature extraction and comparison is needed. Although, in general, we observed the Tg26 mice to exhibit lighter PAS + and nuclear staining compared to WT mice, some slides exhibited globally abnormal staining intensity, which would be considered outliers within both groups. Due to the previously identified dependence on slide-level staining properties, we focused our analysis of this dataset on morphological features.

These findings suggest that HistoLens can aid in determining whether mouse models can mimic similar levels of human phenotypic changes for various disease types, while also informing users on the presence of batch artifacts and mitigating such issues by focusing quantitative analysis on morphological features during the validation process.

Comparison of HistoLens with existing platforms

A variety of open-source platforms have been developed in recent years to aid in the analysis of digital pathology WSIs. Among these, ImageJ (v.2.1.4.0), CellProfiler (v.4.2.5), and QuPath (v.0.4.3) are the most widely used.^6,7,8,9 Like HistoLens, each of these stand-alone desktop applications allows various users with unique levels of computational expertise to apply advanced image analysis approaches to complex biological data. These applications facilitate slide annotation, feature extraction, experiment tracking, and some machine learning functionality.

HistoLens differs from the aforementioned applications because it implements extensive high-dimensional data visualization, including feature distribution plots of hand-engineered image features. HistoLens also allows for classification of structures and clustering analysis with user-provided metadata and is optimized to identify disease-related changes in histological datasets. This focus on structural sub-categorization and slide-level classification allows researchers to obtain insights into their annotated datasets. QuPath includes many useful operations for efficiently annotating images as well as some built-in segmentation; but it does not allow users to extensively quantify sub-compartments within segmented regions without additional plugin development that is not accessible to a non-technical user. CellProfiler does allow for very specific quantification of nuclei within images; however, it is optimized for cell-specific segmentation and not tissue structures as an aggregate. Furthermore, the hand-engineered features calculated in HistoLens describe more pathologically relevant properties of tissue sub-compartments without relying on highly granular measures of image texture.

Discussion

HistoLens is a desktop application that enables pathology investigators to efficiently obtain a finely detailed, quantitative assessment of the structural characteristics of large histologic datasets. In addition to providing standard tools for data visualization, HistoLens offers several features that are designed specifically for histologic WSI data. These include the ability to pull representative images directly from annotated regions within WSIs and provide visual context for each of the 448 features that can be measured in HistoLens. In this work, we demonstrate the ability of HistoLens to validate known discriminative features in human kidney biopsies. Quantitative features that are used in existing clinical diagnostics were assessed and novel features were identified using computational image analysis techniques. These novel features include quantitative measures of texture, packing distribution of certain sub-compartments within object boundaries, and measurements of width and color. Visual evidence for these features was provided using visualizations of hand-engineered features, delivering localization for complex quantitative features.

Through the use of these specific and objective measures of biological structures, existing clinical diagnostic criteria can be updated to incorporate modern computational advances. Although advanced modeling techniques such as convolutional neural networks have proven adept at tasks like image classification, interpreting what features these models identify as important is a difficult task that is vulnerable to factors such as training set biases and the presence of noise (biologically irrelevant information). With these issues in mind, HistoLens is presented as a connection between the existing knowledge of biological relationships and complex computational methods. The ability to return to the image and validate preliminary observations is a critical function in computational pathology and implementing this function was a key design consideration in HistoLens.

Although this manuscript focuses on the analysis of renal histology, the same analytical procedures may be applied to stained histology tissues from any organ. By selecting the type of stain applied to WSIs while processing a dataset, users can interactively segment sub-compartments and design features that quantify characteristics of the structures that they are interested in. In Supplemental Document 2 an example is given for a mouse study that uses an alternative nuclear stain to localize a biologically meaningful cell type and in Supplemental Document 4 this process is extended to the design of custom features which quantify the distribution of Silver stain in glomeruli of IgA nephropathy patients⁴⁴. In Supplemental Document 3, we performed a preliminary analysis of LN cases with patients who responded to treatment and those who did not. Further extension of this tool to include other histological stains and organ-specific workflows will be addressed in future releases. Additionally, we will investigate the feasibility of deploying HistoLens as a cloud-based web application to reach more users without requiring download of specific releases of HistoLens. Initial deployment of HistoLens is limited to stand-alone desktop applications (for Windows, Mac, and Linux operating systems) to allow users to leverage local data without sharing externally, which can be beneficial if sensitive protected health information is used.

HistoLens has limitations. This desktop application presumes that structures are already annotated prior to sub-compartment segmentation and feature extraction. Future iterations of the software may extend functionality to other tissues, particularly those manifesting progressive fibrotic diseases. Annotation on a WSI is a very complex procedure from a tool-design perspective. As mentioned previously, several companies have already devoted significant resources towards the design and deployment of annotation software both in a web browser and in other stand-alone tools. In order to best serve pathologist users who have put significant time into annotation on a particular platform and computational scientists whose algorithms output specific formats of annotations, HistoLens is designed to be receptive to commonly used annotation formats as a secondary tool focused on quantitative feature extraction and analysis. Therefore, some features which are included in other software may not be present in HistoLens’ own WSI annotation tool. Nevertheless, the included WSI annotation tool in HistoLens elevates this tool to an all-in-one for interactive analysis. More details on annotating WSIs in HistoLens can be found starting on page 6 of the included User Manual. Expansion of this functionality as well as the integration of automated segmentation algorithms will be revisited in future releases as more feedback from users is collected.

Increasing the accessibility of quantitative techniques to individuals working in human and animal pathology is critical to fostering collaborative engagement among researchers from different subspecialties and backgrounds. HistoLens empowers clinicians and researchers to answer various questions, facilitating more objective, efficient, and effective workflows into clinical and research pathology.

Data availability

The source code and full installation instructions for HistoLens are available on GitHub at https://github.com/SarderLab/HistoLens (date of last access: 24 June 2024). A handbook for installing and using this tool, and all the human and murine WSIs used for analyzing the performance of HistoLens, are available at https://bit.ly/3VmZUCP (date of last access: 24 June 2024).

Abbreviations

DN:: Diabetic nephropathy
AN:: Amyloid nephropathy
MCD:: Minimal change disease
HIVAN:: Human immunodeficiency virus associated nephropathy
WT:: Wild type
PAS:: Periodic acid schiff
WSI:: Whole slide image
RGB:: Red, green, and blue
ROI:: Region of interest
ASAP:: Automated slide analysis platform
XML:: Extensible markup language
AI:: Artificial intelligence
ML:: Machine learning
DL:: Deep learning
CNN:: Convolutional neural network

References

PathAI. <https://www.pathai.com/> (
Visiopharm. <https://visiopharm.com/> (2024).
Aiforia. <https://www.aiforia.com/> (2024).
Farris, A. B. et al. Artificial intelligence and algorithmic computational pathology: An introduction with renal allograft examples. Histopathology 78, 791–804 (2021).
Article PubMed PubMed Central Google Scholar
Chen, H. & Sung, J. J. Y. Potentials of AI in medical image analysis in gastroenterology and hepatology. J. Gastroenterol. Hepatol. 36, 31–38 (2021).
Article PubMed Google Scholar
Patterson, E. A. & Whelan, M. P. A framework to establish credibility of computational models in biology. Prog. Biophys. Mol. Biol. 129, 13–19 (2017).
Article PubMed Google Scholar
Stiglic, G. et al. Interpretability of machine learning-based prediction models in healthcare. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 10, e1379 (2020).
Article Google Scholar
Anwar, S. M. et al. Medical image analysis using convolutional neural networks: A review. J. Med. Syst. 42, 1–13 (2018).
Article Google Scholar
Tajbakhsh, N. et al. Convolutional neural networks for medical image analysis: Full training or fine tuning?. IEEE Trans. Med. Imaging 35, 1299–1312 (2016).
Article PubMed Google Scholar
Hölscher, D. L. et al. Next-generation morphometry for pathomics-data mining in histopathology. Nat. Commun. 14, 470 (2023).
Article ADS PubMed PubMed Central Google Scholar
Campanella, G. et al. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images. Nat. Med. 25, 1301–1309 (2019).
Article CAS PubMed PubMed Central Google Scholar
Chen, R. J. et al. Scaling vision transformers to gigapixel images via hierarchical self-supervised learning. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2022).
Srivastava, A. et al. Imitating pathologist based assessment with interpretable and context based neural network modeling of histology images. Biomed. Inform. Insights 10, 1178222618807481 (2018).
Article PubMed PubMed Central Google Scholar
Gao, S. et al. Hierarchical attention networks for information extraction from cancer pathology reports. J. Am. Med. Inform. Assoc. 25, 321–330 (2018).
Article PubMed Google Scholar
Stegmüller, T., Spahr, A., Bozorgtabar, B. & Thiran, J.-P. Scorenet: Learning non-uniform attention and augmentation for transformer-based histopathological image classification. Preprint at arXiv:2202.07570 (2022).
Lutnick, B. et al. A user-friendly tool for cloud-based whole slide image segmentation with examples from renal histopathology. Commun. Med. 2, 105. https://doi.org/10.1038/s43856-022-00138-z (2022).
Article PubMed PubMed Central Google Scholar
Laboratory, C. M. a. I. FUSION (Functional Unit State Identification in WSIs), <fusion.hubmapconsortium.org> (2024).
Bankhead, P. et al. QuPath: Open source software for digital pathology image analysis. Sci. Rep. 7, 1–7 (2017).
Article CAS Google Scholar
Teplitz, C. et al. Automated Speech-recognition Anatomic Pathology (ASAP) reporting. Semin Diagn Pathol. 11(4), 245–252 (1994).
CAS PubMed Google Scholar
Litjens, G. Automated slide analysis platform (ASAP). https://computationalpathologygroup.github.io/ASAP/ (2018).
Macenko, M. et al. in IEEE International Symposium on Biomedical Imaging: From Nano to Macro. 1107–1110 (IEEE).
Student,. The probable error of a mean. Biometrika 6, 1–25 (1908).
Article Google Scholar
Fisher, R. On the ‘probable error’ of a coeficient of correlation deduced from a small sample, Metron I (1921). Reprinted in: Contributions to Mathematical Statistics. 3–32 (Wiley, New York, 1950).
Rousseeuw, P. J. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J Comput. Appl. Math. 20, 53–65 (1987).
Article Google Scholar
Gabriel, K. R. The biplot graphic display of matrices with application to principal component analysis. Biometrika 58, 453–467 (1971).
Article MathSciNet Google Scholar
Quinlan, J. R. Induction of decision trees. Mach. Learn. 1, 81–106 (1986).
Article Google Scholar
McLachlan, G. J. Discriminant Analysis and Statistical Pattern Recognition (Wiley, 2005).
Google Scholar
Zhang, H. The optimality of naive Bayes. Aa 1, 3 (2004).
Google Scholar
Rumelhart, D. E., Hinton, G. E. & Williams, R. J. Learning representations by back-propagating errors. Nature 323, 533–536 (1986).
Article ADS Google Scholar
Taha, A. A. & Hanbury, A. Metrics for evaluating 3D medical image segmentation: Analysis, selection, and tool. BMC Med. Imaging 15, 1–28 (2015).
Article Google Scholar
MATLAB. fscchi2, <https://www.mathworks.com/help/stats/fscchi2.html> (2024).
Vivarelli, M., Massella, L., Ruggiero, B. & Emma, F. Minimal change disease. Clin. J. Am. Soc. Nephrol. 12, 332–345 (2017).
Article CAS PubMed Google Scholar
Tervaert, T. W. C. et al. Pathologic classification of diabetic nephropathy. J. Am. Soc. Nephrol. 21, 556–563 (2010).
Article PubMed Google Scholar
Cooper, J. H. An evaluation of current methods for the diagnostic histochemistry of amyloid. J. Clin. Pathol. 22, 410–413 (1969).
Article CAS PubMed PubMed Central Google Scholar
Dickie, P. et al. HIV-associated nephropathy in transgenic mice expressing HIV-1 genes. Virology 185, 109–119 (1991).
Article CAS PubMed Google Scholar
Kimmelstiel, P. & Wilson, C. Intercapillary lesions in the glomeruli of the kidney. Am. J. Pathol. 12, 83 (1936).
CAS PubMed PubMed Central Google Scholar
Herzenberg, A. M., Holden, J. K., Singh, S. & Magil, A. B. Idiopathic nodular glomerulosclerosis. Am. J. Kidney Dis. 34, 560–564 (1999).
Article CAS PubMed Google Scholar
Alsaad, K. O. & Herzenberg, A. M. Distinguishing diabetic nephropathy from other causes of glomerulosclerosis: an update. J. Clin. Pathol. 60, 18–26 (2007).
Article CAS PubMed PubMed Central Google Scholar
Khalighi, M. A., Dean Wallace, W. & Palma-Diaz, M. F. Amyloid nephropathy. Clin. Kidney J. 7, 97–106 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kopp, J. B. et al. Progressive glomerulosclerosis and enhanced renal accumulation of basement membrane components in mice transgenic for human immunodeficiency virus type 1 genes. Proc. Natl. Acad. Sci. 89, 1577–1581 (1992).
Article ADS CAS PubMed PubMed Central Google Scholar
Fogo, A. B., Lusco, M. A., Najafian, B. & Alpers, C. E. AJKD atlas of renal pathology: HIV-associated nephropathy (HIVAN). Am. J. Kidney Dis. 68, e13–e14 (2016).
Article PubMed Google Scholar
Fogo, A. B., Lusco, M. A., Najafian, B. & Alpers, C. E. AJKD atlas of renal pathology: AL amyloidosis. Am. J. Kidney Dis. 66, e43-45. https://doi.org/10.1053/j.ajkd.2015.10.006 (2015).
Article PubMed Google Scholar
Watanabe, T. & Saniter, T. Morphological and clinical features of renal amyloidosis. Virchows Arch. Pathol. Anat. Histol. 366, 125–135. https://doi.org/10.1007/BF00433586 (1975).
Article CAS Google Scholar
Border, S. et al. HistoLens identifies distinct patterns of podocyte injury in HIV-transgenic mice [version 1; not peer reviewed]. F1000Research 11 (2022).

Download references

Acknowledgements

Dr. Jessica Ray (University of Florida) for her advising on concepts related to usability of software tools.

Funding

Pinaki Sarder’s work is supported by NIH-NIDDK grant R01 DK114485, R01 DK131189, R21 DK128668, via the opportunity pool funding mechanism, namely via the glue grant mechanism of the NIH-NIDDK Kidney Precision Medicine Project (KPMP) consortium grant U2C DK114886, NIH-OD Human Biomolecular Atlas Project (HuBMAP) consortium Integration, Visualization & Engagement (HIVE) project OT2 OD033753, NIH/NCI Coordinating and Data Management Center for Acquired Resistance to Therapy Network U24 CA274159, and University of Florida. Jeffrey Kopp is supported by the NIDDK Intramural Research Program, including by DK ZO1-04330850.

Author information

Authors and Affiliations

Section of Quantitative Health, Division of Nephrology, Hypertension, and Renal Transplantation, Department of Medicine, University of Florida, 1600 SW Archer Rd., Gainesville, FL, 32608, USA
Samuel P. Border & Pinaki Sarder
Department of Pathology & Anatomical Sciences, University at Buffalo, Buffalo, NY, USA
John E. Tomaszewski
Kidney Disease Section, Kidney Diseases Branch, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
Teruhiko Yoshida & Jeffrey B. Kopp
Department of Pathology, University of Michigan, Ann Arbor, MI, USA
Jeffrey B. Hodgin
Department of Pathology, Immunology and Laboratory Medicine, University of Florida, Gainesville, FL, USA
William L. Clapp
Department of Pathology, Johns Hopkins Medical Institutions, Baltimore, MD, USA
Avi Z. Rosenberg
New York University Grossman School of Medicine, New York, NY, USA
Jill P. Buyon

Authors

Samuel P. Border
View author publications
You can also search for this author in PubMed Google Scholar
John E. Tomaszewski
View author publications
You can also search for this author in PubMed Google Scholar
Teruhiko Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey B. Kopp
View author publications
You can also search for this author in PubMed Google Scholar
Jeffrey B. Hodgin
View author publications
You can also search for this author in PubMed Google Scholar
William L. Clapp
View author publications
You can also search for this author in PubMed Google Scholar
Avi Z. Rosenberg
View author publications
You can also search for this author in PubMed Google Scholar
Jill P. Buyon
View author publications
You can also search for this author in PubMed Google Scholar
Pinaki Sarder
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.P.B.: Conceptualization, methodology, software, validation, formal analysis, investigation, resources, data curation, writing-original draft, writing-review & editing, visualization. J.E.T.: Conceptualization, formal analysis, investigation, review and editing. J.H.: Data curation. W.L.C.: Data curation. A.Z.R.: Data curation, validation, formal analysis, writing-review & editing. T.Y.: Data curation, validation, formal analysis, writing-review & editing. J.B.K.: Data curation, validation, formal analysis, writing-review & editing. P.S.: Conceptualization, methodology, funding acquisition, writing-review & editing, supervision, project administration.

Corresponding author

Correspondence to Pinaki Sarder.

Ethics declarations

Competing interests

Pinaki Sarder is an advisor for DigPath Inc. All other authors declare no competing interests exist.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Table 1.

Supplementary Table 2.

Supplementary Table 3.

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Border, S.P., Tomaszewski, J.E., Yoshida, T. et al. Investigating quantitative histological characteristics in renal pathology using HistoLens. Sci Rep 14, 17528 (2024). https://doi.org/10.1038/s41598-024-68406-7

Download citation

Received: 12 February 2024
Accepted: 23 July 2024
Published: 30 July 2024
DOI: https://doi.org/10.1038/s41598-024-68406-7
Springer Nature Limited

Investigating quantitative histological characteristics in renal pathology using HistoLens

Abstract

Similar content being viewed by others

Introduction

Methods

Software overview

Experimental design

Approval for animal experiments

Approval for human experiments

Results

Differentiating human DN, MCD, and AN

Interactive glomerular sub-classification

HIV-associated nephropathy mouse model

Comparison of HistoLens with existing platforms

Discussion

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation