figure a

Introduction

The ocular fundus, like the face, varies from person to person, and the variation is great enough to be used for personal identification [1]. This implies that the fundus contains several pieces of information that can identify an individual. Artificial intelligence (AI), particularly deep learning AI, can estimate age, sex, systolic blood pressure, smoking habits, presence of cardiac complications [2], refraction [3], and axial length [4] using only fundus photographs. In particular, age can be estimated with a mean absolute error (MAE) within 3.26 years [2], which is surprising to ophthalmologists. However, it is impossible to identify which factors are important for estimating the age of individuals, even when using attention heat maps or quantifying a few factors. Similar problems occur with chess, Go, and Shogi, in which deep-learning AI can predict the best move or the result; however, it is unknown why the predicted move is superior [5], and professional chess players need to study the reasons for AI moves. Thus, it is difficult for professionals to clarify the reason for selections made by deep learning AI; this is also called black box AI [6]. Furthermore, the fundus is more complicated than what is in these games because the eyes are natural objects.

Conventional statistical methods such as multiple regression analysis using many factors have been proposed as possible explanatory AI to unravel this black box [6]. Like the face, the fundus is unique and has many different features. For example, there are large individual variations in the angle or trajectory of the retinal vessels [7,8,9,10,11], the location and shape of the optic disc [12, 13], and the color of the peripapillary area [14,15,16,17]. The authors have previously reported that regression with L2 regularization (ridge regression) using these fundus parameters could determine sex with an accuracy rate of 77.9% in young adults (persons in their 20 s) [18], 63.2% in 8.5-year-olds [19], and 80.4% in a study population (> 40 years old) [20]. Sex-specific differences were also found because multiple regression analysis was used. The optic disc was oval-shaped, the retinal vessels were closer to the fovea, and the peripapillary color of the fundus was greenish in females than in males [18,19,20].

Therefore, the authors hypothesized that this method could predict age from fundus parameters and reveal specific age-related changes in the fundus. Thus, this study aimed to predict the age of older individuals using the regression analyses of their color fundus photographs (CFPs). The CFPs of 1,646 eyes from individuals ≥ 40 years old who were participants in the Kumejima population study were investigated to accomplish this.

Methods

Study population

The study adhered to the principles outlined in the Declaration of Helsinki and followed Japanese regulations. The protocol received approval from the Ethics Board of the Regional Council, and written informed consent was obtained from all participants prior to their involvement.

Kumejima island spans an area of 63.2 square kilometers and is situated in the southwestern region of Japan. It lies west of the main island of Okinawa and has a population of around 9000, primarily consisting of individuals from Okinawa Prefecture. The study was conducted from May 2005 to August 2006, during which all residents aged 40 and above were informed about the study’s protocol and invited to participate. In 2005, the official household registration database indicated that Kumejima had 5249 residents aged 40 and above. After excluding 617 residents for various reasons, a total of 4632 individuals were eligible for the study [21, 22].

Examination and diagnosis

The screening process involved a structured interview, the specifics of which were previously documented in a published paper [21, 22]. Ocular examinations were conducted by skilled ophthalmologists and examiners. These examinations included assessments of best-corrected visual acuity, spherical equivalent (ARK-730, Topcon, Japan), intraocular pressure measured with a Goldmann applanation tonometer, and axial length (IOLMaster, Carl Zeiss Meditec, Germany). Additionally, slit-lamp biomicroscopy, gonioscopy, ophthalmoscopy, fundus photography, and perimetry were performed. Sequential stereoscopic color fundus photographs (CFPs) with a 45-degree angle were obtained using a nonmydriatic digital ocular fundus camera system (Image Net TRC-NW7, Topcon, Japan). Detailed analyses of the optic disc, fundus, visual field examinations, and diagnosis of ocular diseases have been extensively described elsewhere [21, 22].

Measurement of fundus parameters

Following the method of our prior investigation, we measured 42 fundus parameters [20] (Fig. 1). The specifics of these measurements have been previously documented. We determined the angles of the supratemporal (ST) and infratemporal (IT) major retinal arteries (RA), as well as the angles of the retinal veins (VA) against the temporal horizontal line. Additionally, we measured the ST-RA, IT-RA, ST-VA, and IT-VA angles [7, 8, 11]. The papillomacular position (PMP) was defined as the angle formed by a horizontal line intersecting with a line connecting the optic disc center to the fovea [12].

Fig. 1
figure 1

Method of quantifying retinal vessel angles and papillomacular position (PMP), ovality ratio, and red–green–blue intensity. Red double arrows indicate supratemporal and infratemporal retinal artery angles (ST-RA and IT-RA). Blue double arrows indicate the supratemporal and infratemporal retinal vein angles (ST-RV, IT-RV). The white double arrow indicates PMP. The ovality ratio was determined by dividing the maximum diameter by the minimum diameter of the optic disc. The red-green-blue intensities and the tesselation fundus index (TFI) were calculated for each of the eight locations around the optic disc and fovea

We calculated the ovality ratio by dividing the maximum disc diameter by the minimum disc diameter [13]. Furthermore, we computed the mean red, green, and blue intensities within each area for the eight foveal and peripapillary circles. The tessellation fundus index (TFI) was determined using the mean red (R), green (G), and blue (B) intensities at each of the nine locations with the formula: TFI = R/(R + G + B) [14, 15].

These measurements were conducted utilizing color fundus photography (CFP) images and the ImageJ software (version 1.47, National Institutes of Health, Bethesda, MD, USA; available at http://imagej.nih.gov/ij/). The macro function of ImageJ facilitated semi-automated calculation of these 42 CFP parameters once the locations of the fovea, optic nerve head edge, and crossing points of the supralateral and inferolateral retinal arteries or veins were determined.

Statistical analyses

As in our previous studies, least absolute shrinkage and selection operator (LASSO) regression [23, 24] with 42 variables, RGB intensities, and TFI in nine locations (ST-RA, IT-RA, ST-VA, IT-VA, PMP, and ovality ratio) was used to predict age [25, 26]. LASSO regression with leave-one-out cross-validation was used to overcome the overfitting problem in ordinal statistical regression models, such as multivariate linear regression or binomial logistic regression, by applying a penalty to coefficients (L1 regularization), and the sum of the absolute values of the regression coefficients was regularized [23,24,25,26]. More precisely, let x ∈ R^p denote the variables and let y ∈ R denote the response (please note x_ij are normalized and y has mean zero). The Lasso algorithm solves the following problem:

$$\underset{\left(\beta 0, \beta \right)\epsilon {R}^{p+1}}{{\text{min}}}[\frac{1}{2N}\sum_{i=1}^{N}(yi- \beta 0-{x}_{i}^{Y}\beta {)}^{2}+ \lambda {P}_{\alpha }(\beta )],$$
(1)
$${P}_{\alpha }\left(\beta \right)=\left(1-\alpha \right)\frac{1}{2} \left|\left|\beta \right|{|}_{{l}_{2}}^{2}+ \alpha \right|\left|\beta \right|{|}_{{l}_{1}}$$
(2)
$$=\sum\nolimits_{j=1}^{p}[\frac{1}{2} (1-\mathrm{\alpha }){\beta }_{j}^{2}+ \alpha |{\beta }_{j}|]$$
(3)

In the Eq. (1), \(\underset{\left(\beta 0, \beta \right)\epsilon {R}^{p+1}}{{\text{min}}}[\frac{1}{2N}\sum_{i=1}^{N}(yi- \beta 0-{x}_{i}^{Y}\beta {)}^{2}]\) is identical to OLSLR and \(\lambda {P}_{\alpha }(\beta )\) is the penalty term for the shrinkage [23, 24].

The advantage of this approach lies in its ability to directly observe the effects of parameters in the optimal model, akin to ordinal multiple linear/logistic regression, unlike deep learning. The diagnostic performance of the LASSO binomial logistic regression approach was assessed using the leave-one-out cross-validation method. Here, each eye from the original dataset served as validation data iteratively, while the remaining observations (1645 eyes) were designated as training data [20, 27]. This process was repeated for each eye, resulting in 1646 iterations. Diagnostic accuracy was evaluated using the area under the receiver operating characteristic curve (ROC AUC). The final optimal model was determined using all 1646 eyes.

Pearson’s correlation analysis was employed to ascertain the relationship between actual and predicted ages, as well as the relationships between age and fundus parameters. Given the relatively large study sample, the significance level was set at p < 0.001. All statistical analyses were conducted using SPSS Statistics 19 for Windows (SPSS Inc., IBM, Somers, New York, USA) and R (version 3.1.3; R Foundation for Statistical Computing, Vienna, Austria).

Results

Color fundus photographs (CFPs) of acceptable quality were obtained from 3762 participants, totaling 7524 eyes. However, photographs from 376 right eyes and 421 left eyes were excluded due to conditions such as cataract, corneal opacity, large pterygium, or a small pupil. Additionally, eyes with a history of intraocular surgery (453 right, 443 left) were excluded. Eyes with a refractive error exceeding − 8 or + 5 diopters (14 right eyes, 11 left eyes), or those with optic disc diseases including glaucoma or retinal/brain diseases in either eye (711 right eyes, 679 left eyes) were also excluded.

Ultimately, both eyes of 2208 individuals were deemed suitable for analysis. Further details regarding the demographics of these eyes were provided in a previous report. [20, 28, 29] Only the right eyes were utilized for measuring fundus parameters. Among the 2208 normal eyes, 562 were excluded due to difficulties in measuring fundus parameters arising from unclear peripheral fundus areas or unidentifiable foveal regions. Consequently, 1646 right eyes were included in the analyses. Table 1 presents demographic data for the right eyes of eligible participants. All participants were over 40 years old and exhibited no significant myopia.

Table 1 Participant data

The MAE for predicting the age of participants was 3.64 years (95% confidence interval: 3.48–3.80) (Table 2). The correlation coefficient between the actual and predicted age was 0.88 (p < 0.001) (Fig. 2). The optimal model for age obtained using LASSO regression and Pearson’s correlation coefficient between the actual age and fundus parameters is shown in Table 3. Older eyes tended to have (i) larger ST-RA; (ii) higher red intensity in all areas; (iii) higher green intensity except in the supratemporal, temporal, and infratemporal areas; (iv) lower blue intensity except in the infranasal, nasal, supranasal, superior, and fovea areas; and (v) higher TFI except in the supranasal and nasal areas.

Table 2 Prediction accuracy of this and previous studies
Fig. 2
figure 2

Scatter plot analysis between actual and predicted age. The line represents actual = predicted values

Table 3 Optimal model for the age obtained using the LASSO binomial logistic regression and Pearson’s correlation between actual age and fundus parameters

The typical color fundus images of (a) a younger eye (actual age: 40 years, predicted age: 44 years, axial length: 23.17 mm) and (b) an older eye (actual age: 80 years, predicted age: 78 years, axial length: 23.27 mm) are shown in Fig. 3.

Fig. 3
figure 3

Typical color fundus images of younger and older eyes. a a younger eye (actual age: 40 years old, predicted age: 44 years old, axial length: 23.17 mm) and (b) an older eye (actual age: 80 years old, predicted age: 78 years old, axial length: 23.27 mm)

Discussion

The accuracy of age prediction using LASSO regression in the Kumejima population study was similar to that in a previous report of deep learning AI, also a cross-sectional study, with participants > 40 years old. In general, the prediction accuracy of deep learning AI outperformed that of the regression analysis. For example, in sex determination, deep learning AI had a 97% accuracy rate compared to 80.4% for regression analysis [2, 20]. One reason regression analysis was as accurate as AI for age prediction was that the participants in the AI study included eyes with fundus diseases such as diabetic retinopathy. Thus, the pathological findings may have masked age-related changes. Another possibility is that the prediction may have been highly accurate because no other fundus photographic parameters significantly affected age-related changes. This can be explained by the parameters measured in this study, especially fundus color intensity.

Pearson’s correlation coefficient showed that the red intensity tended to become stronger in older age groups. The retina [30] and retinal pigment epithelial cells [31] become thinner with age, suggesting that the increased visibility of the choroid is responsible for the red intensity. Additionally, tessellation tends to be stronger in older individuals. It has been reported that the choroid [32] becomes thinner with age, and the thinner the choroid, the stronger the tessellation [33]. These findings are consistent with the results of the present study.

The green intensity tended to become stronger in older age except in the supratemporal, temporal, and infratemporal areas. A typical fundus photograph of an older person shows a greenish color in the periphery (Fig. 3b). This greenish appearance of the periphery may be due to the presence of cataracts. Since all cases with poor fundus photographs were excluded, there were no cases of strong cataracts. However, since the lens becomes cloudy with age, the effects of cataracts are inevitable. If this is the case, it is not a true fundus change; nonetheless, because cataracts tend to become stronger with age, it could be said that some age-related changes are visible in fundus photographs. However, no published evidence supports this hypothesis. Hence, further research is needed to determine what happens to the greenish areas in the periphery of fundus photographs before and after cataract surgery.

The blue intensity tended to be weaker in older age groups, except in the superior, supranasal, nasal, and infranasal areas. Conversely, the blue intensity at the posterior pole tended to be stronger in younger patients. Blue light is mainly reflected from the retinal surface, and the retinal light reflex weakens with age [34]. These results may have been caused by decreased retinal surface reflex.

Since the angle of ST-RA is wider in older persons, the retinal vessel angle is unlikely to widen with age. The Kumejima Study was a 2005 cross-sectional study; the older the patient, the shorter the axial length and the more hyperopic the refraction [28]. The correlation coefficient between age and axial length in this study was -0.219 (p < 0.001), and that between age and refractive error (spherical equivalent) was 0.472 (p < 0.001). The arterial vessel angle tends to be larger, with shorter axial length and more hyperopic refraction [7]. Since this was a cross-sectional AI study [2], the vessel changes may be generational differences (cohort effect) rather than the aging effect. Further studies using fundus photographs from long-term cohorts are needed to determine true age-related changes.

Red, green, and blue colors tended to become stronger in the fovea of older individuals. Red and green areas tended to increase with age, as indicated by the peripapillary area; however, the opposite trend was observed for blue areas. The cause of this trend toward a blueish fovea in older participants is unknown. The TFI at the fovea tended to be weaker in older patients. Although foveal TFI has not been reported, more extensive macular TFI tends to be stronger in older patients. The retinal pigment epithelial layer in the macula thins with age [35], but that in the fovea thickens with age [36]. These results may explain the opposite trends in TFI between the macula and fovea.

Because the fundus parameters interacted with each other, the results of the LASSO regression differed in part from Pearson’s correlation results. Blue and green intensities may be more important than red intensities and TFI in estimating age because both blue and green showed similar trends, but red and TFI showed opposite trends for some factors. Our previous reports on sex determination using ridge regression analysis [20] and attention heap map analysis in an AI study 2 revealed that comprehensive judgment using many factors could achieve high accuracy. It is difficult for humans to make comprehensive judgments using many factors; however, multiple regression analysis can help identify important factors related to age-related changes in the fundus.

This study had some limitations. First, this was a cross-sectional study, similar to previously reported AI studies. Hence, future longitudinal cohort studies are needed to investigate the true age-related changes in the fundus. Second, this study was an epidemiological study of persons aged ≥ 40 years and thus did not include young adults aged 20–40 years. Therefore, the results for young adults are unknown. In addition, because the study was conducted in Japan, the results may differ in other racial groups. Finally, the exclusion criteria employed reduced the number of participants and thus the applicability of the results. However, the exclusion criteria were deemed necessary to ensure the quality of the fundus photographs to allow for the measurement of various parameters.

In conclusion, LASSO regression analyses using parameters from CFPs can predict age in the Kumejima population to the same extent as deep learning AI. Specific age-related changes in the fundus tended to be reddish and greenish, with weaker blue intensity in the peripapillary area. These age-related changes in the fundus may aid the understanding of the mechanism of fundus diseases such as age-related macular degeneration.