Introduction

The contribution of an unhealthy diet, including excessive fatty food, to development of fatty liver has been described over nearly two centuries [1]. Non-alcoholic fatty liver disease (NAFLD) is the most common form of fatty liver disease (FLD), with an estimated global prevalence of up to 24% [2] and causes a substantial economic burden to society [3]. The influence of rising obesity prevalence [4, 5], sedentary lifestyle [6], and unhealthy dietary patterns [7] on development of fatty liver has previously been reported in adolescents and young adults. Furthermore, dietary fats are considered to affect the pathogenesis of fatty liver [8].

Although some observational studies [9, 10] and dietary intervention studies [11,12,13] have attempted to clarify the cross-sectional relationship between dietary fats and fatty liver, there is no longitudinal study investigating the relationship between dietary fat intake and fatty liver occurrence and development in adolescents and young adult populations. In the Raine Study, a prospective association between the high-fat “Western dietary pattern” and ultrasound detected NAFLD in adolescents was found [7]. A feature of the western dietary pattern were foods high in fat such as take away foods, processed meats and fried potatoes; therefore, we aimed to test the hypothesis that high intake of dietary fat is prospectively associated with fatty liver in adolescents and young adults.

The primary aim of this study was to examine the association between baseline dietary fatty acids intake during early adolescence and subsequent longitudinal fatty liver trajectories as measured by fatty liver index (FLI) from adolescence to young adulthood. A non-invasive diagnostic model, such as FLI (based on waist circumference, body mass index (BMI) triglycerides and gamma-glutamyl-transferase) is considered an accurate and validated method of determining NAFLD in population-based epidemiological studies [14]. To achieve our primary aim, we identified distinct FLI trajectories from 14 to 22 years in the Raine Study and tested the association of early dietary fats in adolescents with these trajectories.

Methods

Study population

We utilised data from the Raine Study, a longitudinal cohort study in Perth, Western Australia that started as a randomised controlled trial to study the effects of frequent and repeated ultrasound scans on pregnancy outcomes. The background and methods of the Raine Study have previously been described [7]. Briefly, the original cohort of pregnant study participants (Gen1) was recruited between 1989 and 1992, at between 16 and 20 weeks gestation, resulting in 2868 live births. Follow-up assessments of the offspring (Gen2) cohort have been conducted approximately every 3 years. Approximately 70% of the Gen2 participants remained actively involved in the study at the 22-year follow-up. Clinical, biochemical and questionnaire data were collected from serial assessments during antenatal/perinatal stages, infancy, childhood, adolescence and adulthood [15].

Laboratory examination and physical measurement data from the 14-, 17-, 20- and 22-year follow-ups of the cohort were used in this study (see Fig. 1 for detailed information on participant recruitment).

Fig. 1
figure 1

The Raine Study participant recruitment in this study detailed flowchart on participants in this study. Laboratory examination and physical measurement data from the 14, 17-, 20- and 22-year follow-up of the West Australian Pregnancy Cohort (Raine) Study are used to identify distinct FLI trajectories

Institutional ethics committee approval was obtained from the University of Western Australia Human Research Ethics Committee. Signed informed parental or primary carer consent during Gen1 pregnancy, Gen2 childhood and adolescence, and subsequently by Gen2 adolescents and young adults were obtained before participation in each assessment.

Dietary fat intake assessment

A dietary fat intake assessment at 14 years was obtained from a validated semi-quantitative food frequency questionnaire (FFQ-14) developed by the Commonwealth Scientific and Industrial Research Organisation (CSIRO) in Adelaide, Australia, as previously described [16] and evaluated [17]. The parents or primary carer completed the FFQ-14 as being representative of usual dietary and nutrient intake in the previous 12 months [18]. The study nurse checked FFQ-14 responses during the clinical follow-up to clarify responses. All data from the FFQ-14 were verified by CSIRO twice, and Australian food composition data were applied to obtain estimates of usual food and nutrient intakes [19]. These included estimates of macro and micronutrients, including specific dietary fats [18] (TFA, total fatty acids; SFA, saturated fatty acids; PUFA, polyunsaturated fatty acids; MUFA, monounsaturated fatty acids; total omega-3 fatty acids (including alpha-linolenic acid and long-chain n–3), n–3; total omega-6 fatty acids (including gamma-linolenic acid and linoleic acid), n–6; long chain omega-3 PUFA (Eicosapentaenoic Acid, EPA + Docosapentaenoic Acid, DPA + Docosahexaenoic Acid, DHA), n–3 LCPUFA; and the ratio of total n–6 to total n–3 fatty acids, n–6:n–3). The FFQ-14 did not collect dietary fish oil supplement intake.

Dietary misreporting

Dietary misreporting can contribute to measurement error in the analysis of diet–disease relationships [20]. Potential dietary misreporting in the Raine study was determined using the Goldberg method [21], which estimates the cut-offs for plausible reporting based on energy intake relative to basal metabolic rate. These cut-off values classify study participants as under-, plausible- or over-reporters of dietary intake. The method has been used widely to identify misreporting from dietary surveys and studies [22]. Because dietary underreporting may be strongly associated with the risk of overweight [23], we considered excluding under-reporters. However, excluding under-reporters removes participants at the highest risk, reducing our sample size considerably. We, therefore, created a categorical variable for misreporting where all participant data were included in the analysis. This categorical variable for misreporting was included as a covariate in our regression models. Similarly, when we summarise participants’ dietary fat intake at baseline, there is a significant difference in the proportion of dietary misreporting between trajectory groups. Therefore, we show the dietary data for plausible reporters only in Table 2 and data of the entire cohort with all categories of dietary reporting included in supplementary Table 3.

Anthropometric and laboratory measurements

At all years, a trained research assistant weighed and measured participants in light clothing for height and weight using a calibrated stadiometer and electronic chair scales. Body mass index (BMI = weight (kg)/square of the height (m2)) was calculated, and subjects were categorised as underweight, normal weight, overweight and obese using the International Obesity Task Force (IOTF) criteria at 14 years [24, 25]. Blood was taken by a phlebotomist at the home of the participants from an antecubital vein after an overnight fast. Laboratory assessments were performed in the PathWest Laboratories, Perth, for serum glucose, insulin, alanine transaminase (ALT), aspartate aminotransferase (AST), gamma-glutamyl transferase (GGT), triglycerides (TG), total cholesterol, high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C), ferritin, transferrin saturation, high-sensitivity C-reactive protein (hs-CRP), adiponectin, serum glucose, serum insulin and leptin.

Fatty liver index trajectories

The lack of an accurate, non-invasive, easily accessible and affordable diagnostic test is an important factor limiting research regarding fatty liver in the general population. The FLI is an algorithm based on WC, BMI, TG and GGT. The index was initially developed to detect fatty liver [26]. It is a relatively reliable tool for fatty liver research, recommended by a group of European Societies for diagnosis of fatty liver in epidemiological studies [14] and has been validated in the Raine Study cohort [27]. The FLI has been shown to have the best calibration performance compared to other prediction models attempting to identify fatty liver disease. This suggests that the FLI has a good ability to predict the risk of fatty liver among study participants at the individual level. We hypothesised that FLI is useful as a continuous variable in repeated measurements of longitudinal data to detect changes in fatty liver risk. To answer our research question, we applied a trajectory model to describe the natural history of fatty liver over eight years in Raine Study participants from adolescence (14 years) to young adulthood (22 years) using the FLI (Fig. 2).

Fig. 2
figure 2

A group-based trajectory analysis of fatty liver index from 14 to 22 years (985 participants) in the Raine Study. The main dots on each trajectory represent, from left to right, the follow-up nodes from baseline (14 years) to 17, 20 and 22 years. Dotted lines either side of the trajectories are 95%CI curves. SL stable low risk, (N = 782); LH low risk to high risk, (N = 132); SH stable high risk, (N = 71)

Covariate assessment

In the multivariate analyses of this study, there were three main types of model covariates. (1) Energy-adjusted variables, such as total energy and dietary misreporting (Model 1). (2) Demographic background variables of the cohort, such as sex, screen time and family income (Models 2 and 3). 3) Other dietary nutrient exposures and potential mediating variables included dietary patterns, total carbohydrate intake, total protein intake and homeostasis model assessment for insulin resistance (HOMA-IR) (Models 4, 5 and 6).

At each follow-up, medical status was reported, and participants with chronic diseases or use of potentially steatogenic medications (such as amiodarone and methotrexate) were excluded from the analysis. The primary caregivers of the participants were asked to report their annual family income (in Australian dollars) at the 14 year follow-up (2003–2006) and categorised as: < $ 30,000, $30,001–50,000, $50,001–78,000, and > $78,000. Parental education level and BMI were not assessed at our baseline follow-up (14 years). Also, at the 14-year follow-up, hours of physical activity per week were assessed through the International Physical Activity Questionnaire (IPAQ). Participants or their primary caregiver reported the time and frequency spent in exercising vigorously during physical education at school and outside school. Screen time, including time spent using a computer, was collected through a questionnaire with response options from ‘none at all’ to ‘4 h or more’ per day by participants’ primary caregivers. Considering that an analysis-adjusted sample using the physical activity level variable would lose much of the sample relative to the computer viewing (sedentary activity) variable, especially in some of the higher metabolic risk groups. We used the adolescent computer viewing (sedentary activity) variable for the analysis in order to obtain a larger sample with metabolic outcome. Therefore, screen time was used for the analysis to obtain a larger sample with more metabolic outcomes. The Raine Study has identified [16], evaluated [17] and applied [7] Western and healthy dietary patterns score in previous studies. In brief, these dietary pattern scores were developed using dietary data collected from the CSIRO FFQ at 14 years of age from 38 pre-defined food groups, with significant differences in fat and sugar intakes between the healthy and western dietary patterns [16]. Dietary pattern scores were adjusted as covariates in the regression models. In addition, other independent risk nutrients such as total carbohydrate (CHO) and total protein intake from the CSIRO FFQ-14 data were adjusted in the models. The HOMA-IR score was calculated as follows:

$$\begin{gathered} {\text{HOMA}} - {\text{IR score}} = {\text{ Fasting serum glucose }} \hfill \\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,*{\text{ Fasting serum insulin}}/{22}.{5} \hfill \\ \end{gathered}$$

Statistical analysis

Continuous descriptive data with normal distributions are presented as means ± standard deviation (SD); non-normally distributed data are reported as medians and interquartile ranges (IQRs).

Trajectories were estimated with group-based trajectory modelling (GBTM) [28]. The trajectories were derived by modelling the FLI as a function over time (participants’ age at each follow-up measurement). Data requirements for participants in the model were limited to having FLI data at the 14-year follow-up and at least one other follow-up (17, 20, or 22 years).

The best-fit model was based on the Bayesian Information Criterion (BIC) and the presence of a minimum of 5% of participants per trajectory to ensure stable estimates [28, 29]. After setting the number of trajectories and non-significant quadratic and cubic terms were removed, the best optimal model was estimated by fit indices, pragmatic evaluation, and clinical relevance. Fit indices included BIC and Akaike's information criterion (AIC), with the lower the indices, the better the model fit [28] (Supplementary Table 1). After the model was built, we tested its accuracy with the conditions suggested by Nagin [28]: the average posterior probability of assignment of each group was 0.7 or higher; the odds of correct classification was 5.0 or higher; the proportion of samples allocated to a particular group was close to the proportion estimated by the model, and the 99% confidence interval for the estimated proportion was reasonably narrow (Supplementary Table 2).

Dietary fat exposures were converted to Z-scores in the log multinominal regression models to obtain appropriate clinical interpretations. The total fats and sex interaction effect was not significant in our study (P values for interaction effects: 0.080 in low–high FLD risk group to Stable-low FLD risk group regression and 0.084 in stable-high FLD risk group to Stable-low FLD risk group regression). However, sex interaction effects of MUFA and n–3 dietary variables were significant in regression models for the stable-high to stable low FLD risk group (Supplementary Table 8). The conclusions of our analysis are based on the combination of the entire dataset for males and females; however, results of sex subgroup analysis are given in the supplementary material (Supplementary Table 9–15, Supplementary Figs. 1–3). Log multinominal regression analyses were conducted to estimate relative risks (RRs) and 95% confidence intervals (CIs) with multiple attributes [30] of three distinct FLI trajectories (stable-high FLD risk SH; low–high FLD risk LH) using the “stable-low FLD risk, SL” trajectory groups as a reference. RR and 95% confidence intervals (CIs) were reported for each FLI trajectory as follows: model 1 adjusted for total energy intake plus dietary misreporting; model 2 adjusted for model 1 plus sex; model 3 adjusted for model 2 plus computer viewing, and family income. We used the standard multivariate model instead of nutrient density for energy adjustment. This is because if nutrient density is used, nutrient intake will be confounded by total energy intake (in the opposite direction) as it is divided by total energy when the disease is also associated with total energy intake. BMI was not in the model because the already adjusted covariates of total energy intake, dietary misreporting and BMI were highly correlated, and additional adjustment for BMI could not be fitted to the model.

All analyses were performed using Stata 15.0 for Windows (Stata Statistical Software: College Station, Tx, USA). P values < 0.05 were regarded as statistically significant.

Results

Our study population comprised 985 Gen2 adolescents aged 14 years. Participant characteristics at the 14-year follow-up are presented by different FLI trajectory groups in Table 1. The overall characteristics of the cohort showed 52% were males, 89% were of Caucasian maternal race, and 42% of the cohort had an annual family income over $70,000 at baseline.

Table 1 Characteristics of study population at baseline (14 years) by trajectory groups from 14 to 22 years in the Raine Study

From the 985 eligible participants, three different FLI trajectories were identified (Fig. 2) and labelled as “stable-low fatty liver risk, SL” (79.1%, N = 782, trajectory 1), “low-to-high fatty liver risk, LH” (13.9%, N = 132, trajectory 2) and “stable-high fatty liver risk, SH” (7%, N = 71, trajectory 3). The rising FLI trajectory indicates an increased risk of fatty liver for the individual in the SH group. The majority of participants (79%, N = 782) maintained a SL fatty liver risk from 14 to 22 years, while 7% (N = 71) and 14% (N = 132) were categorised in SH and LH fatty liver risk groups, respectively. For the demographic characteristics in the trajectory groups at baseline (Table 1), the SH group had the highest proportion of males (57.8%), Caucasian maternal race (93.0%) and annual family income less than $35,000 (50.7%). The SH group also had higher serum ALT, GGT, triglycerides, insulin and lower HDL-cholesterol, which may represent increased cardiometabolic risk. Lifestyle characteristics, such as dietary misreporting and BMI, were significantly different between the three trajectory groups. In the SH group, nearly 46% were under reporters as identified from the dietary misreporting analysis. The majority of subjects in the SL group (nearly 84%) were categorised as being within the healthy weight range. Progressively, the proportion of healthy weight versus overweight was similar in the LH group (40 vs 42%, respectively). Furthermore, 97% of participants in the SH group were categorised as overweight or obese. Among the laboratory biochemical indicators shown in Table 1, the SH group had the highest mean levels of all the biomarkers other than AST and HDL-C.

The baseline dietary fat intake characteristics are shown in Table 2 for plausible reporters only. Considering the group variations in energy requirements, the percentage of energy intake from specific TFA, SFA and MUFA categories of dietary fats are also reported. Energy intake from the fat categories n–3 LCPUFA, n–3 and n–6 are not shown because energy levels of these nutrients were very low. All indicators of fat intake characteristics (including absolute value and percentage of energy) were non-normally distributed. For most indicators, except for n–3 and n–3 LCPUFA, the SH risk group had the highest median intake of all dietary fats and percentage of energy compared to the SL and LH groups. For n–3 and n–3 LCPUFA, the LH group had the highest median intake level (total n–3 1.3 g/day and n–3 LCPUFA 288 mg/day). Except for SFA, the absolute value of most fat categories (TFA, PUFA and MUFA) showed the same trend as the percentage of energy. However, for SFA, the median absolute value of SFA in the LH group is moderate but accounts for the lowest median energy percentage (absolute value: 41.5 (g/day), energy percentage: 14.4%).

Table 2 FFQ dietary fat intake characteristics of participants at age 14 years presented as median and interquartile ranges (Q1, Q3) (Plausible reporters only) in the Raine Study

Table 3 shows the results from multinomial analysis of the correlation between baseline dietary fats and FLI trajectories with the SL group as the reference group for each dietary fats exposure. The RR of the multinomial analyses shows n–3 LCPUFA and N-3: N6 at baseline (14 years) were significantly associated with the FLI trajectories after adjusting for all covariates. N-3 LCPUFA in the LH group relative to the SL group has a RR of 1.27 (95% CI 1.07–1.51) in model 5. Compared to the SL group, N-3: N6 in the SH group has RRs of 1.37 (95% CI 1.10–1.70) in model 5.

Table 3 Log multinomial regression models for each dietary fat (Z-score) at 14 years and different fatty liver index trajectories in the Raine Study (stable low risk as the reference group)

Supplementary Table 3 shows that the intake of dietary fats was lower in the SH group compared to that of the LH and SL groups. This may be due to the 46.5% under-reporting in the SH group (Table 1). Nevertheless, this relationship is different in n–3 fatty acids and n3 LC-PUFA, with the intake of LH group the highest, while the intake of SH group is lower than that of LH group. Supplementary Table 4 shows the results of regression analyses for the DPA, EPA, DHA, ALA and LA. In a regression model that incorporated DPA, EPA and DHA, none of the fatty acids were significantly associated with the LH group (supplementary Table 5). Supplementary Table 6 indicates the component characteristics of n3 LCPUFA intake (plausible reporters only). DPA had the highest median level in the LH group among the three trajectory groups.

Sensitivity analyses

We considered the interaction of gender and the robustness of the resulting trajectory modelling (Supplementary Table 8) with the same criteria and process for males (N = 507) and females (N = 477) as a subgroup (Supplementary Tables 9 and 12). The results show that, similar to the whole population, within the subgroups, trajectories fall into three categories, i.e. a stable low-risk group, a low- to high-risk group, and a stable high risk group (Supplementary Fig. 1, 2 and 3). The relative risk of dietary exposure and trajectories were tested in the same way (Supplementary Table 11 and 15). Subgroup analyses suggest that the association between n–6 and N6: N3 as a population risk for the SH-SL group may be driven mostly by males (RR of n–6: 1.45 (1.02–2.06), RR of N6: N3: 1.16 (1.07–1.25)) (Supplementary Table 15). In addition, a risk association for monounsaturated fats in the SH-SL group was observed in the female subgroup analyses (RR of monounsaturated fats: 3.41(1.26–9.21)) (Supplementary Table 11). The n-3 LCPUFA as a risk for the LH group (compared to SL group) was predominantly driven by males. Furthermore, in the male and female SH groups (compared to SL group), n–3 LCPUFA showed contrasting effects (RR in male: 0.69 (0.48–0.97); RR in female: 1.46 (1.03–2.07)). In addition, we extended the regression analysis to other individual nutrients (e.g. sugars, dietary fibre and red meat) (Supplementary Table 16) in order to gain a more comprehensive understanding and identify potential interactions between different nutrients. After further adjustment for these nutrients the results were similar to the model results in the original manuscript, other than the effect of red meat on n–3 LCPUFA in the low- to high (LH) risk group (RR: 1.15 (0.97–1.38)). This may be related to the fact that red meat is a major source of n-3 LCPUFA.

Discussion

We identified three different fatty liver index trajectories from a young population in Western Australia and observed a dietary fat intake relationship with these trajectories. The majority of participants were in the SL group, with a sustained low FLI. The SH group had FLI at a relatively persistent high level, while the LH group changed from low- to high-risk for FLD during follow-up. After controlling for total energy, dietary misreporting, sex, computer viewing and family income, intake of n–3 LCPUFA, n–6 and N6: N3 at baseline were significantly associated with prospective and longitudinal FLI trajectories over eight years of follow-up.

Our study provides a unique insight into the relationship of baseline dietary fatty acid intake on the natural history of fatty liver as assessed by trajectories of FLI from adolescence to young adulthood. We show that fatty liver development follows specific trajectories over time in young populations. FLI, as a predictor of fatty liver, has been widely used in epidemiological studies as an alternative diagnostic tool for fatty liver. The FLI incorporates BMI and our data show that adiposity is the main driver of the FLI trajectories. The prevalence of fatty liver based on FLI in 17-year-old adolescents in the Raine Study was 11.6% [27]. Meta-analyses have shown that the prevalence of FLD or NAFLD was 11.3% in children 10–14 years [31], 17.3% in adolescents 15–19 years [32] and 24% in young adults 18–35 years [33]. Our FLI trajectory data show a similar fatty liver disease trend from adolescents to young adults. These findings are worthy of highlighting given the chronicity and severity of fat accumulation in the liver is associated with subsequent risk of type 2 diabetes and cirrhosis later in life [2]. Our findings also demonstrate the importance of the transition from adolescence to young adulthood as a critical period for health promotion and health intervention to reduce the impact of chronic liver disease and cardiometabolic risk.

Our main finding is that a diet high in n–6 fatty acids or a high N6: N3 ratio at 14 years of age associates with an increased risk of FLD from adolescence to young adulthood. This implicates n–6 fatty acids as a potential risk factor in the development of FLD although this association may be influenced by dietary patterns and the intake of carbohydrates and protein in the diet. Moreover, these findings are similar to previous studies in which cross-sectional associations between N6: N3 and obesity [34] and fatty liver risk [35] were observed. In a typical Western diet, the ratio of N6: N3 is usually 15–16:1, rather than the recommended 1–4:1, which is considered the healthy range [36].

The association between N6: N3 and risk of FLD was robust in our study. Nuts and seeds are rich in n–6, which is why large amounts of n–6 can be found in refined vegetable oils such as palm oil and sunflower oil and foods cooked with these vegetable oils. Vegetable oils are a widely used source of fatty acids in the food industry for processed snacks, fast foods, cakes and cured meats that are also rich in n–6 and high in the ratio of N6: N3 [37]. Our study did not support saturated fatty acids as a risk factor for FLD. In the LH group, the risk of FLD was reduced if total carbohydrate and total protein were replaced with the same amount of energy from saturated fat to provide energy (Model 5). Disordered physiologic states related to the metabolic syndrome are recognised risk factors for the development of fatty liver disease in children and adults, and insulin resistance is thought to be a key pathogenic mechanism in the development of FLD [38]. Results from analyses adjusted for HOMA-IR suggest that insulin resistance may not significantly influence the association between dietary fatty acid intake and FLI trajectory.

Another interesting finding suggested here is that there could be a positive association between n–3 LCPUFA and liver fat. Whilst interesting, these data do not accord with clinical trials and meta-analyses that suggest n–3 LCPUFA may benefit fatty liver [39]. We consider this may be related to the misclassification of n–3 LCPUFA using FFQ data. It was noted that when using FFQ data to identify individuals with low n–3 PUFA intake, the FFQ was able to better capture DHA and EPA intake as opposed to ALA and total n–3 when compared to red blood cell or plasma fatty acid levels [40, 41]. Furthermore, dietary fatty acid nutrient intake levels and their food sources are positively correlated with body weight or BMI levels, meaning that heavier individuals consume more nutrients to maintain a balanced energy metabolism. This makes the weight factor a possible mediator in the study of nutritional epidemiology and metabolism-related outcomes. As BMI was one of the main drivers of FLI trajectories in our study, the potential positive association of fatty acids and their food sources with BMI could influence the risk of developing FLD. On the other hand, when individual intake levels were examined using the FFQ, overweight or obese individuals were more likely to under-report nutrient intake (as determined from the data shown in Table 2 and Supplementary Table 3). Thus, individuals with more severe metabolic outcomes (typically higher BMI) appear to have lower dietary intakes. We identified three trajectory groups with an increasing distribution in terms of metabolic outcomes (SH more severe than LH and LH more severe than SL) and dietary misreporting was progressive when underreporting was considered (more severe for SH than LH and more severe for LH than SL). This in turn reinforced the relative risk for the LH group as misreporting was more severe in the SH group than in the LH group. These findings suggest that the effect of n-3 LCPUFA may be overestimated, as the LH group had the highest level of n-3 LCPUFA FFQ intake level regardless of whether dietary misreporting were taken into account (Table 2 and Supplementary table 3). In addition, sensitivity analyses suggested that this association, while persisting across the many adjusted nutrient models, was no longer significant after adjustment for red meat. The association between red meat consumption and NAFLD risk is increasingly being reported [42,43,44] although fish and other seafood are the main dietary sources of LCPUFA. However, the 1995 National Nutrition Survey from Australia showed that close to 43% of LCPUFA intake came from meat, poultry and game products and dishes [45], and among Australian teenagers this proportion increased further to 49%, surpassing that of seafood sources (37%) [46]. The latest data from the Australian Health Survey 2011–2012 showed a similar result with the proportion of Australian adolescents consuming LCPUFA from meat, poultry and game products and dishes (33%) slightly exceeding that of seafood sources (32%) [47]. Red meat as a major source of n-3 LCPUFA is one possibility that may mediate the association found in this study. It is also important to note that the association between n–3 LCPUFA and FLD risk was only present in the LH risk group, which suggests heterogeneity of the association between nutrients and FLD risk in the population.

Our study validates findings in nutritional epidemiology regarding dietary misreporting, which is often associated with BMI status, educational level, and age [48, 49] and can be especially vulnerable to bias when we evaluate dietary-metabolic disease associations. Differences in dietary characteristics between plausible (Table 2) and implausible (Supplementary Table 1) populations may result from overweight and obese people under-reporting their intake of dietary fats [50].

Strengths and limitations

To our knowledge, this is the first study to explore the association between dietary fat intake of adolescents with the longitudinal natural development of fatty liver into young adulthood. The strengths of our study include the use of data from a large cohort with prospectively collected dietary information minimising recall bias and representative of the general adolescent and young adult population in Australia. Our study focus is on the natural history of FLD risk from adolescence to young adulthood and uses an objective and validated fatty liver screening tool for longitudinal data analysis over 8 years.

Our study has several limitations. First, proper interpretation of the results of the study requires consideration of measurement errors in self-reported dietary data, unavoidable when using an FFQ [51]. We minimised this impact by applying a less biased instrument (3 day food record) and performing an internal calibration of the FFQ data [17], finding relative validity. Additionally, dietary misreporting was introduced into the analysis as a covariate factor to achieve better statistical modelling. The energy adjustment method was further applied to minimise the impact of FFQ measurement errors in statistical modelling. Second, we used FLI, a non-invasive, easily calculated but indirect marker that combines anthropometric indices and biochemical parameters to predict the risk of fatty liver disease. Although useful for large-scale epidemiological studies, it is not possible to distinguish whether any association between dietary intake and FLI is independent or mediated through body weight status (body mass index and/or waist circumference), which primarily drives FLI scores. Finally, dietary fatty acid exposure in our study was assessed only at baseline age 14 years and is strongly associated with other nutrients and beverage intake that were not assessed as potential confounders. Dietary habits during adolescence may change over time. The use of only one prospective assessment of dietary intake is a limitation of our study. We have also not accounted for degrees of physical activity.

Our study reveals that aspects of dietary fat intake are associated with the future risk of FLD in young populations. While dietary fat consumption at age 14 years is unlikely to independently lead to an increased risk of FLD 8 years later, dietary habits established during adolescence are likely to persist into adulthood, including into the reproductive years [52]. Interestingly, we examined changes in the trajectory of dietary patterns from adolescence to early adulthood (from 14, 17 to 20 years of age) in the Raine study Gen2 population [53] and found that 21% of men consuming mainly the Western Dietary Pattern Score had a stable, significant growth trajectory over time suggesting that dietary patterns established during adolescence are likely to persist into early adulthood, especially among males supporting the idea that specific populations can still benefit from early dietary interventions. Therefore, family diet education or intervention may be a viable way to prevent or reduce the rising prevalence of fatty liver among younger populations.

Future nutritional epidemiological studies should focus on the pathways by which n–6 fatty acids are associated with FLD, such as the mediation of inflammatory factors. Additionally, a focus on risk factors for the development of FLD is recommended, such as validating the relationships between EPA and metabolism-related outcomes in larger populations.

Conclusions

Our study shows that the FLI trajectory from adolescence to young adulthood is a critical time for the development or continuation of fatty liver. Certain aspects of fatty acid intake in early adolescence are associated with FLI trajectories from adolescence to young adulthood. We conclude that for those at high risk of fatty liver in early adolescence, there may be potential benefits from including a lower N6:N3 fatty acid diet and reducing foods high in N6 to improve the fatty acid balance in the diet. However, great caution should be used in extrapolating the data from this observational study to dietary advice due to the many limitations outlined above.