1 Introduction

The Internet has become an indispensable tool, particularly for young people. For those who have been surrounded by digital technology since birth, it is not only an information tool, but a major innovation that has conditioned how they spend their leisure time and engage in non-leisure activities. The Internet has been fundamental in society’s development; it created a new dimension for digital natives (Prensky, 2001) and has also made it possible to digitise social and paperwork structures that were traditionally “physical” by promoting “virtual mobility” and allowing certain daily activities to be carried out through telework, telehealth and e-learning (Mouratidis & Papagiannakis, 2021).

In this context, the Internet has not only enabled young people around the world to stay in touch with each other, but has also provided them with new learning opportunities, as it is an endless source of information that can speed up the educational process. It allows the creation of “educational environments” that complement the traditional system and fill educational gaps that may be left by face-to-face education. The Internet can even promote an active and independent type of learning adapted to students’ characteristics and their own learning pace (O’Flaherty & Phillips, 2015).

In addition to learning, the Internet is a means of entertainment and communication (Zhang et al., 2018) that has enabled young people around the world to develop and nurture relationships while strengthening their sense of community (Pendry & Salvatore, 2015) and social well-being (Castellacci & Tveito, 2018; Alivernini et al., 2019). Furthermore, using different mobile devices from an early age enables the acquisition of so-called digital skills, such as the ability to search for and evaluate information (Van Deursen & Van Dijk, 2008), which can be extremely useful when writing reports or completing assignments (Pagani et al., 2016).

The Internet has become indispensable in teenagers’ lives, but its excessive or inappropriate use also has undesirable consequences for young people, especially if we consider that they are more likely to develop a certain degree of Internet addiction than adults (Fineberg et al., 2018; Ko et al., 2012). Excessive use can lead to withdrawal and weaker social skills as well as mental health and family problems (O’Day & Heimberg, 2021; Song et al., 2019; Twenge, 2017). Besides, such dependence can negatively interfere with the educational process and consequently reduce academic performance (Azizi et al., 2019; Kates et al., 2018; Koca & Berk, 2019; Sengupta et al., 2018; Wammes et al., 2019).

Given the relevance of the Internet in our lives, it seems reasonable to question whether its use from an early age can negatively affect a person’s psycho-emotional educational and professional development and, specially, if we take into account that late childhood and adolescence are critical stages in human life, as teenagers are supposed not only to develop educational and career goals, but also to ask themselves who they are and who they want to be (Verhoeven et al., 2019).

Therefore, the aim of our study is to identify the profile of students who are able to maximise their academic performance in reading, mathematics, science and English given the different ways in which they can use the Internet. That said, we focus on Spain, where 91.4% of households had a fixed or mobile broadband Internet connection and 92.9% (INE, 2019) of children aged 10–15 use it in 2019, with the average time spent on the Internet being 3 or more hours per day in 2019 (Qustodio, 2019). Particularly, within Spain, we will use a recent database from the Spanish region of the Canary Islands which collected the census of primary school students in 6th grade in 2018–2019.

We have to bear in mind that, according to the latest Programme for International Student Assessment (PISA) report (2018), the Canary Islands are at the bottom of the Spanish educational ranking. For instance, Canarian students scored 19 points below the OECD average in science and 13 points below Spain (MEPF, 2018). In this sense, given the increase in both poor academic results and Internet use by the younger generations, in our study we seek to analyse the influence that Internet use can have on academic performance. To do so, we will make use of interval multiobjective programming, specifically the algorithm proposed in Henriques et al. (2019). This methodology has been used in applications to analyse workers’ well-being (Henriques et al., 2020, 2021) and in the educational context (Prieto-Latorre et al., 2021).

In short, this study aims to enrich the existing literature in, at least, three aspects. Firstly, it assesses how social networks (and additionally WhatsApp) influence on students’ academic performance in different cognitive domains in late childhood, while studies usually focus on secondary school and university students. Secondly, we provide up-to-date evidence for Spain, to the extent that previous literature is limited and the databases used are outdated. Thirdly, by using interval multiobjective programming, we offer a potentially useful tool in the design of educational policies and parental guidance.

The article is structured as follows. First, we provide a brief review of the relevant literature on the influence of Internet use on academic performance. Then, we present the main characteristics of the dataset. Sections 4 and 5 describe the methodology employed and the results obtained. Finally, we discuss and present the main findings, including implications for socio-economic policies.

2 Literature review

The digital dependency and the effect it may have on young people’s academic performance and personal development has sparked an increased interest among researchers. Specifically, some studies have shown how using the Internet can improve academic performance (Çebi & Güyer 2020; Chen et al., 2014; Hou et al., 2021; Gil, 2012; Naqshbandi et al., 2017; Zhu et al., 2011). On the one hand, Chen et al. (2014) analysed the relationship between Internet information seeking, academic performance and academic self-efficacy, with the latter being the mediator between the first two. The authors distinguished between educational and leisure-oriented Internet use, concluding that both types had a positive impact on twelfth-grade students’ academic self-efficacy, indirectly improving their academic performance. On the other hand, Hou et al. (2021) examined the impact that the Chinese social network, WeChat, may have on university students’ academic performance. The authors concluded that the impact of using WeChat was largely due to students’ self-control, with the effect of sharing information through the application being positive when students had high self-control.

Likewise, researchers have found a positive relationship between ICT use and academic performance (Mo et al., 2014; Cabras & Tena Horrillo, 2016; Gubbels et al., 2020; Lei et al., 2021). In particular, Cabras and Tena Horrillo (2016), using data provided by PISA 2012, found a causal effect of ICT use on Spanish students’ mathematics performance, with the effect being stronger for lower-income students. Recently, Gubbels et al. (2020) showed that moderate ICT use was positively related to the reading achievement of 15-year-old Dutch students, with a negative impact when ICT was overused. Similarly, Machin et al. (2007) highlight that the increase of ICT investment at schools in England caused a positive impact on reading competence and science, but it had no significant influence in mathematics, while Villafuerte and Romero (2017) found that watching videos and using social networks help to improve English skills, as motivation and engagement facilitate English learning, both in writing and listening skills. This result is opposed to most empirical evidence, which usually finds a negative effect of social networks on educational attainment (see meta-analysis by Liu et al., 2017).

However, far from reaching the same conclusion, other studies have found a negative relationship between Internet/ICT use and academic performance (Azizi et al., 2019; Chang et al., 2019; Hsiao et al., 2017; Junco, 2015; Karpinski et al., 2013; Kim et al., 2017; Koca & Berk, 2019; Michikyan et al., 2015; Sengupta et al., 2018; Vigdor et al., 2014). Particularly, Kim et al. (2017) considered confounding factors such as gender, drug use or parental education level, and found that using the Internet for general purposes was negatively correlated with higher school performance, in contrast to when the Internet was used for study.

Finally, some studies point to the lack of significant effect of Internet and ICT on educational outcomes (Cristia et al., 2017; Fairlie & Robinson, 2013; Leuven et al., 2007; Mbaeze et al., 2010; Raines, 2012; Spiezia, 2011; Woessmann & Fuchs, 2004).

The influence that the Internet has on the teaching–learning process may depend on the type of analysis conducted, the potential existence of selection bias (Bulman & Fairlie, 2016), how it is used and whether it is more or less academically oriented (Chang et al., 2019; Gil, 2012; Kim et al., 2017; Lau, 2017; Torres-Díaz et al., 2016). In particular, while online information seeking tools and word processing are associated with higher academic performance in 15-year-old students (Gil, 2012), video games or streaming entertainment hinder the educational process (Lopez-Agudo & Marcenaro-Gutierrez, 2020; Rideout et al., 2010).

In this regard, it seems that the problem lies in excessive or inappropriate use of the Internet (Zhou et al., 2020) resulting in situations where users are unable to control the time they spend on online activities and neglect their daily activities (Wąsiński & Tomczyk, 2015). This “addiction”, which some studies refer to as “Internet use disorder” (Peterka-Bonetta et al., 2019; Sha et al., 2019), can lead to socioemotional problems among young people (Pontes et al., 2015), as well as negatively affect their academic performance (Berte et al., 2021; Flisher, 2010; Siciliano et al., 2015) by reducing academic engagement and increasing disaffection with learning activities (Feng et al., 2019; Karpinski et al., 2013; Zhang et al., 2018).

Focusing on Spain, the studies that have analysed this issue are limited (Fernández-Gutiérrez et al., 2020; García-Martín & Cantón-Mayo, 2019; Gómez-Fernández & Mediavilla, 2021). For instance, García-Martín and Cantón-Mayo (2019) assessed how different types of Internet use might affect academic skills, concluding that each type of Internet use was associated with different cognitive domains. Fernández-Gutiérrez et al. (2020) employ PISA data from three waves (2009, 2012 and 2015) to evaluate the use of ICT at secondary school, finding significant effects in students’ outcomes in science, but not in reading and mathematics. Particularly relevant is the study carried out by Prieto-Latorre et al. (2021), in which they analysed the effect that Internet use may have on school grades (content-based knowledge) and test scores (competences) of a cohort of 8th grade students in 2011–2012.Footnote 1 The authors concluded that using the internet academically and for hobbies should be prioritised over continued use of social networks.

In any case, the evidence in Spain is still scarce and further up-to-date research is needed, as the use of the Internet and new technologies is increasing among young people.

3 Data and institutional background

In recent years, the Spanish education system has adopted a formative assessment model in line with European standards that aims to promote lifelong learning and to move away from an exam-centred educational culture. To implement this formative assessment scheme, in addition to a continuous assessment throughout the academic year, Spanish students take an individual assessment test at the end of each educational stage. Specifically, for primary education, Spanish students take the test at the end of 3rd (LOMCE, art. 20.3; BOE, 2013) and 6th grades (LOMCE, art. 21; BOE, 2013). These tests are designed to assess students’ numeracy skills, as well as their oral and written comprehension skills, in order to provide them with individualised attention according to their needs and to prevent them from failing at a later stage of their education.

In this study we have used the data collected by the Canarian Agency for University Quality and Educational Assessment. Specifically, as mentioned above, we have used data from the cohort of 6th grade students in 2018–2019 (in the Autonomous Community of the Canary Islands). In total, our sample collects data from 13,296 students who completed tests in reading, mathematics, science and English. In addition, to avoid losing observations, we have introduced a set of missing flag variables.

In the database under scrutiny, to facilitate comparison with other research studies and between different measures of educational performance, the variables regarding test scores in the different competences have been standardised (through statistical normalisation) to have mean 0 and standard deviation 1 and, consequently, the results can be interpreted as effect sizes. Along with educational competences, by having both a student questionnaire and a parent questionnaire, we have information about parents’ educational level, income and occupation, among other variables. Moreover, to capture the students’ socio-economic level, the Canarian Agency provides us with an indicator of socio-economic and cultural status (ESCS). This index, which is a continuous variable, has also been standardised to facilitate comparisons.

Since our goal is to analyse how Internet use influences academic performance, this analysis is mainly based on the information collected by the student questionnaire. Concretely, we have based our analysis on the following question:

“How often do you use the Internet for the following activities?

  1. a)

    Searching for information for your studies (Google, Wikipedia, etc.).

  2. b)

    Searching for information about games or playing games.

  3. c)

    Searching for information about sports.

  4. d)

    Searching for information about music or movies (YouTube, Spotify, etc.).

  5. e)

    Searching for information on other topics that interest you.

  6. f)

    Communicating with other people (WhatsApp, Telegram, Hangout, etc.).

  7. g)

    For using social networks (Facebook, Twitter, Instagram, Musically, etc.).

For these 7 Internet-related variables, the answer is one of the following options: “never or almost never”, “once or twice a month”, “once or twice a week”, “every day or almost every day”. Descriptive statistics regarding these variables are shown in Table 4 (Appendix). We can observe that 6th grade students mainly use the Internet to communicate with others through applications such as WhatsApp, with 40% of students using it every day. This appears to be consistent with new trends, as the younger generation represents the new wave of users of these applications which can be broadly considered as social networks. Similarly, 32.9% of students use applications such as YouTube or Spotify on a daily basis, while they use the Internet for other topics of interest and to play games once or twice a week (36.1% and 24.1%, respectively). This underlines the aim of our analysis, as younger students tend to use the Internet more for non-academic purposes.

Since WhatsApp or Telegram can be considered social networks, we only include the variables related to social network use as a category that encompasses not only Facebook or Instagram, but also other communication channels. This will help us to avoid multicollinearity problems. However, as we will show below, the estimates corresponding to the model including the variables related to WhatsApp use have been run as a robustness check.

4 Methodology

The main goal of our analysis is to explore the influence that Internet use during late childhood may have on students’ academic performance. In doing so, we aim to provide a student profile that combines both academic and non-academic Internet use to maximise academic achievement. To this end, we will combine econometric analysis with multi-objective programming techniques.

The econometric analysis is based on the estimation by ordinary least squares (OLS) of different regression models in which academic performance is regressed on a set of variables. In these variables we include both a set of control variables and those relating to Internet use. Therefore, our base model would be defined as:

$${Z}_{k}(i)={\widehat{\beta }}_{0}(i)+{\widehat{\beta }}_{1k}{x}_{1}(i)+{\widehat{\beta }}_{2k}{x}_{2}(i)+{\widehat{\beta }}_{3k}{x}_{3}(i)+\dots +{\widehat{\beta }}_{23k}{x}_{23}(i)+{\varepsilon }_{k}(i)$$
(1)

where \(Z\) is the standardised academic performance, \(i\) is the student, \(k=\mathrm{1,2},\mathrm{3,4}\) represents reading, mathematics, science and English, respectively; \({x}_{1}\) to \({x}_{4}\) represent student characteristics; \({x}_{5}\) is a school characteristic; \({x}_{6}\) to \({x}_{23}\) indicate Internet use (see Table 5, Appendix); \({\varepsilon }_{k}\) is the error term; \({\widehat{\beta }}_{j}\) represents estimated regression coefficients for the \(j=1,\dots , 23\) variables. Since this is a point estimate of the influence that each independent variable has on the dependent variable, there is a margin of error and, therefore, we have estimated the lower and upper bounds of the estimated coefficients at the 99% confidence level as follows:

$$\left[{\widehat{\beta }}^{L}, {\widehat{\beta }}^{U}\right]=[\widehat{\beta }-{t}_{n-k-1,\frac{\alpha }{2}}SE\left(\widehat{\beta }\right), \widehat{\beta }+{t}_{n-k-1,\frac{\alpha }{2}}SE\left(\widehat{\beta }\right)]$$
(2)

where \(\widehat{\beta }\) is the estimated average coefficient; \(SE\) is the standard error; \(n\) is the number of observations; \(k\) is the number of conditioning variables; \(\alpha\) is the significance level and \(t\) are the probability values of the t-distribution.

Table 1 shows the lower and upper bounds of estimated coefficients for each of the educational outcomes, i.e. reading, mathematics, science and English.Footnote 2 Those coefficients that are not statistically significant, at least at 10%, are shown as “0”.

Table 1 Regression results for the social networks model

The results obtained show that females perform better than males in reading and English, but perform worse in mathematics. Specifically, girls obtain on average 0.27 standard deviations (SD) and 0.19 SD more in reading and English, respectively, than boys, while their mathematics performance drops by 0.08 SD. Far from being surprising, these findings are consistent with the evidence collected in the literature. On the one hand, females tend to perform better in reading; the Progress in International Reading Literacy Study (PIRLS) 2016 showed that, in 48 of the 50 participating countries, female students’ reading achievement was higher than boys’ and this female dominance has persisted since this assessment was implemented (Mullis et al., 2016). This gender gap in favour of females can be the result of certain gender stereotypes attributing greater mathematical ability to males (Cvencek et al., 2011; Plante et al., 2013; Spencer et al., 1999, 2016). The corollary of this is that girls tend to have a higher linguistic self-concept than boys (Heyder et al., 2017; Jacobs et al., 2002; Wigfield et al., 1997) and to value reading highly, which favours their academic performance in this competence. On the other hand, females’ higher achievement in English is closely linked to reading skills (Grabe, 2010; Oxford, 2011), as developed skills in interpreting texts and processing information enable females to learn a foreign language more quickly (Wightman, 2020).

Similarly, we find that students’ socio-economic status is positively associated with their academic performance. This relationship has been strongly supported by the literature (Cedeño et al., 2016; Hanushek & Woessmann, 2011; Kim et al., 2019; Liu et al., 2020; Martins & Veiga, 2010; von Stumm, 2017) and is explained by the fact that students from disadvantaged backgrounds are constrained by their economic resources and cannot access tutoring and other educational resources that help to improve their educational outcomes (Crosnoe & Cooper, 2010; Lareau, 2011). In contrast, the proportion of poor students in school has a negative influence on academic performance. Children from poor socio-economic backgrounds are more likely to develop behavioural problems (Hendriks et al., 2020; Peverill et al., 2021; Piotrowska et al., 2015); this can negatively affect classroom climate and academic performance via peer effects (Busching & Krahé, 2020; Jerrim et al., 2021).

Regarding Internet use, we observe that using the Internet to study and complete school tasks positively correlates with academic performance in all four subjects. In contrast, using the Internet to play video games is negatively related to reading performance. This negative influence goes from 0.05 SD, on average, when the frequency of use is once or twice a month, to 0.10 SD, on average, when it is used every day. This pattern seems to be in line with previous literature (Lopez-Agudo & Marcenaro-Gutierrez, 2020).

Similarly, using the Internet to search for information on sports is negatively related to reading, mathematics, science and English performance, with the influence also being greater as the frequency of use increases. In contrast, using apps like YouTube or Spotify seems to have a positive influence on academic performance. However, the average influence in mathematics is lower than in the other disciplines. For example, daily use of these apps is positively related to mathematics performance (0.12 SD), while the association is 0.18 SD and 0.16 SD in science and English, respectively. Since using music with lyrics or certain genres can be more distracting (Avila et al., 2012; Perham & Currie, 2014) the level of concentration required for mathematics tasks may not be achieved.

Meanwhile, using social networks has a negative influence on academic performance, with the greater the frequency of use, the greater this correlation. In this sense, its overuse can cause students to adopt less efficient and more superficial study techniques, given a greater number of distractions (Alt, 2018).

Following the econometric analysis, our study is carried out on the basis of interval multiobjective programming. This methodology has been applied because, as we know, there are many factors involved in the educational process that are not always controllable and that can affect students’ academic performance. Consequently, results cannot be interpreted as causal effects, but rather as conditional associations. In this context, using interval multiobjective programming models is particularly useful. Interval multiobjective programming and, specifically, the algorithm proposed in Henriques et al. (2019) and applied in Henriques et al., (2020, 2021) will allow us to overcome the uncertainty inherent to the coefficients in multiobjective problems. This methodology, which solves Multiobjective Linear Problems (MOLP), combines the reference-point approach with the concept of interval programming (Oliveira & Antunes, 2009) and allows us to use objective functions that take into account the confidence intervals of the regression coefficients. Therefore, by using this methodological approach, we will be able to analyse the potential trade-offs between different students outcomes while obtaining robust results.

That said, we start from a maximisation problem of educational outcomes subject to constraints:

$$\begin{array}{c}MaxZ\left(x\right)=\left(Z_1\left(x\right),Z_2\left(x\right),Z_3\left(x\right),Z_4\left(x\right)\right)\\subject\;to\;x\in X\end{array}$$
(3)

where each \({Z}_{k}\) stands for:

$${Z}_{k}\left(\mathrm{x}\right)=\left[{\widehat{\beta }}_{k0}^{L}, {\widehat{\beta }}_{k0}^{U}\right]+\left[{\widehat{\beta }}_{k1}^{L}, {\widehat{\beta }}_{k1}^{U}\right]{x}_{1}+\left[{\widehat{\beta }}_{k2}^{L}, {\widehat{\beta }}_{k2}^{U}\right]{x}_{2}+\dots +\left[{\widehat{\beta }}_{k23}^{L}, {\widehat{\beta }}_{k23}^{U}\right]{x}_{23}$$
(4)

\({Z}_{k}\left(x\right)\) are the objective functions to be maximized; \({\varvec{x}}={\left({x}_{1},\dots ,{x}_{n}\right)}^{T}\) is the vector of decision variables; \({\varvec{x}}\boldsymbol{ }\in X\) is the feasible region; \({\widehat{\beta }}_{kj}^{L}\) and \({\widehat{\beta }}_{kj}^{U}\) are the lower and upper bounds of the estimated coefficients, respectively.

In this sense, the estimated coefficients will be given by the correlation coefficients in Table 1 and our objective functions will be defined as follows:

  1. 1.

    Competences in reading.

    $${Z}_{1}=\left[-0.502, -0.172\right]+\left[0.219, 0.33 \right]{x}_{1}+\left[0.155, 0.211\right]{x}_{2}+\dots +\left[-0.191, -0.02 \right]{x}_{23}$$
  2. 2.

    Competences in mathematics.

    $${Z}_{2}=\left[0, 0\right]+\left[-0.129, -0.03\right]{x}_{1}+\left[0.191, 0.246\right]{x}_{2}+\dots +\left[0, 0\right]{x}_{23}$$
  3. 3.

    Competences in science.

    $${Z}_{3}=\left[0, 0\right]+\left[0, 0\right]{x}_{1}+\left[0.207,0.263 \right]{x}_{2}+\dots +\left[-0.183, -0.017\right]{x}_{23}$$
  4. 4.

    Competences in English.

    $${Z}_{4}=\left[\mathrm{0,0}\right]+\left[0.145,0.241 \right]{x}_{1}+\left[0.289,0.341\right]{x}_{2}+\dots +\left[0,0 \right]{x}_{23}$$

In order to obtain realistic solutions, a set of technical constraints have been defined for Internet use variables. These constraints will guarantee that the solutions are not simultaneously 1 for all binary variables in a group:

  1. 1.

    Using the Internet to study (Google, Wikipedia, etc.).

    $${x}_{6}+{x}_{7}+{x}_{8}\le 1$$
    (C1)
  2. 2.

    Using the Internet to play games.

    $${x}_{9}+{x}_{10}+{x}_{11}\le 1$$
    (C2)
  3. 3.

    Using the Internet to search for information about sports.

    $${x}_{12}+{x}_{13}+{x}_{14}\le 1$$
    (C3)
  4. 4.

    Using the Internet to search for information about music or cinema (YouTube, Spotify, etc.).

    $${x}_{15}+{x}_{16}+{x}_{17}\le 1$$
    (C4)
  5. 5.

    Using the Internet to search for information about hobbies.

    $${x}_{18}+{x}_{19}+{x}_{20}\le 1$$
    (C5)
  6. 6.

    Using social networks (Facebook, Twitter, Instagram, Musically, etc.)

    $${x}_{21}+{x}_{22}+{x}_{23}\le 1$$
    (C6)

Besides the technical constraints, we define another set of constraints reflecting the relationships between those independent variables that have significantly stronger associations. To illustrate the creation of these restrictions, an example is given below using the variables “Proportion of poor students” (\({x}_{5}\)) and “Immigrant status” (\({x}_{3}\)):

Dependence between the two variables is defined as:

$${x}_{5}=a{x}_{3}+b$$
(5)

To incorporate this linear regression into the model, we use 99% confidence intervals for each parameter as follows:

$$a\in\left[a^L,a^U\right]=\lbrack0.011,0.064\rbrack,\mathrm{and}\;b\in\left[b^L,b^U\right]=\lbrack0.210,0.260\rbrack$$

which implies:

$${a}^{L}{x}_{3}+{b}^{L} \le {x}_{5}\le {a}^{U}{x}_{3} +{b}^{U}$$
(6)

This expression can be broken down into two inequalities:

$${a}^{L} {x}_{3}-{x}_{5}\le -{b}^{L}$$
(7)
$$-a^U\;x_3{+x}_5\;\leq\;b^U$$
(8)

By following this procedure, we can obtain the set of constraints:

  1. 7.

    Relationship between proportion of poor students (1st quartile ESCS) and students’ ESCS.

    $$-0.091{x}_{2}-{x}_{5}\le -0.224$$
    (C7)
    $$0.070{x}_{2}+{x}_{5}\le 0.260$$
    (C8)
  2. 8.

    Relationship between proportion of poor students (1st quartile ESCS) and immigrant status.

    $$0.011{x}_{3}-{x}_{5}\le -0.210$$
    (C9)
    $$-0.064{x}_{3}+{x}_{5}\le 0.257$$
    (C10)
  3. 9.

    Relationship between proportion of poor students (1st quartile ESCS) and repeater.

    $$0.072{x}_{4}-{x}_{5}\le -0.211$$
    (C11)
    $$-0.131{x}_{4}+{x}_{5}\le 0.257$$
    (C12)
  4. 10.

    Relationship between students’ ESCS and immigrant status.

    $$-0.261{x}_{3}-{x}_{2}\le 0.014$$
    (C13)
    $$0.0814{x}_{3}+{x}_{2}\le 0.149$$
    (C14)
  5. 11.

    Relationship between students’ ESCS and repeater.

    $$-0.953{x}_{4}-{x}_{2}\le -0.001$$
    (C15)
    $$0.705{x}_{4}+{x}_{2}\le 0.156$$
    (C16)
  6. 12.

    Relationship between students’ ESCS and using the Internet to find information for study (Google, Wikipedia, etc.).

    $$0.052{x}_{6}+0.001{x}_{7}-0.018{x}_{8}-{x}_{2}\le 0$$
    (C17)
    $$-0.337-0.233{x}_{7}-0.198{x}_{8}+{x}_{2}\le 0$$
    (C18)
  7. 13.

    Relationship between students’ ESCS and using the Internet to search for information about other topics.

    $$0.046{x}_{18}+0.085{x}_{19}+0.023{x}_{20}-{x}_{2}\le 0$$
    (C19)
    $$-0.220{x}_{18}-0.248{x}_{19}-0.181{x}_{20}+{x}_{2}\le 0$$
    (C20)
  8. 14.

    Relationship between students’ ESCS and using social networks (Facebook, Twitter, Instagram, Musically, etc.)

    $$-0.338{x}_{21}-0.246{x}_{22}-0.226{x}_{23}-{x}_{2}\le -0.071$$
    (C21)
    $$0.185{x}_{21}+0.102{x}_{22}+0.043{x}_{23}+{x}_{2}\le 0.247$$
    (C22)

Therefore, our multiobjective problem has 23 decision variables (binary and continuous), 4 objective functions and 22 constraints. The type of variables and their bounds (which are considered as constraints of the multiobjective problem) are specified in Table 5 (Appendix).

In order to solve our interval multiobjective problem we must first solve each one of the problems for the objective functions individually. In this way, we will obtain the individual optimal values (Chinneck & Ramadan, 2000) and we will be able to check whether there are trade-offs between the four educational competences.

$$\begin{array}{cc}\mathit{Max}{Z}_{k}^{U}(\mathbf{x})=\sum_{j}{\beta }_{kj}^{U} {x}_{j},& k=1,\dots ,p,\\ s.t.:\sum_{j}{a}_{cj}{x}_{j}\le {b}_{c},& c=1,\dots ,m,\\ {x}_{j}\ge 0& j=1,\dots ,n.\end{array}$$
(9)

and

$$\begin{array}{cc}\mathit{Max}{Z}_{k}^{L}(\mathbf{x})=\sum_{j}{\beta }_{kj}^{L} {x}_{j},& k=1,\dots ,p,\\ s.t.:\sum_{j}{a}_{cj}{x}_{j}\le {b}_{c},& c=1,\dots ,m,\\ {x}_{j}\ge 0& j=1,\dots ,n.\end{array}$$
(10)

where \(k\) is the number of objective functions; \(c\) is the number of constraints; \(j\) is the number of decision variables. Consequently, the optimal solution of our multiobjective problem will be somewhere between the ideal solutions of the upper and lower bounds \({Z}_{k}^{*}=[\) \({Z}_{k}^{U*},{Z}_{k}^{L*}] ,\) where \({Z}_{k}^{U*}={{Z}_{k}^{U}(\mathrm{x}}_{k}^{U*})\) and\({Z}_{k}^{L*}={{Z}_{k}^{L}(\mathrm{x}}_{k}^{L*})\).

Then, and to obtain the solution of the interval multiobjective problem, we use the following surrogate scalarizing problem proposed in Henriques et al. (2019):

$$\begin{array}{l}\min\;v+\rho\sum_k\left(\mu_k^L\left(Z_k^{L\ast}-\sum_j\beta_{kj}^Lx_j\right)+\mu_k^U\left(Z_k^{U\ast}-\sum_j\beta_{kj}^Ux_j\right)\right)\\\begin{array}{cc}\mathrm s.\mathrm t.\mu_k^L\left(Z_k^{L\ast}-\sum_j\beta_{kj}^Lx_j\right)+\mu_k^U\left(Z_k^{U\ast}-\sum_j\beta_{kj}^Ux_j\right)\leq v,&k=1,\dots p,\\\sum_ja_{cj}x_j\leq b_c,&c=1,\dots,m,\\x_j\geq0,&j=1,\dots n\end{array}\end{array}$$
(11)

where the term \(\rho >0\) is an augmentation coefficient that guarantees the uniqueness of the obtained solution; \(k\) is the number of objective functions; \(c\) is the number of constraints; \(j\) is the number of decision variables.

This approach considers the Tchebychev distance to the interval ideal values \({Z}_{k}^{L*}\) and \({Z}_{k}^{U*}\), as well as the relevance of each objective function to reach these values through the weights \({\mu }_{k}^{L},{\mu }_{k}^{U}>0\) for all \(k=1,\dots ,p\). In addition, it provides “possibly” efficient solutions, which implies that the solution will be efficient for a linear combination of the parameters \({\overline{\beta }}_{kj}\in \left[{\beta }_{kj}^{L}, {\beta }_{kj}^{U}\right]\).

On the other hand, the algorithm proposed in Henriques et al. (2019) assumes that \({\mu }_{k}^{L}+{\mu }_{k}^{U}=1\) for all \(k=1,\dots ,p\), which enables to assign different importance to upper or lower bounds and, therefore, the decision maker (DM) will be able to define the importance of achieving each objective function according to their preferences. However, in our analysis we have considered the same importance for reaching each corresponding ideal solution.

5 Results

5.1 Main results

First, we have obtained the individual optimal values for each function. To simplify the interpretation of the results, Table 2 shows the optimal values (\({Z}_{k}^{*}\)) for each function instead of the ideal solutions of the upper and lower bounds (\({Z}_{k}^{L*}\) and \({Z}_{k}^{U*}\)).

Table 2 Mono-objective problem. Ideal solutions and individual optimal values for the social networks model

The last row of Table 2 shows the optimal values in terms of standardised scores. In this sense, the optimal value for English is higher than for the rest of the subjects, with 0.644 SD, compared to 0.549 and 0.472 SD for science and reading, respectively. Meanwhile, the optimal value in mathematics is 0.339 SD.

Along with the individual optimal values, Table 2 shows the profile of the student that maximises their performance in each one of the competences. As we can see, there is a certain trade-off between the different objective functions. Firstly, female students are the highest achievers in reading and English. In contrast, male students are the ones who maximise their performance in science and mathematics. This is hardly surprising and is consistent with the existing literature, as these are male-dominated fields (Spencer et al., 1999, 2016).

Regarding students’ socio-economic level we observe that, in order to maximise students’ academic performance, the ESCS index should not be particularly high, although higher than the sample average value. Specifically, students’ socio-economic index should be 0.149 if they want to maximise their performance in reading, mathematics and science, and 0.068 when it comes to English. This is – to some extent – unexpected, as students’ socio-economic status is positively related to their academic performance. However, given that students interact with peers from diverse socio-economic backgrounds, it seems that academic achievement is maximised when students from low socio-economic backgrounds are surrounded by a higher proportion of poor students. This proportion of poor students will be 23.6% for reading and 22.11% for English performance. In mathematics and science this proportion will be slightly lower (21.11%).

On the other hand, performance in all four subjects is maximised for those students who are not repeaters, while immigrant students are more likely to maximise their performance in English, which may be explained by their language background.

In terms of the Internet use, we can observe a more marked trade-off between the different objective functions. Specifically, students maximise their mathematics performance by continuously using the Internet to study, while it must be limited for the other academic competences. In addition, Internet use for gaming should be limited to once or twice a month to maximise students’ performance in mathematics, science and English, and reduced as much as possible to maximise their reading achievement. In the same vein, using the Internet to search for information on sports should be reduced to zero if students want to maximise their academic performance.

Alike, using the Internet for listening to music or for other hobbies daily contributes to maximise their performance in reading, science and English. In contrast, they should reduce these uses to once or twice a week to perform well in mathematics.

Finally, regarding social networks, we see that students should keep their use to the barest minimum. Given that using social networks can lead to addiction, especially in late childhood and adolescence, students should practically not use them if they want to ensure their academic performance.

Once the individual optimal values have been obtained, we have run the algorithm considering equal importance to reach each objective function and obtain “possibly” efficient solutions, presenting these results in Table 3. This table shows that the range of variation of the achieved value in science is larger (between -0.009 and 0.922 points) than in the other subjects. In contrast, the range of variation in mathematics is the smallest (between -0.115 and 0.602 points), with the maximum value being much smaller than that achieved in the other 3 competences.

Table 3 Multiobjective interval problem. Possibly efficient solution achieved values for the social networks model

As we can see, the profile of the student who maximises her academic performance is female, non-immigrant and has not repeated before 6th grade. Moreover, as in the mono-objective problem, it appears that academic performance is maximised when students from a low socio-economic background are surrounded by a higher proportion of poor students, with students’ socio-economic index being 0.149 and the proportion of poor students being 21.13%.

As for Internet use we can observe that, in order to achieve higher academic performance, students should reduce their use of the Internet both for studying and for searching for information about sports or playing video games. In this regard, it is worth noting that, to maximise their academic performance, students should reduce their Internet use for studying to once or twice a month, which may be the result of multitasking (Feng et al., 2019; Junco, 2015). Since younger students may lack self-control, using the Internet for schoolwork or study may lead to a non-academic use. This could hinder the educational process and encourage procrastination from an early age (Aznar-Díaz et al., 2020).

In contrast, our results show that students can maximise their academic performance by using the Internet daily to listen to music or to search for information on other topics of interest. In this sense, listening to music can improve not only mood, but also arousal levels and, consequently, enhance the performance of certain simple cognitive tasks (Goltz & Sadakata, 2021).

Finally we can observe that, if students want to maximise their academic performance, social networks use should be non-existent. Adolescents and pre-adolescents are heavy users of social networks and their desire to be constantly interacting with others can lead them to misuse their time efficiently and neglect academic work (Qiaolei, 2014).

5.2 Robustness check

To check the robustness of our results, we have replicated our analysis using the variables that only refer to the use of applications such as Telegram or WhatsApp (see Tables 7,Footnote 3 9 and 10, Appendix).

In this sense, if we look at the results obtained for this new mono-objective problem (Table 9), we can observe that the optimal values for each of the subjects are pretty similar to those obtained in our main model. For example, in reading the optimal value decreases just by 0.09 points in terms of standardised scores. As for the student profile, Table 9 shows a very similar pattern to Table 2. In order to maximise their performance in each one of the subjects, students should reduce their use of the Internet to search for information about sports, as well as to use applications such as WhatsApp. These applications should be used minimally to enhance academic performance.

On the other hand, Table 10 shows the “possibly” efficient solutions. In this sense, we can observe that the student who manages to maximise her academic performance is an immigrant female who has not repeated a grade. Her socio-economic index should be around 0.067, while the proportion of poor students (in the school) should be 22.11%.

Regarding the frequency of Internet use, we observe a pattern of use very similar to the one shown in Table 3. Students should use the Internet as little as possible to search for information about sports, while they can listen to music and search for information about other topics of interest daily without harming their academic achievement. As for the use of applications such as WhatsApp or Telegram, this should be limited to once or twice a week.

In summary, as we have seen through the tables, the individual optimal values and the “possibly” efficient solution are pretty close to the ones obtained in our main model, which shows the consistency of our results.

6 Discussion and conclusions

Throughout this paper we have tried to analyse the influence that using the Internet may have on academic performance. In order to do so, using a database that provides us with information on a cohort of 6th grade students in 2018–2019, we have implemented interval multiobjective programming. With this methodology, we have tried to profile those students who are able to maximise their academic performance evenly – and simultaneously – among different subjects.

In detail, we first carried out an econometric analysis to subsequently proceed with the multiobjective programming. From these estimates we observed how, e.g., using social networks had a negative influence on reading, mathematics, science and English performance, with the negative influence being greater as the frequency of use increased. The results of the interval programming model allow to assert that the profile of the student who manages to maximise her academic performance in all four subjects is a female student who has not repeated, from a medium socio-economic level and who attends a school where the proportion of students from a low socio-economic background is 21.11% (i.e. slightly below the sample average).

In addition, the results show that using the Internet from an early age has its ups and downs. To maximise academic performance, students should use the Internet to study or search for information about sports once or twice a month. Similarly, they should use the Internet to play video games 1 or 2 times a week, while they can daily use applications such as Spotify. In contrast, the use of social networks should be practically zero.

Thus, while the Internet can be helpful for the teaching–learning process, its inappropriate use can prevent students from reaching their best balanced performance. Therefore and, given the possibility that constant use of the Internet and social networks may lead to “addiction” that may affect their academic performance, it is necessary to establish some control. Hence, it would be desirable for both the family and the school to offer students some guidance to encourage them to use the Internet appropriately. In addition to promoting awareness and self-control among students, other measures should also be implemented to control both the content and frequency of use by young people. This would prevent exposure to undesirable content, as well as procrastination, that can ultimately lead young people to adopt unhealthy study techniques.

Besides, schools could find more effective ways of integrating ICT into the educational process by providing students with learning environments that are in line with the needs and trends of the twenty-first century. In this sense, while getting students to reduce their use of the Internet to play video games or to practically stop using social networks may be beyond the school’s scope of action, balancing academic performance with the ICT used in the classroom can be an objective of the school. In recent years, students’ use of computers and tablets during lessons has become increasingly common. However, in order to make it compatible with good academic performance, it is essential that schools set guidelines and control measures such as e.g. applications that limit students’ access to non-academic content during class time, in order to avoid multitasking problems.

In any case, it should be noted that our study is not free of limitations. First, as mentioned above, we are working with correlational rather than causal econometric estimates, as far as we cannot control for all variables that may affect academic performance. However, by using interval multiobjective programming, we are able to deal with this state of uncertainty. Second, Internet variables are self-reported, which may lead to measurement errors in the model. Third, our results may not be extrapolated to the rest of Spain, to the extent that we are using students’ information from only one Spanish region.