1 Introduction

Before the COVID-19 pandemic (caused by the coronavirus SARS-CoV-2), tourism was considered one of the sectors with the greatest growth prospects worldwide and one of the greatest drivers of economic development in the regions (Daniel et al., 2017), supported by the increase in the number of tourists and the development of the infrastructure of the destinations (Saluveer et al., 2020; WTO, 2019). However, this sector is currently one of the most affected by the COVID-19 pandemic with repercussions for all actors in the supply chain due to the travel ban (Goodell, 2020; Nicola et al., 2020; Samson, 2020). This situation is expected to generate major changes in travel behavior and planning for post-pandemic tourism scenarios (Li et al., 2020). Therefore, the tourist trip design problem (TTDP) is presented as an alternative solution to the several restrictions that must be considered according to the real contexts of the destinations. The TTDP aims at planning tourist routes for a single tourist or a group of tourists by maximizing the benefit obtained from visiting multiple points of interest (POIs) without exceeding the budget (Gavalas et al., 2015b; Kotiloglu et al., 2017; Liao & Zheng, 2018; Rodríguez et al., 2012; Vansteenwegen & Van Oudheusden, 2007; Zheng & Liao, 2019).

Planning a tourist itinerary is a complex process that involves several variants (Kotiloglu et al., 2017; Liao & Zheng, 2018). One of these variants that tourists should consider when planning their itinerary is the selection of the mode of transport (Garcia et al., 2009). This process may be affected by the pandemic, guiding tourists to prioritize some types of transport over others (Li et al., 2020). Transport takes relevance from the definition of tourism. Indeed, tourism consists of the movement of people or groups of people around a set of different sectors and points of interest (POI) in order to satisfy the preferences of tourists in a destination that the region offers for rest, recreation, among others, within a tourist route (Leiper, 1979; Sedarati et al., 2019). Therefore, transport is an important process that ensures mobility and accessibility between POIs and can even determine the attractiveness of these POIs (Le-Klähn & Hall, 2015; Van Truong & Shimizu, 2017).

On the other hand, current conditions lead tourists to consider social distancing in case of traveling in a group with homogeneous or heterogeneous preferences (i.e. within the group tourists prefer to visit the same POIs or each tourist selects specific POIs). The trend in global tourism is the shift from mass travel to customize itineraries that include heterogeneous preferences, even in group travel (Zheng & Liao, 2019). This variant of the TTDP represents an efficient alternative to be able to distribute the tourists of the group, thus avoiding the total concentration of the number of tourists in each POI and even on the route. However, when constructing the group itinerary, benefits should be achieved for all tourists based on equity (equity in group profit) (Wang, Liu, & Innes, 2019). In this way, it is intended that the tourists in the group present an equitable benefit despite having heterogeneous preferences and visiting different POIs. These social dimensions remain a challenge in the development of optimization models and in resolving conflicts between the three dimensions of sustainability (environmental, social, and economic) (Fiack et al., 2021; Vega-Mejía et al., 2019).

In addition to the situation generated by COVID-19, there are other problems that can substantially affect tourism. Climate change is one of the issues that is expected to have a significant impact in the coming decades (Scott et al., 2012). This situation has been very little investigated, generating a barrier that avoids determining the consequences for the competitiveness and sustainability of tourism in the regions (Scott et al., 2016, 2019); it is important to emphasize that tourism is also a sector that generates large emissions into the environment (UNWTO, 2012). This drives the sector towards the development of sustainable tourism in accordance with the Sustainable Development Goals (SDGs) (Nguyen et al., 2019). This approach takes relevance in transport as a process that generates high CO2 emissions (Pradenas et al., 2013; Qian & Eglese, 2016; Sánchez et al., 2013), as do accommodation and other facilities (Sedarati et al., 2019). There are several perspectives to address the issue of environmental impact for tourism; however, within the tourist trips problems only the work of Susanty et al. (2018) refers to this area from CO2 emissions in transport. Therefore, there is a great research opportunity to work on this topic in tourism.

Considering the previous context, this paper addresses the problem of tourism trip planning with heterogeneous preferences of tourists, transport mode selection, and equity in group profit. The objectives of the model are associated with the benefit of each tourist, the equitable benefit for the tourist group, and the level of CO2 emissions. This problem is approached in two stages. The first stage presents a bi-objective model for tourism planning with heterogeneous preferences that considers the group equitable profit and transport mode selection. This problem also includes the time windows to access the different POIs. This approach is more realistic in contrast to previous works from the literature (i.e., Malucelli et al., 2015; Sylejmani et al., 2017; Zheng & Liao, 2019), in the sense that, on the route, the tourist can use different transport modes. This can be translated as a potential scheme to support sustainability (Tawfik & Limbourg, 2019). Then, the second stage presents the multi-objective model which, in addition to the considerations of the previous both objectives, the minimization of CO2 emissions is added as a third optimization objective. This approach is guided by the new trend of sustainable tourism that the World Tourism Organization and the SDGs are aiming (Nepal et al., 2019).

As pointed out later in the literature review, there is no evidence of a problem that considers these three aspects in an articulated manner (i.e., heterogeneous preferences, transport mode selection, and CO2 emissions). Therefore, this paper proposes novel mathematical models to approach the current conditions of tourism development in times of the pandemic and post-pandemic under these three aspects.

The remainder of this paper is structured as follows. Section 2 presents a literature review about the tourist trip design problem with heterogeneous preferences, transport mode selection and sustainability in the construction of tourist paths. Section 3 presents the proposed mathematical models, while their application is presented in Sect. 4 through an illustrative example. Section 5 presents the results of the experimental evaluation and comparison of the models. Finally, Sect. 6 summarizes the conclusions and outlines some opportunities for future research.

2 Literature review

Tourist trip planning requires information about potential places according to the tourist's preferences, available budget, time to visit each point of interest (POI), travel times, mode of transport, among others, that make it a complex process and difficult to solve mathematically (Kotiloglu et al., 2017). In the academic literature, this problem is called the Tourist Trip Design Problem (TTDP) (Brito et al., 2017; Vansteenwegen & Van Oudheusden, 2007), whose aim is to plan the tourist's or group's itinerary taking into account time or economic budget constraints (Gavalas et al., 2015b; Kotiloglu et al., 2017; Liao & Zheng, 2018; Rodríguez et al., 2012; Vansteenwegen & Van Oudheusden, 2007; Zheng & Liao, 2019).

The great interest of the TTDP has attracted the attention of researchers, as witnessed by the amount of literature reviews published since 2009 (Gavalas et al., 2014; Gunawan et al., 2016; Lim et al., 2019; Ruiz-Meza & Montoya-Torres, 2020; Shcherbina & Shembeleva, 2014; Souffriau & Vansteenwegen, 2010; Vansteenwegen et al., 2009). Throughout the literature, tourism planning problems have been modeled through Operations Management/Operational Research techniques under several approaches, such as the Cruise Itinerary Problem (Leong & Ladany, 2001), the One-Period Routing Bus Problem (BTP) (Deitch & Ladany, 2000, 2001), the City Bus Tour Problem (CTB) (Bagloee et al., 2017), the Generalized Maximum Coverage Problem (GMC) (Brilhante et al., 2013, 2015), the Optimal Tourist Problem (OTP) (Yu et al., 2015), and the Orienteering problem (OP) (Vansteenwegen & Van Oudheusden, 2007). This last together with the Team Orientering Problem (TOP) are the most emplyed modeling appraoches (Ruiz-Meza & Montoya-Torres, 2020; Shcherbina & Shembeleva, 2014; Vansteenwegen & Van Oudheusden, 2007).

The OP can be considered as a selective routing problem that aims to determine the order and set of nodes to be visited in order to maximize the benefit received without exceeding the time budget (Gunawan et al., 2016; Kara et al., 2016). When several routes are to be defined, a TOP is considered more efficient. The main objective of the TOP is to maximize profit by selecting and visiting a set of nodes, where the start and end nodes are fixed. The TOP selects several routes at once, so the OP can be considered as a single-route version (Vansteenwegen, 2009). In this sense, the TOP can be approached as a multi-level optimization problem, starting with a selection of the set of nodes to be visited. Then, the assignment of scores to each team member is made and finally, the construction of the routes is carried out (Chao et al., 1996). The Team Orienteering Problem with Time Window (TOPTW) is the most appropriate extension of the TOP to describe the TTDP because, generally, the POIs have an opening time window (Expósito et al., 2019a; Gavalas et al., 2014).

Several variants are considered when modeling real problems (Ruiz-Meza et al., 2020), such as time windows (Garcia et al., 2009; Souffiau et al., 2009), time-dependency (Garcia et al., 2010a, 2010b), arc score (Arc Orienteering Problem -AOP) (Verbeeck et al., 2014), arc and POI scores (Mixed OP-AOP) (Gavalas et al., , 2016, 2017; Malucelli et al., 2015), multi-commodity (Malucelli et al., 2015), multiple periods (Kotiloglu et al., 2017), time-based user interest (Lim et al., 2017) multiple constraints (Sylejmani et al., 2012), heterogeneous preferences (Malucelli et al., 2015; Zheng & Liao, 2019), uncertain travel times (Hasuike et al., 2013), previous tourist experiences (Zheng et al., 2017), time-dependent stochasticity (Liao & Zheng, 2018), hotel selection (Zheng et al., 2020), electric vehicles (EV) for tourism (Wang et al., 2018), scores and travel fuzzy times (Expósito et al., 2019a, 2019b), and Clustered POIs, when POIs of different types of tourism are identified and grouped according to their class (Expósito et al., 2019a).

In addition, another relevant but little-studied variant is the type of tourism associated with urban environments. This type of tourism is gaining relevance due to the growth of cities and their attractiveness as tourist sites for business activities, shopping, medical purposes, and visits to family and friends, among other activities (Yuan et al., 2018). The complexity of urban tourism is mainly generated in the urban logistics management associated with the problems arising from transportation in urban areas (Zheng & Liao, 2019; Zheng et al., 2020). Therefore, as in urban logistics, it is necessary to consider variants associated with travel time, vehicle congestion, and, above all, the reduction of CO2 emissions (De Marco et al., 2017). However, the TTDP, whether in urban or non-urban environments, focuses on the movement of people, while in urban logistics the urban transport of goods is the main focus (Morana, 2014).

Since the current paper studies a problem with heterogenous preferences, time windows and environmental impact (CO2 emissions), following subsections present a review of the variants that are included in such a problem. A summary of this literature is presented in Table 1.

Table 1 Previous works on the TTDP with heterogeneous preferences, transport mode selection, and consideration of CO2 emissions

2.1 Heterogeneous preferences

About TTPD with heterogeneous preferences, the literature is very limited. Malucelli et al. (2015) proposed a new combinatorial optimization problem called the Multi-Commodity Orienteering Problem with Network Design (MOP-ND). Their approach satisfies the individual preferences of a group of tourists with the same origin–destination, maximizing the profit of each tourist. These authors used real data from Trebon region in South Boemia for cycle tourist. Sylejmani et al. (2017) developed an extension of the TTDP that considers group planning with personalized interests and mutual relationship between different tourists. The model is based on the Multi Constraint TOPTW to consider the case of multiple trips with multiple tours (MCMTOPTW). The problem is solved by a taboo search algorithm with a neighborhood structure based on three unique operators: Separate, Join and Insert. Finally, Zheng and Liao (2019) developed a bi-objective model for planning group tourist trips considering heterogeneous preferences. The first objective maximizes the fairness of individual members, while the second objective maximizes the total group utility. The problem is solved by a non-dominated classification algorithm (NSACDE) that combines ant colony optimization (ACO) and the differential evolution algorithm (DEA). The algorithm was tested with real data from Kulangsu located in China.

2.2 Transport mode selection

Tourism is based on the resource, service, and infrastructure capacities offered by the territory, and in terms of infrastructure and networks, access must be guaranteed for the flow of people and goods (Millán, 2010). Therefore, transport is a process of great importance for the accessibility of the POIs and to define to some degree the attractiveness of these sites (Le-Klähn & Hall, 2015; Van Truong & Shimizu, 2017). Transport is one of the sectors responsible for the operation and development of tourism (Chaudhari & Thakkar, 2019). In addition, transport is included in the TTPD for the flow of tourists (Gavalas et al., 2015a) considering travel time, transport mode, among other aspects that can be associated with transport (Garcia et al., 2009).

Abbaspour and Samadzadegan (2009) applied a genetic algorithm to solve the problem of planning tourist trips considering the travel time in a multimodal transport network. For the model, monomodal (use of only one type of transport mode in the trip) and multimodal (combination of different transport modes) routes was considered. The test database was from Tehran, Iran. Subsequently, Abbaspour and Samadzadegan, (2011) applied two genetic algorithms to solve the tour planning problem with multimodal transport in urban environments. Garcia et al., (2010a, 2010b) develop a hybrid approach to solve the Time Dependent Team Orienteering Problem with Time Windows (TD-TOPTW) including public transport. Garcia et al., (2013) developed two solution algorithms for the TDTOPTW considering public transport as a decision-making tool in real time in the Personalized Electronic Tourist Guides (PETs). Gavalas et al. (2015a) include the eCOMPASS application that allows the calculation of tourist trips considering the multimodal and urban transport based on a TDTOPTW. To solve the problem an algorithm called SlackRoutes was developed. Gavalas et al. (2015b) developed The Time Dependent CSCRoutes (TD_CSCR) and The Time Dependent Slack CSCRoutes (TD_SℓCSCR) which are extensions of Cluster Search Cluster Ratio (CSCRatio) and Cluster Search Cluster Routes (CSCRoutes). The new algorithms allow managing time-dependent travel times between different POIs, to solve the problem of TDTOPTW with which the periodicity of transit services is not assumed. Wu et al. (2017) propose a tourist planning model to maximize the tourist utility denoted as the tourist experience. Tourist preferences for the attraction, the transport mode used between each POI, attraction attributes, time and cost budget were considered. Yu et al. (2017) propose a Multi-Modal Team Orienteering Problem with Time Window (MM-TOPTW) for the development of a tourist trip design application with several transport modes to be chosen by the tourist.

In addition, other time-dependent work has been developed on tourist itineraries with transport mode selection. Liao and Zheng (2018) develop a model based on a stochastic OP that aims at designing more realistic and personalized routes for each tourist. The problem considers spatial–temporal structures and mode selection between POIs (i.e. the tourist can select the mode of transport considering the time constraints, preferences, and availability). To solve the problem a hybrid heuristic algorithm based on random simulation (RSH2A) is applied.

2.3 CO2 emissions

Tourism is one of the major contributors to pollution problems (Susanty et al., 2018; UNWTO, 2012) as is the transport sector (Bai et al., 2017; Pradenas et al., 2013; Qian & Eglese, 2016; Sánchez et al., 2013). This is a serious concern worldwide because of the gradual increase of CO2 emissions and their environmental impact (Almouhanna et al., 2020; Juan et al., 2016; Muñoz-Villamizar et al., 2017, 2019). However, the application of environmental practices by companies in the tourism sector can generate improvements in operational performance, and minimize energy consumption (Perramon et al., 2014; Zeng et al., 2010). Therefore, many studies have been developed to minimize CO2 emissions in the routing process (Marrekchi et al., 2021). In the field of tourism and itinerary design, however, this problem is addressed only by Susanty et al. (2018), where a dynamic programming approach is applied to find the shortest routes in the tourist trip to minimize CO2 emissions.

For a sustainable tourism, sustainable transportation is necessary (Le-Klähn & Hall, 2015). Filimonau et al. (2014) suggest that tourists should avoid cars and air travel to minimize CO2 emissions and seek to use cleaner transportation systems such as public transport networks. However, public transport systems are not sustainable systems worldwide. Therefore, the use of intermodal transport can be considered as a contribution to sustainability for tourism (Tawfik & Limbourg, 2019). As pointed out before, this last is one of the contributions of the current paper.

3 Proposed models

The mathematical approach, which is divided into two stages, is detailed below. The first stage presents a TTDP based on a team orienteering problem with time windows (TOPTW) for heterogeneous preferences in tourist groups and transport mode selection between arcs. Subsequently, the model with CO2 minimization is detailed.

3.1 TTDP with time windows, heterogeneous preferences, and selection of transport mode

The mathematical model for tourist trip planning with heterogeneous preferences in tourist groups and selection of transport modes is detailed next.

Sets

\(I\) set of POIs \(\{ 1,2,3, \ldots ,i\}\)

\(K\) set of routes \(\{ 1,2,3, \ldots ,k\}\)

\(U\) set of tourists \(\{ 1,2,3, \ldots ,u\}\)

\(L\) set of transport modes \(\{ 1,2,3, \ldots ,l\}\)

\(I^{0}\) Depot node \(\{ 0\}\)

Parameters

\(P_{iu}\) profit in each POI for the tourist, \(\forall i \in I\backslash \{ 0\} ,\forall u \in U\)

\(v_{i}\) visiting time for each POI, \(\forall i \in I\backslash \{ 0\}\)

\(\left[ {a_{i} ,b_{i} } \right]\) time window in each POI, \(\forall i \in I\backslash \{ 0\}\)

\(d_{ij}\) travel distance between POIs, \(\forall ij \in I\)

\(Q_{k}\) maximum number of tourists on each route, \(\forall k \in K\)

\(Qm_{k}\) minimum number of tourists on each route, \(\forall k \in K\)

\(TMax_{u}\) maximum time budget for the tourist, \(\forall u \in U\)

\(Vel_{l}\) average speed for each transport mode, \(\forall l \in L\)

\(Vc_{ijl}\) variable cost for each transport mode, \(\forall ij \in I,\forall l \in L\)

\(Pi_{u}\) maximum transportation budget, \(\forall u \in U\)

\(t_{ijl}\) travel time for each transport mode, \(\forall ij \in I,\forall l \in L\)

Variables

\(X_{ijk} = 1\) if route \(k\) goes from \(i\) to \(j\), and \(X_{ijk} = 0\) otherwise, \(\forall ij \in I:i \ne j,\forall k \in K\)

\(Y_{ik} = 1\) if POI \(i\) is included in route \(k\), and \(Y_{ik} = 0\) otherwise, \(\forall i \in I\backslash \{ 0\} ,\forall k \in K\)

\(W_{ijku} = 1\) if tourist \(u\) goes from \(i\) to \(j\) in route \(k\), and \(W_{ijku} = 0\) otherwise, \(\forall ij \in I:i \ne j,\forall k \in K,\forall u \in U\)

\(\delta_{iku} = 1\) if POI \(i\) is included in route \(k\) for tourist \(u\), and \(\delta_{iku} = 0\) otherwise, \(\forall i \in I\backslash \{ 0\} ,\forall k \in K,\forall u \in U\)

\(Z_{ijkul} = 1\) if tourist \(u\) goes from \(i\) to \(j\) in route \(k\) with transport mode \(l\), and \(Z_{ijkul} = 0\) otherwise, \(\forall ij \in I:i \ne j,\forall k \in K,\forall u \in U,\forall l \in L\)

\(G_{k} = 1\) if route \(k\) is used, and \(G_{k} = 0\) otherwise, \(\forall k \in K\)

\(T_{iku}\): non-negative variable representing the time arrival in each POI \(i\) for tourist \(u\) in route \(k\), \(\forall i \in I\backslash \{ 0\} ,\forall k \in K,\forall u \in U\)

\(TR_{u}\) nonnegative variable; Travel time for the tourist \(u\), \(\forall u \in U\)

\(Max\): maximum profit

\(Min\): minimum profit

Bi-objective function

$$ {\text{Maximize}}\;Z = \alpha \sum\limits_{i = 1}^{I} {\sum\limits_{k}^{K} {\sum\limits_{u}^{U} {P_{iu} \delta_{iku} } } } + \beta ( - Max + Min) $$
(1)

Subject to:

$$ \sum\limits_{j = 1}^{J} {X_{ijk} = } \sum\limits_{j = 1}^{J} {X_{jik} } \;\forall i \in I^{0} ,\;\forall k \in K $$
(2)
$$ \sum\limits_{j = 1}^{J} {W_{ijku} = } \sum\limits_{j = 1}^{J} {W_{jiku} } \;\forall i \in I^{0} ,\;\forall k \in K,\forall u \in U $$
(3)
$$ \sum\limits_{j = 1}^{J} {X_{ijk} = } G_{k} \;\forall i \in I^{0} ,\forall k \in K $$
(4)
$$ \sum\limits_{k}^{K} {Y_{ik} \le 1} \;\forall i \in I\backslash \{ 0\} ,\forall k \in K $$
(5)
$$ \sum\limits_{i = 1,i \ne h}^{I} {X_{ihk} = } \sum\limits_{j = 1,j \ne h}^{J} {X_{hik} } = Y_{hk} \;\forall h \in I,\forall k \in K $$
(6)
$$ \sum\limits_{i = 1,i \ne h}^{I} {W_{ihku} = } \sum\limits_{j = 1,j \ne h}^{J} {W_{hiku} } = \delta_{hku} \;\forall h \in I,\forall k \in K,\forall u \in U $$
(7)
$$ \sum\limits_{u}^{U} {W_{ijku} \le X_{ijk} Q_{k} } \;\forall i \in I,\forall j \in I\backslash \{ 0\} :i \ne j,\forall k \in K $$
(8)
$$ \sum\limits_{u}^{U} {\delta_{iku} \le Y_{ik} Q_{k} } \;\forall i \in I\backslash \{ 0\} ,\forall k \in K $$
(9)
$$ \sum\limits_{j = 1}^{J} {\sum\limits_{k}^{K} {W_{ijku} } \le 1} \;\forall i \in I^{0} ,\forall u \in U $$
(10)
$$ \sum\limits_{u}^{U} {W_{ijku} } \ge Qm_{k} - M(1 - X_{ijk} )\forall i \in I^{0} ,\forall u \in U $$
(11)
$$ \sum\limits_{l}^{L} {Z_{ijkul} = W_{ijku} } \;\forall i \in I,\forall j \in I\backslash \{ 0\} :i \ne j,\forall k \in K $$
(12)
$$ \sum\limits_{l}^{L} {Z_{ijkul} \le 1} \;\forall ij \in I:i \ne j,\forall k \in K $$
(13)
$$ T_{jku} \ge \sum\limits_{l}^{L} {t_{ijl} Z_{ijkul} - M(1 - W_{ijkul} )} \;\forall i \in I^{0} ,\forall j \in I\backslash \{ 0\} ,\forall k \in K,\forall u \in U $$
(14)
$$ T_{iku} + v_{i} + \sum\limits_{l}^{L} {t_{ijl} Z_{ijkul} - T_{jku} \le M(1 - W_{ijkul} )} \;\forall ij \in I\backslash \{ 0\} :i \ne j,\forall k \in K,\forall u \in U $$
(15)
$$ TR_{u} \ge T_{iku} + v_{i} + \sum\limits_{l}^{L} {t_{ijl} Z_{ijkul} - M(1 - W_{ijkul} )} \;\forall i \in I,\forall j \in I\backslash \{ 0\} ,\forall k \in K,\forall u \in U $$
(16)
$$ TR_{u} \le TMax_{u} \;\forall u \in U $$
(17)
$$ a_{i} \delta_{iku} \le T_{iku} \le b_{i} \delta_{iku} \;\forall i \in I\backslash \{ 0\} ,\forall k \in K,\forall u \in U $$
(18)
$$ Max \ge \sum\limits_{i = 1}^{I} {\sum\limits_{k}^{K} {P_{iu} \delta_{iku} } } \ge Min\;\forall u \in U $$
(19)
$$ \sum\limits_{i \ne j}^{I} {\sum\limits_{j \ne i}^{J} {\sum\limits_{k}^{K} {\sum\limits_{l}^{L} {Z_{ijkul} Vc_{ijl} \le Pi_{u} } } } } \;\forall u \in U $$
(20)
$$ X_{ijk} ,W_{ijku} ,Y_{ik} ,\delta_{iku} ,Z_{ijkul} ,G_{k} \in \left\{ {1,0} \right\}\;\forall ij \in I,\forall k \in K,\forall u \in U,\forall l \in L $$
(21)
$$ T_{iku} ,TR_{u} ,Max,Min \ge 0\;\forall i \in I,\forall k \in K,\forall u \in U $$
(22)

Equation (1) models the bi-objective function, where α and β are the weights that decision-makers can assign to either objectives, as follows. The first function corresponds to the benefit of each tourist, while the second function ensures equitable profit for the tourism group. Constraints (2) and (3) ensure the start and end of each route at point 0. Constraints (4) determine the use of the routes. Constraints (5) ensure that a POI belongs to a specific route. Constraints (6) and (7) ensure the flow and continuity of the route. Constraints (8) and (9) ensure that the maximum capacity of tourists in an route is not violated. Constraints (10) ensure that each tourist can belong to only one route. Constraints (11) states that, if an arc is included in a route, at least one tourist must be assigned. Constraints (12) and (13) ensures that only one transport mode is used per tourist in each arc. Constraints (14) and (15) calculates the arrival time at each POI. Constraints (16) calculate the route time of each tourist. Constraints (17) ensure that each tourist's time budget is not exceeded. Compliance with time windows is guaranteed by the Constraints (18). Constraints (19) obtain maximum and minimum profits from tourists. Constraints (20) ensure that the travel cost budget is not exceeded. Finally, the domain of decision variables is summarized in Constraints (21) and (22).

3.2 Consideration of CO2 emissions

Based on the mathematical model in Sect. 3.1, the level of CO2 emissions generated by transportation is added. Equation (23) is added as a third objective function. Emissions costs per unit of kilometers traveled in arc \(ij\) on each route \(k\) by transport mode \(l\) are considered and minimized. The following parameters are added: CO2 emissions per distance traveled in each arc \(ij\) and mode of transport \(l\) is denoted as \(CO2_{ijl}\), the cost of emissions is denoted by \(ce\), while the profit-to-cost conversion parameter is denoted as \(\varphi\). Similarly, the variable \(\varpi_{ijkl}\) is added, where \(\varpi_{ijkl}\) is a integer variable denoting the number of tourists traveling through the arc \(ij\) of route \(k\) using transport mode \(l\), \(\forall ij \in I:i \ne j,\forall k \in K,\forall l \in L\).

$$ \sum\limits_{i}^{I} {\sum\limits_{j}^{J} {\sum\limits_{k}^{K} {\sum\limits_{l}^{L} {CO2_{ijl} ce\varpi_{ijkl} } } } } $$
(23)

The complete model is presented next:

Multi-objective function

$$ {\text{Maximize}}\;Z = \alpha \sum\limits_{i = 1}^{I} {\sum\limits_{k}^{K} {\sum\limits_{u}^{U} {P_{iu} \delta_{iku} } } } \varphi + \beta ( - \varphi Max + \varphi Min) + \gamma \left[ { - \sum\limits_{i}^{I} {\sum\limits_{j}^{J} {\sum\limits_{k}^{K} {\sum\limits_{l}^{L} {CO2_{ijl} ce\varpi_{ijkl} } } } } } \right] $$
(24)

Subject to:

Constraints (2)–(22)

$$ \sum\limits_{u}^{U} {Z_{ijkul} = \varpi_{ijkl} } \;\forall ij \in I:i \ne j,\forall k \in K,\forall l \in L $$
(25)
$$ \sum\limits_{i = 1}^{I} {\sum\limits_{k}^{K} {\sum\limits_{u}^{U} {W_{ijku} = U} } } \;\forall j \in I^{0} $$
(26)
$$ \varpi_{ijkl} \ge 0\;\forall ij \in I,\forall k \in K,\forall l \in L $$
(27)

The multi-objective function (24) aims to maximize individual tourist profits, equity in group profit, and minimize the costs of emissions. Parameter γ is added to represent the weight given by decision-maker to the computation of CO2 emissions in the objective function. The model requires the use of previously explained Constraints (2)–(22). Constraints (25) calculates the number of tourists using mode of transport l in arc ij. Constraints (26) ensure that all tourists are assigned to the open routes. Finally, Constraints (27) define the domain of the variable.

4 An illustrative example

To develop an example that illustrates the problem and the solution of the mathematical modeling, we built some random instances that will be detailed later. An example of an application is presented for which an instance was considered with 21 POIs, including the starting point, six tourists, three route options, and three modes of transport (car, bus, and walking). The data were applied for the first and second modeling. The results obtained are shown in Figs. 1 and 2.

Fig. 1
figure 1

Paths obtained for the first model with hptoptw-j21b instances

Fig. 2
figure 2

Paths obtained for the second model with hptoptw-j21b instances

Initially, we show the route planning for the first model that considers individual, group benefits and transport mode selection. The selected solution corresponds to the criteria weights \(\alpha\) = 0.7; \(\beta\) = 0.3, which present a total of 15 POIs visited by tourists, individual benefits that add up to a score of 587.5 and 75 points of difference between the maximum and minimum score obtained (group profit equity). As shown in Fig. 1, two routes were generated. The first route corresponds to the sequence 0-5-19-10-20-18-14-0 assigned to tourists 1, 4, and 6. However, in POI 10, tourist 6 returns to the starting point while tourists 1 and 4 continue the tour by visiting the remaining POIs in the sequence. In the first arc (0–5), two transport modes are used (car and bus). Only the first mode of transport is used for the rest of the arcs. The second route assigned to tourists 2, 3, and 5 presents the sequence 0-11-6-8-3-7-12-9-4-0. In POI 12, tourist 3 returns to the starting point. Similarly, in POI 9, tourist 2 returns to the starting point. In this route, the three transport modes are used.

The planning of the second model that considers the minimization of CO2 emissions, presents a total of 13 POIs visited through 3 routes. The criteria weights of the selected solution correspond \(\alpha\) = 0.7;\(\beta\) = 0.2; \(\gamma\) = 0.1 with accumulated individual benefits of 428, group difference of 99 points and costs of $856.66 per CO2 emitted. The first route made by tourists 4 and 5 has the sequence 0-5-3-7-14-0, with the use of the three modes of transport. The second route made by tourists 1 and 3 has the sequence 0-1-19-10-20-12-9-0. However, in POI 12, tourist 3 returns to the starting point while tourist 1 continues the route sequence. Only transport modes 1 and 2 (car and bus) are used. Finally, the third route with sequence 0-8-16-4-0 is made by tourists 2 and 6. In POI 8, tourist 2 returns to the starting point. Three modes of transport were used on the route.

5 Computational experiments

Proposed models were coded using GAMS (General Algebraic Modeling System) and solved using the solver CPLEX on a computer with 20 GB of RAM, an Intel Core i7-8565U CPU@ 1.8 GHz, a 1-TB hard drive and a 64-bit operating system. The model was tested with a set of small randomly generated instances and programming a maximum execution time of two hours.

5.1 Description of test data sets

Due to the lack of testing instances according to our review of models that consider tourist preferences, in addition to the lack of small-sized instances for accurate testing of team orienteering problem with time windows models, we built for this work a set of randomly generated data sets based on the instances for TOPTW-MV presented in (Lin & Yu, 2017). A total of 14 instances was built as shown in Table 2. The unit costs of transportation were established in $20, $10, and $3 and budgets associated with transportation were determined for each tourist at $5000, $5000, $5000, $6000, $7000, $4000, $5000, and $5000. The maximum time budget \(TMax_{u}\) available for each tourist corresponds to 900, 1100, 800, 850, 900, 500, 900, and 600 min. The instances are available at the following webpage https://jrmontoya.wordpress.com/research/instances/.

Table 2 Overview of the three groups of instances

5.2 Model with heterogeneous preferences and transport mode selection

For the first model, we performed a total of 28 initial tests to obtain general solutions for each proposed objective (i.e., weights of \(\alpha\) = 1;\(\beta\) = 0 and \(\alpha\) = 0;\(\beta\) = 1 for each instance). With the assignment of values \(\alpha\) = 0;\(\beta\) = 1 to the objectives, no results were obtained within the running time limit. The individual benefit corresponds to the maximization of the sum of the benefits obtained for each tourist. The equity in group profit corresponds to the minimization of the difference between the maximum and minimum profit obtained on the route. The results for weights \(\alpha\) = 1;\(\beta\) = 0 are shown in Table 3. These results show the optimal value of the instance and will be a reference for comparison with the solutions obtained with the variations of the weights of each objective.

Table 3 Overall results for maximizing tourist profits

5.3 Considering CO2 emissions in the model

In addition, the CO2 emissions by mode of transport and passenger for each kilometer traveled are as follows: taxi 0.14886 kg CO2/passenger-km, bus average 0.10391 kg CO2/passenger-km, motorcycle 0.11314 kg CO2/km, and car average 0.18014 kg CO2/km. CO2 values associated with public transport (taxi and bus) are calculated on a single passenger basis according to the Department for Business, Energy and Industrial Strategy (2019). The value of car emissions is obtained based on the average emissions of vehicles in different market segments (only CO2 emissions are considered in our study, without considering other gases such as CH4 and N2O). This choice is in line with the statements of Bektaş and Laporte (2011) and Susanty et al. (2018); this last is the only one that includes these emissions in tourism, to our knowledge. So, in developing the model, the decision-maker divides the car emissions by five average occupants. The calculation allows obtaining the emissions per passenger transported, considering that the number of passengers affects fuel consumption.

The average costs per kg of CO2 are €0.02241 (SENDECO2, 2020). For the initial tests of the model we assigned the weights \(\alpha\) = 1; \(\beta\) = 0; \(\gamma\) = 0, \(\alpha\) = 0; \(\beta\) = 1; \(\gamma\) = 0 y \(\alpha\) = 0; \(\beta\) = 0; \(\gamma\) = 1 generating a total of 42 tests. With the assignment of values \(\alpha\) = 0; \(\beta\) = 1; \(\gamma\) = 1 to the objectives, the solution time limit was exceeded in the instances hptoptw-j11e, hptoptw-j16 [a, b, d, and e], and hptoptw-j21 [a, b, and d] with a 100% gap. The instances hptoptw-j21 [a and c] did not generate a solution for the assignments \(\alpha\) = 1; \(\beta\) = 0; \(\gamma\) = 0, besides the instances hptoptw-j16e and hptoptw-j21d exceeded the assigned time with a gap of 6.89% and 25.46%, respectively. Which confirms the computational complexity. The rest of the solutions detailing the optimal value of the instances is shown in Table 4.

Table 4 Overall results for maximizing benefits and minimizing emissions

5.4 Comparison of results

The complete set of results is available at https://jrmontoya.wordpress.com/research/instances/. Within the set of instances created, hptoptw-j11e, hptoptw-j16e, and hptoptw-j21d are the most complex due to the increase in the number of routes and tourists. We selected these instances to show the graphical behavior of the solutions as a function of the Pareto frontier for the first and the second model (see Fig. 3). In the first model, an inverse behavior of the two objectives is generated. As the individual benefit of tourists increases, the value of the difference between the profit of each tourist increases (i.e., the gap between the highest and lowest profit obtained by a tourist increases). Therefore, the equity in the group profit decreases. Figure 4 shows a two-by-two comparison of the three objectives of the hptoptw-j21a instance to detail the behavior of the solutions and the Pareto frontiers. In addition to the behavior described between the first and second objectives, the Pareto frontier obtained in the second model denotes a directly proportional behavior between the first and third objectives. As the individual profit increases, the amount of emissions increases. Therefore, the objectives conflict because the model aims to maximize the former and minimize the latter. A total of 476 tests was performed corresponding to 13 tests (combinations of the weights of the objectives) for the first model and 21 tests for the second model, for each of the 14 instances.

Fig. 3
figure 3

Pareto frontier of hptoptw-j [11e, 16e, and 21d] instances for both models

Fig. 4
figure 4

Two-by-two objective comparison for hptoptw-j21d

Comparing the results of the models, an influence on the solutions can be seen by the inclusion of the third objective. This objective forces the model to decrease the POIs visited, generally increasing the number of routes and the computational response time. Consequently, the benefits obtained by tourists are diminished, showing that the three objectives conflict. The selection process of the solutions in each instance is performed by the decision-maker. The solutions must meet the Pareto dominance criteria (non-dominated solutions) (Villegas et al., 2006). In Addition, for the selection of solutions, it was necessary to analyze criteria of the behavior of the routes in terms of number of nodes visited, routes enabled, gap, solution time, and transport modes selected in contrast to the solutions given in Sects. 5.2 and 5.3. The selected solutions compared to the optimal solutions of the first objective are shown in Table 5.

Table 5 Comparison of optimal and selected solutions

Based on these criteria, in the first model, solutions with \(\beta \ge\) 0.5 are selected because solutions with a gap of 0% are generally obtained. Additionally, we considered solutions that had almost the same number of POIs as the optimal solution. The selected solutions of the second model have a \(\alpha \ge\) 0.5 which ensures that the model gives greater importance to the individual profit objective. This objective is directly correlated to the number of POIs that compose the routes. Therefore, a higher weight of \(\alpha\) generates the inclusion of a greater number of POIs. \(\gamma \le\) 0.2 is selected to ensure that as many POIs as possible are included in the route. The lower the CO2 emissions, the fewer POIs are included. The aim is to allow tourists to visit as many POIs as possible without neglecting equity and minimizing emissions. In addition, priority is given to the solutions that present the smallest gap. The analysis is individual for each instance.

The selected solutions are compared with the optimal result of the solved model only for the first objective (maximize profit). Similarly, the number of POIs included in both solutions (selected and optimal) is observed. The average difference in individual profits is 14.35%. The 40% of the selected solutions have the same number of POIs as the optimal route. Some even have a higher number of POIs. However, a higher number of POIs does not ensure a higher profit because it depends on the importance of those POIs for the visiting tourist (heterogeneous preferences). The solution times also vary depending on the combinations of the objective weights and the complexity of the instance. For the hptoptw-j21[c and d] instances, the selected solutions did not reach the 0% gap in the maximum time set for model execution (120 min).

6 Concluding remark

Tourism requires the application and development of different models and strategies that allow for assertive decision-making for the economic recovery generated by the current COVID-19 pandemic. Likewise, these models must contribute to the achievement of sustainable development objectives within the framework of sustainable tourism. Unlike the literature cited in Table 1, via the development of this paper, the following main contributions were made. In the first instance, two models of tourism itinerary planning were developed to maximize profit, considering two major current tourism problems. Initially, the first model considers the group travel restrictions generated by the pandemic and the selection of modes of transport, through the scheme of flexibility in the organization of routes for a tourist group based on individual preferences and equity in group profit. Therefore, the model allows for the definition of the number of tourists per group and route as well as the type of transport to be used for the trip. Later, in the second model, in addition to the aspects of the first model, the environmental implications generated by transport are considered and minimized. The second contribution of this paper concerns that the model was tested with a set of small theoretical instances that were designed because of the lack of instances for models involving individual preferences in a group. The results show that the objectives of the models are in conflict, generating direct repercussions on the number of POIs visited; in none of the solutions generated was it possible to visit all the POIs. Additionally, with the inclusion of the objective associated with the minimization of CO2, visits were substantially reduced compared to the results of the first model and the initial optimal results. Similarly, an increase in the computational solution time was observed, which leads to the need to apply approximate methods in future work to plan for scenarios that present more than 21 POIs, 7 tourists, the option of opening 4 routes, and 3 modes of transport. Thirdly, the results show that to carry out the analysis of the best solutions it is not enough to examine the Pareto frontier, but also to consider the number of POIs visited, the computational times, the gap, the number of routes, and transport modes, i.e., a generalized analysis of each group of solutions obtained in each instance. However, it was noted that the best solutions are those where the weight of the individual benefit criterion is greater than others. Fourthly, the model presents a more realistic approach that makes it computationally complex and can be contrasted by applying it to real-life tourism route design problems.

Our paper contributes to the management of more sustainable tourism itineraries from an environmental and social perspective (minimization of CO2 emissions and the contribution of social criteria based on the equity of group profit). These criteria suggest real scenarios where the tourist prefers to travel in groups but at the same time has individual preferences on some POIs and is aware of the environmental impact. Other future research lines that can be directed correspond to the inclusion of the types of tourism in the planning of the itinerary according to the typology of products and tourist activities that are offered. In the work of McKercher (2016) the tourist products are grouped in five families which are human endeavors, personal quest, nature, business, and pleasure. Similarly, it is possible to consider the construction of routes that have more than one starting point (i.e., multiple deposits). Finally, in terms of the development of approximate methods, the use of hybrid methods may represent an improvement approach for the solutions of this type of problems (Hapsari et al., 2019), in addition to being a novel contribution to the literature due to their scarce application in the TTDP (Ruiz-Meza & Montoya-Torres, 2020).