Introduction

Demand forecasting, pricing, and mechanism design are all important research topics in transportation planning. Developments in information and communication technology enable us to collect user travel behavior data in detail and design complex systems for transportation services. The empirical study of new pricing systems and mechanism design is needed to validate whether these systems perform efficiently. Analysis of travel behavior under a novel transportation system is also important because such behavior usually varies across users.

To achieve policy goals, which means efficient use of transportation system, without monitoring the private value that users placed upon their time or upon transportation services, the tradable credit scheme model has been proposed (Verhoef et al. 1997; Viegas 2001; Yang and Wang 2011; Wu et al. 2012; Nie and Yin 2013). According to this scheme, credits were distributed by the government and users were then required to pay a certain number of credits to use transportation services and facilities. The credits could be traded among users and the price was determined by the market through free trading. By deciding the initial credit distribution, the government could achieve its policy goal. The major advantage of the tradable credit scheme was the achievement of efficient allocation by free trading without requiring information about user preferences.

In this study, we implemented a tradable permit system for bicycle-sharing service to clarify the usefulness and practical issues of a tradable permit system empirically. In particular, permits for using the bicycle-sharing service were distributed to users, and we attempted to allocate them efficiently through free trading. Our objective is validating tradable credit/permit schemes, which had been validated only by theoretical approach, by a social pilot program empirically. If the social pilot program show some practical issues of these schemes, then we can improve the scheme practically.

Our key contributions are as follows:

  • We implemented a tradable permit system in Yokohama city as a social pilot program. It is a new attempt even from a global perspective. Although there are many existing studies of transportation service auction by theoretical approaches, there is no empirical study of transportation service auction. The empirical transaction behavior is valuable.

  • We collected both transaction behavior in the transportation service auction and travel behavior by GPS every day in the social pilot program. These data can clarify the relationship between transaction of tradable permit and day-to-day travel behavior.

  • We did not use real money but a “virtual money” in the social pilot program. However, we designed the incentive compatible social pilot program by the induced value theory of experimental economics.

  • We discovered that a tradable permit system for a transportation service cannot necessarily achieve the efficient allocation that the theoretical analysis forecasts though the sample size of our pilot program is small. In particular, the transaction behavior of tradable permits is a type of reservation behavior under uncertainty and, in reality, users postponed their decision-making and changed their schedule without prior warning. We showed empirically that this behavior is one of the causes that the tradable permit system in our pilot program did not work.

  • By using a dynamic discrete choice model, we modeled the decision-making of tradable permit transaction, and we empirically clarified the important elements about transaction and future decision-making.

The rest of this paper is organized as follows. “Related work” section reviews the reservation system, the pricing system, and the tradable credit/permit scheme for transportation service. In addition, we review the dynamic discrete choice model for describing dynamic decision-making. “Pilot program and Data” section describes the pilot program in Yokohama city in detail and summarizes our data. “Basic analysis” section presents the basic analysis to clarify the relationship between transaction behavior and travel behavior. “Dynamic discrete choice modeling for choice dynamics” section formulates the dynamic discrete choice model for choice dynamics. In “The estimation result of the dynamic discrete choice model and discussion” section, we estimate the parameters of the dynamic discrete choice model, and sensitivity analysis is described in “Sensitivity analysis” section. Finally, “Conclusions” section concludes the paper.

Related work

Let us begin by reviewing the reservation and tradable permit system of modes of transportation for efficient use. The transportation reservation system (for modes such as railway service and airlines) is not new. Akahane and Kuwahara (1996) analyzed the motorway reservation system using a stated preference survey. Wong (1997) described a reservation system for improving the road performance of highways. Teodorovic et al. (2005) and Edara and Teodorovic (2008) proposed a highway space inventory-control system. Yang et al. (2013) analyzed the reservation system for parking spaces for morning commute. Liu et al. (2015) analyzed the efficiency of highway reservation system from the point of user heterogeneity in values of travel time and schedule penalties.

On the other hand, there have been studies that aim to achieve efficient allocation by bidding an individual’s own value or trading his/her own rights and permits for a transportation service. One such a scheme is the tradable credit scheme (Verhoef et al. 1997; Viegas 2001; Yang and Wang 2011; Wu et al. 2012; Nie and Yin 2013) and another is the tradable permit system (Akamatsu 2007; Wada and Akamatsu 2013). In tradable mobility credit scheme, the government sets a predetermined congestion reduction goal and tries to achieve it by creating a market for mobility credits. And the government issues mobility credits to all eligible travelers and determines how many credits are to be charged on each road. In tradable permit scheme, the government issues the permits. The number of permits of each road link is determined by the road capacity of each link. The tradable permit scheme does not generate the road congestion because the number of permits is less than the road capacities. And users bid the tradable permits directly in the market. Although there is a subtle difference between two schemes, both these approaches aim to solve problems with user heterogeneity and information asymmetry through market mechanisms. Compared to a reservation system, these schemes can allocate infrastructure capacity efficiently and in detail, but they force users to understand the complex underlying mechanisms. Additionally, there have been no empirical studies to validate these system. This study clarifies the issues surrounding tradable permit system. Nie (2012) and He et al. (2013) both consider the impact of transaction costs on tradable permit schemes. They theoretically show that the consideration of transaction costs makes the outcomes of the system depend on the market arrangement, particularly the initial allocation of mobility credits. Shirmohammadi et al. (2013) consider the impact of uncertainty on tradable permit schemes theoretically. In this paper, we focus on the empirical study of the impact of both transaction cost and the uncertainty on tradable permit schemes.

Then, let us review the discrete choice model for dynamic decision-making because reservation behavior and bidding behavior for transportation services are future decision-making and the timing of the decision-making is also important. Discrete choice model has been used widely in travel demand analysis, marketing, and other areas, and it can explain the choice result on the basis of random utility theory from observed data. Discrete choice models have been generalized and extended. Although the original logit model (McFadden 1974) is easy to apply, it has problems with its error structure. Hence, other models, such as the Probit (Daganzo et al. 1977), Nested Logit (Ben-Akiva 1973), generalized extreme value (Vovsha 1997; Ben-Akiva and Bierlaire 1999; Wen and Koppelman 2001), and Mixed Logit (McFadden and Train 2000) models, have been developed. However, the typical discrete choice model is a static model that cannot describe the dynamics of choice behavior along the time axis. Although static discrete choice models are useful for learning the final choice result, they are inadequate for describing the timing of the choice.

Over the past few decades, a considerable number of studies have been made on the inter-temporal decision-making process in economics. These approaches are mainly based on a Markov decision process (MDP). MDP provides a broad framework for modeling sequential decision-making under uncertainty. In labor economics and industrial organization, dynamic discrete choice models have been used for empirical study; this is a dynamic discrete choice model that includes a Markov decision process. Early studies of dynamic discrete choice model were made by Gotz and McCall (1984), Miller (1984), Pakes (1986), Rust (1987), Wolpin (1984, 1987), Das (1992) and Berkovec and Stern (1991). The subjects of those studies included participation behavior in the army, updates of patents, bus engine-replacement, and childbirth decision-making. These studies had timespans ranging from a few months to a few years, and they modeled long-term decision-making. In addition, many early studies applied binary choice settings; Rust (1996) was the first to apply multinomial choice setting. There exist survey papers on dynamic discrete choice models, such as those proposed by Eckstein and Wolpin (1989), Puterman (1990), Rust (1994), Rust (1996), Aguirregabiria and Mira (2010).

The major difference between a static discrete choice model and a dynamic discrete choice model is the latent variable for introducing choice probability. Whereas static discrete choice models use the difference between utility functions of alternatives as the latent variable, dynamic discrete choice models use the difference between value functions as the latent variable. Value function is expressed by the sum of the present utility and the present discounted future utilities. Its expression is called the Bellman equation (Bellman 1957). For example, in a static discrete choice model, utility function of alternative i is \(U_i\). And we assume that each individual maximizes the utility function. In a dynamic discrete choice model, we assume that each individual maximizes value function \(v(\cdot )\) and it is expressed as \(v(s_{it}) = U(a_t, s_{it}) + \beta \int v(s_{i,t+1}) dF(s_{i,t+1}|a_t, s_{it})\), where \(v(s_{it})\) is the value function of state \(s_{it}\), \(U(a_t, s_{it})\) is the present utility of action \(a_t\) and state \(s_{it}\), \(a_t\) is the action in time period t, \(s_{it}\) is the state in time period t, \(F(\cdot )\) is the transition probability distribution, and \(\beta\) is the discount factor. Hence, the value function of a dynamic discrete choice model is the sum of the present utility (it is the utility function of a static discrete choice model) and discounted future utility. A static discrete choice model can only express the choice probability at a certain point, whereas a dynamic discrete choice model can express the choice probability of every time period and can describe individual’s decision-making considering future decision-making under the uncertainty such as a change of a plan.

For considering future decision-making, dynamic models need to solve dynamic programming problems to compute value functions, and thus have larger computational loads than static models. In travel behavior analysis, only a few attempts have so far been made at dynamic discrete choice modeling because of the computational loads. In this paper, we use a dynamic discrete choice model for describing reservation behavior and bidding behavior for transportation services as future decision-making. And we clarify the important elements of the decision-timing empirically.

Pilot program and data

Implementation of bicycle-sharing auction system

First, we briefly explain the outline of our pilot program. The bicycle-sharing auction system implemented for this study consists of 4 subsystem: (1) a probe person system; (2) a bicycle-sharing system; (3) a tradable permits system; and (4) an eco point system. As Fig. 1 shows, the 4 subsystems are mutually connected. Travel and activity data of research participants in this pilot program are collected by the probe person system. To use the bicycle-sharing system, participants need bicycle-sharing permits from the tradable permit system and to make a reservation at the bicycle-sharing system. In the payment system for the pilot program, we use eco-points as a virtual currency instead of real money. The participants are given eco-points for their daily eco-friendly travel behavior, such as public transport use or walking trips logged by the probe person system. When participants use their bicycle-sharing permits and make a reservation at the bicycle-sharing system, the bicycle-sharing system sends them a passcode for each bicycle. Hara and Hato (2010) analyzed the same data by basic analysis and static modeling.

Fig. 1
figure 1

Total system of this study

The probe person system (Asakura and Hato 2004; Hato and Kitamura 2008) is a method for obtaining travel diary and positioning data in detail using GPS mobile phone. Users operate the mobile phone when they depart and arrive. An application on the mobile phone records data about trip OD, travel mode, trip purpose, time of departure, time of mode change, time of arrival, and location data during the trip. Using a probe person survey, we can collect travel behavior data in greater detail than a paper-based survey.

Then, we introduce a transportation eco-point system. On the basis of the travel behavior of each participant, which is observed by the probe person system, we evaluate the eco-friendliness of each user’s behavior. If participants use public transport, walk long distances, or ride with other people, we give them eco-points. The participants can use eco-points for the bicycle-sharing and tradable permit system, as described previously. Transportation eco-points are not directly exchangeable for real money; however, all participants can obtain survey rewards depending on their collection of eco-points at the end of this survey. For example, the user who has the most eco-points at the end of the survey can obtain the highest survey reward. On the other hand, the user who has the least eco-points can get the least reward. This system is inspired by the induced value theory of experimental economics (Smith 1976). Therefore, our pilot program gives the participants the incentive to regard transportation eco-points as real money despite transportation eco-points is a “virtual money”.

Now let us explain our bicycle-sharing system. Our pilot program implemented two bicycle-sharing systems in the Minato Mirai 21 (MM) area, Yokohama city. MM area is a waterfront urban district in central Yokohama, which contains many offices and commercial facilities. This area is 5 km × 2 km area. One bicycle-sharing system is the permit system in which users need permits for each time slot. There are two time slots for bicycle-sharing: morning (9:30–13:00) and afternoon (13:30–19:00) of each day. Users can go anywhere by bicycle within the time slot. The other system is a first-come first served system in which users can use the bicycle-sharing system if there is no capacity shortage. Users need make a reservation by mobile phone and they can choose their usage time freely. The two bicycle-sharing systems have been implemented separately and the quality of bicycle is different; that is, the bicycles of the permit system are high quality and those of the first-come first-served system are of normal quality, for differentiation. In both bicycle-sharing systems, it is necessary for users to return the sharing-bicycle to the sharing port in MM area.

The tradable permit system is the main subject of our study. We implemented two transaction protocols, a single-auction protocol and a double-auction protocol, for analyzing the effect of auction protocols on user behavior. In single auction, a seller is the administrator of the bicycle-sharing system only and buyers are users (Fig. 2). Each item is a bicycle-sharing permit of a time slot in a day. For example, the system treats the permit of Dec 10th p.m. and the permit of Dec 11th p.m. as different goods. Users have the choice of bidding when they want. As the auction mechanism is set to a second price auction, the highest bidder wins the auction and pays the second-highest price. Vickrey (1961) proves that it is the dominant strategy for bidders to bid their true willingness to pay in the second price auction. In the single auction system, participants can bid the permit by their transportation eco-points. If the participant win the auction, then he/she get the permit and pay the price of the permit by eco-points. In this way, permits are distributed according to each user’s willingness to pay.

Fig. 2
figure 2

The image of a single auction system

Single auction achieves an efficient allocation of resources but it is not fair because the users who have little eco points cannot win the auction. Thus, some researchers have suggested a double-auction mechanism as an alternative (for example, Friedman and Rust 1993). A double auction is a process of buying and selling goods when potential buyers submit their bids and potential sellers simultaneously submit their ask prices to an auctioneer. In the permit trading market, all users are buyers and at the same time sellers. In a double auction (Fig. 3), all tradable permits are allocated to all users at random for achieving fairness. In the example of Fig. 3, the initial allocation of user A is {Nov 16th a.m., Nov 20th a.m., Nov 24th a.m., Nov 25th p.m., Dec 2nd p.m.}. Then, if users do not intend to use permits, they can offer them for sale. In the example of Fig. 3, user A offers the permits of Nov 16th a.m. and Nov 20th a.m. Users who want the permits can buy them in a tradable permit system and the permits are reallocated based on the outcome of the trade. For example, when user B wants the permit of Nov 20th a.m., he/she bids the permits in tradable permit market. And if he/she wins the auction, he/she gets the permit and pay the price to user A. In the double auction, auction mechanism is also the second price auction and the currency for trading is transportation eco-points. If no one buys the permits which are sold in the tradable permit market by the expiry date, then the permits are spoiled at the expiry date. In the example of Fig. 3, the permit of Nov 16th a.m. is spoiled. This is a phase for achieving efficient allocation. This pilot program implemented both single-auction and double-auction systems. This paper focuses on the user behavior in a double auction mainly.

Fig. 3
figure 3

The image of a double auction system

Data

The data in this study was collected from the “Yokohama Mobility Design Survey 2008” using the system noted above. Table 1 shows the overview of the data. The surveillance period was from 10th Nov. to 24th Dec., lasting 44 days. The activity and trips of participants were recorded by GPS mobile phone and web diary in detail. The number of participants was 118, and only 19 people participated in pilot program of the bicycle-sharing service. In this study, we focus on the data of these participants and double-auction period (19 respondents, 23 days). Although the sample size in pilot program is small, there is no study to implement the pilot program of transportation service auction and to observe the travel and transaction behavior for more than 1 month. From the observation and analysis, we can discover the problem of transportation service auction and the important factor that affect users’ transaction behavior. The male-to-female ratio in this data is 5:4 and the ratio of people in their 20, 30, and 40 s is 2:7:1.

Table 1 Overview of pilot program

The double auction took place from 14th Nov. to 2nd Dec. and single auction took place from 3rd Dec. to 24th Dec. The auction site is linked from each participant’s web diary page. At this auction site, participants could buy or sell their permits freely by presenting their sales information and purchase intention. In these transactions, they used transportation eco-points as virtual money. In the double auction, all tradable permits were randomly allocated to all users. In the period of the double auction, the number of total capacity of permits is 114. Each participants had 5 or 6 permits initially. To trade their permits and transportation eco-points, each participants used a permit auction website that operated like eBay.

Basic analysis

The results of the auction transaction and research question

First of all, we show the behavioral patterns in the bicycle-sharing permit auction. In the single auction setting, there are two classes of users. One class is the users who buy permits in auction. The other class is the users who do not buy it. Figure 4 shows about 20% of users participated in the auction. On the other hand, there are four classes of users in the double auction setting. they are “sold the permits they had and purchased permits they did not have”, “only purchased permits they did not have”, “only sold the permits they had”, and “did not participate at all during the period”. Figure 4 shows about 40% of users participated in the auction in any way and about 20% of users both bought and sold permits on the auction web site. In the surveillance period, there were 24 transactions in double-auction setting and 11 transactions in single-auction setting. Only 5 users, whose user IDs were ym205, ym209, ym210, ym215, and ym219, made 25 transactions, making up about 70% of all transactions. This result indicates that a few users were active and most users are inactive in auction transactions of this pilot program.

Fig. 4
figure 4

Ratio of auction participation

In the single-auction setting, there were six biddings, which were all closed. Four permits were holiday permits and had execution prices of over 500 eco-points. There was no bidding-up in any bidding. On the other hand, there were 18 selling intents and 6 buying intents but closed transaction in only one in the double-auction setting. Sellers intended to sell their permits at over 100 eco-points, and some sellers differentiated permit prices, for example, by setting weekday permits at 100 points, Saturday permits at 300 points, and Sunday permits at 500 points. This result indicates that participants recognized that the value of bicycle-sharing permits depended on the characteristics of the permits. The closed transaction was for the permit for the morning on a holiday (24th Nov) and the execution price was 100 points. The buyer showed the buying intent of the permit in the auction site and the seller coupled with the request. The permits of remaining five buying intents expired because the users with the permits neither use nor sell them. This is the problem of mismatching for permit allocation; one cause of the problem is that the number of participants in the pilot program was small and the auction market was thin. Another reason was the difficulty of future decision-making under uncertainty about a user’s future schedule. Judging from the above, we found that the bicycle-sharing auction mechanism could not achieve efficient allocation in practice, unlike in the case of the theoretical analysis, because of several realistic factors arising from the implementation of the pilot program.

Transaction analysis focused on decision-making timing

In this pilot program, all permits in the double-auction setting were randomly allocated to all participants. On the first day of pilot program, therefore, each participant has their own permits. The permits were valid on various days, with a maximum of 22 days and a minimum 4 days before permit expiry.

Fig. 5
figure 5

Result of total decision-making of each day

Figure 5 shows when decisions were made about each permit. First of all, this figure shows that almost 80% of the permits were wasted because the participants neither intended to sell them nor use them. In the case of using permits, almost all participants made the decision on the day before permit expiry, and the earliest participant made their decision 8 days before. The ratio of permit selling also increased as the permit expiry approached. Figure 5 indicates that it was difficult for participants to make decisions about their permit until the deadline was near.

Fig. 6
figure 6

The examples of decision-making timing. Left is user ID: ym205 and Right is user ID: ym210

As an illustrative example, Figure 6 shows the transaction behaviors of the users with the IDs ym205 and ym210. ym205 (left) made a decision about their permit more than a week in advance. On the other hand, ym210 (right) booked for use or sold their permits around the previous day. These results indicate that there is heterogeneity in both future schedule uncertainty and attitude toward bicycle-sharing permits, and that these effects are important to figure out for efficient allocation.

Analysis of the time of active decision-making

Now, we analyze the time of active decision-making to clarify the factors affecting future decision-making, such as reservation behavior and transaction behavior. Figure 7 shows the time of active decision-making in a tradable permit auction and there are two peaks: corresponding to lunch hour and evening. From probe person data, the places where the participants made their decisions were their homes (13 times) and work places (5 times). In the case of work place, the time of active decision-making was noon (4 times) and 14:00 (once). And this suggests that people actively decided what to do with their permits in their free time. This result is intuitive and indicates that active decision-making about tradable permits needs disposable time, which is the time remaining after the mandatory activity time is subtracted from the daytime.

Fig. 7
figure 7

Time of day of auction transaction

We hypothesize that disposable time affects future decision-making. Probe person data show the location of each individual and sojourn time at each stop, but they cannot show activity in detail. Therefore, we define days upon which users stay at home for over 10 hours as free-time days. Figure 8 shows the relationship between the day-to-day decision-making and number of free-time days; it indicates that a large number of free-time days decreases the number of decisions to “do nothing” and increase the number of decisions to “use” and “sell”. This result suggests that the availability of free time affected decision-making about future schedules and future travel behavior.

Fig. 8
figure 8

The relationship between the number of free-time days and decision-making

Fig. 9
figure 9

The relationship between the number of MM area visits and decision-making

Figure 9 shows the relationship between decision-making about permits and the cumulative number of MM area visits. When the cumulative number of MM area visits is 0, the ratios of “do nothing” and “sell” to “use” are high. This tendency indicates that participants with low numbers of MM area visits tended not to use their permits and think of trading their permits to other users from the very beginning. On the other hand, the participants whose number of MM area visits was more than 5 decided to “use”, indicating that the number of MM area visits affected the use of permits.

Figures 8 and 9 show that the decision-making of each participant depended not only on static elements such as the values of the permits and the socio-demographic characteristics but also dynamic elements such as day-to-day behavior and remaining days until permit expiry.

Fig. 10
figure 10

Time series of auction transactions and MM area visits

Analysis considering activity patterns

Now, we analyze the relationship between auction transactions and MM area visits. Since we implemented the tradable permit system of bicycle-sharing in the MM area in this pilot program, the users who had no scheduled visits to the MM area or could not visit the MM area because of time pressures tended to have an increased intention of selling permits. Figure 10 shows the relationship between transaction behavior and MM visits without work purpose. The days marked in yellow are weekends and holidays. The figure shows that the intention to sell permits increased on weekdays, and the intention of purchasing permits increased on weekends and holidays. The result shows that the activity patterns affect auction transactions.

Fig. 11
figure 11

The difference in activities between weekdays and weekends

Figure 11 shows the number of MM visits and the number of auction transactions per day. On holiday, the number of MM visits was over twice its weekday value, and there were many opportunities for bicycle-sharing on holidays. There is a slightly greater intention to sell permits on weekdays than on holidays; on the other hand, there is a significant intention to purchase permits oh holidays.

The average asking price of weekday permits in the double auction was 100 points, whereas the asking price of weekend permits was 240 points. On the other hand, the average price from a buyer was 100 points on both weekdays and weekends. As we set the threshold of the bidding price to 500 points in a single auction, the price of all permits that users purchased was 500 points.

The number of purchase intentions per day in the double-auction setting was 0.083 on weekdays and 0.714 on holidays. The number of biddings per day in the single-auction setting was 0.133 on weekdays and 0.571 on holidays. Though the tendency on weekdays is different from that on holidays, the type of auction did not affect demand for the permits because t-test reports no significant difference between the average demand of single auction and that of the double auction.

Fig. 12
figure 12

The relationship between MM area visits and distance from home

Analysis considering user home location and mode of transport

Next, we analyze the relationship between the number of MM area visits and the locations of par residences. Figure 12 shows that the number of MM visits decreased with the increasing distance from home, both for all-purpose visits and non-work-related visits.

Fig. 13
figure 13

The relationship between auction transaction and distance from home

In a similar way, Figure 13 shows that the relationship among auction transactions, bicycle use, and participants’ place of residence. It indicates that (1) neighborhood users used the bicycle-sharing system frequently and tended to sell the permits when they were not needed; (2) users who lived a moderate distance from the MM area tended to keep their permits and use bicycle-sharing most frequently out of the 3 groups; (3) users who lived a long distance from the MM area rarely used bicycle-sharing and tended to sell their permits.

We examined user tendency for bicycle-sharing on the basis of these results. As the neighborhood users lived near the MM area and had many opportunities to use bicycle-sharing but also had the option of walking or using private bicycles for visiting the MM area, bicycle-sharing was one of several options for them. As middle-distance users visited the MM area more frequently than long-distance users and the MM area was too far to go by private bicycle, they could benefit from bicycle-sharing. Long-distance users did not use bicycle-sharing frequently because they did not visit the MM area frequently. From the above discussion, we think that middle-distance users attached more value to bicycle-sharing.

Fig. 14
figure 14

The mode share sorted by distance between home and the MM area

This discussion can be supported by following fact. Figure 14 shows the shares of various modes of transportation to the MM area sorted by distance from home. It does not include neighborhood users because their home is within MM area. For both weekdays and weekends, public transportation such as trains and buses were used by the largest share of middle- and long-distance users. This indicates that middle- and long-distance users who visited the MM area by public transport lacked mobility in the MM area and had restriction to travel freely according to their needs, so bicycle-sharking could generate latent travel demand among these people.

Dynamic discrete choice modeling for choice dynamics

The formulation of the dynamic discrete choice model

In this section, we formulate the dynamic discrete choice model for describing choice dynamics. This study focuses on the transaction behavior of user own tradable permits. The objective of our model and analysis is to clarify the contributing factors to the choice result and choice timing.

We index individuals by i. Time is discrete and indexed by t. Each individual i can use a transport service only with a tradable permit (indexed by r). All permits are allocated to individuals at random. Each permit has a valid time slot for accessing the transport service and the permit is invalid outside of the valid time slot. In this situation, the choice set of each individual is \(J= \{use, sell, do nothing \}\). We notate the choice set \(J=\{1, 2, 3\}\). We assume that each individual i makes a decision about their own permits r every day between the day permits are distributed to the day until when their validity expires. The period in which individual i makes a decision about his/her permit r is \(t=\{1,2,...,T_{ir} \}\).

The state of the world at period t for individual i is determined by the state vectors \(\varvec{s}_{it}\) and the choice result \(a_{it}\). The choice result \(a_{it}\) indicates the choice of the individual i at time period t. On the other hand, the state vector \(\varvec{s}_{it}\) is the group of variables which indicate the state of the individual i. In particular, the state vector is consist of \(y_{it}\), \(\varepsilon _{it}\), \(r_{it}, c_{it}, m_{it}, and b_{it}\). \(y_{it}\) denotes the state of the permit of the individual i before the choice of time period t. \(\varepsilon _{it}\) denotes the unobserved error term. The set of state variable \(y_{it}\) is the same as J and the set of state variable \(a_{it}\) is also the same as J. As the state variables which describe the state of the individual i, \(r_{it}\) is the remaining number of permit-valid days at time t, \(c_{it}\) is the cumulative number of days on which the individual i has large disposable time, which is the time remaining after the mandatory activity time is subtracted from the daytime, \(m_{it}\) is the cumulative number of days on which the individual i visits the area where the transportation service is implemented, and \(b_{it}\) is the cumulative number of days on which the individual i uses their bicycle. Hence, \(\varvec{s}_{it} = (y_{it}, \varepsilon _{it}, r_{it}, c_{it}, m_{it}, b_{it})\). The choice set J can also change depending on the choice result of the day before. Once the individual i decides to use or sell the permit r, the choice set after the day is limited to the choice result. Hence, if the individual i postpones their decision about the permit, the individual will face the choice situation on the next day. This relationship can be represented by

$$\begin{aligned} J_{i,t+1}= {} \{1, 2, 3 \} \quad if \ \ a_{it} = 3 \\ J_{i,t+1}= {} \{j_{it}\} \qquad otherwise . \end{aligned}$$
(1)

The relationship between the state variable \(y_{it}\) and the choice result \(a_{it}\) is shown by Fig. 15.

Fig. 15
figure 15

The relationship between the state variable \(y_{it}\) and choice result \(a_{it}\)

Individuals’ preferences over possible sequences of states of the world can be represented by a utility function \(\sum _{j=t}^T \beta _i^j U(a_{i,t+j}, \varvec{s}_{i,t+j})\), where \(\beta \in (0,1)\) is the discount factor and \(U(a_{it}, \varvec{s}_{it})\) is the current utility function. As each individual faces uncertainty of future state variables, their belief in particular future state variables can be represented by the Markov transition distribution function \(F(s_{i,t+1}|a_{it}, \varvec{s}_{it})\). In every period t, the individual observes the vector of state variables \(\varvec{s}_{it}\) and chooses their action \(a_{it} \in J\) to maximize their expected utility

$$\begin{aligned} E \left( \sum _{j=0}^{T-t} \beta _i^j U(a_{i,t+j}, s_{i,t+j}|a_{it}, \varvec{s}_{it})\right) . \end{aligned}$$
(2)

This is the individual’s dynamic programming (DP) problem. Let \(V(s_{it})\) be the value function of the DP problem. By Bellman’s principle of optimality, the value function can be obtained using the recursive expression:

$$\begin{aligned} V(\varvec{s}_{it}) = \max _{a \in A} \left\{ U(a, \varvec{s}_{it}) + \beta _i \int V(\varvec{s}_{i,t+1}) dF(\varvec{s}_{i,t+1}|a, \varvec{s}_{it}) \right\} . \end{aligned}$$
(3)

For simplicity, we define that \({\varvec{z}}_{it} = (r_{it}, c_{it}, m_{it}, b_{it})\). Then we can distinguish the subsets of the state variable \(\varvec{s}_{it} = (y_{it}, \varepsilon _{it}, {\varvec{z}}_{it})\). We assume that the error term follows an independent and identically distributed (i.i.d.) extreme value distribution. When \(u(a_{it}, y_{it}, {\varvec{z}}_{it})\) is the utility function of each alternative \(a_{it}\), the value function \(v(a_{it}, y_{it}, {\varvec{z}}_{it})\) of each alternative \(a_{it}\) is as below:

$$\begin{aligned} v(a_{it}, y_{it}, {\varvec{z}}_{it})&= {} u(a_{it}, y_{it}, {\varvec{z}}_{it}) + \beta \sum _{y, z} \bar{V}(y_{i,t+1}, {\varvec{z}}_{i,t+1}) f_{y,z} (y_{i,t+1}, {\varvec{z}}_{i,t+1}|a_{it}, y_{it}, {\varvec{z}}_{it})\\&+\,\varepsilon _{it}, \end{aligned}$$
(4)

where \(\bar{V}(y_{it}, {\varvec{z}}_{it}) \equiv \int V(y_{it}, {\varvec{z}}_{it},\varepsilon _{it}) dG_{\varepsilon }(\varepsilon _{it})\).

Using the value function of each alternative, the choice probability of the dynamic discrete choice model follows the same form as the static discrete choice model:

$$\begin{aligned} P(a_{it}|y_{it}, {\varvec{z}}_{it}, \theta ) = \frac{\exp (v(a_{it}, y_{it}, {\varvec{z}}_{it}))}{\sum _{J} \exp (v(a^{'}_{it}, y_{it}, {\varvec{z}}_{it}))}. \end{aligned}$$
(5)

In addition, we assume conditional independence between the error and transition distributions:

$$\begin{aligned} P(\varepsilon _{i,t+1}, y_{i,t+1}, {\varvec{z}}_{i,t+1}|\varepsilon _{it}, y_{it}, {\varvec{z}}_{it})=&g(\varepsilon _{i,t+1}|x_{i,t+1}) \cdot f_y(y_{i,t+1}|a_{it}, y_{it}) \cdot \nonumber \\&f_r(r_{i,t+1}|r_{it}) \cdot f_c(c_{i,t+1}|c_{it}) \cdot f_m(m_{i,t+1}|m_{it}) \cdot f_b(b_{i,t+1}|b_{it}). \end{aligned}$$
(6)

By the conditional independence between the error term and transition probability, we can identify the solution of DP problem as the expectation of the value function over the error distribution \(\bar{V}(x_{it})\equiv \int V(x_{it},\varepsilon _{it}) dG_{\varepsilon } (\varepsilon _{it})\), as described above.

As state variables in this study, we define variables that do not change for time t by \(x_i\) and variables that shift for time t by \(r_{it}, c_{it}, m_{it}, b_{it}\). We assume the Markov transition function of these state variables as given below:

$$\begin{aligned} r_{i,t+1} = r_{it} - 1 \end{aligned}$$
(7)
$$\begin{aligned} c_{i,t+1} = f_c(c_{it}) \end{aligned}$$
(8)
$$\begin{aligned} m_{i,t+1} = f_m(m_{it}) \end{aligned}$$
(9)
$$\begin{aligned} b_{i,t+1} = f_b(b_{it}) \end{aligned}$$
(10)
$$\begin{aligned} r_{i1} = T_{ij},\ c_{i1}= & {} 0,\ m_{i1} = 0,\ b_{i1} = 0 , \end{aligned}$$
(11)

In this study, we assume the structure of the transition function \(f_c(c_{it})\), \(f_m(m_{it})\), and \(f_b(b_{it})\) as below:

$$\begin{aligned} f_c(c_{it}) \equiv \frac{\exp (V_c)}{1 + \exp (V_c)} + c_{it} \end{aligned}$$
(12)
$$\begin{aligned} f_m(m_{it}) \equiv \frac{\exp (V_m)}{1 + \exp (V_m)} + m_{it} \end{aligned}$$
(13)
$$\begin{aligned} f_b(b_{it}) \equiv & {} \frac{\exp (V_b)}{1 + \exp (V_b)} + b_{it}, \end{aligned}$$
(14)

where \(V_c, V_m, V_b\) denote the observed utility terms of each behavior. These equations indicates whether the individual i chooses each behavior at time t, as well as the total number of days of each behavior. Whether the individual i chooses each behavior at time t can be represented by a binary logit model.

The setting of utility function

Then, we estimate the parameters of value functions and transition functions from observed data. Let \(\theta\) be the parameters of utility functions. In accord with above analysis, we set the utility function of each alternative.

$$\begin{aligned} u_{1,t} = \theta _{holiday} x_{holiday_i} + \theta _{dist} \log (x_{dist_i}) + \theta _{mm} m_{it} \end{aligned}$$
(15)
$$\begin{aligned} u_{2,t} = \theta _{weekday} x_{weekday_i} + \theta _{am} x_{am_i} + \theta _{disptime} c_{it} + \theta _{bicycle} b_{it} \end{aligned}$$
(16)
$$\begin{aligned} u_{3,t} = \theta _{sch} \log (r_{it} + 1), \end{aligned}$$
(17)

where \(x_{holiday_i}\) is the dummy variable of whether the day of the permit-validity is a holiday, \(x_{weekday_i}\) is the dummy variable of whether the day of the permit-validity is a weekday, \(x_{dist_i}\) is the distance (km) from the participant’s home to the MM area and \(x_{am_i}\) is the dummy variable of whether the time of the permit-validity is in the morning. These variables are constant with regard to temporal transitions. The state variables \(m_{it}\), \(c_{it}\), \(m_{it}\), and \(r_{it}\) are the variables that shift for time t and are defined by Eqs. (7), (8), (9) and (10).

Next, we assume that each individual has his/her discount factor of future utility. The reason for different discount factors is that the discount factor is affected by individual attributes. Thus, we model the individual discount factor \(\beta _i\) as a logistic function as

$$\begin{aligned} \beta _i = \frac{\exp (\theta _{hm} x_{hm_i} + \theta _{40} x_{forty_i})}{1 + \exp (\theta _{hm} x_{hm_i} + \theta _{40} x_{forty_i})}, \end{aligned}$$
(18)

where \(x_{hm_i}\) is the dummy variable for a full-time homemaker and \(x_{forty_i}\) is the dummy variable for someone in their 40s.

We set the Markov transition probability of the state variables as the binary logit model given below:

$$\begin{aligned} u_c = \theta _{holiday} x_{holiday_{it}} + \theta _{worker} x_{worker_i} + ASC_c \end{aligned}$$
(19)
$$\begin{aligned} u_m = \theta _{homeMM} x_{homeMM_i} + \theta _{officeMM} x_{officeMM_i} + ASC_m \end{aligned}$$
(20)
$$\begin{aligned} u_b = \theta _{holiday} x_{holiday_{it}} + \theta _{hm} x_{hm_i} + ASC_b \end{aligned}$$
(21)
$$\begin{aligned} P(c_{i,t+1} = c_{it} + 1) = \frac{\exp (u_c)}{1 + \exp (u_c)}, \ P(c_{i,t+1} = c_{it}) = \frac{1}{1 + \exp (u_c)} \end{aligned}$$
(22)
$$\begin{aligned} P(m_{i,t+1}= & {} m_{it} + 1) = \frac{\exp (u_m)}{1 + \exp (u_m)}, \ P(m_{i,t+1} = m_{it}) = \frac{1}{1 + \exp (u_m)} \end{aligned}$$
(23)
$$\begin{aligned} P(b_{i,t+1} = b_{it} + 1) = \frac{\exp (u_b)}{1 + \exp (u_b)}, \ P(b_{i,t+1} = b_{it}) = \frac{1}{1 + \exp (u_b)}, \end{aligned}$$
(24)

where \(x_{holiday_{it}}\) is the dummy variable for whether the day t is holiday for the individual i, \(x_{worker_i}\) is the dummy variable for being a full-time worker, \(x_{homeMM_i}\) is the dummy variable for whether the individual i’s home is in the MM area, \(x_{officeMM_i}\) is the dummy variable for whether the individual i’s office is in the MM area and each ASC is an alternative specific constant. The initial values of state variables are \(m_{i0}=0\), \(b_{i0}=0\), \(c_{i0}=0\).

As we assume that the choice set can be limited by Eq. (1) and the error term follows i.i.d. extreme value distribution, the value function of each alternative at time t can be written as

$$\begin{aligned} v_{1,t} = u_{1,t} + \beta _i v_{1,t+1} \end{aligned}$$
(25)
$$\begin{aligned} v_{2,t} = u_{2,t} + \beta _i v_{2,t+1} \end{aligned}$$
(26)
$$\begin{aligned} v_{3,t} = u_{3,t} + \beta _i \log (\exp (v_{1,t+1}) + \exp (v_{2,t+1}) + \exp (v_{3,t+1})), \end{aligned}$$
(27)

because the expectation of value function of alternative 3 is the log-sum variable of the value functions at time \(t+1\). To estimate the parameters, the value functions at each time can be computed backward from last time \(T_{ir}\).

Estimation method

Let \(\theta = (\theta _u, \theta _f)\) be the vector of structural parameters. The parameters of the value function are \(\theta _u\) and the parameters of the transition function are \(\theta _f\). To estimate \(\theta\), we define the log likelihood function of individual i by

$$\begin{aligned} l_i (\theta ) = \log Pr \{ a_{it}, y_{it}, {\varvec{z}}_{it}; t=1,2,...,T_i | \theta \}. \end{aligned}$$
(28)

As we assume conditional independence between the error and transition distributions, we can split the probability function, \(a_{it}\) and the transition function, \(x_{it}\):

$$\begin{aligned} Pr (a_{it}, y_{it} | a_{i,t-1}, y_{i,t-1}, {\varvec{z}}_{i,t-1}) = P(a_{it}|y_{it}, {\varvec{z}}_{it})f_z ({\varvec{z}}_{it}|{\varvec{z}}_{i,t-1}). \end{aligned}$$
(29)

Using these relationships, we can rewrite the log likelihood function of the individual i as

$$\begin{aligned} l_i (\theta )= & {} \sum _{t=1}^{T_{ir}} \delta _{a_{it}} \cdot \log P(a_{it}|y_{it}, {\varvec{z}}_{it}, \theta _u ) + \sum _{t=1}^{T_{ir} -1} \delta _{z_{i,t+1}} \cdot \log f_z ({\varvec{z}}_{i,t+1}|a_{it}, {\varvec{z}}_{it}, \theta _f) \nonumber \\&+\,\delta _{z_{i1}} \cdot \log Pr(z_{i1}), \end{aligned}$$
(30)

where \(P(\cdot )\) is the choice probability at time t, \(f_z (\cdot )\) is the transition probability of state variables, and \(\delta _{\cdot }\) is the Kronecker delta, which equals 1 when the individual i choices and equal 0 otherwise. As Eq. (6) assumes conditional independence between the choice and transition probabilities, we can maximize each log likelihood function independently. Therefore, the log likelihood function for estimating parameters \(\theta\) can be represented by

$$\begin{aligned} L_i (\theta )= & {} \sum _{i} \sum _{t=1}^{T_{ir}} \delta _{a_{it}} \cdot \log P(a_{it}|y_{it}, {\varvec{z}}_{it}, \theta _u ) \nonumber \\&+\,\sum _{i} \sum _{t=1}^{T_{ir} -1} \delta _{z_{i,t+1}} \cdot \log f_z ({\varvec{z}}_{i,t+1}|a_{it}, {\varvec{z}}_{it}, \theta _f) \nonumber \\&+\,\sum _{i} \delta _{z_{i1}} \cdot \log Pr(z_{i1}). \end{aligned}$$
(31)

To estimate the parameters \(\theta\) by the maximum likelihood estimation method, we only have to maximize the log likelihood function and obtain the parameters \(\theta\).

Comparison models

To evaluate the effectiveness of our proposed model, we compare the dynamic discrete choice model with two static discrete choice model. The first static discrete choice model is the typical multinomial logit model, which does not include the dynamic factors. Because this model does not include the state variables that shift as the day goes on, the choice probability of each alternative is constant, regardless of the remaining days of permit-valid day.

The second static model has the structure as the first, but the state variables shift as the day goes on. Therefore, the dynamic model and the second static model have same utility function. However, the second static model does not consider the future utility under uncertainty, as does the value function. The parameters of these two static model can be estimated by a maximum likelihood estimation method.

The estimation result of the dynamic discrete choice model and discussion

Estimation results and discussion

First, we show the estimation result of the transition model of state variables \(c_{it}\), \(m_{it}\), \(b_{it}\) in Table 2. The transition probability of the state variables has a simple structure that uses the individual attributes and characteristics of the day only. However, the likelihood ratio of each model is sufficient. With regard to the cumulative number of free-days, office workers tend to have more free-days and participants have small disposable time on holidays. The location of the home and office greatly impact the number of visits to the MM area. Full-time homemakers tended to use bicycles regularly, and participants tended to use bicycle on holidays. Using these estimation results for transition probability, we compute the state variables at time t and compute the value function of the dynamic discrete choice model.

Table 2 The estimation result of transition probability

Table 3 shows the estimation result of the dynamic discrete choice model. Static behavior model 1 is the lowest adjusted likelihood ratio and static behavior model 2, which uses the state variables, has a higher adjusted likelihood ratio than the first model. The dynamic behavior model has the highest adjusted likelihood ratio among these models and the value is 0.614. This result indicates that the dynamic behavior model can express the sequential decision-making from the point of view of both the model structure of a dynamic discrete choice model and the variables that the model uses.

Table 3 The comparison of transaction behavior models

The estimated parameter indicates that the available value of permits is high when the day of the permit-validity is on a holiday and low when the distance from home to the MM area is large. The more frequently the individual visits the MM area, the more the utility increases. This is the rational result for using permits.

With regard to selling permits, the result shows that individuals wanted to sell their permits when the permit expiry was a weekday and the time slot of the permit was in the afternoon. The number of free-days increased the decision-making for selling. As the basic analysis shows, the number of free-day was the important factor for active decision-making. The parameter of the amount of bicycle usage is positive, meaning that individuals who have their own bicycles do not need to share.

The utility function for postponing decision-making is set to the log function of the remaining days, and means that the uncertainty of the day of permit-validity decreased as time passed. On the other hand, postponing of decision-making had an option value that did not impose a limitation of the future choice set. Therefore, the estimation result indicated that each individual compared the current utility to the option value and chose whether postponing decision-making or active decision-making.

Sensitivity analysis

Reproducibility of the dynamic model

We assume that the discount factor has heterogeneity for socio-demographic attributes. The estimation result indicates that older people and full-time homemakers tended to have a larger discount factors than younger people and other occupations. This shows that older people and full-time homemakers estimated the high option value of postponing decision-making. We interpret the result as meaning that full-time homemakers had more flexible schedule than office workers; hence, they had the high option value. The reason for high discount factor of people in their 40s was that the tradable permit system of bicycle-sharing was unfamiliar to them so they tended to postpone their decision-making.

Fig. 16
figure 16

The estimated choice probability sequence of each alternative. Left is user ID: ym205, permit expiry is 24th Nov. and right is user ID: ym215, permit expiry is 2nd Dec

We confirm of the reproducibility of the dynamic model. Figure 16 shows the example of actual choice result and the reproduced choice probability sequence by the above model. The individual (user ID: ym205) decided to use the permit 8 days before in the left figure and the individual (user ID: ym215) decided to sell the permit 8 days before in the right figure. On the other hand, our model reproduced the choice probability sequence for “use”, “sell” and “do nothing”. Thought the choice timing is not completely correct, our model can capture the tendency of each user’s decision-making.

Policy simulation

Now, we think of the case in which the permit value is higher than observed data. One policy simulation is to change the distance from a user’s home to the MM area. To verify the effect of the distance from their home, we compute the choice probability sequence under a distance change within 2 km. Figure 17 shows the choice probability in the observed setting by a solid line and the choice probability in the simulation setting by a dotted line. It is found from the result that a higher permit value makes a higher choice probability for using the permit. However, the choice timing in the temporal axis has the same tendency as the observed setting. Hence, the permit value affected the final choice result but did not affect the choice timing.

Fig. 17
figure 17

The sensitive analysis for the high permit value case

Fig. 18
figure 18

The sensitive analysis in the case where the time discount factor is low

Finally, we consider the case where the discount factor is lower than observed data. Figure 18 shows the result where the discount factor of all participants is 1/2. Because the final utility term is the same as the observed data in this setting, the final choice result and choice probability are the same. However, the result indicates that participants made decisions earlier because the option value generated by postponing decision-making was low.

The implication from these results is that a policy that improves the permit value or increases population of participants cannot solve the problem that participants made decisions about their permits at the last minute. As many participants performed such an action, the double auction in the tradable permit system could not achieve the efficient allocation by participants’ transactions only. This analysis and this pilot program indicate that efficient relocation of tradable permits need not only a market mechanism but also understanding participants’ schedule under uncertainty.

Conclusions

In transportation research area, little attention has been paid to auction-based transportation services except tradable credit scheme or tradable permit system. Tradable credit scheme and tradable permit system were analyzed theoretically but there is no empirical study of these schemes. The novelty of this research is that we implemented a tradable permit system as a new bicycle-sharing system in Yokohama city. And we designed our pilot program inspired by experimental economics. The currency of pilot program is not real money but a “virtual money”. However, it has the incentive compatible social pilot program by the induced value theory. In addition, we analyzed users’ travel behavior and transaction behavior empirically and clarified the issue of transportation service auctions. Although the number of participants in this pilot program was not large and transactions were not very active, our data showed the complex relationship between travel behavior and transaction behavior. It is one of the interesting results of our pilot program. The other contribution of our empirical study is that postponement of decision-making under uncertainty, which have received insufficient attention in previous studies, has an impact on the decision-making timing.

Using behavior and transaction data from social pilot program, we estimated transition probability and transaction behavior model by dynamic discrete choice model. From the estimation results, we showed the relationship among postponement of decision-making, the number of free-time days, and the number of MM area visits. As this basic analysis, we modeled the sequential decision-making for permits by a dynamic discrete choice model and clarified the contributing factors that affected the decision-making timing using parameter estimation. The estimation results showed that the option value of postponing decision-making caused the transactions to occur at the last minute, and that this effect blocked the liquidity of the permits trade. In addition, we showed the heterogeneity of the time discount factor, which indicated that initial allocation was important for efficient allocation. Finally, the sensitivity analysis of the dynamic discrete choice model could simulate the sequence of choice probabilities. These simulations showed the relationship between each parameter and choice timing.

For future work, there are some issues in our empirical study. As the issues of our pilot program, the number of samples and the number of active users were too small; hence, our results are limited in terms of their general implications. We could only show the interesting fact finding at this time. We need to implement a larger pilot program as the way to approach this issue. In addition, it is necessary to validate tradable credit/permit scheme empirically for not only a bicycle-sharing service but also road network. To analyze user behavior under a complex system such as a tradable credit/permit scheme, additional empirical studies are needed. As the issues of our empirical analysis, our approach needs to be modified with two estimation steps for transition probabilities and the dynamic discrete choice model. To estimate consistent parameters, simultaneous estimation is needed. And the assumption of transition probabilities is simple. We need to readjust the assumption. To model the decision-timing under the uncertainty, it is useful to use dynamic discrete choice model. The model structure of dynamic discrete choice model and the parameter estimation are important for describing future decision-making.