1 Introduction

Driver assistance systems (DAS) became very popular in recent years. The automobile industry and academic researchers spend much effort to develop such systems [16]. They can detect certain driving situations and provide car drivers with real-time information on different aspects such as travel time [1] and parking facilities. The parking process can involve several annoying tasks: finding a free parking space, manoeuvring into a narrow space, finding a ticket machine or filling in a prepaid ticket etc. As a result, there is growing interest in parking assistance systems.

There are several techniques, which provide drivers with real-time information on the occupancy of car parks and guide them to the next free parking space. Counting systems like traditional induction loops and video detection [7] as well as secured wireless networks are available in car parks to record entering and existing vehicles. The degree of occupancy (or number of free spaces) is displayed on variable message signs (VMS) and drivers are guided to available parking facilities with directional signs [15]. A recent contribution is “smart parking”, which has been implemented in many European and Japanese cities to improve the speed and efficiency of locating, reserving and paying for parking [18]. But even inside a car park assistance may be required to find a free space. There are vision-based systems [13] and wireless sensor networks (WSN), which have been developed and tested, e.g., by Seong-eun et al. [20], Jaeseok et al. [10], Tang et al. [23] and Gongjun et al. [8]. They inform drivers about free parking spaces and route them directly to a free place.

Many drivers would welcome analogous information services for on-street parking spaces. This task is more difficult, because entries, exits, and parking spaces are less well defined and inappropriate for counting. On-street parking vehicles can be detected with panoramic street images taken with different cameras. Hirahara and Ikeuchi [9] used a laser-ranger finder and a line-scan camera taking epipolar-plane images (EPIs) for this task. This method delivers only static snap-shots. Most attempts to deliver real-time information about parked vehicles on-street rely on roadside sensors. For example, Berenger Vianna et al. [3] as well as Basu and Little [2] developed a system with linked parking meters to find free spaces. Parking meters were equipped with infra-red sensors to identify free and occupied parking spaces; the parking meters were connected with each other via radio frequency transceivers. Free parking meters send their position to the network and passing cars can receive this information as well. BMW developed a similar system in the project ‘stadtinfoköln’ with about 140 connected parking meters in Cologne [14]. This information service is available for 19 German cities and can be used for free via internet.

Wired sensors networks are impractical to cover large areas that lack a suitable electrical infrastructure because of high deployment and maintenance costs [21]. An alternative option for detecting the occupancy of on-street parking spaces are wireless sensor networks with autonomous sensor nodes [7]. Most of these systems use ultrasonic sensors or induction loops. The devices communicate via radio frequency transmitters; the occupancy information can be accessed through mobile devices [12] or being displayed on VMS. Cenerario et al. [4] took a different path. They employed protocols that vehicles can use to share information by means of V2V communication; it could for instance be used to share information about available parking spaces. A similar community-based system is currently tested in Vienna [22]. It enables users to indicate (release) free on-street parking spaces and to find these spaces via a mobile app. This system would however require a high share of market penetration to provide a complete picture, which can only be achieved in the long run.

An accurate detection of on-street parking spaces relies invariably on area-wide infrastructure on street or within vehicles, which causes high costs for implementation and maintenance. Such systems are also controversially discussed, because they aim at navigating drivers as quickly as possible to the next free parking space, that way making car use more convenient. It is commonly accepted that car use should not be further encouraged, in particular not in the inner districts of large cities. This argues for an information system, which informs road users early about the current and forthcoming parking situation at the destination—preferably prior to departure, when there are still more options available to adapt the behaviour.

Many drivers are able to roughly estimate the availability of short-term parking spaces at the destination based on experience. However, situational influences such as holidays, building sites, and special events introduce variability to the parking pressure, what encourages people to try their luck. As a result, parking search traffic amounts to 25 % of total traffic in the inner districts of Vienna [6]. Rye et al. [19] emphasises that managing information about parking availability should be part of demand management. A more reliable forecast of the occupancy could support the use of alternative options (public transport, Park & Ride, garage at the destination) and reduce parking search traffic at the destination. The self-regulating effect could be reinforced by variable parking fees depending on the demand. Such schemes are already in operation, e.g. in Amsterdam [17].

The city of Vienna offers an SMS payment service for short-term parking fees, which is called “HANDY Parken”. More than 40 % of the short-term parking tickets are paid by this means. The success of HANDY Parken in Vienna opens the way for a solution, which is independent from on-street sensors. Each electronic ticket generates information about the duration of the parking process and the approximate location of the vehicle via cell location. The approach is based on the assumption that these data provide a probabilistic indicator for the number of parked vehicles, thereby enabling an estimation of the total occupancy. A real-time readout procedure of position data of HANDY Parken does not exist yet, but has to be developed.

2 Objectives

The objective of the research project this article reports on was to develop a real-time model for the occupancy of short-term parking zones, which works without roadside infrastructure, but only uses existing real-time data sources (Fig. 1). Three different kinds of data were tested for their predictive power:

Fig. 1
figure 1

Data flow in the real-time occupancy model

  • position data of short term parking customers, who buy an electronic parking ticket on their mobile phone via the HANDY Parken service;

  • counts of short-term parking customers in car parks in or close to the target area;

  • traffic flow volumes derived from around 300 automatic counting sites in Vienna.

We focussed on these data sources, because they are already available or could be gained at low cost. Moreover, this article is only about (i) testing for the predictive power of these data sources, (ii) developing a suitable model specification, and (iii) evaluating the performance of the model. The development of a physical model is not part of this article. The resulting model shall not exactly map the occupancy of single parking lots, but provide a sufficiently accurate description and reliable prediction of the parking pressure in the target area.

Vienna provides optimal conditions for this attempt due to the high share of mobile phone parkers. Parking meters are not available in Vienna and sending an SMS is apparently considered much more convenient than filling in a paper ticket. As a result, more than 40 % of parking tickets is paid by this service, what corresponds to more than a million SMS tickets every month. This makes HANDY Parken the most commonly used mobile service in Austria.

The mobile phone data are generated only during the validity period of the short-term parking zone and arrive just in time, which limits their usability for a forecast. Therefore, two further real-time data sources were tested in this study, which are available all time and may increase the reliability of forecasts. Both data sources are state of the art, but their use as real-time indicator of the occupancy of short-term parking zones remains to be proven.

The real-time occupancy model was developed and calibrated with data of an empirical parking space survey in two test areas. The most challenging tasks were to find out, (i) if the extraction of position data of HANDY Parken customers is technically and legally feasible and (ii) if the three aforementioned indicators can predict the real-time occupancy of the short term parking zones. The expected result is an online prototype, equipped with a visualisation on Google Maps and a data-interface for integration into existing travel information services. Preliminary results were presented at the ITS World Congress 2012 in Vienna.

3 Real-Time Data Sources

The mobile phone parking data constitute the most promising indicator of the occupancy model. The HANDY Parken service has an agreement with several mobile network operators (MNOs), the largest of which participated in the research project. Reading out all SMS-tickets booked in a given time interval together with the permitted parking duration is technically and legally feasible. But, we also need to determine the position of the mobile phone from which the SMS was sent. It serves as proxy for the position of the car and is allocated to a particular street section. We aimed to perform this task by determining the coordinates of the active network cell. This procedure is less precise than GPS location, but can be universally applied, not only on smart phones. Due to the high density of network cells in the inner districts of Vienna, we expect a spatial accuracy of 100−200 m. Given that drives roam areas of a similar size when looking for a parking space, a more precise location is presumably not necessary for an information service about the occupancy of on-street parking spaces.

The Austrian Data Protection law under the rule of the European Data Protection law (EU-Dir. 95/46/EC and EU Dir. 2002/58/EC) [5] as well as the Austrian Telecommunication Act (§99 TKG2003) raise several cautious questions regarding privacy concerns: What kind of data retrieval is permitted for the MNO? At which stage of data processing will the aggregation take place? What level of aggregation is sufficient? These questions could not be answered in advance, but had to be discussed during the project. The data protection officers of the MNO finally decided against a data transfer despite an initial agreement. As a result, the HANDY Parken data could be made available for calibrating the model, but will not be available for operation.

The counts of short-term parkers in the car parks were easier to get, because all car parks in Vienna dispose of automatic counting systems. The counts are periodically retrieved and fed into the Viennese parking guidance and information system. However, for the development of an occupancy model it was necessary to distinguish between customers with long term contracts and short term customers. Since the parking information system requires only the total number of free parking spaces for its operation, this distinction is not common state of practice. We achieved an agreement with five of the six car park operators in the test areas. All of them were able to distinguish between long term and short term parkers and provided the counts during the parking space survey.

Traffic flow data could be collected without any difficulty. They are already collected at a central point, where they feed into a transport model, which is in service at www.AnachB.at. A data exchange protocol has been established, so that we could retrieve the data at any time.

4 Parking Space Survey

The parking space survey conducted in the research project served two purposes: (i) to deliver the ‘ground truth’ for the calibration and validation of the occupancy model; and (ii) to deliver the day curves of those vehicles, which have a permanent parking permission, any other special permission, or no permission at all (fee dodgers). These parkers leave no electronic trace for real-time indication. The survey was conducted in two test areas as shown in Fig. 3 with a total of 3,000 parking spaces. The areas vary in terms of residents-visitors ratio, supply of on- and off-street parking, spatial functions, and demographic structure. The survey was carried out in three periods in February, March and April 2012. It should be noted that the parking fee had been increased between period 1 and 2 from previously 1.20 to 2.00 €/h (the survey also served to estimate the effect of this intervention), so that only the 2nd and 3rd period are suitable for cross-validation under ceteris paribus conditions.

The survey started with a record of all parking supplies. For each available parking space it was indicated, whether it is a regular short term parking space, a special parking space for loading, disabled drivers, or any other specific group. The demand of parking spaces was afterwards observed periodically every 30 min from 8 am to 10 pm. Each parked vehicle was registered along with the type of parking permission in terms of permanent permission, fill-in ticket, or no visible permission. The latter could be fee dodgers or users of the mobile parking service. In order to distinguish between the two, we recorded the license plates of these vehicles and checked them against the HANDY Parken database, which allowed us to determine, which of these cars had a valid electronic ticket during the observation period.

5 Observed Occupancy Rates

The cars parking on short-term parking spaces can be grouped into the following categories according to the type of parking permission:

  • permanent permission (mainly residents, but also businesses and other specific groups);

  • short term permission in terms of a prepaid fill-in ticket;

  • short term permission in terms of an electronic ticket; and

  • no permission; these are fee dodgers who forgot to buy a ticket or deliberately try their luck that they are not enforced and punished; the share is 12 % in the total sample; it matches well with the experience of the parking surveillance authority.

The repeated observations of parking cars (every 30 min from 8 am to 10 pm on 11 days) result in 28 × 11 = 308 data points in time for each street section. The objective of the occupancy model is to predict the occupancy rate of an area rather than the number of parked cars. We thus translated the car counts into fractions of available parking spaces occupied by different categories as shown in Fig. 3. The result reveals strong differences between the areas (Fig. 2). The share of short-term parkers is approx. 40 % in the city centre (area 1), whereas it is only 20 % in the residential area (area 2). The day curves are also different. The occupancy in the city centre is highest during the day and again during the opening hours of pubs in the evening, whereas the residential area shows the highest occupancy in the morning before the residents leave and in the evening after they are back.

Fig. 2
figure 2

Share between different types of parking permissions as fractions of the estimated number of available parking spaces in 1st district (area 1) and 6th district (area 2)

6 Analysis of Relationships Between Occupancy Rates and Real-Time Data Sources

This section includes the tests of linear relationships between the three kinds of real-time data and the occupancy of short term parking zones. The tests were performed using correlation coefficients. The results and conclusions presented in the following hold only true for Vienna. We do not have data outside Vienna, so we cannot discuss the applicability for other cities.

6.1 Counts of Short Term Parking Customers in Car Parks

Car park counts were collected from six car parks in or close to area 1 and three car parks in or close to area 2 (green triangles in Fig. 3). We computed several correlations of car park data with parking space occupancy. As an illustration, Fig. 3 shows the correlations of the inflow of short term parking customers into the car parks with the average parking occupancy of the corresponding street sections. We found no systematic relationship between car park inflow and parking space occupancy, so that the former are no suitable indicator of the latter.

Fig. 3
figure 3

Correlation coefficients of garage inflow with mean parking occupancy in the corresponding area (right top: 1st district; left bottom: 6th district); red points = street sections, green triangles = car parks, numbers = correlation between inflow of short term parkers into car parks and on-street parking occupancy

Most of the car park operators, who participated in the advisory board of the research project and provided the counts of car park customers, held the view that the garage inflow is just the overflow of on-street parking demand. This view is not confirmed at all by the survey data.

6.2 Traffic Flow Volumes

We examined traffic flow data of 830 major roads in the Vienna region. They were derived from a traffic model operated by the provider of real time traffic information in the Vienna reagion [24]. The Model uses the data of some 300 automatic counting sites as input ([11]). In order to test for the predictive power of the traffic flows, we computed correlation coefficients with mean occupancy rates in both target areas. The results are given in Fig. 4. Although we found some large correlations among the numerous candidates, they have different signs and don’t reveal any recognisable geographical pattern. We thus considered the relationship between traffic flow data and parking space occupancy too fragile to be used as predictor in the real-time occupancy model.

Fig. 4
figure 4

Correlation coefficients of traffic flow data with mean parking occupancy rates in area 1 (1st district; left graph) and in area 2 (6th district; right graph)

6.3 Mobile Phone Parking Data

These data were the most promising candidates for indicating the occupancy of short term parking zones in real time. Our hypothesis was that (i) mobile phone parkers are a congruent subset of all short term parkers, and that (ii) the latter can be added to the number of permanent parkers, which evolves according to its own day curve independent from short term parkers. The mobile network operator involved in the research project refused to provide the location data of mobile phone parkers for privacy concerns (see Section 3). For the vehicles observed in the parking space survey, however, we could retrieve this information from the HANDY Parken database.

Table 1 shows the correlation coefficients of the number of electronic tickets with other kinds of permissions and with the total occupancy rate; it sheds some light on the relationships between these quantities. The strong positive correlation between electronic tickets and fill-in tickets indicates that mobile phone parkers are indeed a congruent subset of all short term parkers (hypothesis 1). However, the assumption of independence between short term parkers and permanent parkers (hypothesis 2) was rejected. The negative correlation with permanent permissions and fee dodgers (without permission) indicates a displacement effect, which may occur in cases when the total demand exceeds parking space supply.

Table 1 Correlation coefficients (r) and significance (p) between electronic parking tickets and other parking permissions in 1st district (area 1) and 6th district (area 2)

This pattern appears similar in both areas. A difference between the areas however appears when it comes to the correlation between electronic tickets and total occupancy rate. Area 1 shows a high positive correlation, whereas area 2 shows a low correlation. The reason is the lower share of permanent parkers in area 1. It causes that the relationship between electronic tickets and total occupancy rate is less adversely affected than in area 2.

Figure 5 shows a linear regression \( \widehat{o}= kx+d \) of the total occupancy rate \( \widehat{o} \) (last row in Table 1) on the percentage x of parking space occupied by cars with electronic tickets. The model explains r2 = 43.5 % of the occupancy rate variance in area 1 and only r2 = 1.3 % of the variance in area 2.

Fig. 5
figure 5

Linear regression of the total occupancy rate on the percentage of parking space occupied by mobile phone parkers in 1st district (area 1) and 6th district (area 2)

7 Development of the Real-Time Model

In order to measure the improvement in prediction gained from the real-time information we defined two reference models, which differ in the underlying assumption:

  1. (1)

    The constant average model is based on the naive assumption that the occupancy rate stays at a constant average level at any time; it neglects the variability throughout the day. The expected average occupancy rate is simply estimated by

    $$ \overline{o}=\frac{1}{ DT}{\displaystyle \sum}_d{\displaystyle \sum}_t{o}_{t,d} $$
    (1)

    over all days d and time intervals t from a sample of D days each divided into T time intervals. The Root Mean Square Error (RMSE) of this predictor is given by the sample standard deviation

    $$ {\mathrm{s}}_o=\sqrt{\frac{1}{ DT}{\displaystyle \sum}_d{\displaystyle \sum}_t{\left({o}_{t,d}-\overline{o}\right)}^2} $$
    (2)
  2. (2)

    The average day curve model is based on the slightly more advanced assumption that the occupancy rate follows the same average day curve on all days. The expected occupancy rate in time interval t on any day is estimated by

    $$ {\overline{o}}_t=\frac{1}{D}{\displaystyle \sum}_d{o}_{t,d} $$
    (3)

    where o t,d is the occupancy rate on day d in time interval t. This model explains 65.8 % of the variance of the occupancy rate in area 1 and 54 % in area 2, respectively. The resulting day curves are shown in Fig. 2.

We tested the increase of predictive power gained from the real-time information against these models. The car park counts and traffic flow data were excluded from the outset, because no systematic relationship could be identified between these quantities and the parking space occupancy rate (see chapter 6); only the number of purchased electronic tickets was considered in the model. Predictions based on this indicator explain 43.5 % of occupancy rate variance in area 1 and 1.3 % in area 2 (Fig. 5). This is still inferior to predictions based on the average day curve model, which account for 65.8 % and 54 %, respectively (eq. 3). Nonetheless, it is possible that the real-time indicator improves the predictive power, if added to the average day curve model. We tested this possible improvement in three respects:

  • improvement of approximation of the occupancy rate in the current time interval;

  • improvement of prediction of the occupancy rate of future time intervals;

  • improvement of prediction of the occupancy rate in exceptional situations, in which the occupancy rate differs considerably from the usual day curve.

7.1 Approximation of the Occupancy Rate in the Current Time Interval

In order to determine the additional variance explained by the real-time indicator, we calculated the residuals of the average day curve model \( {\tilde{o}}_{t,d}={o}_{t,d}-{\overline{o}}_t \) and, analogously, the residuals of the number of purchased electronic tickets \( {\tilde{p}}_{t,d}={p}_{t,d}-{\overline{p}}_t \). The additional percentage of occupancy rate variance explained by a linear regression model

$$ {\widehat{o}}_{t,d}=k{\tilde{p}}_{t,d}+d $$
(4)

is r2 = 29.3 % × (1 − 65.8 %) = 10 % in area 1 and r2 = 3.6 % × (1 − 54 %) = 1.7 % in area 2, respectively. Figure 6 shows a plot of \( {\tilde{\mathrm{p}}}_{\mathrm{t},\mathrm{d}} \) against \( {\tilde{\mathrm{o}}}_{\mathrm{t},\mathrm{d}} \) for both areas along with the linear regression model according to eq. (4).

Fig. 6
figure 6

Linear regression of the residuals of occupancy rate \( {\tilde{\mathbf{o}}}_{\mathbf{t},\mathbf{d}} \) on the residuals of the number of mobile phone parkers \( {\tilde{\mathbf{p}}}_{\mathbf{t},\mathbf{d}} \) in 1st district (area 1) and 6th district (area 2)

7.2 Prediction of the Occupancy Rate of Future Time Intervals

The attempt to improve the prediction of the occupancy rate of a time interval in the near future by means of the number of purchased electronic tickets in previous time intervals is based on a linear model of the day curve of electronic tickets. The model uses the mean values \( {\overline{p}}_t \) and the covariance matrix S of electronic tickets purchased in previous time intervals to compute the predictor

$$ {\widehat{p}}_{t+i}={\overline{p}}_{t+i}+{\mathbf{S}}_{t+i;t-j,\dots, t}{\mathbf{S}}_{t-j,\dots, t;t-j,\dots, t}^{-1}\left({p}_{t-j,\dots, t}-{\overline{p}}_{t-j,\dots, t}\right) $$
(5)

of the number of electronic tickets, which will be purchased in time interval t + i, given the number of electronic tickets purchased in time intervals tj,…,t. The predictor of the occupancy rate in time interval t + i is again computed by a linear model, which uses the predictor of the number of electronic tickets \( {\widehat{p}}_{t+i} \) as predictor:

$$ {\widehat{o}}_{t+i}={\overline{o}}_{t+i}+k\left({\widehat{p}}_{t+i}-{\overline{p}}_{t+i}\right)+d $$
(6)

The model parameters S, \( {\overline{p}}_t \), \( {\overline{o}}_t \), k and d are estimated from empirical data.

A separate model was developed for both areas and subject to three cross-validation rounds. Within each round, two of the three survey weeks served for calibration, while the remaining week served for validation. In the prediction model given by eq. (6) we chose j = 0, i.e., only the electronic tickets purchased in the current time interval were used for prediction. Figure 7 shows the Root Mean Square Errors (RMSE) of occupancy rate predictions calculated 0, 30, 60 and 90 min ahead. Each chart shows a separate cross-validation round, which corresponds to one of the three weeks. The red line indicates the RMSE of predictions based on the average day curve model \( {\widehat{o}}_{t+i}={\overline{o}}_{t+i} \). The real-time indicator could in most cases not reduce the prediction error of the average day curve model.

Fig. 7
figure 7

RMSE of a cross validation of occupancy rate predictions based on an average day curve model without (red line) and with (blue bars) real-time indication of purchased electronic tickets for time intervals t + 0 to t + 3

A direction for future work would be to examine the prediction accuracy of an implementation in a Kalman filtering framework with measurement model p * t  = ho * t  +  t and process model o * t + 1  = F t o * t  + η t + 1 with observations \( {p}_t^{*}={p}_t-{\overline{p}}_t \), hidden state variables \( {o}_t^{*}=\left({o}_t-{\overline{o}}_t\right) \), observation model h = k −1, observation noise t  ∼ N(0,σ t ), state transition model F t  = cov(o t + 1,o t)/var(o t ) and process noise η t + 1 ∼ N(0,ξ t ).

7.3 Prediction of the Occupancy Rate in Exceptional Situations

Exceptional situations are by definition rare and will hardly come up numerously in three weeks survey period. We identified only one event, in which the occupancy rate differed considerably from the usual level. It was Shrove Tuesday in the 1st week, an event at which particularly many people visit the bars and pubs in the inner district of Vienna (area 1). This situation is included as one of three days in the upper left chart of Fig. 7. In this particular case it turns out that the real-time model predicts the occupancy rate clearly better than the average day curve model. The error improvement is strongest in the current time interval, but it still appears in predictions of future time intervals. This result indicates that the real-time information about electronic parking tickets might increase the accuracy of an occupancy model in situations, in which the demand of parking place deviates from the usual level.

8 Conclusions and Outlook to Model Application

We developed and tested a real-time occupancy model of short-term parking zones, which works without roadside infrastructure, but only uses existing real-time data sources. The following lessons we have learned from our work:

  • Among the tested real-time data sources (electronic parking tickets, counts of car parks, traffic flow data) only the number and location of electronic tickets is able to indicate the occupancy rate in a given time interval and street section. However, this indicator is not yet available due to privacy concerns of the major mobile network operator in Austria.

  • The occupancy rate of on-street parking spaces follows a strongly recurrent pattern on different days. An average day curve model thus predicts the occupancy at a given time interval very well. For the usual day-to-day variations of the occupancy rate it can hardly be outperformed by a real-time model.

  • Unusual deviations from the day curve due to exceptional events are better predicted (and can only be predicted) by the real-time model. We had only one exceptional event in our three weeks survey, but such events will occur more often in the long run. There are various factors that significantly influence human behaviour: holidays, day characteristics such as working day, weekend, day before or after holiday or weekend, big events, etc. This makes such a model very useful. Its further development requires data from longer periods. It is not necessary to anticipate the exceptional events; they would simply appear in the real-time signal, so that the model can learn from its own history for future events.

The unavailability of the location data of electronic parking tickets is not necessarily the end of the story. The negative decision was based on an extremely narrow interpretation of the legal regulations, so that we see a chance for a positive decision in a second attempt. A significant improvement can also be expected from the new standards for Intelligent Transport Services (ITS) and their subsection Cooperative IT, which are currently developed by the standard organisations ISO, CEN, and ETSI. The new standards will start at 2016 with the availability of new cars. The solution will be a combination of GPS-data, vehicle to vehicle (V2V) and vehicle to infrastructure (V2I and I2V) communication. They will deliver location data in the range of five meters. It can be used as real-time indicator of parked vehicles with unreached accuracy (among many other things).

A simple option to make the information service available to the users would be a stand-alone webpage as shown in Fig. 8, where the user can chose the area and time for which the occupancy information shall be provided. The main goal is however to integrate the service into existing information services, e.g.

Fig. 8
figure 8

Internet information tool based on the real-time model

  • the intermodal real-time traffic service AnachB.at;

  • the mobile payment service HANDY Parken;

  • the Viennese parking guidance system;

  • traffic information services of broadcasting stations.

The end users’ benefit of the information service is in either case a better knowledge of the current and forthcoming occupancy of on-street parking spaces at the destination. It could also include recommendations for alternative options in case of high parking pressure such as off-street parking facilities, intermodal transport, or public transport.