1 Introduction

In the market, traders guess the value of an asset and take a behavior based on their expectations about the asset value. The market disseminates information in the form of, for instance, prices that reflect traders’ expectations. Information, such as prices, affects traders’ expectations. As George Soros (1987, p. 2) says, cognition of participants and their participation in the market interfere with each other. Soros, a legendary investor, calls such interference reflexivity.

Soros (1987, p. 16) complains that economic theories strangely neglect reflexivity. However, modern economic studies have attempted to unveil these reflexive aspects by introducing interactive agents explicitly in their models. Learning-to-forecast experiments (LtFEs) were introduced by Marimon et al. (1993) and spawned a series of works: Heemeijer et al. (2009), Bao et al. (2012, 2017), and Colasante et al. (2019) investigated interaction between prices and expectations in the laboratory. In these experiments, subjects forecast a price and their forecasts were aggregated to determine the price; then, they forecast again considering the realized prices. These studies focused on expectation formation rather than price formation, requiring subjects to only forecast prices. LtFEs, based on price adjustment rules, relied on a computer to calculate prices from subjects’ forecasts.

In contrast, experimental economics has a long-standing tradition of studying price formation. Double-auction (DA) experiments pioneered by Smith (1962) reveal that simple laboratory markets can discover the price that is efficient by a certain criterion. In these classic experiments, values of tradable objects are given by an experimenter as experimental parameters; therefore, there is no need for subjects to form an expectation about the value of objects. Further, Smith et al. (1988) and Haruvy et al. (2007) investigated expectation formation in DA markets.Footnote 1 In these studies, assets brought about stochastic dividends. The probability distribution of dividends, given by an experimenter, determined the asset value.

The present study also investigates expectation formation in DA markets. However, the value of our asset is deterministic and set as an experimental parameter. We used a glass jar filled with small balls to define the asset value. We asked our subjects to guess the number of balls in the jar and to trade an asset whose value is equal to that number. Treynor (1987) conducted a similar jar-guessing experiment in his classroom. The mean estimate of the number of beans in a jar, as estimated by his students, was close to the true value, but when they were cautioned, the mean of their estimates became worse. Treynor (1987) conjectured that his warnings about a jar caused a shared error to creep into individual estimates. This study suggests that the heterogeneity among individual expectations is essential for the accuracy of the average expectation, which determines the price in the market.

Our study aimed to investigate whether the effect of the market on expectations, if any, deteriorates or improves the accuracy of expectations. We also wanted to examine how subjects’ expectations about the asset value determine its price in the market. Our experiment was conducted under the environment of the constant fundamental value, which Noussair et al. (2001) reported as a factor making bubbles smaller and less frequent in experimental markets, although it was not sufficient to eliminate the possibility of a bubble. Such a simple environment would provide a benchmark for future studies. We found that prices converged to the median expectation of the asset value. This convergence seemed to be a necessary consequence of the subjects’ behavioral tendency in trading, wherein the subjects with higher expectations tended to buy and those with lower expectations were apt to sell. Our subjects tended to revise their expectations, that is, their estimates of the asset value, in accordance with the market prices. Despite this, our experimental markets showed no definitive effect on the accuracy of the subjects’ estimations. This is because their revision process was consistent with the partial adjustment formula that targets the median estimate without any significant disturbances.

We explain the design of our experiment in Sect. 2, and analyze experimental outcomes in Sect. 3, where an analysis on trading behavior is in Sect. 3.2.2. To examine the process of individual subjects to revise their expectations, we use the partial adjustment model in Sect. 4, where Sect. 4.2 presents our analysis of the disturbance in this model. The disturbance has a potential to improve or deteriorate the estimation of subjects as a cohort. Section 5 discusses the notion of the wisdom of crowds, the comparison to previous studies, and the directions for future research. We provide concluding remarks in Sect. 6.

2 Experimental design

In our experiment, each subject had two roles: a forecaster and a trader. A forecaster estimates the number of balls in the jar and submit an estimate as a prediction of the value of an experimental asset. A trader transacts to buy or sell an asset in the DA market. We paid our subjects cash rewards according to their performance.

2.1 Asset

Our experimental asset—neither dividends nor coupon payments—had a deterministic fundamental value. We called this asset a “certificate”Footnote 2 to allow the subjects, many of whom were unfamiliar with finance, to easily understand the experimental procedures. A jar filled with bingo balls defined the value of the asset. After the completion of the period of the experimental market, transactions were settled at the value, i.e., the number of balls in the jar. Therefore, the profits from trading were calculated as follows:

$$\begin{aligned} \hbox {Seller's Profit}= & {} \hbox {Price} - \hbox {Value} \end{aligned}$$
(1)
$$\begin{aligned} \hbox {Buyer's Profit}= & {} \hbox {Value} - \hbox {Price} \end{aligned}$$
(2)

Acquired profits were converted into cash payments at the end of the experiment.

The transaction of the asset in our laboratory was similar to the futures contract in real markets, though we called the asset “certificate" in our instructions. In real world, traders often close their positions before the expiry by taking opposite positions. Such closeouts through reverse contracts were impossible in our market because we allowed our subjects to trade only once in a period. Therefore, all transactions, calculated as per equations (1) and (2), were settled in cash.

The life of the asset was one experimental period without the possibility of resale or buyback. Therefore, the only way to gain a positive profit was either to buy at a price lower or sell at a price higher than the real value of the asset. Expectations about price movements were almost useless, since there was no chance for capital gain. This allowed our subjects to concentrate on estimating the fundamental value of an asset without bothering themselves about the behavior of other subjects.Footnote 3 Experimental markets designed in this study provided minimal to no opportunity for speculation and thus could be used as a benchmark to assess the effect of speculation in speculative markets.

2.2 Jar-guessing

We used a 1.7-l transparent glass jar filled with balls to define the value of the asset. The experiment used Japanese style bingo balls of nine colors, each 12-millimeters in diameter, without any numbers on their surface.

Before the experimental market was opened, we showed our subjects a jar and asked them to guess the number of balls in the jar. 24 subjects participated in a session. If they crowded around a jar, it would be difficult for some of them to observe a jar. Therefore, we presented two jars with the same number of balls during the observation period. The subjects could observe either of the two jars revealed by the experimenter. After observation, they submitted their estimates as their predictions about the fundamental value through the computer provided in their cubicles.Footnote 4 We instructed all subjects to “submit your prediction within 30 s in general” and mentioned that “the remaining time would be displayed at the upper-right corner of your screen.” However, they could submit their estimates as predictions even after the required 30 s were over. We paid monetary rewards to the subjects based on the precision of their predictions.

Our experiment had no strict controls or restrictions about how the subjects observed the jar to make their estimates. When the experimenter announced that he would present jars, the subjects could leave their cubicles to watch a jar and then return to their seats to enter their estimate of the number of balls in the jar. They could return to their cubicle any time during the observation period. If some subjects continued their observation more than 2 min, the experimenter would gently notify the subjects to return to their seats. For example, he announced to all the subjects that “If you make a guess, you can go back to your seat and click a ready-to-start button.” We covered the jars after all subjects returned to their seats to avoid providing an unnecessary advantage of unlimited observation time for subjects who were incidentally seated near the jars.

We used this method to alleviate the inequality of observation among subjects and, at the same time, to also avoid an artificial restriction that could possibly hinder natural guessing.

2.3 Asset trading

In a session, the subjects experienced nine periods of asset trading but were unaware that the ninth was the last.Footnote 5 Each subject could bid or ask freely during a period until they executed a contract. Every subject could choose to be either a buyer or a seller in each period. A subject could trade up to one asset in each period. Once they had bought or sold an asset, they had to wait until the market closed, since reverse contracts of resale or re-purchase were impossible.

Assets were traded using Smith’s (1962) DA market system, where buyers’ English auction and sellers’ Dutch auction proceeded simultaneously. The subjects could execute their contracts once they chose a price from the order bookFootnote 6 and confirmed to execute the transaction. Our markets were computerized and used Fischbacher’s (2007) z-Tree software package.Footnote 7 The subjects observed how a market operated on their monitorsFootnote 8 and submitted their asks and bids using the mouse and keyboard.Footnote 9

Each trading period, separated from a period of jar observation and estimation, lasted for 5 min. The remaining time would be displayed at the upper-right corner of the monitor screen. The period ended before this time limit if no further transactions remained in the market.

2.4 Subject

For each session, as participants, we recruited 24 college students of Kyoto Sangyo University. A total of 48 students participated in our two sessions. All of them were undergraduate. Our recruitment excluded graduate students majoring in economicsFootnote 10 and foreign students.Footnote 11 We had no control over other attributes; hence, our sample contained subjects with varying age, gender, and academic majors.Footnote 12 We excluded those who had previously participated in our earlier experiments. Nobody participated in both the sessions. Thus, all subjects were novices to the experimental tasks of this study.

2.5 Procedures

Our experiment was conducted in a computerized laboratory. The subjects were seated in a cubicle with a monitor screen, a mouse, and a keyboard. Written instructions, which were also read aloud by an experimenter at the beginning of a session, were distributed to each subject.Footnote 13

After the experimenter read aloud instructions to the subjects, a jar was revealed and presented to the subjects; they were then asked to predict the number of balls in the jar and submit their estimates as predictions. There was no dry run. After submissions were made, the market was opened for trading assets. During a 5-min trading period, each subject could buy or sell an asset. When the trading period ended, they were required to submit a prediction of the number of balls in the jar again, and then the second trading period started. After the third prediction and subsequent trading period, we showed the same jar again without the cover, and then the subjects alternated their prediction and trading thrice. After the sixth prediction and trading, we showed the same jar once more, followed by three predictions and three markets.

In sum, our subjects alternated between jar estimates and asset trading nine times, and they saw the same jar three times. All subjects knew that the predictions and trading would occur three times after they saw the jar, but were unaware about how many times they would be able to observe the jar. We revealed the number of balls in the jar only after the experimental session. Profits obtained through predictions and trading were also announced at the end of the session.

We omitted a dry-run period to exclude prior interaction as far as possible. Our aim was to study the holistic process of inter-participant interaction. However, the trial market entails some interaction among subjects, so we decided to omit it. We also skipped a trial period of jar-guessing, even though we could design a non-interactive trial for it. This is because we wanted to observe how the market affects naive subjects who had no prior training of jar-guessing.

To reduce the noise caused by erroneous tradings, we designed a simple market and carefully instructed our subjects on how to trade in this market and showed them sample screenshots of a trading monitor. In each session, about 20 min was devoted to instruct participants about the procedure and computer-interface, despite which, we anticipated some errors and hesitation regarding decision-making among participants. However, a participant without complete understanding about the experimental market was not necessarily a nuisance in our experiment. Indeed, some participants may disturb a market, but we were also interested in assessing whether the market would absorb or magnify such a disturbance.

2.6 Rewards

We paid rewards to participants in cash (JPY), based on their performance. Profits and losses were accumulated and paid in cash at the end of an experimental session.

The reward for a completely accurate prediction was 200 yen, while 1 yen was deducted for every unit error. The minimum reward for a prediction was set at zero; hence, a prediction with an error of 200 or more units yielded no payment.

The profit from asset trading, calculated by formulae (1) and (2), could be negative (i.e., a loss). We endowed each subject with a show-up fee of 750 yen and a trading fund of 2000 yen. If a subject suffered a loss in a market, this initial payment would be reduced.Footnote 14 A subject who made no transactions during a market period made neither profit nor loss during that period.

2.7 Sessions

We conducted two sessions at the laboratory of Kyoto Sangyo University in Kyoto, Japan. Session 1 was held on 19th February, 2019, and Session 2 on 5th March, 2019. Each session lasted 78 min,Footnote 15 including 18 min of instruction, and comprised 24 participants each. The number of balls in the jar, the fundamental value of the asset, was 384. The total payment of rewards, in JPY, was 88 682 and 80 593 for Sessions 1 and 2, respectively.

3 Experimental results

3.1 Prediction of asset value

3.1.1 Downward bias and diminishing diversity

One of our initial findings was that the subjects’ predictions regarding the fundamental value—the number of balls in a jar—had a downward bias. Figure 1 shows the mean and median of the 48 predictions submitted in each period of the two sessions. They consistently remained below the true value, that is, 384. Figure 2 is a histogram of predictions made during the first period of the two sessions.Footnote 16 The fundamental value 384 lies at the midpoint of the fifth class [346, 422). The mode is immediately to the left of this class. This initial distribution is skewed right with the peak veering to the left; the skewness is 0.208. Our subjects alternated between jar-guessing and asset trading, and their trading experience in the laboratory market seemed to affect their predictions. The frequency table of Appendix B represents how the prediction distribution changed over nine periods in our experiment.

Fig. 1
figure 1

Mean and median of predictions in the two sessions

Fig. 2
figure 2

Prediction distribution at 1st period in the two sessions

Fig. 3
figure 3

Cumulative distribution of predictions in Session 1

Fig. 4
figure 4

Cumulative distribution of predictions in Session 2

Fig. 5
figure 5

Standard deviation and interquartile range of predictions in the two sessions

Fig. 6
figure 6

Kurtosis of predictions in the two sessions

Figures 3 and 4 represent the transition of the cumulative distribution of predictions in Sessions 1 and 2, respectively. Each figure contains three distribution lines for the first, mid, and last periods, that is, the first, fifth, and ninth periods. The lines became gradually steep in each session, implying diminishing diversity of predictions. The declining tendency of the standard deviation and the interquartile range in Fig. 5 also indicated that the market made the subjects’ predictions less heterogeneous. A decrease was clearer in the interquartile range than in the standard deviation. This suggests that some outliers persisted in their own predictions. The increasing kurtosis in Fig. 6 was conformable to the existence of adamant outliers.

Although the divergence in the predictions decreased due to the market experience, it seemed to have had no systematic effect on the precision of the predictions. The mean and median of the predictions changed erratically, as shown in Fig. 1. T-statistics of the mean prediction error were always significantly negative. The maximum T value, that is, the closest value to zero, was \(-3.913\) at \(t=1\) with a single-sided significance probability of \(1.461\times 10^{-4}\).

The mean and median in Fig. 1 diverged widely during the middle periods.Footnote 17 This implies that the prediction distribution was more asymmetric during these periods. The skewness increased in the middle periods, as seen in Table 17 of Appendix A and Fig. 15 of Appendix D. The jar observation may have caused predictions of the fundamental value to be distributed asymmetrically. We consider this possibility in Appendix D.

A table of statistics regarding submitted predictions is presented in Appendix A. Every figure in this subsection except for the histogram, which depicts frequencies in Table 18 in Appendix B, represents the statistical values in Table 17. This table in Appendix A also presents all values of the skewness and T-statistics for every period.

3.1.2 Time for guessing

Table 1 presents the observation time taken to study the jar. The time for the first observation began when the experimenter started the z-Tree computer program and displayed the jars after he finished reading the instructions. The time for the second and third observation started when the third and sixth period of the market closed, respectively.Footnote 18 An observation time of each subject ended when a subject clicked the ready-to-start button after he/she returned to the cubicle.Footnote 19

Table 1 Time for jar observation

Table 2 presents the time taken by subjects to submit their prediction. The time for the first, fourth, and seventh period began when all subjects clicked the ready-to-start button after they came back to their seats. The time for other periods, i.e., for \(t=2,3,5,6,8\), and 9, started when the latest market closed. A time of each subject ended when a subject submitted his/her prediction.Footnote 20

Table 2 Time to submit prediction

In both Tables 1 and 2, the time taken tended to decline. Learning or fatigue, associated with experimental tasks, could be a reason for this tendency. Some subject might not understand the experimental procedure sufficiently during earlier periods. As periods proceeded, they would become accustomed to the procedure and could be quicker in their decisions and actions. Such an effect of learning was probably important at the earliest periods because we omitted trial periods in our experiment.

The time for the jar observation decreased drastically at the third observation in Session 1, as seen in Table 1. It is possible that the subjects in Session 1 might be quick learners. This characteristic could cause the outcomes of Session 1 to differ from those of Session 2. In addition, the subjects in Session 1 took longer at the first and second observations than did the participants in Session 2. This could make the predictions in Session 1 more accurate than those in Session 2 if longer observations would entail more precise predictions. We discuss these topics in Appendices E and F,Footnote 21 although we found no clear correlation between the observation time and prediction accuracy.

During later periods, subjects might have gotten tired and avoided a careful consideration. In fact, one subject in Session 1 submitted the value 1’s as predictions for the jar at the eighth and the ninth period. This could be due to either disinterest or fatigue. However, this was merely an exceptional case. We think that the fatigue effect was not so important in our experiment because the activity of the subjects showed no clear tendency to decline (Table 3). Data in Table 3 indicate how many subjects did not vacuously revised their predictions, reporting the number of subjects who did not submit the same prediction as the preceding latest period. The table also contains the transaction volume at each market period. We can confidently state that many of our subjects were not very tired and actively continued revising their estimates or trading an asset even during the later periods.

Table 3 Number of nonzero revisions and transactions

3.2 Transaction prices

3.2.1 Price convergence

Prices in our experimental markets stayed below the fundamental value of the experimental asset, reflecting the downward bias of the subjects’ expectations about the asset value. Figures 7 and 8 show transaction prices of each session, where the horizontal dashed line represents the asset value defined by the jar. Each figure contains nine segmented lines, corresponding to nine periods of the market. Prices appeared to fluctuate less sharply as the market periods proceeded. In fact, Table 4 indicates a declining tendency of the standard deviation of the price. The price seemed to converge to a certain level, though it never converged to the fundamental value.

Fig. 7
figure 7

Price in Session 1

Fig. 8
figure 8

Price in Session 2

Table 4 Standard deviation of price
Fig. 9
figure 9

Absolute deviation of price in Session 1

Fig. 10
figure 10

Absolute deviation of price in Session 2

The price of our laboratory markets converged to the equilibrium price, but not to the fundamental value. We defined the equilibrium price at t by the median of predictions submitted directly before the t-th market period. The median prediction divided the subjects into two equal-sized groups. One group consists of subjects who predicted relatively larger values. They would buy an asset at the price of the median prediction because the asset was cheaper than their own valuation. In contrast, this price was expensive for subjects in the other group who predicted smaller values, and consequently they would sell. Therefore, the demand and supply would balance at the price that was equal to the median prediction, since every subject could trade only one asset in each period. The price of our experiment converged to the equilibrium price defined by the median prediction. Figures 9 and 10 show the absolute deviation of the transaction price from the equilibrium price, which declined to near-zero. Each panel (Figs. 9, 10) also involves the absolute deviation from the fundamental value, which is rather distant from zero. Prices were attracted to the median prediction which was always significantly smaller than the fundamental value 384.Footnote 22 Table 5 presents equilibrium prices, that is, the medians of the subjects’ asset valuations, at every period in each session.

Table 5 Equilibrium price/median prediction

In traditional DA experiments, the demand and supply are strictly controlled through parameters like values/costs that clearly define at what prices each subject could obtain a positive profit. In our experiment, expectations set by the subjects induced the demand and supply. We controlled them only indirectly, through the jar with balls. However, prices in our DA markets exhibited a good convergence to the equilibrium price.

3.2.2 Auctioning behavior

In our experimental markets, the subjects were able to trade only one asset in each period. Therefore, the demand and supply would be balanced, if half of the subjects wanted to buy and the other half wanted to sell. The median of submitted predictions divides subjects into two equal-sized groups. The demand and supply would equilibrate at the price of the median prediction, so long as subjects who submitted lower predictions than the price in the market wanted to sell and subjects of higher predictions wanted to buy.

In our experiment, any subjects could submit either a bid or an ask at their will during each period. They could switch their orders from buying to selling or vice versa at any time so long as their orders were not executed. If just a small fraction of the subjects wanted to sell and an overwhelming majority persisted in buying, the price in the market could be considerably higher than the median prediction. Now, we consider an extreme case where only one subject who submitted the second largest prediction wanted to sell. All other subjects specialized in buying and would never sell even at significantly higher prices than their predictions. In this case, only one asset would be transacted at the price between the largest and the second largest predictions.

Such an imbalance did not occur in our experiment. Let us define a proper buyer and a proper seller as follows. A subject who only bids during a market period is a proper buyer for that period. A proper seller only asks in a period. Some subjects submit both buy and sell orders or no orders within a certain period.Footnote 23 They are neither a proper buyer nor a proper seller. Table 6 shows the number of proper buyers and proper sellers at each period. Every period had a balanced number of each type.

Table 6 Number of proper buyers and proper sellers

Proper buyers usually predicted higher values, and proper sellers tended to estimate lower values. Table 7 summarizes the average rank of prediction according to the subject type. The smallest prediction is ranked 1, and the largest is ranked 24. In case of a tie, each prediction is allocated a mean rank. For example, if the smallest predictions are submitted by two subjects and other 22 predictions are larger, each of the smallest predictions is ranked 1.5. The average rank of predictions submitted by proper buyers was larger than that of predictions submitted by proper sellers at every period in both sessions.

Table 7 Average rank of predictions by proper buyers and proper sellers

Approximately half of the subjects wanted to buy and the rest wanted to sell. Subjects with higher expectations about the asset value were apt to bid, and subjects with lower expectations tended to ask. Hence, it was not surprising that the prices in our experimental markets converged at the median of predictions, although we need more studies to elucidate the exact mechanism of this convergence.

3.3 Interaction between prices and predictions

The divergence in expectations about the asset value decreased in our experiment, as shown in Sect. 3.1.1. Such a clustering of predictions occurred because all the subjects watched the same market, and tended to revise their predictions by referring to prices in the market.Footnote 24

Fig. 11
figure 11

Gap and revision of predictions in the two sessions

In fact, the revision of the prediction and the gap between the average price and the prediction were negatively correlated. A revision is the difference in predictions of a subject from one period to the next. A gap is the difference between a subject’s prediction and the average transaction price. A subject submitted a prediction and participated in trading in the following market period and then revised the prediction. This revision appeared to be based on the prices during the trading period directly before a revision. Figure 11 is a scatter diagram of the pooled predictions of the two sessions. The horizontal axis is the gap between the t-th prediction and the average price of the t-th market period. The vertical axis is the prediction revision form t to \(t+1\). The correlation was \( -0.337\).

A regression is useful for analyzing how the subjects revised their predictions, being based on the average price. We regressed the revision on the gap. The regression model is as follows.

$$\begin{aligned} V^e_{i,t+1,k}-V^e_{i,t,k}= a + b (V^e_{i,t,k}-{\bar{P}}_{t,k}) + v_{i,t,k} \; , \end{aligned}$$
(3)

where \(V^e_{i,t,k}\) is a t-th prediction for the asset value submitted by the subject i in Session k. \({\bar{P}}_{t,k}\) is the average price of the t-th market of Session k. If n transactions occurred at the t-th market of Session k; thus, \({\bar{P}}_{t,k} = (\sum _{j=1}^{n} {P}_{j,t,k})/n \) where \({P}_{j,t,k} \) is the j-th transaction price of the t-th market of Session k. The perturbation is denoted by \(v_{i,t,k}\). The explained variable is the revision of prediction, i.e., the difference in predictions between adjacent periods for each subject. The explanatory variable, the gap, is the discrepancy between the prediction of a subject before a revision and the average price in the market directly before a subject revised the prediction. As a subject was required to submit nine predictions in a session, each subject revised a prediction eight times. The average price of the ninth market is not used in our regression because no prediction was submitted after the ninth market.

Table 8 reports a pooled regression with 384 observations,Footnote 25 where \(t=1,2,\cdots ,8,\) \(i=1,2,\cdots ,24\), and \(k=1,2\). The intercept \({\hat{a}}\) was not significantly different from zero. The coefficient \({\hat{b}}\) was significantly negative. This means the subjects usually revised their predictions upward if their predictions were below an average price and downward if their predictions were above the average. However, the value of \({\hat{b}}\) implies that the ordinary revision was only by a value of 17% of the gap between a prediction and the average price. The value of \(\sqrt{R^2}\), which is equivalent to the absolute value of correlation, was low, suggesting this revision process was often considerably disturbed.

Table 8 Pooled regression of the two sessions
Fig. 12
figure 12

Gap and revision of predictions in the two sessions of Period 1

Table 9 shows a pooled regression with 336 observations, which excludes 48 observations at the 1st period of both sessions. In this regression, \(t=2,3,\cdots ,8\). We omitted the trial period in our experimental procedure; hence, our subjects might have been confused at the first period. However, the results from this additional regression were essentially same as the regression presented in Table 8 in that \({\hat{a}}\) was not significantly different from zero and \({\hat{b}}\) was significantly negative.Footnote 26 Both \( | {\hat{b}} |\) and \(\sqrt{R^2}\) were smaller in Table 9 than in Table 8 because some outliers with great revisions were excluded from the additional regression. Figure 12, a scatter diagram at \(t=1\), shows observations which were excluded from the regression presented in Table 9.

Table 9 Pooled regression without observations at 1st period

4 Estimation for individual prediction revision

4.1 Partial adjustment model

A pooled regression in Sect. 3.3 suggests our subjects revised their predictions by referring to prices in the market. We now examine how an individual subject revised predictions. Each subject’s point of reference probably differed. However, it was inevitably related to prices in a market, and the price in our laboratory markets tended to converge to the median of predictions, as shown in Sect. 3.2.1. Therefore, the median of submitted predictions would be a good approximation to the individual reference in the revision process, even though none of the subjects directly observed the median.

Thus, we start our analysis with the following regression equation for the subject i,

$$\begin{aligned} {\varDelta } V^e_{i, t} = \alpha _i + \beta _i ( {{\mathcal {M}}}_{t} - V^e_{i, t} ) + e_{i, t} \; , \end{aligned}$$
(4)

where \(V^e_{i,t}\) is a t-th prediction by the subject i, \( { \varDelta } V^e_{i, t} = V^e_{i, t+1} - V^e_{i, t} \), and \({{\mathcal {M}}}_{t}\) is the median of predictions submitted at the period t.Footnote 27 Every subject submitted nine predictions; then, they revised their predictions eight times at \(t=1,2, \cdots , 8\). The degree of freedom was only six; hence, our regression analysis of individual subjects would present just a sketchy description rather than a precise estimation about the subjects’ behavior.

We obtained 48 regression equations from the two sessions as 24 subjects participated in each session. (Tables of estimates are in Appendix C.) Estimates of the regression coefficient, \({\hat{\beta }}_i\)’s, often lay between 0 and 1. Only 13 out of 48 estimates were outside the (0, 1) open interval. Most estimates of the intercept, \( {\hat{\alpha }}_i \)’s, were non-significant. At the 5% level, only nine intercepts were significantly different from zero. Values of residuals, \( {\hat{e}}_{i, t} \)’s, were not very large. (See Sect. 4.2 for details.) Thus, most of our subjects seemed to revise their predictions in accordance to the following partial adjustment process.

$$\begin{aligned} {\varDelta } V^e_{i, t} = \beta _i ( {{\mathcal {M}}}_{t} - V^e_{i, t} ) \; , \end{aligned}$$
(5)

where \( 0< \beta _i < 1\). This coincides with an adaptive expectation model if \( {{\mathcal {M}}}_{t} = V \). Therefore, the above partial adjustment model implies that individual subjects would form adaptive expectations, if the median of individual predictions gave perfect foresight about the asset value V.

Now, we transform (5) as follows, in order to see how predictions could be revised.

$$\begin{aligned} V^e_{i, t+1} - {{\mathcal {M}}}_{t} = ( 1 - \beta _i ) \{ V^e_{i, t} - {{\mathcal {M}}}_{t} \} \end{aligned}$$
(6)

It is easily seen that \( V^e_{i, t+1} \) is closer to \( {{\mathcal {M}}}_{t} \) than \( V^e_{i, t} \;\), and \( V^e_{i, t+1} \) exceeds \( {{\mathcal {M}}}_{t} \) neither from above nor from below \( {{\mathcal {M}}}_{t} \), since \( 1> 1-\beta _i > 0 \). Revised predictions get closer to, but never overshoot, \( {{\mathcal {M}}}_{t} \). Therefore, the revised median \( {{\mathcal {M}}}_{t+1} \) stays in the neighborhood of the previous median \( {{\mathcal {M}}}_{t} \). Now, we introduce a notation about the order of predictions, in order to formalize this argument. \(V^e_{(n),t}\) denotes the n-th smallest prediction at the period t. If \( m < n \), then \( V^e_{(m),t} \le V^e_{(n),t}\). This second inequality is an inclusive inequality, since we gave a different placing number for every prediction even when ties exist. \({{\mathcal {M}}}_{t}\), the median of predictions at the period t, is the midpoint of \(V^e_{(12),t}\) and \(V^e_{(13),t}\), since each experimental session involved 24 subjects. Thus, we have the next proposition.

Proposition 1

If all subjects revise their prediction according to the partial adjustment formula (5), then the revised median \( {{\mathcal {M}}}_{t+1} \) lies between \(V^e_{(12), t}\) and \(V^e_{(13), t}\).

Proof

The equation (6) that is equivalent to (5) never revise predictions beyond the target \( {{\mathcal {M}}}_{t} \), since \( 1- \beta _i > 0\). This implies revisions of \( V^e_{(1), t}, V^e_{(2), t}, \ldots , V^e_{(12), t} \) are smaller than or equal to \( {{\mathcal {M}}}_{t} \), and revisions of \( V^e_{(13), t}, V^e_{(14), t}, \ldots , V^e_{(24), t} \) are greater than or equal to \( {{\mathcal {M}}}_{t} \). Therefore, the lower half predictions at the period \(t+1\) consist of revisions of \( V^e_{(1), t}, V^e_{(2), t}, \ldots , V^e_{(12), t} \,\).

Suppose the subject i submitted the 12th smallest prediction at the period t, i.e., \( V^e_{i, t} = V^e_{(12), t} \,\). \( V^e_{i, t+1} \le V^e_{(12), t+1} \,\), since \(V^e_{(12), t+1} \) is the largest in lower half predictions at \(t+1\). Further, \( V^e_{i, t} \le V^e_{i, t+1} \) because equation (6) adjusts a prediction closer to the target, as \(1> 1- \beta _i > 0\). Therefore, \( V^e_{(12), t} = V^e_{i, t} \le V^e_{i, t+1} \le V^e_{(12), t+1} \,\). After all, \( V^e_{(12), t} \le V^e_{(12), t+1} \,\).

\( V^e_{(13), t+1} \le V^e_{(13), t} \) follows in the same manner. Thus, we have the following inequalities.

$$\begin{aligned} V^e_{(12), t} \; \le \; V^e_{(12), t+1} \; \le \; V^e_{(13), t+1} \; \le \; V^e_{(13), t} \end{aligned}$$

This completes a proof, since \( {{\mathcal {M}}}_{t+1} \) lies between \( V^e_{(12), t+1} \) and \( V^e_{(13), t+1} \,\).

\(\square \)

Proposition 1 implies that the partial adjustment formula (5) confines the median of revised predictions \( {{\mathcal {M}}}_{t+1} \) in a neighborhood of the previous median \( {{\mathcal {M}}}_{t} \), which is a midpoint of \(V^e_{(12), t} \) and \( V^e_{(13), t} \,\). Some deviations from (5) are necessary to revise a median prediction beyond an interval \([V^e_{(12), t}, \; V^e_{(13), t}]\). This means the disturbance to (5) is inevitable not only to perturb appropriate predictions, but also to correct a bias of initial predictions.

Experimental outcomes entailed deviations from (5) more or less, whether temporal or persistent. Even if our subjects followed the partial adjustment formula, their target could deviate from the median prediction, depending on perturbations in transaction prices and variations about their focal points in the market. Now, we introduce \( \epsilon _{i, t} \) to represent a variability in the target, and \( u_{i, t} \;\), which denotes the disturbance to the revision process. Then we have the following generalized model.

$$\begin{aligned} {\varDelta } V^e_{i, t}= & {} \beta _i \{ ( {{\mathcal {M}}}_{t} + \epsilon _{i, t} ) - V^e_{i, t} \} + u_{i, t} \end{aligned}$$
(7)
$$\begin{aligned}= & {} \beta _i \{ {{\mathcal {M}}}_{t} - V^e_{i, t} \} + \beta _i \epsilon _{i, t} + u_{i, t} \end{aligned}$$
(8)

Thus, we have the next general model of prediction revision.

$$\begin{aligned} {\varDelta } V^e_{i, t} = \beta _i ( {{\mathcal {M}}}_{t} - V^e_{i, t} ) + \eta _{i, t} \; , \end{aligned}$$
(9)

where \( \eta _{i, t} = \beta _i \epsilon _{i, t} + u_{i, t} \,\).

4.2 Analysis of disturbances

The disturbance term \( \eta _{i,t} \) in a general adjustment model (9) is a driving force to correct or exacerbate a bias in initial predictions. The next proposition follows directly from Proposition 1.

Proposition 2

If \( {{\mathcal {M}}}_{t+1} \not \in [V^e_{(12), t}, \; V^e_{(13), t}]\), then \( \eta _{i, t} \not = 0 \) for some i, so long as all subjects follow (9) with positive adjustment coefficient \( \beta _i \) that is smaller than one.

Medians of predictions in our experiment showed only moderate fluctuations and no tendency to increase or decrease (see Fig. 13), suggesting \(\eta _{i, t}\) did not have enough power to mitigate or reinforce a downward bias of initial predictions.

Fig. 13
figure 13

Medians in each session

We estimated the disturbance term \(\eta _{i, t}\) in the prediction adjustment model (9) by estimates of the intercept and residuals of regression model (4) as follows.

$$\begin{aligned} {\hat{\eta }}_{i, t} = {\hat{\alpha }}_i + {\hat{e}}_{i, t} \end{aligned}$$
(10)

Statistics about \( {\hat{\eta }}_{i, t} \)’s are summarized in Table 10. The magnitudes of disturbances were not very large. Most \( {\hat{\eta }}_{i, t} \)’s were in a range of \( 0 \pm {{\mathcal {M}}}_t/4\) as is shown in Table 11. Table 12 reports Durbin-Watson ratio of regression for every subject. Positive serial correlations were not detected in \({\hat{e}}_{i, t}\)’s. Therefore, \({\hat{\eta }}_{i, t} \)’s also had no positive serial correlations. This implies that there were not enough accumulated effects of disturbances to drive predictions in one direction.

Table 10 Statistics of disturbance
Table 11 Frequency of disturbance in \(0\pm \alpha {{\mathcal {M}}}_t \) Region
Table 12 Durbin–Watson ratio of individual regression

Table 13 shows the correlation between \({\hat{\eta }}_{i, t}\) and the submitted prediction \( V^e_{i, t} \). They were positively correlated for all t. This implies that a revision of a larger prediction was often accompanied by a positive \({\hat{\eta }}_{i, t}\), and negative ones tended to occur when smaller predictions were revised. Therefore, \({\hat{\eta }}_{i, t}\) was apt to be positive or negative, when \( {{\mathcal {M}}}_{t} - V^e_{i, t} \) was negative or positive, respectively. This means \({\hat{\eta }}_{i, t}\) often had an effect that made a new prediction move away from, rather than closer to, the previous median \( {{\mathcal {M}}}_{t} \). The disturbance term \( \eta _{i, t}\) added inertia, rather than agility, to the prediction revision process in our experiment.

Table 13 Correlation between disturbance and prediction

4.3 Over-adjustment and overshoot

The disturbance term \( \eta _{i, t}\) did not have enough power to correct or exacerbate a prediction bias in our experiment. Although Proposition 2 insists the power of \( \eta _{i, t}\) is essential to release \({{\mathcal {M}}}_{t+1} \) from the neighborhood of \( {{\mathcal {M}}}_{t} \), it assumes the partial adjustment, i.e., \( 0< \beta _i < 1 \). If \(\beta _i > 1 \), the prediction revision process of (9) even with zero disturbances could extrude \( {{\mathcal {M}}}_{t+1} \) out of \( [V^e_{(12), t}, \; V^e_{(13), t}]\). Actually, some subjects followed a non-partial adjustment as is shown in Table 14. Adjustment processes of nine subjects in Session 1 and four subjects in Session 2 were non-partial, i.e., they contradicted the parameter restriction \(0< {\hat{\beta }}_i <1\). Six and two of them in Sessions 1 and 2, respectively, over-adjusted their predictions, i.e., \( {\hat{\beta }}_i > 1\).

Table 14 Non-partial adjustment
Fig. 14
figure 14

Absolute change of median in each session

Although the median prediction fluctuated moderately as is shown in Fig. 13, the absolute value of \( {\varDelta }{{\mathcal {M}}}_{t} = {{\mathcal {M}}}_{t+1} - {{\mathcal {M}}}_{t}\) was relatively large in Session 1 at \(t=1\). (See Fig. 14.) This is partly because Session 1 involved over-adjusting subjects much more than Session 2. In spite of this, \( |{\varDelta }{{\mathcal {M}}}_{t}| \)s were smaller in Session 1 than in Session 2 during every period except \(t=1\) and 4, in contrast to the largest value at \(t=1\) in Session 1. Six over-adjusting subjects by themselves were not sufficient to explain why \(| { \varDelta }{{\mathcal {M}}}_{1} |\) in Session 1 was the largest. A synergy of the over-adjusting subjects and the effect of disturbances is a key to reveal the reason.

Fig. 15
figure 15

Skewness of predictions in the two sessions

As Table 13 indicates, correlation between the prediction \(V^e_{i, t} \) and the disturbance \( \eta _{i, t}\) was relatively small at \(t=1\) of Session 1, suggesting weaker inertial effects of the disturbance. Therefore, in this period, reinforcement of adjustment by \( \eta _{i, t}\) was likely. If this reinforcement occurred to over-adjusting subjects, the revision of their predictions would be overshooting. An overshoot is the case when either of the next two conditions holds.

$$\begin{aligned}&0< {{\mathcal {M}}}_{t} - V^e_{i, t} < { \varDelta } V^e_{i, t} \end{aligned}$$
(11)
$$\begin{aligned}&0> {{\mathcal {M}}}_{t} - V^e_{i, t} > { \varDelta } V^e_{i, t} \end{aligned}$$
(12)

The magnitude and the direction of overshooting were calculated by the next formula.

$$\begin{aligned} { \varDelta } V^e_{i, t} - ( {{\mathcal {M}}}_{t} - V^e_{i, t} ) = V^e_{i, t+1} - {{\mathcal {M}}}_{t} \end{aligned}$$
(13)

Table 15 summarizes the aggregated value of (13) for all overshooting subjects, i.e., for all i which satisfies (11) or (12), at each period.Footnote 28

At \(t=1\) in Session 1, three subjects overshot \( {{\mathcal {M}}}_{1} \) from above, and only one did so from below this target. (See ‘Frequency’ of Table 15.) Further, these overshoots aggregated to \(-210\). (See ‘Aggregated Value’ of Table 15.) The first period of Session 1 showed clearly asymmetric overshooting in both aspects of the magnitude and the frequency. This period, as shown in Table 16, also involved an uneven distribution of six over-adjusting subjects, i.e., five of them predicted above \( {{\mathcal {M}}}_{1} \), probably just by chance. This transient imbalance, which led to prominently asymmetric overshooting in prediction revision, seems to be a fundamental reason for the largest absolute change of the median prediction at \(t=1\) in Session 1.

Table 15 Aggregation and frequency of overshoot
Table 16 Frequency distribution of six over-adjusting subjects in Session 1

5 Discussion

5.1 Wisdom of crowds

An aggregation of estimates is not uncommonly more precise than individual estimates. Galton (1907), Lorge et al. (1958), and Treynor (1987) showed the precision of collective estimate of many individual estimates, where each estimate was usually not precise in comparison to the estimation as a cohort, i.e., the mean/median of individual estimates. Surowiecki (2004) cited these studies to demonstrate examples of the wisdom of crowds, the notion also known as the collective intelligence. The market mechanism is possibly an example of this kind of wisdom, since the market price reflects various insights of many traders. Stock prices in real markets seem to fully reflect available information, and for individual investors to persistently outperform the market index is very difficult (see, e.g., Cowles 1933; Fama 1970; Carhart 1977). Surowiecki (2004) also described the market mechanism as a typical example of the wisdom of crowds, while referring to an empirical study of Maloney and Mulherin (2003), who reported that the stock price of the firm responsible for the crash of the space shuttle Challenger had plunged most sharply soon after the crash, long before the cause of the accident was determined.

Contrary to the results of these previous studies, our subjects, as a cohort, underestimated the number of balls in the jar, and prices in our laboratory markets never revealed the true value of the asset. The results could have been influenced by cultural bias as our experiment was conducted in Japan, with only Japanese subjects. According to Rieger et al. (2014), who had surveyed the risk preferences in 53 countries, significant cross-country differences in risk aversion depended on cultural factors. Their studies indicated that ambiguity aversion was much higher in Japan than in the USA, and Japanese people were slightly more risk averse when compared with global samples. Our results also possibly indicated that our subjects, all Japanese, were considerably risk-averse, and hence, their bids to buy assets were low, resulting in significantly low prices, which in turn led to lower expectations. However, our subjects would offer higher prices to sell assets if they were risk-averse.Footnote 29 Therefore, risk aversion is insufficient to explain the downward bias found in our experiment, so long as our subjects did not estimate the asset value only from the buyers’ viewpoint. In addition, their inexperience in jar-guessing and other similar contests could also be a reason for imprecise estimates, but not their downward bias.

The reasons behind the negative bias in estimations could be a topic of interest for psychologists. Economists would be interested in the process of how the bias evolves in the market, rather than the bias in and of itself. Unfortunately, we did not discover any systematic effect of the market on prediction errors. Mean/median errors of predictions were erratic rather than systematic. In other words, collective predictions had no clear tendency to improve or worsen in our experimental markets.

However, we found that experience in the market reduced the diversity of predictions. Our subjects watched prices that converged at the median of submitted predictions. They adjusted their predictions toward the median prediction, even though they could not directly observe predictions of others. Hence, their predictions became similar as they participated in the same market. This can be a threat to the collective intelligence of the market. Surowiecki (2004) and Page (2007) insisted that individual judgements should be diverse for the wisdom of crowds to emerge. Similarly, Treynor (1987) demonstrated the ways in which the shared error among investors deteriorated the accuracy of the market price.Footnote 30 Therefore, if the market reduces the diversity of individual expectations, it may destroy its own intelligence. Having said that, 24 traders in our experiment is too small to be called a crowd.Footnote 31 However, in light of the partial adjustment model of the prediction revision, we can guess that a larger laboratory market would reproduce the diminishing diversity of predictions, which may also hinder the emergence of the collective intelligence.

Our laboratory markets showed no intelligence in revealing the true value of the asset even during the early periods, when the subjects’ predictions remained relatively heterogenous. Despite this, our market was efficient in disclosing a collective opinion as a cohort. Prices in our laboratory converged to the median of the subjects’ predictions about the jar, i.e., to the equilibrium price that would balance demand and supply. It is well established that the price in the DA market converges to the equilibrium price. Demand-supply conditions that determine an equilibrium are given as parameters in traditional DA experiments that originated from Smith (1962). In contrast, the demand and supply in our laboratory were induced from the subjects’ expectations about the jar, and these expectations changed during an experimental session. Despite such an insecure environment, the price converged to the equilibrium price. In conclusion, a failure in the market price to reveal the fundamental asset value was due to the preconceptions of the traders, and not the inefficiency of the market.

Conversely, if the initial median prediction had been precise, the market price would have revealed the true value of the asset. In view of the convergence of the price in our laboratory market, this revelation process would be robust to disturbances. Therefore, if subjects had enough practice to estimate the asset value, the market price would be a precise and stable estimator of the fundamental value. However, if we shocked the market, e.g., by changing the fundamental value at the stage when expectations of traders became homogenous, what would happen? Is a shock in the late stage more severe to the market stability than at the early stage with heterogenous expectation? We need further experiments to examine this topic.

5.2 Comparison to other studies

Smith et al. (1988), so-called SSW,Footnote 32 and Haruvy et al. (2007) studied the relation between the price formation and the expectation formation in experimental markets. Learning-to-forecast experiments, known as LtFEs,Footnote 33 also examined the relation, though their subjects concentrated on price forecasting since experimenters used a computer to make prices from submitted forecasts. In these experiments, subjects were required to forecast prices in the future. While our experiment also investigated the interdependence between price and expectation, the task of our subjects was to estimate the fundamental value of the asset, and not its price in the market. Other than this, our experiment differs from SSW’s and LtFEs in many respects.

The life of the asset in our experiment was set for only one period unlike SSW’s and LtFEs. Any inter-period trading which may bring capital gains was impossible in our experiment. This denied the profitability of the trend following behavior in which followers expect the future price to rise when the past prices increased. Even intra-period trading to secure a short-term profit was impossible, since our subjects could trade only once in a period and hence reverse contracts were excluded. They could neither resell an asset, nor buy it back from other subjects. Therefore, the uncertainty about the behavior of others, which is a cause for price deviations according to Akiyama et al. (2017), was irrelevant in our experimental market. No room for speculation in our market probably prevented bubbles, which often emerged in SSW’s and LtFEs.Footnote 34

Also, contrary to SSW’s, the fundamental value of the asset was constant in our experiment. According to experiments by Noussair et al. (2001), constant fundamentals made bubbles less pronounced. Similarly, Kirchler et al. (2012) demonstrated that the declining fundamental value was the main driver for mispricing in experimental markets. We suspect that changing fundamentals could cause bubbles even in our non-speculative markets. An endogenously changing asset is a good prospect, since Soros (1987, p.57) said that a reflexive connection is interesting only if fundamentals are endogenously changeable.Footnote 35 What would happen if we changed the number of balls depending on the price in the market? We need further experiments to investigate this.

Even though no bubble occurred in our experiment, the fair price that reflected the fundamental value was never found in our markets because of a persisting downward bias in the subjects’ expectations. This is not surprising because our subjects received no feedbacks about their errors and had no clues to correct estimates, which was distinct from SSW’s and LtFEs. Real markets also lack the feedback about the fundamentals. Despite the feedback about errors in forecasting prices or profits of firms, or the information from the financial statements of firms, investors in the real world never know the true fundamental value. Our experiment was similar to the real world in this respect.

In SSW’s and LtFEs, subjects knew the fundamentals since the probability distribution of dividends was revealed, even if they might not regard it fundamental. We determined the fundamental value by a jar with balls, not by a probability distribution. Our subjects probably regarded the jar as a fundamental since the transaction was settled at the value defined by the jar, but never knew the right value. Knight (1957) and Keynes (1921) contemplated the uncertainty that has no mathematical probability. Further, Ellsberg (1961) experimentally discovered the effect of the ambiguity, which was not specified by the probability distribution. Although the value of our experimental asset was deterministic, our subjects never knew its fundamental value. It was ambiguous for the subjects not because of the uncertainty due to the mathematical probability, but because of the limitation of their cognitive ability.

To extract the effect of this ambiguity, we need additional experiments. For example, we may introduce the uncertainty of the fundamental value through the probability distribution determined by the distribution of submitted predictions about the jar. What if subjects participated in both markets: the market of the ambiguous asset and the market of the asset of measurable uncertainty?

5.3 Future issues

We need more work to examine the exact mechanism of price convergence. In our experiment, approximately half of the buyers with higher estimations of the asset value were bidding up—toward their estimations. The other half were asking to sell, conceding their order prices toward their lower estimated values. Such a process might necessarily make the price to converge at the median of estimations. However, the following questions arise: Why was the number of buyers approximately same as the number of sellers? Why did order prices of each side meet at the middle of the estimated values? Were there no inequalities in bargaining power? We need to scrutinize our experimental data more elaborately to answer these questions. Additional experiments might be necessary.

We studied our subjects’ expectation revision process with a simple linear model in Sect. 4. The disturbance term in this linear model had a positive correlation with the subjects’ expected value (see Table 13), suggesting some nonlinearity in the nature of the revision process. A non-linear model is probably more appropriate to represent the behavior of our subjects. However, nine periods in our experiment were too short for examining individual behavior. It is necessary to conduct a more-than-12-period session.

We modeled our subjects’ expectation formation as the partial adjustment process to the median of submitted predictions. Even though the subjects had no direct information about predictions submitted by others, this model provided a good approximation of the subjects’ behavior, which was not surprising since prices in markets tended to converge to the median of predictions. Nevertheless, targets of adjustment probably differ among subjects, depending on their focus about what to observe in the market. The variety of the target, represented by \( \epsilon _{i, t} \) in our model (7), is part of the driving force to correct or exacerbate a prediction bias. Eye tracking of subjects would reveal this variety. Interviews and questionnaires to subjects after the experimental sessions would also be useful.

6 Concluding remarks

In our experiment, the subjects’ predictions about the number of balls in a jar had a downward bias. Transaction prices of the asset whose value was defined by the jar also showed a downward bias. Prices reflected the downward bias of the subjects’ expectations about the fundamental value because of their auctioning behavior. Subjects who predicted higher values tended to bid. Lower value predictors were apt to ask. Consequently, the prices converged at the median prediction, which had a downward bias in our experiment. We need additional studies to reveal the precise mechanism of this convergence to the equilibrium price defined by the median prediction.

As the subjects alternately repeated jar-guessing and asset trading, their expectations about the fundamental value changed. Although our market neither improved nor deteriorated the accuracy of predictions submitted by the subjects, it made them less heterogenous. We examined the prediction revision of our subjects using the partial adjustment model, which targets the median of submitted predictions. Despite our experimental design that did not allow the subjects to observe predictions of others, this model was useful to explain experimental outcomes, probably because prices in the market reflected the prediction median. Further investigation is necessary to uncover the exact target of the adjustment process, which may vary among subjects.

If all subjects strictly followed the partial adjustment process with the target of the median prediction, the central tendency of their predictions would almost stay at the initial region. Therefore, some deviation from the partial adjustment is inevitable not only to disturb precise predictions, but also to correct a bias in initial expectations. Such deviations possibly come from over-adjusting behavior, perturbations in the market, individual variation in the target, and personal uncertainty.