The concept of scaffolding is well known in psychology. This concept was originally proposed by the Russian developmental psychologist Lev Vygotsky (1986) and represents the distance between the actual developmental level of a child and their potential developmental level (See Fig. 3). However, Vygotsky never used the term scaffolding. His famous theory, the zone of proximal development (ZPD), indicates the transitional learning area between what a child can do alone and what a child cannot do without the help of others. The overlapping area between the two stages indicates the developmentally undetermined area, in which a child may acquire knowledge or skill through appropriate support from the teacher. The two areas located before and after the ZPD are the area of learning already completed and that of learning yet to be completed. The crossover in-between indicates the area of learning that can be accomplished with guidance and encouragement, which is the proximal zone where actual learning occurs.
The idea of the ZPD was picked up and later developed by psychologists J. Bruner and others under the name “scaffolding.” Through this scaffolding, a child obtains new skills and knowledge by being offered “support points” or “stepping stones” (Wood et al. 1976). The concept was further interpreted by J. Hoffmeyer for the biological sciences under the name semiotic scaffolding, which represents a web of checks and balances established through communication between organisms (i.e., a historically created interaction mechanism) (Hoffmeyer 2015, p. 154). A concrete example is revising the preceding act at a later moment. Emmeche (2015) rephrases Hoffmeyer’s point by stating that semiotic scaffolding guides a system’s behavior to follow a more definite sequence of events, indicating the network of semiotic controls. Favareau (2015) uses a framework of cybernetic jargon to show that the definite sequence of events from semiotic scaffolding is a type of interactional constraint and possibility bias.
Let us explain our view regarding time progression schematically based on semiotic scaffolding. For this purpose, a situation should be presented in which there is no semiotic scaffolding, which is similar to an artist’s white canvas before drawing. However, we also will use the following two examples throughout for illustration purposes: one is a parent teaching a child how to ride a bicycle, and the other is a bird teaching a young bird to fly. Thus, Fig. 4a is an undefined space that includes only a frame. That is, there is no punctuation and hence “no difference that makes a difference,” and only the arena (time and space) of activity is provided, such as the day/season and environment for the riding or flying. In the jargon of the ZPD, only the zones, the areas focused on learning, are granted. Here, an action that occurs inside the space has no way of being evaluated due to the absence of criteria. There is no difference; thus, the action does not lead to any time.
Figure 4b provides vertical and horizontal lines that form a matrix. The lines can be considered pre-scaffolding for an organism to act upon the space and make interpretations. Only with the presence of agency can the matrix function as scaffolding; however, there is no agency, and no rule has been established for utilizing the matrix at this stage. Greater degrees of freedom are observed in this figure; thus, our action can be started from any place downward from the upper right corner or radially from the central space. The zones in the ZPD, such as the Known, Can be Known, and Not Known (Fig. 3), may correspond to this stage, in which no learning direction is specified. This stage is similar to the stage for checking the bicycle as a vehicle beforehand or for the mother bird stopping to feed her young. For both riding and flying, they still remain a set-up or static scheme that needs a rule to make it workable. Unless acted upon, the scheme is simply a picture corresponding to the C-series, and Fig. 4b remains in a pre-scenario (See also Table 1). Such a scheme is similar to a clock’s face without the two hands.
The next step is to introduce a rule onto the matrix regarding use. If rules are set, then the lines in the space become scaffolding that can “guide a system’s behavior to follow a more definite sequence of events” (Emmeche 2015: p. 275). If a child occupies a box in the upper right corner and continues to occupy boxes vertically one by one, then their actions will determine the direction of movement. For example, the Greek alphabet is inscribed vertically as follows: ➔ (See Fig. 4c). Once the rule for progression has become clear, the writing can act as a sort of primitive clock because it has a series of progressive punctuations; however, what type of time would it keep? We do not know at this point. This stage also corresponds to the ZPD, in which the parent briefly holds the child’s bicycle to assist the first and second pedal strokes, parent ducks actually push their ducklings out of the nest to force them to use their wings, and parent eagles fly with their young together to illustrate the process of catching the wind in the air. Definite stepping stones exist in the above scenarios.
Next, the matrix can be used to approach a time indicator (i.e., a calendar), in which the rule becomes clearer because each box counts a single day and the days move horizontally from left to right (See Fig. 4d). If the timekeeper is a bystander-observer or the third-person observer (e.g., someone looking at next year’s calendar), the scheme appears to him/her to be in unidirectional time, indicating before/after relationships. Such time becomes the self-running B-series time. B-series time corresponds to linearly progressive clock time or the ST in our civilization, in which everyone agrees on how to read the punctuation following the suggestions of physicists. This time is objective in nature and indicates only earlier/later or before/after relations. B-series time shows a sequence but remains tense-less; therefore, it does not tell us anything about the past, present, or future. The B-series clock is globally synchronized and externalized because it is read by all of the concerned bystanders, the third-person observers, who do not participate in its making. The ingenuity of B-series time leads to the inevitable selection of the tense-less time code whenever the space-time coordinate is introduced.
If, however, the timekeeper is a first-person agent (or someone actually using this calendar) positioning June 18th in one of the unit boxes that indicates “today,” the scheme appears to her to be an ongoing continuation from the past to the future via the present. Such time becomes self-narrating A-series time, for which she can put a check mark on her own decision. A-series time has been defined as subjective time assuming a single agent has access to its own memory and anticipation (Nomura et al. 2018). The essence of A-series time is centered around tense, or a sense of duration or an unbroken succession of events that constitute one’s own past, present, and future. For example, we know that the timing of saying “good night,” which indicates an individual’s end of the day, may not occur at midnight but instead is a choice of the individual. A-series time seems to presume agential capacity in that the transformation of tense is made possible by a single experiencing subject coordinating different tenses (See Table 1).
However, if you and I are in a communicative relationship, neither you nor I alone can make a unilateral decision over the definition of a day’s passage. Deciding the beginning and end of the day becomes a matter of working out or negotiation because we are second-person negotiators working toward collaborative decisions. In this case, turn taking of punctuation and mutual time alignments are necessary to co-fabricate the present moment of now or today from within. Such locally synchronized time corresponds to E-series time, figuratively speaking. The second-person negotiators as the participants (i.e., internal observers) have a vision of the scenario and anticipate near-future punctuations in advance and in response to each other. Therefore, June 18th is jointly checked based on the shared knowledge that June 19th lies ahead and can be expected, as is the case for athletes guided by the lead car at 5-m intervals. Thus, the matrix (Fig. 4d) can be the semiotic scaffolding or predictive model that assists us in negotiating and deciding the beginning and ending of the day and in providing a scheme for future joint action. This assisting guidance may also represent a reverse causality because the scheme that extends to the future prompts us to define the current position (Matsuno 2016, 2017). E-series time control permits one to foresee the very next step and possesses the C-series scheme as its prediction model. Both the novice bicycle rider and the young flying eagle have attained their own schemes (i.e., prediction models) for riding or flying, which will be an interaction with the parent’s or mother eagle’s schemes and adjustments of their temporal spans to achieve joint actions.