Background

Observational measurement of eating and feeding provides a window into behaviors of children and their families that participants may not self-report. This methodology provides the researcher with a unique glimpse into eating behaviors, which can yield rich quantitative and qualitative data. A wide variety of behaviors can be observationally captured. For example, one can examine individual eating behaviors or familial interactions around food, how much food is consumed under certain conditions, environmental factors (where a meal is eaten, how it is served, who is present at the meal), and linguistics of conversation around food. Most studies of eating, feeding and mealtime behaviors have used self-report measures [1, 2], which although less resource intensive, is limited by social desirability bias [3,4,5]. Observational methodology can capture behaviors of which people are unaware or do not to report, or that are not explicitly asked about. Some prior work [3, 5, 6] has found that self-report and observational measures of eating and feeding behaviors are only weakly correlated. These weak associations may be because these different methodologies are measuring different constructs, with questionnaires measuring parents’ perceptions of behaviors, and videotaped observations measuring objective observations. The weak correlations could also be due to social desirability bias on questionnaires, or the Hawthorne effect [7] that comes with knowing one is being video recorded. Other work [8] has found good correlations between parent report and observed mealtime behaviors, when the constructs from both measures are tightly mapped and highly concordant. It is important to note that self-report measures only capture constructs and behaviors that the questionnaire specifically asks about, and therefore can leave aspects of eating or feeding interactions unquantifiable. Video recorded observational techniques allow for the preservation of a nuanced interaction, which can be reviewed when new questions arise, and re-examined for different facets of behavior. The strengths and limitations of employing observational and self-report methodology for eating and feeding behaviors are summarized in Table 1. The researcher must carefully weigh the strengths and limitations of each methodology, while considering how best to test their hypothesis when choosing their methodological approach. In summary, recognizing that all methods have strengths and limitations, observational methodology captures unique aspects of eating and feeding that enrich our understanding of these behaviors and the contexts in which they occur.

Table 1 Advantages and challenges of self-report measures vs. observational coding for measurement of eating and feeding behaviors

Despite the many benefits of observational coding of eating and feeding behaviors, there are potential challenges which deserve consideration in study design and planning. Herein, we will briefly describe some of the challenges. Technical considerations of video recording eating behaviors should not be underestimated. These include ensuring adequate video and audio quality to capture desired behaviors, getting video recording devices to participants along with instructions for set-up, recording and return of devices and data, in addition to storage and management of video files. This can be logistically challenging, and requires a detail oriented staff who can communicate well with participants throughout the process to maintain data quality. Obtaining video recordings of eating and feeding behaviors is more time and resource intensive than questionnaires as it requires more participant time, more involved data management and logistics, and coding of behaviors. Furthermore, as described below, coding behaviors can be challenging and time intensive. As previously mentioned, the Hawthorne effect with regard to participants’ behavior while being video recorded will always be a limitation. However, most of these challenges are surmountable with careful design and flexible thinking and, in our opinion, are outweighed by the quality of the data the can be yielded from observational methodology.

The objectives of this paper are to outline methodological considerations when designing, implementing, coding and analyzing data from a study using observational coding of eating and feeding behaviors in children and their families, and highlight best practices for doing so. An overview of the steps in such a study is presented in Fig. 1.

Fig. 1
figure 1

A flowchart of basic steps in the study of observational eating and feeding behaviors

Designing the behavioral protocol to be observed

When designing a behavioral protocol to capture eating or feeding behaviors, the authors believe that it is important to start with a hypothesis. What is the behavior or variable that the researcher is most interested in? The answer to this question will guide elements of the behavioral protocol such as whether the video recording should be in a naturalistic environment (i.e., at home) or in a laboratory setting. If, for instance, the researcher is interested in testing if maternal-child food talk varies based on portion size of a meal presented, then researcher may opt to conduct their study in a laboratory based setting in which food can be precisely portioned, and audiovisual equipment can capture the verbal and physical interaction. If, on the other hand, the researcher is interested in examining whether the structure of family meals (e.g., location of the meal, whether the family says a blessing before the meal, if the meal is eaten at a table, etc.) is associated with child weight status, obtaining videos from a naturalistic environment would be more appropriate. Being hypothesis driven will also be important in determining the parameters placed around the video recording – including how the food is presented, types and portions of foods served, length of video recording or food exposure, people present during food consumption, etc. There are many considerations that contribute to careful design of a protocol, which are essential to ensuring that the data obtained will be optimal quality to test the hypothesis.

The setting

The setting of a protocol to capture eating and feeding behaviors is a fundamental consideration in study design, as the setting may have a strong influence on participants’ behavior [9]. The setting of a protocol can be a naturalistic environment (e.g., in the home or at a restaurant), a laboratory, or a semi-naturalistic environment with controlled elements (e.g., a simulated laboratory restaurant). The advantages and disadvantages to each type of setting, (summarized in Table 2) should be carefully weighed when choosing a study design to best test a hypothesis. Naturalistic settings theoretically capture people exhibiting more naturalistic behaviors [10]. For instance, in a home mealtime recording participants may be more at ease and may be more likely to eat and interact in a more natural manner, allowing for the capture of data that is closer to “real life”. Additional advantages include being able to capture the eating interaction between multiple family members or participants more easily than arranging for a laboratory visit for multiple people. The researcher may also gain insights into the ways in which food is prepared and served, who is present at the meal, and what additional events are going on at the same time (e.g., TV on, people coming in and out of the home, mobile devices present, etc.) [11]. The day-to-day variability of eating in naturalistic settings is another important consideration [12, 13] – (e.g., just because a family orders in pizza one night, or has grandmother over to eat with the family, does not necessarily mean that this is an everyday practice). Additional disadvantages to observations in a naturalistic setting include the lack of experimental control with regard to the ways in which the foods are served or eaten, difficulties in measurement of food intake, and who is present at the mealtime [14]. Technical considerations around video recording are also a major consideration, as sound quality, lighting, movement of the participants during the mealtime out of the view of the camera, and background noise may all interfere with the quality of data captured. Some of these issues can be overcome if a research assistant delivers the video recorder to the participant(s) and instructs the family on its use, retrieving it after the first session or after subsequent sessions [8], however this may come at the expense of social desirability bias, as it is an additional reminder to the participants that they are being observed, and is labor intensive [15]. In addition, it is important to consider that for all individuals present in a video recording for research purposes, informed consent or assent must be obtained [16].

Table 2 Advantages and challenges of different settings for the observation of behavioral eating and feeding

In a laboratory setting, the researcher has much more control over the experimental design and implementation [17]. The manner in which the protocol is set up can be standardized, including the physical environment, the ways in which food is presented (e.g., buffet style, family style, individualized portions), who is present for the protocol, and length of time the participants are exposed to a food. In addition, the type and amount of food served can be carefully controlled and standardized across participants. The researcher can also manipulate the independent variables, and can more accurately measure intake [17]. Technical issues such as video and sound quality are easily controlled, and the researcher can troubleshoot any issues that arise in the moment. However, as in all data collection relying on video recorded data, the laboratory setting is not exempt from technical challenges (e.g., battery or power issues, video recorded malfunctioning, etc.). A challenge of the laboratory environment is that participants may interact with each other and food differently in an unfamiliar environment [18]. A final setting, the semi-naturalistic environment, consists of a replication of a naturalistic environment in a laboratory setting that allows for tighter control of experimental and technical elements, but may put the participants at ease. Examples of these types of arrangements include a laboratory designed like a kitchen and dining room [19] or a simulated fast-food restaurant [20], equipped with “hidden” cameras. While the participants know about the presence of the cameras, they may eat and interact in a more naturalistic manner due to the camera not being in front of them on a tripod as a constant reminder.

For all settings, the impact of social desirability bias and the Hawthorne effect [7] on participants’ behavior due to the presence of a video camera is difficult to gauge–participants may behave differently because they are aware of being observed, to avoid embarrassment, or to “look good” to the researcher, which may differ from their typical behavior. Observations in naturalistic or semi-naturalistic environments may theoretically lessen this effect, but cannot eliminate it.

When choosing a setting, it is important for the researcher to weigh the strengths and weaknesses of each setting type, as mentioned above. The authors feel that a guiding question can be whether or not the hypothesis requires testing of a tightly controlled exposure (e.g., time, food type, amount consumed, presentation, etc.). If the hypothesis requires a tightly controlled exposure, then a laboratory setting is likely best. However, if the research design does not require a tightly controlled exposure and the researcher wishes to capture more “typical” behavior, then a naturalistic setting should be chosen. Lastly, the researcher may choose a semi-naturalistic setting if they want to both control for an exposure as well as capture more typical behavior.

The set-up

Regardless of the setting of the video recording, the authors suggest that it is important to explain the purpose of the protocol to participants in a scripted and standardized manner, carefully veiling the purpose of the study and the hypothesis being tested. The participants should not be aware of the specific goals of the study or the specific behaviors being observed, to avoid biasing their behavior. Special attention needs to be paid so that the script is not leading, but also meets the ethical obligation to inform the participants about the study. For instance, if the amount of food consumed in a particular condition is the outcome to be measured, it is important to avoid implying to the participants that they are expected to eat the food (which could serve to increase their consumption). Rather, instructing the participants to “eat as much or as little as you’d like”, allows for more flexibility in the participants’ behavior. In a home mealtime recording, the same considerations apply when explaining the purpose of the video recording to the participants. The authors have found that explanations can be as vague as, “we want to learn how families do dinner”.

Technical issues

When video recording, the frame of the video should to be standardized so that the desired behavior is captured [21]. The authors have found that there are many considerations for the researcher that should be outlined explicitly in the protocol to optimize quality data collection. The researcher must consider if they want just an individual in the frame, an entire group or family or a dyad. If multiple people are present, do they all need to be facing the camera, or is it acceptable for some people to have their backs to the camera? If there is an index participant who is the focus of the study, can other people be eating with him or her, and does the index participant need to be in the center of the frame? Is it important to have the food being eaten in view of the camera, and if so, does this include the plate/dish/wrapper from which the individuals are eating, and/or any larger serving containers? Does the view of the participants need to be unobstructed?

The length of the recording should be specified [22]. Does the researcher need a fixed length of time for the eating segment (this is easy to do in a laboratory setting), or should the participants be allowed to decide when they are “done”? In a naturalistic setting, clear instructions should be provided to the participants with regard to how to set up the video recorder, when to turn it on or off and how to set up the video frame. In a laboratory setting, these details should be articulated in a standardized written protocol. Files of the video recordings, regardless of setting, should be labeled with the participant’s unique identifying number after the file is obtained.

Lighting and sound quality can vary greatly in naturalistic settings [21] and, in the author’s experience, are more easily controlled in a laboratory setting. There are additional challenges of returning the video camera, or digital files containing the video recording to the research team when recordings are completed by participants in naturalistic settings. Associated costs and logistics must be considered in study planning.

Special considerations for naturalistic settings

There are some additional special considerations for observed eating or feeding behaviors in naturalistic settings. Given variability across days, it is important to consider how many events or meals are needed to achieve acceptable reliability across meals. In addition, the number of meals necessary to achieve reliability across meals depends on the behavior being evaluated. It is also important to obtain a sense from the participants as to whether or not the meal or eating interaction was “typical” for the family. This is important to know given the inherent variability of naturalistic settings (e.g., child is unwell or has a tantrum, unexpected mealtime visitor, etc.) and the common desire for studying “typical” behavior.

After the meal, it is often necessary to document who was present and what was served, as this may not be apparent when reviewing the video recordings. The authors have found that this can be done via phone call by a research assistant, or self-recorded by the participant. This information should be collected in a standardized manner across mealtimes or eating events and participants.

Lastly, the authors have encountered instances in which interactions or behaviors captured on video cause concern for the safety of participants, including concerns for child abuse or neglect. Prior to implementing a protocol and collecting data, it is important to have a plan in place regarding how to report these concerns to the appropriate agency to protect the well-being of participants. In the United States, researchers are Mandated Reporters [23], meaning that the law requires researchers to report their suspicions of child abuse or neglect to Children’s Protective Services. The laws and agencies may vary in different countries.

Coding

Coding is a rigorous, systematic and often iterative process that consists of identifying a target behavior or event, and the best way to capture it from video recordings [10]. It is important to consider the potential coding approach during study design [10], and then revisit the feasibility of this approach through review of videos prior to development of the coding scheme. If an event or behavior does not occur in the video recordings as frequently as the researcher had hypothesized (or occurs in nearly every video), the researcher would be prudent to reconsider the approach prior to starting to code.

Questions that need to be answered early in this process to inform the development of a coding strategy include: which individuals (e.g., an individual, a family, a dyadic interaction?) and/or what events (e.g., food being served, mobile device use while eating, conversation, bites of food) are being coded [24]? If events or behaviors are contingent upon one another (e.g., child compliance with maternal prompts to take a bite), multiple behaviors will need to be coded in relation to one another in time.

Coding approaches: Time and frequency considerations

The time interval for coding must be carefully selected. Videos can be coded as a whole, in intervals or as event based. The selection of coding interval depends on the level of detail that the researcher wishes to capture. For instance, to capture general information about a family mealtime, a researcher may choose to code a video in its entirety for simple binary or categorical variables (e.g., in what room of the house did the meal take place? Which family members were present at the meal? Did the mother sit down at the table? Was the TV audible)? More descriptive questions can be answered as well, (e.g., on a scale of 1–5, how oppositional was the child during the meal?) However, if a researcher desires to know more granular information about the meal or eating event, they may wish to use frequency-based, duration or interval coding.

Frequency or event-based coding records the occurrence of each event in a specific period. This coding approach is best used for discrete behaviors that are easily counted (i.e., each time a participant picks up his/her fork and puts food in his/her mouth), as well as shorter observational periods, as frequencies can be labor intensive to capture. If the hypothesis seeks to examine the temporal relationship of one event or behavior to another, it is important to also capture the timing of each event.

Duration coding captures the absolute value or percent of the time that a behavior occurs during an observation period (e.g., minutes in which the child is seated at the table during the meal). Duration coding can be used to capture the percentage of time that an individual is engaged in a specific behavior (e.g., percentage of the protocol that a mother was engaged with her mobile device). This type of coding is best used for behaviors that last more than a few seconds and that occur repeatedly throughout a meal or protocol.

Lastly, interval coding is an alternative approach that can alleviate some of the challenges of frequency or duration coding [10]. In this method, the researcher examines whether a behavior or event occurs at all in multiple smaller time intervals during the observation. For instance, a researcher may code whether or not a participant took a bite of food within each 10 s window of a 4-min video observation. This can provide an estimate for frequency and also duration.

Coding schemes and guides

Coding schemes can be created anew to answer specific questions or previously developed coding schemes may be applied [24]. While many prior studies have coded behavioral aspects of eating and feeding behaviors, few have published their coding manuals in detail. This may be due to word count limitations in print journals (a factor that is less pressing in the age of primarily digital publication), as well as authors’ desires to protect their intellectual property. If a coding scheme is not published, authors will often provide it upon request. The authors argue that in order for the science eating and feeding behavior observations to advance swiftly, more authors should share their coding methodology using open access platforms. Existing coding schemes that capture mealtime and/or eating behaviors are presented in Table 3. These coding schemes may need to be modified depending on the hypotheses being tested [24]. Chorney et al. [24], have published a detailed guide to developing and refining coding schemes, which is a helpful resource.

Table 3 Eating and feeding behavior coding schemes for children over 12 months of age, and their families

The authors have found that there is great value in developing a new coding scheme which can capture novel behaviors or events. To begin, the researcher should watch many video observations to get a sense of how frequent the behavior/event is and whether it is best captured qualitatively, categorically, or as a count. These observations will guide the time segment considerations in coding previously described. While watching the videos, the researcher should take notes to describe the behavior in a specific and detailed manner, defining the specific parameters of the behavior or event, and differentiating between categories if applicable. It is equally important to describe what does not qualify as meeting the specified parameters of a behavior (e.g., a sip of water does not count as a bite of food). This level of specificity will improve the likelihood that the codes will be applied in a reliable manner. Coding categories can be binary (e.g., did an event happen? Yes, vs No) or categorical (e.g., the child took a small, medium or large serving of food). An excessive number of categories will make reliability difficult to achieve, but an insufficient number of categories may lose important nuance in the data. After drafting a coding scheme, the researcher should apply it to several videos and refine it. Explicit directions should be included for the application of the coding scheme including where to access and store data, how many passes should be used to code, if all behaviors or events should be coded in the same pass or one at a time, etc.

Refining a coding scheme and reliability

Once the initial draft of a coding scheme is created, two to three coders should independently apply it to a preselected random sample of video observations to achieve familiarity with the coding scheme. The authors usually start with 5 observations. Coders then meet to compare their codes and discuss disagreements or points of confusion, which may lead to the modification of the coding scheme. This iterative process will continue, with the application and subsequent modification of the coding scheme, until the researcher feels that the final coding scheme has been developed. On all codes where disagreements occurred, the team must come to a consensus. Alternatively, one researcher can code all videos with a coding scheme, and a second coder can be brought in later to establish inter-rater reliability. This practice is sometimes necessary given limited resources or the complexity of a coding scheme, however can be higher risk if reliability cannot be established, and the original coding scheme needs to be modified which would result in re-coding of videos.

It is important to note that reliability can be difficult to establish at times, even for seemingly simply codes. Each coder brings with them their own set of experience and biases, which influence how they interpret and code behaviors. Even in simple coding approaches, such as frequency or event based coding, there can often be challenges in establishing reliability between raters, especially for rare events or if events are easily overlooked. Difficulty in establishing reliability may be due to a variety of factors such as a lack of explicit detail in the coding guide or need for more simplistic coding conceptualization, or a need for training sessions and discussion of disagreements in the coders. A resource recommended by the authors for improving and troubleshooting reliability issues for videotaped behavioral observations is Haidet et al., 2009 [22].

Once it seems that the coders are similarly interpreting the coding scheme, all coders should independently apply the coding scheme to 20–30% of video segments, again randomly sampled. Inter-rater reliability should be calculated based on a Cohen’s kappa [22] (for categorical codes) or an intraclass correlation coefficient (for continuous codes, such a Likert scales or counts), which should exceed 0.7 or 0.8, respectively, for each code [25]. If reliability is achieved, then the remainder of the videos may be coded by a single coder. If raters are not reliable for all codes, disagreements should be discussed, the coding scheme refined, and the process of independently applying the new coding scheme to new videos must be repeated. After reliability is established for all codes, the videos used for “training” and reliability establishment must be coded with the final coding scheme. Furthermore, it is important to periodically check inter-rater reliability to ensure no significant drift has occurred during coding of the corpus.

When selecting coders, it is best to train individuals who are blind to the research hypotheses, to avoid unintentional bias. It is important to consider the complexity and nuance of the behavior being coded, and whether a specific coder skill set or background is needed [24] (e.g. choosing between undergraduate students vs. doctoral students). It is also recommended to train more coders than the researcher anticipates needing to guard against staffing changes impeding progress.

Finally, the authors have found that it is useful to pause after coding about 25–30% of the corpus of videos and review the frequencies of behaviors being coded. Is this a rare event that is worth capturing at all? If so, should the definition be changed? Should the code be abandoned or the time segment broadened (e.g. does this behavior occur in this 5 s segment versus does this behavior ever happen at all in this video)? There is often a need to balance the labor and time that goes into coding an uncommon event with the value of that event data to the greater project.

Analyses

Once the data have been painstakingly coded, the next step is to analyze the data, which largely depends on how the data were coded [26]. While original hypotheses will drive the study design and therefore the analysis plan, there is still some flexibility. It is helpful to first examine the frequencies, means, ranges, and standard deviations of codes. For counts of behaviors, one can examine a summary score of all counts in a video segment (e.g., number of bites during a meal) latency to first event (e.g. time to first bite), or change in event rate over time (e.g., bite rate at the beginning of the meal vs middle vs the end). If interval coding was performed (i.e., whether a behavior was present (vs. not) in each 5 s interval), the proportion of time engaged in that behavior can be calculated. If certain behaviors or events occur infrequently, then code categories can be created (e.g., did the child ask for second helpings >5 times, 4–5 times, 1–3 times, or not at all). Categories can be collapsed (e.g., did the child ask for second helpings >3 times, 1–3 times, or never) or changed into whole video coding (e.g., did the child ever as for second helpings? Yes vs. no). Macro-level behavioral coding can be applied using a simple spreadsheet to capture codes, and video playing software. Micro-level coding such as time-window sequential analysis [27] will require additional training and use of coding software, such as Observer XT [28]. A detailed resource on analyzing data from behavioral observation is the text by Suen and Ary, 2014 [26].

Best practice highlights

The authors outline below what they consider to be best practices for observational coding of eating and feeding behaviors in children and their families.

  • Start with a hypothesis. The researcher should be sure to understand the question they are asking and how they want to test their hypothesis. This will guide selection of the setting.

  • Select the setting based on the hypothesis. If the researcher wants to examine a behavioral response to a specific exposure, then a tightly controlled laboratory experiment may be the best fit. Whereas if the observer wants to examine behaviors in a naturalistic setting, for instance differences in foods served when families eat at the kitchen table vs. other locations, then a naturalistic setting would be a better fi.

  • Carefully plan the protocol using an extensively detailed guide. The authors recommend writing out explicit step by step instructions that are detailed so that each protocol can be set up as similarity as possibly to improve scientific rigor. It is important to specify the “who, what, where, when” and especially the “how” of a behavioral protocol.

  • Pilot the protocol. As unforeseen events can occur even with the most seemingly straightforward protocols, it is important to pilot and refine the protocol before involving participants. This will highlight areas of challenge that can be avoided or minimized when collecting valuable participant data.

  • Double check video and audio recording devices. Prior to an encounter with a participant, it is paramount to ensure the recording devices are working properly to minimize technical issues that may impair data collection.

  • Review data once collection has started, pilot a coding scheme and adjust protocol as necessary. Once data collection has started, it is important to review the video recordings as well as develop and pilot a coding scheme. This will help the researcher understand if their protocol is successfully testing their hypothesis, or if changes need to be made before data collection is complete.

  • Train raters on reliably coding scheme application through an iterative process. This will likely involve training of the coder as well as multiple revisions of the coding scheme to add more detail, specificity around the concepts described

  • Trouble-shoot reliability issues. Reliability should be calculated at the beginning of coding as well as periodically throughout the coding process to protect against drift. When difficulties in reliability present it is important to examine the coding scheme (e.g. definitions of codes may need to be specified or categories may need to be collapsed), the coding approach (e.g. frequency based vs. binary coding, or widening a time interval if too narrow). It is important to examine both the coding scheme, as well as its application by the coder. Some coders may not have the necessary insights, observation skills or attention to detail to apply all coding schemes, therefore it behooves the researcher to train more coders than they think are necessary.

  • Analyze the data to test the hypothesis. At the end of the long road of observational data collection and coding, the analysis allows for the opportunity to test the hypothesis. Examine univariate data (e.g. frequencies, means, standard deviations) first. Categories with low frequencies may need to be collapsed into larger categories.

Conclusions

Observational coding of eating and feeding behaviors is a resource and time intensive process, but can yield invaluable data. Researchers can minimize the challenges, and maximize the benefits of this methodology by carefully considering their hypotheses prior to data collection, and designing their protocol and coding approach to address their questions. Additional sharing of methodology and coding schemes around eating and feeding behaviors could advance the science and field.