Introduction

As obesity has become a global epidemic [1, 2], what is considered a “normal” or desirable body size has also increased, especially among certain racial-ethnic and gender groups [35]. A growing number of studies have found that body image is related to health behaviors including eating and dieting, particular among young people in the USA [3, 4, 6], and thus affects obesity. A full understanding of the obesity epidemic is therefore likely to include social norms along with two related mechanisms: feedback between population-level characteristics and individual behavior, and the role of social networks in defining who is influencing whom. For example, a well-known study by Christakis and Fowler posits that social contacts of individuals directly or indirectly influence obesity-related behaviors and that social networks structure therefore shapes who becomes obese [7]. We call this the “social influence hypothesis,” and in the following discussion, we will focus our attention on its application to obesity.

One source of confusion over the value or plausibility of the social influence hypothesis in the public health community is that models are often equated with correlational models, such as the one proposed by Christakis and Fowler. Epidemiologists and others have described the need for a priori specification of a causal mechanism in order to provide causal inference from such models [8, 9], yet mechanisms may be unknown, disputed, or untested. However, correlational modeling is only one approach. Other types of models are possible, including informal mental models, maps representing a territory, and mechanistic models of behavioral and physical phenomena [10], such as the mathematical models of classical mechanics. Ultimately, a “model” is any simplified representation of reality that omits many details in order to provide insight into a problem [11]. Expanding the range of models epidemiologists utilize in research and policy analysis enables us to address more diverse problems and gain different types of insights. The need for a fuller range of models is especially pertinent to social epidemiology, as much of the field addresses issues that arise with social systems, including social influence. Particularly pressing are problems of endogeneity and feedback, which render statistical identification of causes difficult or impossible [12]. The remainder of this paper will address generative and mechanistic approaches to the problem of social influence with a focus on obesity. We expand on prior work summarizing the use of systems science tools in public health, including a recent article by Luke and Stamatakis [13•], by focusing on the issues of influence and norms.

Many types of social influences are theoretically possible, including mechanisms that do not involve norms such as market forces, advertising, neighborhood crime, and social capital. Natural experiments and randomized trials further suggest that social influence may be either permissive (encouraging behavior) or proscriptive (limiting behavior), depending on the (dis)similarity of those interacting. These mechanisms may suggest very different policies and interventions, including inoculation, induction, rewiring networks, identifying opinion leaders as champions, and de-normalizing of socially acceptable (but unhealthy) behavior [14•].

In this paper, we highlight how employing a broader range of models may help us understand the processes by which norm formation and change occur in a more direct way, helping researchers adjudicate some of these debates about what statistical signatures norms do or do not have. To that end, we will provide examples of approaches used by several teams who are members of the National Collaborative on Childhood Obesity Research (NCCOR) Envision network, including statistical, social network, agent-based, and system dynamics [15].

A second source of confusion related to the social influence hypothesis stems from different presuppositions about where social norms exist or “live.” That is, do social norms exist as variables within individuals, as an emergent property of systems of individuals, or do they have an independent existence from individuals as part of the environment individuals observe and act within? The confusion arises in part because data about norms is collected from individuals, while the focus of study is usually effects at the aggregate level. Furthermore, norms are closely related to behaviors; as defined by Coleman, a norm concerning an action (which would include behaviors) exists “when the socially defined right to control the action [behavior] is held not by the actor but by others” ([16],p. 243). Deeper insights into dynamics that span both individual and collective levels may require new approaches that focus on mechanisms.

As the psychologist and philosopher of social science Paul Meehl noted, the problem here is not one of better statistical methods, but developing better theory specification through more explicit models [17]. In this paper, we recognize this by emphasizing the use of models to specify and test different formulations of the social influence hypothesis according to the type of influence or mechanisms. That is, rather than take a position on where social norms “live,” we call for a more critical and pluralistic approach where norms are formulated according to the type of influence hypothesized. To support this, we provide the following taxonomy of influences in the form of logically plausible mechanisms (see Table 1).

Table 1 Taxonomy of social influence

The paper focuses on social influence as a mechanism shaping behavior. We do not claim that our provisional list of mechanisms in Table 1 is exhaustive; as noted above, many types of influence are possible. Nevertheless, the list encompasses a wide range of mechanisms operating at multiple levels and will serve to illustrate how these might be modeled. Social influence requires that there be at least two units interacting with one another. In what follows, we will refer to these units as “actors.” While our focus is primarily on individual human beings as actors, we will also demonstrate the flexibility of the definition; for example, one type of actor may be a customer, and another a restaurant.

Defining Social Influence

Abraham and Michie [18] reviewed behavior change techniques that have been applied to physical activity and/or healthy eating and other health risk behaviors (e.g., HIV/AIDS). Eight of these techniques employ what might be considered social influence mechanisms: providing information on approval by others, providing encouragement, modeling or demonstrating a behavior, providing feedback on performance, using follow-up prompts, providing opportunities for social comparison, planning social support or change, and prompting identification as a role model. Many of these techniques are either explicitly drawn from, or can be tied to, well-established behavioral theories: theory of reasoned action [19], theory of planned behavior [20], information–motivation–behavioral skills [21], social cognitive theory [22], theory of social comparison [23], social support theory [24], and control theory [25]. These behavior change techniques are largely drawn from the clinical psychology and health behavior within the context of dyadic (therapist–patient) interactions.

While therapeutic relationships may serve as a model of one type of social interaction, social influence clearly extends beyond dyadic interactions to relationships between individuals and larger social and political systems. Kelman [26] proposed a framework of influence at operating at multiple levels, with three processes (compliance, identification, and internalization) linking individual behavior to norms outside the individual, shaped by the structure of social interaction. We therefore turn to social networks as a means of understanding this structure.

Social Network Models

We imagine that many types of norms arise from the local social environment, especially social networks. In turn, norms help shape and re-enforce social networks. In what follows, we focus on actor-by-actor (one-mode) networks. Networks are typically represented as graphs. Actors in a network are referred to as nodes, and relationships (including interactions) between nodes are represented as edges [27, 28]. A focal actor in a network (for example, a person filing out a questionnaire) is called an ego; egos have “neighbors” to which they are tied (friends, contacts), whom we refer to as alters. If the network is egocentric, all real-world information about the network is collected from, and revolves around, the ego; an example is the General Social Survey (GSS) [29]. If alters are also egos, then the network is sociometric; the National Longitudinal Study of Adolescent Health (Add Health) is a well-known example [30]. Specification of interaction without a network structure is also possible and, in many situations, is plausible [31].

The lowest level of interaction is the dyad, comprises of only two agents. Kelman’s work (discussed above) examined influence within therapist–patient dyads [26]. Some intriguing experimental studies of dyadic interaction offer further insights into rules for people’s behavior. Yakusheva et al. [32] studied college freshman women randomly assigned dormitory roommates; an ego rooming with an obese alter tended to gain fewer pounds, showing heterogeneity in the “freshman 15.” McFerran et al. [33] found that eating behaviors were modified according to an alter’s apparent body size. Thus, influence may vary in a manner analogous to infectivity factors in infectious disease [34]. Focusing on dyads alone is only appropriate for models where exogenous ties (outside the dyad) can be ignored; researchers are advised to address this question before conducting analyses solely at the dyadic level, since this implies the absence of a larger network [35•].

The next level of interaction is the triad, comprised of three agents. While there are 16 possible triad configurations [36], the most important for studying norms and influence are fully and mutually connected triads (where all three nodes share ties with the other two nodes). Such triads are the smallest completely connected networks (“cliques”) possible; they have high rates of social cohesion and homophily. Homophily is the tendency of individuals to select friends who are similar to themselves in background characteristics (as in the old adage “birds of a feather flock together”) that can channel and reinforce the flow of important information and health-related behaviors [37, 38]. Based on theories of social capital [16, 39], Simmelian ties [40] and structural balance [41], complete triads that persist over time would exhibit especially strong normative influences (else they would fall apart). The relationship of norms to cohesion is central to Coleman’s work on social capital, providing an explanation for the function of norms in maintaining social cohesion through sanctions of norm violation [16]; relevant sanctioning for obesity might include table manners or shaming. However, we know of no attempts to explicitly modeling the role of social triads or cliques in norms or obesity-related behaviors; given that clique membership is tied to (non)smoking behavior [42, 43], this is likely a rich area of research.

Norms and influence also operate at higher levels, where diffusion processes operate, and location (center, periphery) within the network may be important for adoption of new behaviors [44]. Individuals occupying central positions in networks tend to behave in a manner following larger group norms [45, 46]. Because they are subject to fewer influencers and are less invested in maintaining the status quo, individuals at the periphery may be easier to influence, yet have less influence on the system as a whole [14•]. Tools for identification of network position include block modeling [47] and centrality formulae [28]. Nonetheless, structural positions of nodes may obscure other explanations of position (e.g., homophily), requiring more sophisticated tools that can tease apart processes and handle dyadic and other dependencies [37]. Ties tend to be non-random, and exponential random graph model (ERGM) models can capture departures from randomness as well as dependency of ties (dyads), using the R packages ergm [48] and pnet [49]. The joint co-evolution of ties and behaviors may be examined using the R package Simulation Investigation for Empirical Network Analysis (R-SIENA) [50]. We emphasize that these models are generative, in the sense that a series of “random” (a misnomer) networks are proposed, then either accepted or rejected based on model fit. Authors who have employed these techniques to study obesity include de la Haye [51, 52], Valente [53], Shoham [54], and Simkins [55].

In spite of the growing interest in social network models to understand obesity, many challenges remain. The Add Health study [30] remains the gold standard for its nearly complete social network data, measured over multiple waves over time. However, the adolescent data were collected in the 1990s and do not capture the current prevalence of adolescent obesity in the USA; the adolescent data further lack detailed information on physical activity and dietary behavior. Additional longitudinal datasets, with detailed information on obesity-related behaviors, are necessary. Another challenge is understanding the impact of online social media (OSM), with a recent Pew survey finding 82 % of teens using OSM, mostly Facebook [56]. The role of OSM in shaping norms around obesity-related behaviors is unknown, although some preliminary work on tobacco and alcohol use has been done by Moreno [57] and Valente [58]. Finally, if networks are important for structuring norms and influencing behavior, they should also be useful in designing behavioral interventions [14•]. Given that interventions are expensive and some network-based approaches might fail to have an effect, simulations are crucial tools in deciding, which approaches have promise. Simulation models of whole networks have also been created, including Bahr [59], El-Sayed [60], and Shoham (under review). We now turn to one approach to simulation, agent-based models.

Agent-Based Models

Agent-based modeling (ABM) is a relatively new type of computational simulation model with wide applicability to social dynamics. In an ABM, each individual actor (or “agent”) is explicitly represented in computer code. Agents are generally given a set of adaptive rules for interaction with their environment and with other agents. A population of these individuals is simulated as an “artificial society” on a computer, with output through time generated “from the bottom up” by the decentralized interactions and decision processes of the agents [6164]. In this way, dynamic mechanisms that link individual decision making or behavior to social or population-level outcomes can be effectively studied.

Several features make ABM a potentially powerful approach for the study of social norms and social influence in obesity. First, because each actor is individually represented and modeled in an ABM, the approach permits consideration of substantial diversity among agents—in demography, social exposure, biology, psychology, etc. No aggregation of agents into homogenous pools or compartments is required; both actors and outcomes far in the “tail” of a distribution can be studied effectively. For the study of obesity (characterized in many populations by distributions with substantial skew and variance), and for the study of social influence (where pronounced heterogeneity by social context, age, or psychology may be important), these can be important considerations.

Second, the agent-based approach allows great flexibility in cognitive assumptions about individual decision making and information processing. To translate individual state and external information into a behavior, an individual agent may follow simple heuristic rules, complex cognitive or statistical processing models, or even representations of multiple conscious and unconscious brain processes. Agents may also differ from one another in the form of decision making used, and can adapt not only their behavior but even their rule set through time (for example, by learning). The processes that underlie body weight change and obesity are known to involve multiple brain systems, heterogeneity in decision making, and several forms of dynamic adaptation [6568].

A third potential advantage of the ABM methodology for the study of social influence lies in its ability to effectively incorporate social and physical space. ABMs provide extensive capability to directly include rich and explicit representations of the context which governs agent interaction and exposure—including geographic spaces [6971], full or partial social network structures [72], and neighborhood effects [73]. Since the specific form and structure of social networks and geography often determine by whom an individual may be influenced socially, this is a potentially important advantage for modeling social influence effectively.

Finally, ABM provides a powerful platform for the study of co-evolving individual behavior and social context, with dynamic feedback in both directions. ABM includes both a direct representation of the individual level and the ability to capture dynamic changes in population- or neighborhood-level outcomes. This enables the approach to capture multiple pathways of feedback between micro-behavior and macro-outcomes. For example, agents can be influenced by extant social norms, by observing distribution across their peers of key characteristics, by direct social contagion, by messaging, or even by the presence or absence of others. At the same time, the model can capture how these environmental contexts change through time as a direct result of the behavior of individual actors. Thus, an individual agent can be influenced by a social norm at time t even as he or she contributes (along with peers) to changes in the state of the norm at time t + 1. The ABM approach also allows multiple dynamic mechanisms to be layered within the same model—for example, social influence via concurrent processes of homophily and contagion.

The ABM approach has been widely used within public health for the study of infectious [7476], although its application to obesity and chronic disease is relatively recent [64]. In social science, ABM has been used extensively to study social norms—including models of social sanctioning [77, 78], segregation [79, 80], cooperation [81, 82], culture [83], agriculture [69], ethnocentrism [84], retirement [85], corruption [86], and civil violence [87].

Several recent studies have applied ABM to the specific topic of social influence in obesity, helping to identify potential mechanisms through which social influence may affect dynamics of body weight. Zhang and colleagues use ABM to study the interaction of homophily and social imitation as dual influences on individual BMI and behaviors such as physical activity and screen time (under review). Bahr et al. [59] explore the implications of similar mechanisms for effective targeting and application of weight management interventions. Hammond and Epstein [88] introduce a specific hypothesized mathematical mechanism (“follow the average”) by which individual BMI may co-evolve with population-level aggregate BMI. Chen and colleagues (under review) apply this mechanism to an empirical data set of youth, while Hammond and Ornstein (under review) extend it to include more empirically grounded assumptions and accurate physiology before applying to a different dataset. Opportunities for further application of the technique to gain insight into mechanisms and dynamics of social influence in obesity are substantial and promising, but must overcome potential challenges such as the relative scarcity of longitudinal data revealing dynamic patterns and the limited toolsets and training currently available in the technique.

System Dynamic Models

System dynamic (SD) models typically divide the population into different groups based on their adoption of a practice, a physical feature (e.g., BMI), or other relevant characteristics. Social influence is then incorporated in at least three distinct ways. First, the number of people in each category may change due to social influence from those in other categories. For example, a large literature model contagion as the process of communication between groups that have adopted a practice or product and those who are potentially interested, and how the rate of conversion depends on both population sizes [8991]. Obesity has been treated in the same manner, in which a subset of normally weighted individuals is seen to become obese due to interaction with overweight and obese subpopulations [92]. A second group of models captures the dependence of individual’s goals on social norms. For example, BMI or weight averages for a group of similar individuals are seen to influence the weight towards which those individuals aspire. One study finds a significant adjustment of energy intake to reach that socially sanctioned average BMI [93]. Thirdly, by categorizing individuals based on their weight groups, the inertial effect of norms are captured in that individuals are naturally staying in their current weight group, unless a separate mechanism changes their weight status. For example, groups’ current energy intake or physical activity determines the base level around which the actual values are adjusted using an anchoring and adjustment formulation [94, 95].

System dynamics modeling process may also be used to provide a more nuanced understanding of the mechanisms through which norms are generated and modified [96]. Group model building processes [97, 98] identify what social influences different stakeholders in a community perceive and how those influences shape their reactions. Such studies draw heavily on participatory group model building sessions and other qualitative tools, and the resulting qualitative models can inform more nuanced quantitative models with a richer appreciation of social determinants of health [99, 100]. For example, as part of Envision, Hovmand and Brennan [101] have focused on modeling the social determinants of childhood obesity in a low-income urban neighborhood. The approach used group model building methods to involve community members including children, youth and young adults, parents, service providers, and clergy in the process of developing a causal map of the influences and consequences of childhood obesity in the community. Specific examples of community based norms that emerged included the fact that while girls tended to seek to lose weight through unhealthy diets, boys tended to seek to gain weight or “bulk up” in an effort to avoid bullying. Other examples included community norms related to “pressure to be cool” that led to fights, shoplifting, and early gang-related behavior. Understanding the causal mechanisms leading to the formation of norms and the consequence of norms at the neighborhood level provides both a better picture of their endogenous influence on obesity trends as well as how to measure them in ways that are most salient to community members.

The structure of SD models is built based on qualitative and process data (e.g., group model building), and social and biological theories (e.g., models of body weight dynamics [102]), and parameters are specified based on theory, expert input, or estimated using aggregate data regarding number of people in different population groups [103], parameterized based on statistical estimates from prior literature [102], or methods that connect individual level dynamics to aggregate models and data [104]. However, the application of these approaches to obesity research is at early stages, and many opportunities exist for incorporating norms into SD models of obesity that are grounded in empirical data and useful for policy analysis. Additionally, SD models could be combined and compared with ABMs in this domain to benefit from the detailed actor network data while keeping the broad model boundary typical to SD models [31].

Intervention and Policy Relevance of Models

Anecdotal evidence suggests that researchers and policy makers are divided regarding the utility of mathematical and simulation models in developing health-related policy and programs. On the one hand, there has been strong support and many examples of useful mathematical models, especially in the areas of infectious disease and tobacco control. On the other hand, evidence-based medicine and evidence-based policy generally demand empirical, not simulated, findings, with randomized intervention trials considered the best evidence for program and policy development.

Mathematical modeling offers many promises, but also faces many challenges as the analysis process often involves making assumptions and subjective decisions (especially when addressing complex public health problems and when data are limited or impossible to collect). However, these factors are in fact among the key reasons why mathematical and simulation modeling is needed. For example, models lay bare assumptions often implicit in other analytical techniques, allowing them to be examined and even tested directly. Similarly, models can help guide future empirical efforts iteratively, by focusing attention on those conceptual variables that matter most, and sometimes even by identifying new ones.

Furthermore, some societal problems are not well served by traditional, quantitative research methods, leading to debate about the methods considered critical for the collection of relevant evidence. Many authors have noted that complex systems solutions are needed to address problems like obesity and related chronic disease [105107]. Complexity extends to the policy process, which is rarely linear, operates under limited information, and presents temporal challenges including simultaneity of events and heterogeneity of lag times; all of this makes policy evaluation particularly difficult [108].

Stakeholder support has grown for rigorous complex systems thinking regarding health problems, paralleling development in other fields to address the problems of industry engineering, war, disaster response [105]. Examples with a specific health focus include applications to smoking and infectious disease outbreaks. SimSmoke is a simulation model that assesses the impact of past tobacco control policies and generates predictions about the future effect of policies on smoking prevalence and premature mortality attributable to smoking [109]. Soon after influenza emerged in North America in 2009, the World Health Organization (WHO) convened an informal mathematical modeling network of health experts and modeling groups, which made recommendations to limit the spread of the H1N1 virus [110]. The US National Institutes of Health has also supported modeling projects, including the Cancer Intervention and Surveillance Modeling Network (CISNET) [111] and Models of Infectious Disease Agent Study (MIDAS) collaborations [112]. Such efforts have now extended to modeling obesity, including the UK Government sponsored Foresight “Tackling Obesities: Future Choices” study, which applied systems science approaches to obesity and provided recommendations for solutions including government policies [107]. Two recent US Institute of Medicine (IOM) reports further recommend the use of systems science-guided thinking, research, and intervention approaches to fight obesity (IOM, 2010, 2012). Finally, the NCCOR Envision project (which the coauthors are all members of) is applying the methods discussed in this paper to identifying key policy levers for reducing childhood obesity [15].

Conclusions

The diverse modeling approaches described above can be applied for both discovery and policy analysis in the context of social influence in obesity. The discovery goal can be pursued in the form of both statistical identification of causal mechanisms, and the generation of diverse insights and hypotheses. The network analysis methods allow for teasing out different social influence mechanisms regulating energy intake, physical activity, and other obesity-relevant traits, especially if network data is available over time. Other mechanistic models can also be used for estimating the strength of networks of causal pathways. However, pursuing the statistical identification goal often requires time series data at the right level of granularity and utilization of advanced statistical and estimation methods. Social epidemiologists should make additional use of mechanistic models in obesity research in general and in analyzing social influence hypothesis in particular.

The findings from previous type of research, combined with traditional statistical estimates, can inform the development of realistic agent-based models representing complex interactions among actors. System Dynamics models can also use this data in aggregate and combine it with broad model boundary and feedbacks crossing diverse domains to represent core mechanisms operating on obesity. Both these types of models are suited for the second type of discovery: generating hypotheses on complex interactions that can explain observed regularities, illustrating core dynamics and uncertainties, guiding data collection, and communicating complex insights with different audiences.

A major value of such models is their ability to integrate within a single tool both complex social influence pathways and other types of obesity drivers, from economic factors to the built environment. Such integration is crucial for theoretical understanding if the obesity trend is not the result of any single causal mechanism, but the emergent outcome of complex interactions. The resulting systemic perspective is also indispensable for policy analysis. The broad boundary of these models allow for tracking outcomes of interventions that are distant in time or space: from policies that target individuals or small groups (e.g., social network-enabled interventions) to those targeting social institutions (e.g., public campaigns). While increasingly there are examples of mechanistic models applied to well-specified policy problems in the obesity domain, there is much room for models that leverage our enhancing understanding of social influence to design innovative new obesity interventions and assess the existing ones in vitro.