DieTryin: An R package for data collection, automated data entry, and post-processing of network-structured economic games, social networks, and other roster-based dyadic data

Ross, Cody T.; Redhead, Daniel

doi:10.3758/s13428-021-01606-5

DieTryin: An R package for data collection, automated data entry, and post-processing of network-structured economic games, social networks, and other roster-based dyadic data

Open access
Published: 02 August 2021

Volume 54, pages 611–631, (2022)
Cite this article

Download PDF

You have full access to this open access article

Behavior Research Methods Aims and scope Submit manuscript

DieTryin: An R package for data collection, automated data entry, and post-processing of network-structured economic games, social networks, and other roster-based dyadic data

Download PDF

Cody T. Ross¹ &
Daniel Redhead¹

3728 Accesses
6 Citations
17 Altmetric
Explore all metrics

Abstract

Researchers studying social networks and inter-personal sentiments in bounded or small-scale communities face a trade-off between the use of roster-based and free-recall/name-generator-based survey tools. Roster-based methods scale poorly with sample size, and can more easily lead to respondent fatigue; however, they generally yield higher quality data that are less susceptible to recall bias and that require less post-processing. Name-generator-based methods, in contrast, scale well with sample size and are less likely to lead to respondent fatigue. However, they may be more sensitive to recall bias, and they entail a large amount of highly error-prone post-processing after data collection in order to link elicited names to unique identifiers. Here, we introduce an R package, DieTryin, that allows for roster-based dyadic data to be collected and entered as rapidly as name-generator-based data; DieTryin can be used to run network-structured economic games, as well as collect and process standard social network data and round-robin Likert-scale peer ratings. DieTryin automates photograph standardization, survey tool compilation, and data entry. We present a complete methodological workflow using DieTryin to teach end-users its full functionality.

formr: A study framework allowing for automated feedback generation and complex longitudinal experience-sampling studies using R

Article Open access 01 April 2019

Online panels in social science research: Expanding sampling methods beyond Mechanical Turk

Article Open access 11 September 2019

The Online Coalition Game: A tool for online interactive coalition formation research

Article Open access 24 January 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In the psychological and sociological sciences, there is a keen history of developing and applying social network methods (Borgatti et al. 2009)—i.e., methods that quantify relationships (termed edges or ties) between individuals (termed nodes or actors). At the same time, there have been increasing calls to expand the general scope of data collection to include more technologically, geographically, and culturally diverse groups (Henrich et al. 2010; Nielsen & Haun, 2016; Nielsen et al. 2017; Broesch et al.,, 2020). Interdisciplinary and international social network research has enabled many broad-reaching empirical advances, e.g., about the forms and functions of social relationships (Cutrona, 1986), the evolutionary functions of emotion and inter-personal sentiments (Gervais & Fessler, 2017; Gervais, 2017), and the drivers of friendship (Dijkstra et al. 2013), personality (van Zalk et al. 2020), religion and religiosity (Power, 2017), and social status (von Rueden et al. 2019). Such methods have also been crucial to applied research, e.g., in public health (Holt-Lunstad et al. 2010; Smith & Christakis, 2008).

Likewise, in fields as diverse as evolutionary biology and cultural anthropology, there is an increasing uptake of field research concerning the properties, causes, and consequences of social networks and associated social-relational characteristics. Field researchers have conducted examinations of the mechanisms that govern the emergence and stability of social bonds (e.g., Silk et al., 2009; Rucas et al., 2006; Hooper et al., 2013; von Rueden et al., 2019), economic, physical, and emotional well-being (e.g., Crittenden & Zes, 2015; Koster & Leckie, 2014; Ready & Power, 2018; Pisor et al. 2020), knowledge and skill acquisition (e.g., Franz & Nunn, 2009; Hill et al. 2014; Barrett et al. 2017; Lew-Levy et al.,, 2020), disease transmission (e.g., Read et al. 2008; Salathé et al.,, 2010; Ready et al. 2020b), and a variety of other phenomena.

While the vast majority of human studies assess the structure of social relationships through survey and interview methods, recent studies (e.g., Rucas et al. 2010; Gervais, 2017) have introduced novel and valuable methodologies for administering network-based experimental economic games (see also Pisor et al.,, 2020). These newly-developed games marry the strengths of experimental economic games to reliably measure preferences (Henrich et al. 2001, 2005) and the strengths of socio-relational data to measure how individual-specific characteristics and inter-personal relationships structure behavior in real-world networks—i.e., they extend economic games from measuring only the effects of a focal individual on allocation decisions, and allow measurement of recipient- and dyad-specific effects.

Classical economic games, like the Dictator and Ultimatum games (Henrich et al. 2001, 2005), are able to measure specific preferences of focal respondents—e.g., how willing they are to give resources to an anonymous same-community recipient/alter at a cost to themselves. While cross-cultural comparisons of such characteristics have been hugely influential, the simplistic game structure precludes investigations of how variation in focal, alter, and dyadic characteristics, might affect site-level preferences for specific behaviors, e.g., giving/keeping resources. The three network-structured economic games (termed RICH games, or recipient identity-conditioned heuristics games) introduced by (Gervais, 2017) generalize classical economic games for studying cooperation, exploitation, and punishment, and measure dyad-level behaviors using a full-community photograph roster. According to Gervais (2017), “...these RICH economic games tap the norms and motives that regulate enduring social relationships” [p. 127]; in other words, they allow researchers to study not just the behavioral preferences of focal respondents towards anonymous alters, but additionally how focal and alter characteristics (e.g., age, sex, wealth, religion, social network centrality, prestige, dominance, etc.), as well as dyadic characteristics (e.g., kinship, friendship, group-membership, etc.), are related to dyadic behaviors like resource transfers, exploitation, and spiteful punishment—permitting much more nuanced investigation of the drivers of cross–cultural variation in inter-personal behavior.

Despite their enormous promise for testing a wide array of questions in cross-cultural psychology, economics, and anthropology, full-community roster-based methods for collection of experimental economic game data and other dyadic measures have not been used as widely as other data-collection methods. Researchers studying social networks and inter-personal sentiments in bounded or small-scale communities generally face a trade-off between the use of roster-based and name-generator-based methods. A core theoretical distinction between these data collection techniques is that roster-based methods solicit data from participants’ recognition of names or photographs, while name-generator-based methods solicit recall data (Ferligoj and Hlebec, 1999). Roster-based methods scale poorly with sample size, and can more easily lead to respondent fatigue (especially when administered verbally with names, rather than visually with a roster of photographs). However, they require less post-processing (i.e., record linkage and de-duplication), and generally yield higher quality data, as recognition memory is generally recognized as more accurate than recall memory. Roster-based methods have been shown to capture a comparatively larger number of nominations (that also include the nominations collected via recall methods), and are less susceptible to the biases associated with recall (Bahrick et al. 1975; Hlebec, 1992; Hammer, 1984; Sudman, 1985). Name-generator-based methods, in contrast, scale well with sample size and are less likely to lead to respondent fatigue. However, they may be more sensitive to recall bias (Brewer, 2000), and they entail a large amount of highly error-prone post-processing after data collection, in order to link elicited names and nicknames to unique identifiers.

In order to make the use of powerful, roster-based research methodologies easier for field researchers working in large field-sites (e.g., small-scale societies or bounded communities), we introduce an R package, DieTryin, that allows for roster-based dyadic data to be collected and processed as rapidly as name-generator-based data. DieTryin was specifically developed to run network-structured economic games—e.g., Gervais' (2017) RICH games—as well as collect and process standard social network data and dyadic Likert-scale peer ratings. DieTryin automates photograph standardization, survey tool compilation, and data entry.

While other high-tech tools for social network data collection have been developed—e.g., Breadboard (Human Nature Lab, 2020a), Trellis (Human Nature Lab, 2020b), and Open Data Kit (ODK Team, 2020)—they generally rely on respondents interacting with electronics on the ‘front-end’. This can limit their usefulness to researchers like cross-cultural psychologists and anthropologists working in areas where the front-end use of electronics might be problematic. DieTryin instead allows for network-structured economic game data and social network data to be collected using simple, physical photograph rosters and game tokens (e.g., poker chips), and moves the high-tech functionality to the ‘back end’ where machine learning algorithms are used to automatically code edge-list data from photographs of token allocations.

In what remains of the paper, we provide: a) a brief overview of the benefits and limitations of different network measurement and sampling regimes, b) a review of data collection methods using a photograph roster, including a review of the RICH games methodology, and c) a description of the DieTryin R package and its unique functionality, including a step-by-step walk-through of the workflow for collecting and processing data from RICH economic games and other roster-based designs.

Measuring social relationships

Self-reported network data

Field studies of social networks typically employ one of two interview/survey techniques for collecting relational data. The first is the name generator method, which entails participants freely listing the names of other individuals within the community with whom they have a specific kind of relationship (Marsden, 1990). The second widely-used approach is a roster-based design, whereby the researcher generates a list of all members of a population and then asks each participant to report whether they have a specific kind of relationship with each-and-every individual on the roster (Marsden, 2005). In exceptional cases, social networks can also be created from long-term ethnography or observation (see Ready et al.,, 2020a; DeTroy et al. 2021), focal or scan sampling (see Altmann, 1974; Amato et al. 2013), and direct GPS tracking or proximity detection (see Davis et al. 2018; Wood et al.,, 2021). Each of these methods carries their own costs and benefits.

The principle benefits of the name generator method are that: (i) it is comparatively fast to implement—i.e., the time burden of data collection and entry scales roughly linearly with sample size—and (ii) it captures extra-community ties. The principle costs of the name generator method are that: (i) there is more work on the back-end for researchers to match the names—and possibly nicknames—recalled by respondents with unique personal identifiers, especially in communities where it is common for several individuals to share the same first and last name (or nickname), and (ii) various forms of recall bias may impact data quality—e.g., some individuals may be more likely than others to forget real ties (Bell et al. 2007; Brewer, 2000), some alters with specific characteristics may be more likely to be remembered or named by others in the community (Marin, 2004), and differences across interviewers in terms of personality type and data collection style can influence the number of names elicited (Harling et al., 2018). These costs and benefits are more or less inverted in roster-based designs.

In roster-based designs: (i) data collection and entry burden scales with the square of sample size, and (ii) the method fails to easily capture ties to extra-community individuals not appearing on the roster. However, there are some benefits: (i) there is no work on the back-end to link names to personal identifiers, as the roster can be constructed to differentiate between individuals with the same name a priori, and (ii) recall bias is minimized by presenting each focal respondent with a prime—be it a name or a photograph—of each alter in the community. While these trade-offs need to be considered on a case-by-case basis, DieTryin aims to increase the feasibility of roster-based methods by automating the data collection and entry process so that the time burden scales linearly with sample size.

Experimental network data

In an attempt to reduce some of the potential biases (e.g., recall bias, researcher demand bias, social-desirability bias, etc.) associated with self-report techniques for capturing social networks, social scientists have recently advanced network-based extensions of several classical games developed in experimental economics (Centola, 2010; Kearns et al. 2009; Suri and Watts, 2011). Such experiments are generally devised to test specific predictions from game theory, and are frequently conducted among relatively homogenous samples of undergraduate students in laboratory settings using networked computers or administered as online network-based games (e.g., Colman, 2016; Charness et al. 2014). Within these paradigms, researchers provide monetary incentives for participation and pose a specific social problem between a dyad or group, where there may be competing interests, and where participants’ decisions dictate their financial payoffs.

These network-based experimental games generally have high internal validity and have made many important contributions to the understanding of human preferences and behavior (e.g., Cassar, 2007; Rand et al. 2014; Rand et al. 2011; Ohtsuki et al. 2006; Suri & Watts, 2011). However, these paradigms have also been criticized for their potential lack of generalizability and external validity (Hagen & Hammerstein, 2006). In response, experimental game paradigms are now being administered in naturalistic settings (e.g., Rucas et al.,, 2010; Gervais, 2017; Pisor et al.,, 2020) to test their validity in real-world contexts.

Sampling

To successfully capture relational data with either self-report or experimental measurement instruments, researchers generally need to collect data about the social relationships between most, if not all, individuals within a focal community. This is important because the statistical properties of higher-order network characteristics can be very sensitive to even a relatively small proportion of missing data (Granovetter, 1976; Frank, 2005). In most contexts, researchers will restrict their samples and strategically collect data from participants of a specifically defined group, often based on: (i) residence within a delineated geographic area (e.g., residents between two rivers), (ii) membership in a specific group (e.g., children in classrooms), or (iii) participation in mutually-shared events (e.g., ‘regulars’ at a bar or pub), and only consider the reported relationships within the sample (Laumann et al. 1989). Only in certain cultural contexts, where participants reside in relatively small and geographically-isolated populations, is capturing ‘complete’ population networks feasible.

A complete workflow with DieTryin

In what follows of the paper, we provide step-by-step instructions on how to use DieTryin to prepare, collect, and process the data resulting from network-structured economic games and other dyadic data collection methodologies. We also provide an accessible tutorial workflow at: https://github.com/ctross/DieTryin, which contains additional annotated R code and example photograph datasets. Our tutorial trains end-users to run a full workflow using DieTryin to prepare for, conduct, and manage a real-world, roster-based social network study. Bug-reports, feature requests, and other relevant comments should be made through GitHub, where the package will be maintained.

Although DieTryin has much functionality specifically tailored to Gervais’ (2017) RICH economic games, it is considerably more general and can be used to facilitate collection, entry, and processing of round-robin peer ratings and roster-based self-report measures. These measures are widely used in psychology and other related disciplines to capture perceptions of peers—e.g., their personality, status, dominance, or behavioral profiles—as well as reports of dyadic characteristics—e.g., church co-membership, directed food/money transfers, or kin relationships. In Fig. 1, we provide a brief visual overview of each of these data collection methods.

Installation and setup

Much of the functionality of DieTryin is made possible by R (R Core Team, 2019) and the imager (Barthelme, 2019), MASS (Venables & Ripley, 2002), and xtable (Dahl et al. 2019) packages and their dependencies, as well as by the LATE X software system. The user must install these programs in order to reproduce our workflow.

Installation and loading of DieTryin is then simple: just run three lines of code from R:

Next, we set a path to where we will save all of the files related to our project, and initialize a directory structure there:

This step creates a directory—titled RICH—with all of the sub-folders needed to organize the workflow. The standard function call—i.e., setup_folders(path)—sets up data storage only for the three RICH games, but additional folders—for example, to store data on friendship ties—can be added as well, by setting: add=games_to_add, where games_to_add is a vector of folder names that should be created.

Standardization of photographs

The next step of the workflow is to create the photograph roster used in data collection. Once a full sample of informed and consenting research participants is selected, photographs of each participant can be taken, and the research process started. The raw photos of all respondents who will take part in the network-structured economic games and/or do a social network interview should be copied into the RICH/RawPhotos directory. These should be jpg-formatted images. The filenames must include the unique identifier (ID) codes for the respondents: e.g., AOC.jpg, KW1.jpg, JLO.jpg. These filenames should all be of the same character length and contain a letter as the first character. Raw photographs, however, will frequently vary in terms of size, aspect ratio, orientation, zoom, and centering (e.g., see the example database of raw photographs in Fig. 2), and response validity can be improved by standardization. In large field-sites, manual adjustment of respondent photographs can be time-consuming. To facilitate the standardization of respondent photographs, DieTryin includes a function that largely automates the process of image standardization.

Once the full set of respondent photographs are copied-and-pasted into the RICH/RawPhotos folder, the full directory of images can be rapidly processed with a single line of code:

The arguments to standardize_photos are intuitive: pattern is simply the file extension (accepting either ".jpg" or ".JPG"), start and stop give the locations of the first and last letters of the unique identifier in the filename of each photograph, size_out gives the width of the exported photograph in pixels, asr gives the aspect ratio, and border_size gives the number of pixels on the boundary of each photograph to be colored black.

In normal contexts, the standardize_photos function is programmed to run over all photographs in the RICH/RawPhotos directory. However, if only a small range of photographs (or a single photograph) needs to be processed, then this can be specified. For example, by changing id_range=NULL to id_range=c(1:5), only the first five photographs in the directory will be processed. Likewise, changing id_names=NULL to id_names=c("BYC", "FN1") instructs the program to only process the photographs of the individuals with those specific ID codes. This can be useful to correct user-introduced errors in specific processed photographs.

Finally, the argument spin=TRUE can be set to FALSE, in order to skip the photo-rotation step—discussed below—if all photographs are already correctly oriented. Otherwise, for each photograph in the RICH/RawPhotos directory, standardize_photos will open a two-step process. First, the raw image will be displayed as it is stored by R. The user must then click in one of four quadrants on the displayed photograph: a click in the upper-left will apply no rotation, a click in the upper-right will apply a 90^∘ clockwise rotation, a click in the lower-right will apply a 180^∘ rotation, and a click in the lower-left will apply a 270^∘ clockwise rotation (see Fig. 3a). Next, the correctly oriented picture will be displayed. The user must then click and drag a bounding-box giving the approximate area to be used as the final processed photograph (see Fig. 3b). Repeat these steps for each photo. Each standardized photograph will be saved to the RICH/StandardizedPhotos directory, where they can be checked by the user for quality (see Fig. 3c). Figure 4 presents the photographs from Fig. 2 after applying the standardization procedure.

Building the survey tool

Once the respondent photographs are standardized, they can be printed out, laminated, and arranged on boards. To randomize the positions of photographs on the game boards and generate a survey tool that can be used to efficiently record data, DieTryin provides the build_survey function. Replicable randomization of photographs is done using the seed argument. However, if a specific order of photographs is desired, this can also be specified as we show below. Then, a single line of code will generate a LATE X document and compile it to PDF:

The seed argument gives the starting value for the random number generator used to randomize the order of photographs, the n_panels argument gives the total number of panels/boards of photographs to be displayed, and the n_rows and n_cols arguments give the number of rows and columns of photographs per panel/board. It is generally important to randomize the ordering of photographs on the panels a few times over the course of a single field-season, especially when conducting the economic games. By changing the seed argument, the order of photographs on the survey tool can be changed. Later, this same seed value will be passed to the data-entry functions in order to facilitate data entry. If the user wishes to pass in a specific order of photographs, a vector of IDs can be passed in via the ordered argument, which will take precedence over seed, otherwise set ordered=NULL to randomize order.

Figure 5 illustrates an example survey tool compiled by DieTryin. Frame (a) illustrates the exported survey tool. Frame (b) illustrates how data on directed coin allocations may be recorded using the survey tool. Note that the header of the survey tool can be customized by editing the header.txt file in the RICH/Survey directory before running the build_survey function.

Assembling the game boards

Once the survey tool is compiled, the standardized respondent photographs can be arranged on the game boards in the same order as is the survey. For even moderately large field-sites, it is common to need multiple boards/panels. Figure 6 illustrates a layout of 39 respondents distributed over two panels.

For field-sites up to around 300 respondents, it can be feasible to run RICH games and other forms of dyadic data collection using a photograph array of all residents. For larger field sites, it might be necessary to create a large number of panels, and then present only a random subset of these panels to each focal respondent, randomizing the set of panels between respondents. This ensures balanced potential payoffs across all recipients in the field-site, without requiring each focal to make a decision with respect to each and every alter. Balanced potential payoffs are important for ensuring that all recipients feel that everyone in the community has equal and fair access to the possible benefits from the games. Another possible strategy is to run the economic games and other forms of dyadic data collection independently within sub-communities of feasible size. Either of these methods comes with inferential trade-offs that must be considered in light of the research goals.

Manually entering data

For some kinds of data, especially data from the allocation or costly reduction games—where multiple coins can be allocated to each person—it is often best to enter data manually using the enter_data function:

The function will open a pop-up window in R, and start a two-step data entry process. In the first step, the header data (i.e., name, date, and ID code) is entered. In the second second step, the network data is entered. If one is entering data from the first game of a given respondent, then type: Y, and the header data will be cleared. Otherwise, if one wants to save some time and carry the header data forward (e.g., if one is entering data on a second game from the same person), type: N, to carry the prior header data forward.

Now enter the header data as shown in Fig. 7a. Data must be supplied for ID and Game, but other entries can be left blank if desired. In the pop-up window, the entry for Game must take one of three special values for RICH games data (G for the giving/allocation game, L for the leaving/taking game, and R for the reduction/punishment game). Other question types can be given arbitrary names (corresponding to those supplied in the add argument of the setup_folders function). The argument Order gives the order of the panels as presented to the respondent, with A being the first panel in the PDF, B being the second, and so forth. The seed argument is prefilled during the call to the enter_data function. As long as the seed in the function call matches the seed printed on the PDF, the recipient IDs will be properly sorted and ready for data entry. Once the header data is entered, simply close the window in R; there is no need to save or hit ctrl+s.

A new pop-up window will open, as shown in Fig. 7b. Now, if coins were placed on a recipient’s ID, click on that ID code, type the number of coins placed, and then move on to the next ID. If no coins were placed on a recipient, just leave the ID code alone. There is no need to type in the zeros, as they will be filled automatically. Once the data are entered, simply close the window. Game data will be saved as a csv file in the appropriate Results directory. If errors are made in data entry, the resulting csv file can be edited by hand, or the enter_data function can be run again; if the function is run again, it will overwrite the previous (erroneous) version of the person-specific results file. The data in Fig. 7b and c correspond to the data recorded on the PDF in Fig. 5b.

When entering data for the three RICH games, DieTryin accepts only numerical values (i.e., the number of coins placed on a given alter). This requirement is relaxed when entering data from other question types: text strings can be used, for example, to define qualitative or categorical ties.

Data compilation

After the data collection protocols have been completed by each participant and all data for each game have been entered by the researcher, then the individual-level csv files can be compiled into a single data-set and checked for basic data-entry errors. To do so, we run the compile_data function:

This will build two files for each game—a summary table and an edge-list—and store them in the Results folder. The summary table gives a basic self versus other coin allocation count, and checks that the sum of entries (i.e. the total number of coins/tokens appearing on the game board) is correct. If the checksum cell in the summary table for every respondent is not equal to the total number of coins used in a given game, then there was likely a mistake during data collection or data entry. Note, for example, the data entry error in Fig. 7, where the sum of entries is 19, even though 20 coins were distributed according to Fig. 5b. If the checksum for a given respondent is wrong, the corresponding game board photographs, surveys, and csv files should be checked and revised (in this example, the allocation on row 3, column 3, of game-board panel A must be changed from a 1 to a 2 to match Fig. 5b). The compile_data function must be rerun to integrate changes. Once all of the summary tables appear correct, we can continue with the workflow.

Payout calculation

Once reliable edge-list data for all games have been compiled as above, payouts for each respondent can be calculated. To do so, we run the calculate_payouts function:

The argument game indicates which game data will be used in payout calculations. All combinations of G for the giving/allocation game, L for the leaving/taking/exploitation game, and R for the reduction/punishment game, are accepted: i.e., type "G", "L", "R", "GL", "GR", "LR", or "GLR". The argument GV gives the monetary value of the coins used in the giving game, LV gives the monetary value of the coins used in the leaving game, KV gives the monetary value of the coins kept for self in the reducing game, and RV gives the reduction value of the tokens in the reducing game. In the case that some recipients never appear as respondents—presumably due to temporary absence from the community—coins directed to these recipients are refunded to donors. The total payouts to each individual are displayed in R and saved as a csv file in the RICH/Results folder.

Automatically entering data: Binary indicators

While RICH games data are often best entered manually, since there can be several coins allocated to each recipient, it can be useful to collect additional binary dyadic data: e.g., “With whom have you shared food in the last 30 days?” using the same photograph roster. By placing tokens of a known color on the photograph roster to indicate directed ties and then photographing the resulting game boards, a researcher can implement an automated data entry workflow with DieTryin.

Photographs of the game boards, however, normally suffer from rotation, skew, or shearing that can complicate automated data entry. To correct these visual distortions, DieTryin uses a two step process in R. First, the user must identify the corners of each game board with a click. Then, DieTryin will identify the camera position relative to the game board, and apply an algorithm that corrects any distortions. See Fig. 8 for unprocessed photographs of the game boards and Fig. 9 for photographs of the game boards after algorithmic distortion correction and cropping.

Next, once undistorted images have been produced, they are automatically fed into a classification algorithm. Under-the-hood, each individual respondent photograph is extracted from the overall image array using the dimensions of each panel, and the number of photograph rows and columns it contains. The recipient’s ID code is mapped onto this image. For each recipient, pre- and post-allocation photographs are then analyzed and compared. Figure 10 provides an outline of the method used to identify the presence/absence and color of tokens.

For a user to implement automatic data entry, all photographs of a given respondent’s game boards must be pasted into the RICH/ResultsPhotos directory. To account for variation across respondents in the lighting of the game boards, there should also be a photograph of each panel for each individual with no tokens placed (the control condition). The file-names of these images must be formatted as: Blank_AAR_A.jpg. The first string is the word Blank followed by an underscore, then the respondent’s ID code, then another underscore, and finally the board/panel ID. For each additional question/game, another set of photographs is provided. These file-names must be formatted as: Game_AAR_A.jpg, but in this case, Game can be replaced with an arbitrary text string.

Then, to speed up the classification algorithm, photographs of the game boards can—optionally—be resized to a smaller dimension:

This line of code will copy all game board photographs, downsize them by a factor of 5, and save them in a new folder. If the files do not need to be resized, set scaler= 1.

Next, the user must pre-process the images. The pre_process function opens an interactive window that displays each photo array. The user must click the top-left corner of the photograph array, then the top-right, bottom-right, and bottom-left, in that order. This provides DieTryin with the location information needed to crop-out only the photograph array, and correct any rotations or distortions. The user will need to process the blank boards and the boards for at least one other question/game. First run:

where game is the game ID code, and panels are the board/panel ID codes. Then, we can wrangle all of the above data for each game into the list structure needed for the classifier:

Each of the lists above can be extended as needed. See GitHub for more details on batch processing/vectorization across several games from the same respondent.

Now we can run the data entry function. This is the most computationally-intensive step in the data entry process:

The function defaults generally work well, so only the hue thresholds should need to be set by the user in most cases. There are three main parameters that control classification performance: thresh∈ (0,1), which controls how much the percent difference in hue density must diverge between control and treatment cases for a tie to be declared, and lower_hue_threshold∈ (0,360) and upper_hue_threshold∈ (0,360), which give the lower and upper bounds of the hue range corresponding to the token color (see Fig. 10 step E). To identify good hue threshold values for a given token color, it is helpful to use a color picker app that can interactively plot the HSL (Hue, Saturation, and Luminescence) values for the pixels corresponding to tokens on a given photograph. We provide a simple interactive application in R—via the function: get_hue(file.choose())—for this purpose, but many online tools are also available.

Additionally, there are several other more technical parameters that can be modulated from their defaults to control classification: lower_saturation_threshold∈ (0,1) controls the limit of saturation at which hue values are excluded from measurement (because they are essentially grey) in step (D) of Fig. 10, lower_luminance_threshold∈ (0,1) and upper_luminance_threshold∈ (0,1) control the values of luminescence at which hue values are excluded from measurement (because they are essentially black or white, respectively) in step (D) of Fig. 10, iso_blur controls the width (in pixels) of the isoblur applied in step (C) of Fig. 10 (a value of 0 turns off isoblur), border_size∈ (0,1) controls the width (in percent) of the excluded border in step (B) of Fig. 10, histogram_balancing is an indicator variable for whether histogram balancing should be applied to enrich the photographs prior to processing, and finally direction∈{"forward", "backward"} indicates whether the distortion correction algorithm should be run in forward or backward mode. Forward mode is fast but produces lower-quality images (that may nevertheless permit perfect classifications), and backward mode is slower but produces higher-quality corrected images (see imager documentation for further details about these modes).

Running the above steps on the images presented in Fig. 8, yields an edge-list as an output (see Table 1). The classification model with default settings generally works well, but performance can be sensitive to input parameters, including the legal range of hues attributable to each token, and the required divergence in hue density between control and treatment photographs. These parameters can be optimized by the user prior to fieldwork using simulated allocations. Tokens of the cool hues like green, blue, and purple are generally easier to correctly classify, as they are less likely to overlap with skin hue than tokens of warm colors, like red or orange. Surprisingly, use of black-and-white recipient photos can decrease classification accuracy, since the hue of values of such photos in the control condition can vary a lot based on ambient lighting.

Table 1 The directed ties present in Fig. 11, as inferred by the classifier. Inferred ties should always be checked visually, by plotting these classified ties back onto the raw images

Full size table

To check that the inferred edge-list is correct, the user can run the check_classification code:

This code will map the inferred ties back on to the photograph array, and save the resulting output in the RICH/ClassifiedPhotos folder, where each panel can be checked for accuracy. See Fig. 11, which demonstrates perfect classification in this test. More broadly, we repeated this process with 26 different allocations of 20 tokens per round across the 39 possible recipients, and find that 520 of 520 true ties were correctly identified, and that 0 of 494 non-ties were mistakenly classified as ties. In other words, with minor tuning of essential parameters, excellent automatic classification is possible using DieTryin.

If the inferred classification of tokens is good, header data can be appended and the results saved as a csv file using the annotate_data command:

This function will export a data file of the same format as the standard enter_data function. Once all data are entered, the final edge-list can be compiled in the same way as was done for manual entry:

Automatically entering data: Likert-scales

In addition to simple binary directed ties, researchers often desire dyadic peer ratings between respondents using Likert-scales. DieTryin supports dyadic Likert-scale ratings through the use of tokens with different colors: e.g., blue = − 1, purple = 0, green = 1, blank = NA. Up to five token colors are supported. Data are prepared as before, but now we run:

where a vector of lower and upper hue thresholds are supplied, and matched to a vector of color codes. The check_classification code can be run as before, with no modifications. In Fig. 12, for example, we attempt to classify categorical ties using three token colors. All ties are again correctly classified (see Table 2). More broadly, we repeated the process with 26 different allocations of 9 tokens of each of 3 colors per round across the 39 possible recipients, and find that 702 of 702 true ties were correctly identified, that 0 of 312 non-ties were mistakenly classified as ties, and that no tokens of one color were mistakenly classified as a different color.

Table 2 The directed ties present in Fig. 12, as inferred by the classifier. Note that the edge list now has a column giving token color which can then be mapped to ordered categories in R

Full size table

Automatically entering data: Input-free coding

In the examples above, user input is required to identify the corners of the game boards in haphazardly taken photographs. If data are collected using more careful photography—where images are all cropped and squared at the time of data collection—the workflow can be further automated to remove all user input. This approach demands much less data entry effort at the computer, but demands greater attention to detail in the field in order to collect perfect game-board photographs. Third party Android or iOS apps, like Tiny Scanner, however, provide functionality that makes such data collection possible.

As before, the data must be read by the pre_process function, but now with the argument pre_processed=TRUE, to indicate that the photographs have already been prepared for the classifier:

The data are again wrangled to prepare them for the classifier:

and then the classifier is run. In this case, however, the argument pre_processed=TRUE must be set as below:

As before, the inferred classifications should be visually checked for accuracy using the check_classification function. If the classification is good, header data are then appended as before using the annotate_data function. Once all data are entered, they can be compiled into a single edge-list using the compile_data function.

Discussion

There has been a recent drive to make tools available for researchers interested in collection of social network and relational data—e.g., Breadboard (Human Nature Lab, 2020a), Trellis (Human Nature Lab, 2020b), and Open Data Kit (ODK Team, 2020). These tools, however, are generally catered towards researchers working in areas where research participants have access to and familiarity with computer interfaces. DieTryin offers a low-cost alternative for socio-relational data collection, and caters to a different sub-field of researchers—especially cross-cultural psychologists and anthropologists—working in areas where data quality and validity may be improved by employing technologically simpler research tools—i.e., using physical photograph rosters, rather than tablet computers—to collect data. Nevertheless, DieTryin streamlines the data collection, entry, and cleaning process, through the use of high-tech functionality on the ‘back end,’ where machine learning algorithms are used to automatically code and validate edge-list data from photographs of token allocations. It is our hope that DieTryin will contribute to the compilation of a rich corpus of cross-cultural socio-relational data, and will help to alleviate some of the validity concerns associated with the use of name-generator-based methodologies.

Building richer cross-cultural databases

Even comparatively simple cross-cultural studies of behavioral variation in classical economic games, like the Dictator and Ultimatum Games (e.g., Henrich et al.,, 2001; Henrich et al.,, 2005), have had a profound impact on our understanding of human behavior, cognitive processes, and cultural variation. New methodologies for conducting networked-structured economic games with roster-based methods, e.g., as introduced by Rucas et al., (2010) and Gervais (2017), have the potential to further extend cross-cultural studies of human behavioral variation and unpack the effects of focal, alter, and dyadic characteristics. The data generated under these methods are thus richly informative, reflecting both positive and negative ties between all community members at a dyadic level, and are broadly applicable to tests of theory in fields as diverse as behavioral economics, cross-cultural psychology, and evolutionary anthropology (Pisor et al., 2020). Although network-structured economic games generally, and RICH games specifically, have been used primarily by anthropologists studying rural social networks (Rucas et al., 2010; Gervais, 2017; Pisor et al., 2020), they can just as easily be deployed to study social relations in other bounded communities, like classrooms, sport teams, or urban neighborhoods.

These games, however, have not, as of yet, been applied across anthropological and psychological study sites as broadly as classical economic games (e.g., Henrich et al.,, 2001, 2005; Purzycki et al.,, 2016), presumably due to the difficulty of scaling roster-based social-network methods. The time burden of in situ data collection with paper and pencil—and on site data entry with standard, spreadsheet-based methods—scales with the square of sample size and rapidly becomes prohibitive. In this paper, we have introduced an R package that dramatically reduces the time burden of data collection and entry—hopefully stimulating wider use of RICH games and associated dyadic data collection methods across study sites. Moreover, by reducing the amount of time needed to collect, enter, and process data, DieTryin allows for more questions to be asked to respondents during a given interview—permitting collection of important control variables. Reducing the time burden of data collection also facilitates longitudinal study designs—permitting study of how dyadic game behavior changes with time and with time-varying covariates.

Validity

A growing literature has called into question the reliability and validity of some traditional survey methods for measuring social networks—especially the name-generator method (Bernard et al. 1982; Bernard et al. 1984; Brewer, 2000; Krackhardt and Kilduff, 1999). That is, doubt has arisen about whether these tools: (i) indeed capture what the researcher believes that they do, (ii) yield consistent results if and when repeated, and (iii) lead to well-founded inferences (O’Reilly, 1988). Although self-reports of social relationships are commonly used (Kashy & Kenny, 1990; Romney & Weller, 1984; Moody & et al. 2007), an individual’s perception of, or statements about, their social ties are not necessarily impartial or accurate accounts of their social world (Krackhardt, 1987; Freeman, 1992).

Several important biases occur when using such measurements. For example, individuals may preferentially remember and nominate those with desirable qualities or high positions within their social group (Marin, 2004). Likewise, individuals are at times unable to accurately remember their interactions (Bernard et al., 1984; Brewer, 2000; Bell et al., 2007), and question/roster order effects can impact responses, especially if the data collection burden leads to respondent fatigue or boredom (Pustejovsky & Spillane, 2009).

As such, self-report data collection is not a replacement for direct observational data collection. For example, spot-check (e.g., Borgerhoff et al.,, 1985; Koster et al. 2013) time allocation data provide a more accurate representation of true time budgets than simple self-reports, which are prone to exaggeration, limited recall, and even self-deception. There is a general awareness of the benefits of collecting observed behaviors (e.g., ethnographically documented food transfers), self-reported behaviors (e.g., transfer ties as recalled by respondents), hypothetical behaviors (e.g., who respondents claim they would visit for a small loan or advice), and experimental behaviors (e.g., transfers in a network-structured economic game), as each method has its own strengths and weaknesses—but in unison, they help cross-validate one another. DieTryin is not a solution to all validity issues associated with collection of self-report data, but it functions to allow full roster data collection. This minimizes many validity concerns associated with data collected using the name-generator method. Specifically: (i) recall bias is attenuated by providing a visual prime for each community member, (ii) record linkage and de-duplication issues (Steorts et al. 2016) associated with post-processing of name-generator-based data are bypassed through the use of a full-community roster, and (iii) the speed of name-generator-based collection is maintained, reducing respondent fatigue.

Conclusion

While it is broadly acknowledged that roster-based designs for collecting social network data provide more reliable data than free-recall name-generator-based approaches, the logistical challenges of collecting such data—such as time burden and participant fatigue—have prevented widespread use of such designs. In this paper, we introduce the DieTryin R package, which streamlines the process of collecting and managing roster-based social network data, making the approach as, if not more, feasible than free-recall name-generator designs. DieTryin expedites standardized, reproducible, and robust collection and curation of social-relational data in ecologically-valid contexts. It allows researchers across the social sciences to obtain comparable data that can be used to test broad-ranging questions across a wide array of populations.

References

Altmann, J (1974). Observational study of behavior: Sampling methods. Behaviour, 49(3–4), 227–266.
Article PubMed Google Scholar
Amato, K R, Van Belle, S, & Wilkinson, B (2013). A comparison of scan and focal sampling for the description of wild primate activity, diet and intragroup spatial relationships. Folia Primatologica, 84(2), 87–101.
Article Google Scholar
Bahrick, H P, Bahrick, P O, & Wittlinger, R P (1975). Fifty years of memory for names and faces: A cross-sectional approach. Journal of Experimental Psychology, 104(1), 54.
Article Google Scholar
Barrett, B J, McElreath, R L, & Perry, S E (2017). Pay-off-biased social learning underlies the diffusion of novel extractive foraging traditions in a wild primate. Proceedings of the Royal Society B, 284(1856), 20170358.
Article PubMed PubMed Central Google Scholar
Barthelme, S (2019). imager: Image Processing Library Based on ‘CImg’. https://CRAN.R-project.org/package=imager. R package version 0.41.2.
Bell, D C, Belli-McQueen, B, & Haider, A (2007). Partner naming and forgetting: Recall of network members. Social Networks, 29(2), 279–299.
Article PubMed PubMed Central Google Scholar
Bernard, H R, Killworth, P, Kronenfeld, D, & Sailer, L (1984). The problem of informant accuracy: The validity of retrospective data. Annual Review of Anthropology, 13(1), 495–517.
Article Google Scholar
Bernard, H R, Killworth, P D, & Sailer, L (1982). Informant accuracy in social-network data v. an experimental attempt to predict actual communication from recall data. Social Science Research, 11(1), 30–66.
Article Google Scholar
Borgatti, S P, Mehra, A, Brass, D J, & Labianca, G (2009). Network analysis in the social sciences. Science, 323(5916), 892–895.
Article PubMed Google Scholar
Borgerhoff, M, Caro, T M, Chrisholm, J S, Dumont, J P, Hall, R L, Hinde, R A, & Ohtsuka, R (1985). The use of quantitative observational techniques in anthropology. Current Anthropology, 26 (3), 323–335.
Article Google Scholar
Brewer, D D (2000). Forgetting in the recall-based elicitation of personal and social networks. Social Networks, 22(1), 29–43.
Article Google Scholar
Broesch, T, Crittenden, AN, Beheim, BA, Blackwell, AD, Bunce, JA, Colleran, H, ..., Mulder, MB (2020). Navigating cross-cultural research: Methodological and ethical considerations. Proceedings of the Royal Society B, 287(1935), 20201245. https://doi.org/10.1098/rspb.2020.1245
Article PubMed PubMed Central Google Scholar
Bruno, J, Ingo, J, & Frese, D (2020). Pexels: The best free stock photos & videos shared by talented creators https://www.pexels.com
Cassar, A (2007). Coordination and cooperation in local, random and small world networks: Experimental evidence. Games and Economic Behavior, 58(2), 209–230.
Article Google Scholar
Centola, D (2010). The spread of behavior in an online social network experiment. Science, 329 (5996), 1194–1197.
Article PubMed Google Scholar
Charness, G, Feri, F, Meléndez-Jiménez, M A, & Sutter, M (2014). Experimental games on networks: Underpinnings of behavior and equilibrium selection. Econometrica, 82(5), 1615–1670.
Article Google Scholar
Colman, AM (2016). Game theory and experimental games: The study of strategic interaction. Elsevier.
Crittenden, A N, & Zes, D A (2015). Food sharing among Hadza hunter-gatherer children. PloS One, 10(7), e0131996.
Article PubMed PubMed Central Google Scholar
Cutrona, C E (1986). Objective determinants of perceived social support. Journal of Personality and Social Psychology, 50(2), 349.
Article PubMed Google Scholar
Dahl, DB, Scott, D, Roosen, C, Magnusson, A, & Swinton, J (2019). xtable: Export Tables to LaTeX or HTML. https://CRAN.R-project.org/package=xtable. R package version 1.8-4.
Davis, G H, Crofoot, M C, & Farine, D R (2018). Estimating the robustness and uncertainty of animal social networks using different observational methods. Animal Behaviour, 141, 29–44.
Article Google Scholar
DeTroy, S E, Ross, C T, Cronin, K A, Van Leeuwen, E J, & Haun, D B (2021). Cofeeding tolerance in chimpanzees depends on group composition: A longitudinal study across four communities. Iscience, 24(3), 102175.
Article PubMed PubMed Central Google Scholar
Dijkstra, J K, Cillessen, A H, & Borch, C (2013). Popularity and adolescent friendship networks: Selection and influence dynamics. Developmental Psychology, 49(7), 1242.
Article PubMed Google Scholar
Ferligoj, A, & Hlebec, V (1999). Evaluation of social network measurement instruments. Social Networks, 21(2), 111–130.
Article Google Scholar
Frank, O (2005). Network sampling and model fitting. In PJ Carrington, & J Scott (Eds.) Models and methods in social network analysis (pp. 31–56). Cambridge: Cambridge University Press.
Franz, M, & Nunn, C L (2009). Network-based diffusion analysis: A new method for detecting social learning. Proceedings of the Royal Society B: Biological Sciences, 276(1663), 1829–1836. The Royal Society London.
Article PubMed PubMed Central Google Scholar
Freeman, L C (1992). Filling in the blanks: A theory of cognitive categories and the structure of social affiliation. Social Psychology Quarterly, 118–127.
Gervais, M M (2017). RICH economic games for networked relationships and communities: Development and preliminary validation in Yasawa, Fiji. Field Methods, 29(2), 113–129.
Article Google Scholar
Gervais, M M, & Fessler, D M (2017). On the deep structure of social affect: Attitudes, emotions, sentiments, and the case of “contempt”. Behavioral and Brain Sciences, 40, e225. https://doi.org/10.1017/S0140525X16000352.
Article PubMed Google Scholar
Granovetter, M (1976). Network sampling: Some first steps. American Journal of Sociology, 81 (6), 1287–1303.
Article Google Scholar
Hagen, E H, & Hammerstein, P (2006). Game theory and human evolution: A critique of some recent interpretations of experimental games. Theoretical Population Biology, 69(3), 339–348.
Article PubMed Google Scholar
Hammer, M (1984). Explorations into the meaning of social network interview data. Social Networks, 6(4), 341–371.
Article Google Scholar
Harling, G, Perkins, J M, Gómez-Olivé, F X, Morris, K, Wagner, R G, Montana, L, ..., Berkman, L (2018). Interviewer-driven variability in social network reporting: Results from health and aging in Africa: A longitudinal study of an INDEPTH community (HAALSI) in South Africa. Field Methods, 30 (2), 140–154.
Article PubMed PubMed Central Google Scholar
Henrich, J, Boyd, R, Bowles, S, Camerer, C, Fehr, E, Gintis, H, & McElreath, R (2001). In search of Homo Economicus: Behavioral experiments in 15 small-scale societies. American Economic Review, 91(2), 73–78.
Article Google Scholar
Henrich, J, Boyd, R, Bowles, S, Camerer, C, Fehr, E, Gintis, H, ..., et al. (2005). “Economic man” in cross-cultural perspective: Behavioral experiments in 15 small-scale societies. Behavioral and Brain Sciences, 28(6), 795–815.
Article PubMed Google Scholar
Henrich, J, Heine, S J, & Norenzayan, A (2010). The weirdest people in the world? Behavioral and Brain Sciences, 33(2–3), 61–83.
Article PubMed Google Scholar
Hill, K R, Wood, B M, Baggio, J, Hurtado, A M, & Boyd, R T (2014). Hunter-gatherer inter-band interaction rates: Implications for cumulative culture. PloS One, 9(7), e102806.
Article PubMed PubMed Central Google Scholar
Hlebec, V (1992). Recall versus recognition: Comparison of the two alternative procedures for collecting social network data. In A Ferligoj, & A Kramberger (Eds.) Developments in statistics and methodology, (Vol. 9 pp. 121–128).
Holt-Lunstad, J, Smith, T B, & Layton, J B (2010). Social relationships and mortality risk: A meta-analytic review. PLoS Medicine, 7(7), e1000316.
Article PubMed PubMed Central Google Scholar
Hooper, P L, DeDeo, S, Caldwell Hooper, A E, Gurven, M, & Kaplan, H S (2013). Dynamical structure of a traditional Amazonian social network. Entropy, 15(11), 4932–4955.
Article PubMed Google Scholar
Human Nature Lab (2020a). Breadboard. Yale Institute for Network Science https://breadboard.yale.edu/
Human Nature Lab (2020b). Trellis. Yale Institute for Network Science https://trellis.yale.edu/
Kashy, D A, & Kenny, D A (1990). Do you know whom you were with a week ago Friday? A re-analysis of the Bernard, Killworth, and Sailer studies. Social Psychology Quarterly, 55–61.
Kearns, M, Judd, S, Tan, J, & Wortman, J (2009). Behavioral experiments on biased voting in networks. Proceedings of the National Academy of Sciences, 106(5), 1347–1352.
Article Google Scholar
Koster, J M, Grote, M N, & Winterhalder, B (2013). Effects on household labor of temporary out-migration by male household heads in Nicaragua and Peru: An analysis of spot-check time allocation data using mixed-effects models. Human Ecology, 41(2), 221–237.
Article Google Scholar
Koster, J M, & Leckie, G (2014). Food sharing networks in lowland Nicaragua: An application of the social relations model to count data. Social Networks, 38, 100–110.
Article Google Scholar
Krackhardt, D (1987). Cognitive social structures. Social Networks, 9(2), 109–134.
Article Google Scholar
Krackhardt, D, & Kilduff, M (1999). Whether close or far: Social distance effects on perceived balance in friendship networks. Journal of Personality and Social Psychology, 76(5), 770.
Article Google Scholar
Laumann, E O, Marsden, P V, & Prensky, D (1989). The boundary specification problem in network analysis. Research Methods in Social Network Analysis, 61, 87.
Google Scholar
Lew-Levy, S, Kissler, S M, Boyette, A H, Crittenden, A N, Mabulla, I A, & Hewlett, B S (2020). Who teaches children to forage? Exploring the primacy of child-to-child teaching among Hadza and BaYaka Hunter-Gatherers of Tanzania and Congo. Evolution and Human Behavior, 41(1), 12–22.
Article Google Scholar
Marin, A (2004). Are respondents more likely to list alters with certain characteristics?: Implications for name generator data. Social Networks, 26(4), 289–307.
Article Google Scholar
Marsden, P (1990). Network data and measurement. Annual Review of Sociology, 16, 435–463.
Article Google Scholar
Marsden, P V (2005). Recent developments in network measurement. Models and Methods in Social Network Analysis, 8, 30.
Google Scholar
Moody, J, et al. (2007). To tell the truth: Measuring concordance in multiply reported network data. Social Networks, 29(1), 44–58.
Article Google Scholar
Nielsen, M, & Haun, D (2016). Why developmental psychology is incomplete without comparative and cross-cultural perspectives. Philosophical Transactions of the Royal Society B, 371(1686), 20150071.
Article Google Scholar
Nielsen, M, Haun, D, Kärtner, J, & Legare, C H (2017). The persistent sampling bias in developmental psychology: A call to action. Journal of Experimental Child Psychology, 162, 31–38.
Article PubMed Google Scholar
ODK Team (2020). Open data kit https://opendatakit.org/
Ohtsuki, H, Hauert, C, Lieberman, E, & Nowak, M A (2006). A simple rule for the evolution of cooperation on graphs and social networks. Nature, 441(7092), 502–505.
Article PubMed PubMed Central Google Scholar
O’Reilly, P (1988). Methodological issues in social support and social network research. Social Science & Medicine, 26(8), 863–873.
Article Google Scholar
Pisor, A C, Gervais, M M, Purzycki, B G, & Ross, C T (2020). Preferences and constraints: The value of economic games for studying human behaviour. Royal Society Open Science, 7(6), 192090.
Article PubMed PubMed Central Google Scholar
Power, E A (2017). Social support networks and religiosity in rural South India. Nature Human Behaviour, 1(3), 1–6.
Article Google Scholar
Purzycki, B G, Apicella, C, Atkinson, Q D, Cohen, E, McNamara, R A, Willard, A K, ..., Henrich, J (2016). Moralistic gods, supernatural punishment and the expansion of human sociality. Nature, 530 (7590), 327.
Article PubMed Google Scholar
Pustejovsky, J E, & Spillane, J P (2009). Question-order effects in social network name generators. Social Networks, 31(4), 221–229.
Article Google Scholar
R Core Team (2019). R: a language and environment for statistical computing, R Foundation for Statistical Computing, Vienna. https://www.R-project.org/
Rand, D G, Arbesman, S, & Christakis, N A (2011). Dynamic social networks promote cooperation in experiments with humans. Proceedings of the National Academy of Sciences, 108(48), 19193–19198.
Article Google Scholar
Rand, D G, Nowak, M A, Fowler, J H, & Christakis, N A (2014). Static network structure can stabilize human cooperation. Proceedings of the National Academy of Sciences, 111(48), 17093–17098.
Article Google Scholar
Read, J M, Eames, K T, & Edmunds, W J (2008). Dynamic social networks and the implications for the spread of infectious disease. Journal of The Royal Society Interface, 5(26), 1001–1007.
Article PubMed Central PubMed Google Scholar
Ready, E, Habecker, P, Abadie, R, Dávila-Torres, C A, Rivera-Villegas, A, Khan, B, & Dombrowski, K (2020a). Comparing social network structures generated through sociometric and ethnographic methods. Field Methods, 32(4), 416–432.
Article Google Scholar
Ready, E, Habecker, P, Abadie, R, Khan, B, & Dombrowski, K (2020b). Competing forces of withdrawal and disease avoidance in the risk networks of people who inject drugs. PloS One, 15(6), e0235124.
Article PubMed PubMed Central Google Scholar
Ready, E, & Power, E A (2018). Why wage earners hunt: Food sharing, social structure, and influence in an Arctic mixed economy. Current Anthropology, 59(1), 74–97.
Article Google Scholar
Romney, A K, & Weller, S C (1984). Predicting informant accuracy from patterns of recall among individuals. Social Networks, 6(1), 59–77.
Article Google Scholar
Rucas, S L, Gurven, M, Kaplan, H, & Winking, J (2010). The social strategy game. Human Nature, 21(1), 1–18.
Article PubMed Google Scholar
Rucas, S L, Gurven, M, Kaplan, H, Winking, J, Gangestad, S, & Crespo, M (2006). Female intrasexual competition and reputational effects on attractiveness among the Tsimane of Bolivia. Evolution and Human Behavior, 27(1), 40–52.
Article Google Scholar
Salathé, M, Kazandjieva, M, Lee, J W, Levis, P, Feldman, M W, & Jones, J H (2010). A high-resolution human contact network for infectious disease transmission. Proceedings of the National Academy of Sciences, 107(51), 22020–22025.
Article Google Scholar
Silk, J B, Beehner, J C, Bergman, T J, Crockford, C, Engh, A L, Moscovice, L R, ..., Cheney, D L (2009). The benefits of social capital: Close social bonds among female baboons enhance offspring survival. Proceedings of the Royal Society B, 276(1670), 3099–3104.
Article PubMed PubMed Central Google Scholar
Smith, K P, & Christakis, N A (2008). Social networks and health. Annual Review of Sociology, 34, 405–429.
Article Google Scholar
Steorts, RC, Hall, R, & Fienberg, SE (2016). A Bayesian approach to graphical record linkage and deduplication. Journal of the American Statistical Association, 111(516), 1660–1672. https://doi.org/10.1080/01621459.2015.1105807
Article Google Scholar
Sudman, S (1985). Experiments in the measurement of the size of social networks. Social Networks, 7(2), 127–151.
Article Google Scholar
Suri, S, & Watts, D J (2011). Cooperation and contagion in web-based, networked public goods experiments. PloS One, 6(3), e16836.
Article PubMed PubMed Central Google Scholar
van Zalk, M H, Nestler, S, Geukes, K, Hutteman, R, & Back, M D (2020). The codevelopment of extraversion and friendships: Bonding and behavioral interaction mechanisms in friendship networks. Journal of Personality and Social Psychology, 118(6), 1269.
Article PubMed Google Scholar
Venables, W N, & Ripley, B D. (2002) Modern Applied Statistics with S, 4th edn. New York: Springer.
Book Google Scholar
von Rueden, C R, Redhead, D, O’Gorman, R, Kaplan, H, & Gurven, M (2019). The dynamics of men’s cooperation and social status in a small-scale society. Proceedings of the Royal Society B, 286(1908), 20191367.
Article PubMed PubMed Central Google Scholar
Wood, B M, Harris, J A, Raichlen, D A, & et al. (2021). Gendered movement ecology and landscape use in Hadza hunter-gatherers. Nat Hum Behav, 5, 436–446.
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

C.T.R. and D.R. were supported by the Department of Human Behavior, Ecology, and Culture at the Max Planck Institute for Evolutionary Anthropology. Leipzig, Germany.

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Human Behavior, Ecology, and Culture, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
Cody T. Ross & Daniel Redhead

Authors

Cody T. Ross
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Redhead
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Cody T. Ross.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ross, C.T., Redhead, D. DieTryin: An R package for data collection, automated data entry, and post-processing of network-structured economic games, social networks, and other roster-based dyadic data. Behav Res 54, 611–631 (2022). https://doi.org/10.3758/s13428-021-01606-5

Download citation

Accepted: 26 April 2021
Published: 02 August 2021
Issue Date: April 2022
DOI: https://doi.org/10.3758/s13428-021-01606-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

DieTryin: An R package for data collection, automated data entry, and post-processing of network-structured economic games, social networks, and other roster-based dyadic data

Abstract

Similar content being viewed by others

formr: A study framework allowing for automated feedback generation and complex longitudinal experience-sampling studies using R

Online panels in social science research: Expanding sampling methods beyond Mechanical Turk

The Online Coalition Game: A tool for online interactive coalition formation research

Introduction