Synchronous development in open-source projects: A higher-level perspective

Bock, Thomas; Hunsen, Claus; Joblin, Mitchell; Apel, Sven

doi:10.1007/s10515-021-00292-z

Synchronous development in open-source projects: A higher-level perspective

Open access
Published: 13 October 2021

Volume 29, article number 3, (2022)
Cite this article

Download PDF

You have full access to this open access article

Automated Software Engineering Aims and scope Submit manuscript

Synchronous development in open-source projects: A higher-level perspective

Download PDF

Thomas Bock ORCID: orcid.org/0000-0001-6906-3489¹,
Claus Hunsen²,
Mitchell Joblin³ &
…
Sven Apel¹

2126 Accesses
3 Citations
Explore all metrics

Abstract

Mailing lists are a major communication channel for supporting developer coordination in open-source software projects. In a recent study, researchers explored temporal relationships (e.g., synchronization) between developer activities on source code and on the mailing list, relying on simple heuristics of developer collaboration (e.g., co-editing files) and developer communication (e.g., sending e-mails to the mailing list). We propose two methods for studying synchronization between collaboration and communication activities from a higher-level perspective, which captures the complex activities and views of developers more precisely than the rather technical perspective of previous work. On the one hand, we explore developer collaboration at the level of features (not files), which are higher-level concepts of the domain and not mere technical artifacts. On the other hand, we lift the view of developer communication from a message-based model, which treats each e-mail individually, to a conversation-based model, which is semantically richer due to grouping e-mails that represent conceptually related discussions. By means of an empirical study, we investigate whether the different abstraction levels affect the observed relationship between commit activity and e-mail communication using state-of-the-art time-series analysis. For this purpose, we analyze a combined history of 40 years of data for three highly active and widely deployed open-source projects: QEMU, BusyBox, and OpenSSL. Overall, we found evidence that a higher-level view on the coordination of developers leads to identifying a stronger statistical dependence between the technical activities of developers than a less abstract and rather technical view.

On the fulfillment of coordination requirements in open-source software projects: An exploratory study

Article 08 October 2020

Tracing distributed collaborative development in apache software foundation projects

Article 15 November 2016

Developer Dynamics and Syntactic Quality of Commit Messages in OSS Projects

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The success of large software projects relies on the extent to which developers coordinate their efforts. This is especially true for large-scale open-source software (OSS) projects, to which often numerous globally distributed and independent developers contribute (Herbsleb 2007). When multiple developers contribute to interrelated source-code fragments, changes that lack coordination often introduce unintentional side effects. Developers must coordinate their interdependent activities to prevent conflicting changes, to avoid bugs, or to keep the code simple and maintainable (Cataldo et al. 2008, 2009; Bird 2011; Kwan et al. 2011). In large-scale projects, developer coordination is absolutely crucial to ensuring high-quality software and to supporting high developer productivity (Cataldo and Herbsleb 2013).

Since software developers in OSS projects are often globally distributed, they mostly communicate via the Internet to discuss software issues or enhancements or to review code changes (Wu et al. 2003). Mailing lists, issue trackers, and instant messengers are the most commonly used communication channels for coordination of developers in OSS projects (Storey et al. 2017). We dedicate attention to analyzing developer communication on mailing lists because they are historically rich and well-established sources of data for discussions regarding software architecture and reviewing of code changes (Rigby et al. 2008; Ramsauer et al. 2019). In a recent study on 37 OSS projects, Mannan et al. 2020) have shown that about 89% of such discussions take place on the project’s mailing list. Mailing lists are a greater source of longitudinal data than more recently introduced social-coding platforms (e.g., GitHub), because their usage dates back more than 10 years (see Table 2). Mailing lists are also used to discuss the outcomes of developer conferences and similar events where complex issues and long-term plans for feature development are discussed. Even developers in OSS projects who work for corporations may use mailing-list discussions to communicate their intentions to others as public communication is one of the basic concepts in OSS projects (Riehle 2015).

To obtain deeper insights into the fundamentals of developer coordination and the role communication plays in OSS projects, we investigate the relationship between co-editing activities on source-code artifacts and communication activities on the developer mailing list. For this purpose, we replicate and extend an empirical study of Xuan and Filkov 2014) on synchronous development in OSS projects, which we will refer to as the original study. The authors of the original study identified pairs of developers co-editing files to explore the relationship between developer productivity and communication activities. Their major finding was that time intervals rich in co-editing activities are correlated with time intervals rich in e-mail activities and, more importantly, that during these synchronized periods developer productivity was higher.

The original study already provided interesting and useful insights on developer collaboration and developer communication. Nonetheless, they relied on a rather technical, low-level view. Regarding developer collaboration, they limited their perspective to co-edits of individual files. There is reason to believe, though, that this perspective only covers technical edits to files which are likely to be a noisy indication of the content-wise relationship between the edits. Developers co-editing a file may not change any interrelated source code because a file can contain lots of independent functionality. Conversely, highly interrelated source code that is scattered across multiple files will also not be captured by a file-level abstraction. To raise the abstraction level, we analyze co-edits on related source code in terms of features. A feature is a characteristic, user-visible behavior or configuration option of a software product (Czarnecki and Eisenecker 2000; Apel et al. 2013). The information a feature usually conveys is richer and more closely mirrors a developer’s mental model of the software than files. For that reason and also due to the fact that the concept of features is apparently used by developers (Berger et al. 2015; Queiroz et al. 2017; Hunsen et al. 2016, 2020), our overarching research question is whether there is a difference in developers’ collaboration and coordination on features and files. Technically, the code belonging to a feature may be scattered across several files and several features may be tangled within a file (Apel et al. 2013), which needs to be taken into account when developers coordinate.

Compared to the original study, we also take a more nuanced view on communication activity by grouping individual e-mails together according to the thread of communication they belong to. In the original study, all e-mails sent to the mailing list are considered equally likely to be related to each other. We extend the original study by lifting this message-based view of developer communication to a conversation-based view, which incorporates the context of e-mails by grouping e-mails according to threads. Since e-mails belonging to the same thread address a relatively narrow topic space, the likelihood of these e-mails being content-wise related is higher (Bird et al. 2008). A heuristic solely based on temporally close-by e-mails sent to the mailing list likely misses meaningful communicative associations between developers. Hence, we investigate the question of whether there is a difference in the dependence of social and technical activities using a message-based or a conversation-based view of the complex processes involved in developer coordination.

By means of an empirical study, we investigate whether the different abstraction levels (file-based vs. feature-based and message-based vs. conversation-based) affect the relationship between commit activity and e-mail communication observed in the original study using state-of-the-art time-series analysis. More specifically, to learn whether developers engineer their mutual contributions on features, we investigate whether synchronous development occurs more frequently or with a higher degree of synchronicity on features than on files. Knowing about differences between abstraction levels could be exploited for improving developer coordination (e.g., to predict on which parts of the source code a developer is likely to work on next). Furthermore, we investigate whether synchronous development is temporally aligned with coordination on the mailing list. To find out whether developers working on the same file or same feature contemporaneously also communicate, that is, to measure synchronization, we use dynamic time warping (Rabiner and Juang 1993), a state-of-the-art time-series analysis technique.

It is important to note that, when we investigate whether co-editing activity is accompanied by communication on the mailing list, we cannot be sure that the mailing-list communication is related to the co-editing activity. However, it is a difficult task to find out which e-mails are related to the co-editing activity and which not, as e-mails sent by a developer shortly before or after a commit could also cover completely unrelated topics (especially if there are many commits and e-mails within a short period of time); when relating only e-mails whose subject is related to the commit we may omit related e-mails that have a different subject. For that reason, we propose two different approaches, which we call the lower-bound approach and the upper-bound approach: Whereas the upper-bound approach considers all e-mails sent to the mailing list to identify time intervals rich in e-mail activities (as in the original study), the lower-bound approach considers only e-mails whose subject is topically related to the co-editing activity following a very strict matching procedure. We call them upper-bound and lower-bound because the former considers all messages without restrictions, ending up in the maximum amount of considering communication activity, and the latter considers only messages related to co-editing activity, which is a very small subset of the total set of e-mails. Hence, the actual amount of the communication that is content-wise related to the co-editing activity lies in-between these bounds. For the upper-bound approach, we additionally perform manual checks to explore to which extent the content of e-mail communication is related to temporally close-by collaboration on the source code.

For the purpose of the study, we analyze a combined history of 40 years of data for three highly active and widely deployed open-source projects: QEMU, BusyBox, and OpenSSL. We investigate synchronous collaboration on source code and coordination on mailing lists using different abstraction levels. Overall, we found evidence that a more abstract and higher-level view describes developer collaboration and coordination more accurately than a less abstract and more technical view. That is, developers collaborate more frequently and more synchronously on features than on files. For some of our approaches and projects, a conversation-based representation of developer coordination reveals a stronger statistical relation to co-editing source-code artifacts than a message-based representation.

In summary, we make the following contributions:

We replicate the original study on a different data set: three highly active and widely deployed open-source projects. Regarding the existence of synchronous development, we are able to confirm the results of the original study. However, we cannot confirm the results of the original study regarding code growth and implementation effort in synchronous development nor the relationship between the number of synchronous collaboration activities and the number of synchronous communication activities.
We propose two methods for raising the abstraction level of exploring synchronization between developers’ collaboration and communication activities:
- Instead of viewing files as the primary artifacts on which developers are expected to coordinate, we lift the abstraction level to the higher-level perspective of features (which often crosscut the underlying file decomposition).
- We lift the view of developer communication from a message-based model, which treats each e-mail individually, to a conversation-based model, which is semantically richer due to grouping e-mails that represent conceptually related discussions.
We introduce the continuous variable synchronicity degree to quantify the significance of co-editing artifacts. (Previously only binary variables were used.)
We propose an upper bound and a lower bound for determining whether e-mail communication is related to co-editing activity, as relating e-mail communication to co-editing activity is not trivial.
We manually investigate whether e-mail communication is content-wise related to temporally close-by collaboration activities. Our results indicate that only between 29% and 47% (depending on the subject project) of temporally aligned collaboration and communication activities are content-wise related.
We use a novel technique based on dynamic time warping to measure synchronization of activities across source code and mailing lists to adequately take care of the dynamic nature of socio-technical congruence.
We report on an extensive empirical study of three highly active and widely deployed OSS projects. We found that feature-based collaboration captures developer collaboration more accurately than file-based collaboration. In general, our results indicate that a more abstract and higher-level view leads to a stronger statistical dependence between developers’ pairwise technical activities than a less abstract, technical view.

A full replication package is available on a supplementary Web site.^{Footnote 1}

2 Background

Xuan and Filkov (2014) define synchronous development as the situation where two developers contribute to the same source-code file within a short period of time. In the original study, they consider two different kinds of synchronous activities: co-commit bursts and e-mail bursts. To explore the temporal relationship between co-commit bursts and e-mail bursts, they construct continuous curves by smoothing time series of bursts. In the end, they calculated the correlation of these curves to measure the synchronization of collaboration activities and communication activities.

In this section, we introduce the algorithms and concepts of co-commit bursts and e-mail bursts as well as the continuous curves in detail, as used by the authors of the original study.

2.1 Co-commit bursts

Version-control systems (VCS), such as Git, are frequently used to manage the codebase of software projects. In a VCS, developers can access the source code from a main repository, modify parts of the code, and submit their patches, for example, to the mailing list (Sommerville 2010; Ramsauer et al. 2019; Draheim and Pekacki 2003). Code changes can implement bug fixes, refactorings, or further enhancement of the software. Developers often discuss and review code changes on the project’s developer mailing list (Mannan et al. 2020) and then someone else may merge the discussed changes into the main repository (Storey et al. 2017). The VCS stores all code changes in the form of commits together with meta-data such as author information and modification timestamps.

When two developers commit to the same source-code artifact (i.e., file) within a short period of time, Xuan and Filkov 2014) call this a co-commit burst (short, C-burst). For two commits to be included in a burst, the time difference between the commits must not exceed a specified time window, denoted by $\xi $. The time window resembles the fact that developers may have different preferences of how quickly and often they contribute code. Note that looking at only pairs of developers is not a limitation, as groups of more than two collaborating developers end up in separate C-bursts for each pair of developers that are part of such a group. Hence, group-wise collaboration can be considered as the composition of the collaborations of individual developer pairs.

As we describe in Algorithm 1 (adapted from Xuan and Filkov 2014, for each pair of developers (Lines 2–22), it is checked whether the two developers are authors of mutual commits to the same source-code artifact that have a time^{Footnote 2} distance of, at most, $\xi $, and whether these commits have been made to, at least, one common artifact (Line 7). If so, these commits form a C-burst (Lines 4–10), where each burst is represented by a start time and an end time. Finally, overlapping bursts of the same developer pair are merged (Lines 11–19). This algorithm has a complexity of ${\mathcal {O}}(|\overline{D}|^2\cdot |\overline{c_{max}}|^2)$, with $|\overline{D}|$ being the number of developers and $|\overline{c_{max}}|$ being the maximum number of commits of a single developer in the project.

In Fig. 1, we show an example of four commits made by one pair of developers, D1 and D2. In the commits $c_1$ and $c_2$, both D1 and D2 change artifact A3. Using a time window $\xi =5$ days, $c_1$ and $c_2$ were created within the time window and form a burst. Analogously, $c_2$ and $c_3$ form a C-burst due to the change of artifact A5. Since both bursts overlap at $c_2$, they are merged into one burst in the end. $c_4$ also changes the same artifact as $c_3$, but these commits have a larger distance than the time window. Hence, $c_3$ and $c_4$ do not form a C-burst.

In addition to identifying C-bursts, the original study analyzed how C-bursts are related to code growth $\varDelta L$ and implementation effort $\varDelta W$, defined as follows: Let $L_{\text {Add}}$ denote the number of added lines of code (LOC) per commit and $L_{\text {Delete}}$ the number of deleted LOC per commit. Then, $\varDelta L = L_{\text {Add}} - L_{\text {Delete}}$ and $\varDelta W = L_{\text {Add}} + L_{\text {Delete}}$ (Xuan and Filkov 2014).

2.2 E-mail bursts

Xuan and Filkov (2014) use a message-based model to identify e-mail bursts. An e-mail burst (short, E-burst) arises if two persons each send an e-mail to the mailing list within a defined time window $\xi $. For determining E-bursts, Xuan and Filkov use almost the same approach as for identifying C-bursts: For each pair of developers, they iterate over all the e-mails sent by one developer and search for all e-mails of the other developer whose creation dates have an absolute time difference of less than or equal $\xi $ to the e-mail of the first developer. As opposed to the C-burst identification, there are no further conditions to be checked. Hence, all detected e-mails of two different developers within the time window $\xi $ form an E-burst, where each burst is represented by a start time and an end time. Similar to C-bursts, overlapping E-bursts of the same developer pair are merged.

2.3 C-curves and E-curves

To check whether two developers coordinate their collaboration, that is, to check whether C-bursts and E-bursts of a developer pair are synchronized, Xuan and Filkov (2014) introduced the notions of C-curves and E-curves. They compute a C-curve (or E-curve) for each developer pair denoting the number of commits (or e-mails) that are part of a burst aggregated for each day of the time series, as we illustrate in Fig. 2. By comparing the C-curve and the E-curve of a developer pair, they investigate whether synchronous development and communication activities of the developer pair are temporally related. Since coding collaboration and e-mail communication do not take place at exactly the same time, it is not useful to directly compute the overlap of the resulting curves. Therefore, they applied Gaussian smoothing on each of the curves to also be able to align slightly off-set C-bursts and E-bursts. To compare the smoothed curves, they used the Pearson correlation coefficient to check whether C-curve and E-curve of a developer pair are dependent or independent from each other.

3 Research approach

In our study, we extend the original study by lifting the abstraction level in two ways and by changing the methodology of comparing C-curves and E-curves. On source code, we consider synchronous development based on files and features. Additionally, we introduce a metric to quantify the synchronicity of C-bursts. On mailing lists, we differentiate between message-based communication (considering all synchronously sent e-mails from two developers) and conversation-based communication (considering only e-mails belonging to the same thread). When identifying E-bursts, we use two different approaches to determine a lower-bound and an upper-bound for identifiable coordination. Finally, we use a sophisticated time-series analysis technique to check whether C-bursts and E-bursts of a pair of developers are synchronized.

3.1 Research questions

Before we state our research questions, let us reiterate the precise meaning of the terms collaboration, communication, and coordination:

Collaboration:: means that two developers work together by contributing to common source-code artifacts.
Communication:: means that two developers talk to one another on the mailing list (i.e., exchanging e-mails).
Coordination:: means developers are collaborating and communicating in (content-wise related) temporally aligned manners.

To obtain deeper insights into the fundamentals of developer coordination in OSS projects, we investigate the relationship between co-editing activities on source-code artifacts and communication activities on the mailing list. The idea is that developers rely on the characteristic information conveyed by features and conversation threads for building a mental model of the software and the processes around it, which in turn drives the communication and coordination with other developers (Espinosa et al. 2001; Scozzi et al. 2008; Cannon-Bowers et al. 1993). So, the overarching question is whether there is a difference in the statistical dependence of social and technical activities between a semantic, high-level view and a rather technical, low-level view of the complex processes involved in developer coordination. That is, we investigate whether developers collaborate more frequently and more synchronously on features than on files and whether a conversation-based representation of developer coordination reveals a statistically stronger relation to co-editing source-code artifacts than a sole message-based representation. Specifically, we will address the following two research questions regarding each abstraction level of collaboration (files and features) and coordination (message-based communication and conversation-based communication):

RQ1::: Which abstraction level of the source code captures the collaboration of developers best: files or features? That is, which of the two abstraction levels of the source code leads to identifying a stronger statistical dependence between technical activities of developer pairs?
RQ2::: Which abstraction level of the mailing list captures the coordination of developers best: message-based communication or conversation-based communication? That is, which of the two abstraction levels of the mailing list leads to identifying a stronger statistical dependence between technical activities and social activities on the mailing list?

3.2 Files and features

We perform the extraction of C-bursts, as defined in Sect. 2, in two separate analyses for files and features. In the file-based analysis, the commits from two developers within a certain time window form a C-burst if the commits change the same file. One could also think of considering a C-burst if the commits just change a file in the same folder, as files in the same folder may be semantically related to each other. However, projects differ in how they organize files into folders. Folders may be deeply nested, having files at different nesting levels. High-level folders may be too coarse-grained (co-editing code in the same folder may be not related at all), whereas low-level folders may be too fine-grained (missing the relations between files at different levels of nested folders). As it is not obvious and mostly project-dependent which nesting level of folders would be appropriate for C-burst identification, we stick to a file-based analysis, which has been established in the original study.

In the feature-based analysis, the commits from two developers within a certain time window form a C-burst if the commits change the same feature. A feature is a characteristic, user-visible behavior or configuration option of a software product (Czarnecki and Eisenecker 2000). There are different ways of implementing features in source code, one common way is the use of preprocessor directives (Apel et al. 2013; Ernst et al. 2002). For feature extraction from the source code, we rely on C preprocessor directives (#ifdef, #endif, etc.) (Kernighan and Ritchie 1988). In Fig. 3, we demonstrate a short example: All the code which is in-between #ifdef and #endif belongs to the feature stated in the same line as the #ifdef directive (in the example, the feature is called CLOCK_MONOTONIC.

Note that one line of code can belong to multiple features, for example if nested #ifdef directives are used or more than one feature is stated at the beginning of #ifdef. All code changes that affect one of the lines between #ifdef and #endif account for the change of the corresponding feature(s). Note that features may be scattered across multiple files, possibly tangled with other features (Apel et al. 2013). All the changes surrounded by #ifdef directives together with the same feature name belong to the same feature, even if they are part of different files. When analyzing co-edits to features, in our study, code changes which do not belong to a feature (i.e., not surrounded by #ifdef directives) are ignored. We introduce the tools we use to extract feature information in Sect. 4.2.

3.3 Synchronicity degree

The method to identify synchronous development described in Sect. 2 is limited because it does not quantify the magnitude of the overlap among the commits of a C-burst. Essentially, the variable denoting synchronous development is binary. To gain precision, we model the overlap of synchronously changed artifacts within a burst using a continuous variable. This is beneficial because synchronous commits from two developers can contain changes to one common artifact while most of the other changes are to artifacts that are touched by only one of the developers (Bird et al. 2011). For this reason, we introduce the synchronicity degree, a metric capturing the overlap based on the number of lines of code (LOC) each of the two developers adds to the artifacts changed in a C-burst. We calculate the synchronicity degree individually for each C-burst. Formally, we define the synchronicity degree $ deg _{ sync }$ for a C-burst c of the developers A and B as follows:

$$\begin{aligned} deg _{ sync }(c) = \sqrt{\frac{ add (A, syncArt (c))}{ add (A,art(c))} \cdot \frac{ add (B, syncArt (c))}{ add (B,art(c))}}, \end{aligned}$$

(1)

where $ add (A,x)$ denotes the number of code lines added by developer A to the list of code artifacts x in C-burst c, $ syncArt (c)$ denotes the list of synchronously changed artifacts in C-burst c (i.e., the set of all artifacts changed by both A and B in their respective commits), while art(c) is the set of all artifacts changed in C-burst c. In other words, to determine the synchronicity degree, we calculate the geometric mean of the code changes made by the two developers involved in a C-burst. Specifically, the metric incorporates the size of changes to synchronously changed artifacts of each developer, normalized by the changes to all artifacts in the C-burst. To let the synchronicity degree assign high values only to C-bursts that have a high portion of synchronously changed artifacts, and to down-weight C-bursts that have a highly imbalanced number of changes to non-synchronously changed artifacts, we use the geometric mean, this way reducing the weight of higher values compared to the arithmetic mean (as we also show in the following examples).

Table 1 Examples of the synchronicity degree $ deg_{sync} $ for different numbers of added LOC by developers A and B in C-burst c

Synchronous development in open-source projects: A higher-level perspective

Abstract

Similar content being viewed by others

On the fulfillment of coordination requirements in open-source software projects: An exploratory study

Tracing distributed collaborative development in apache software foundation projects

Developer Dynamics and Syntactic Quality of Commit Messages in OSS Projects

1 Introduction

2 Background

2.1 Co-commit bursts

2.2 E-mail bursts

2.3 C-curves and E-curves

3 Research approach

3.1 Research questions

3.2 Files and features

3.3 Synchronicity degree

3.4 E-mails and e-mail threads

3.5 Upper-bound and lower-bound approach for determining coordination

3.6 Time-series analysis of C-curves and E-curves

4 Study design

4.1 Subject projects

4.2 Data extraction

4.3 Variables

4.4 Null model

4.5 Hypotheses

4.5.1 Hypotheses related to C-bursts

4.5.2 Hypotheses related to E-bursts

4.5.3 Hypotheses related to C-bursts and E-bursts

4.5.4 Statistical tests

5 Results

5.1 C-bursts

5.2 E-bursts

5.3 C-bursts and E-bursts

6 What is discussed within E-bursts?

7 Discussion

7.1 C-bursts (H1)

7.2 E-bursts (H2)

7.3 C-bursts and E-bursts (H3)

7.4 Research questions (RQ1 and RQ2) and perspectives

8 Threats to validity

8.1 Internal validity

8.2 External validity

9 Related work

10 Conclusion

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix

Appendix

1.1 Dynamic time warping and Sakoe–Chiba band

1.2 Algorithms

1.3 Result tables

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation