Introduction

The global comic market has been rapidly growing. The market size was valued at USD 14.7 billion in 2021 and is expected to expand at a compound annual growth rate of 4.8% from 2022 to 2029 (Fortune Business Insights 2022). A driving force underlying the rapid expansion of the market is the widespread availability of e-books, which enables us to easily access comics around the world. The COVID-19 pandemic also helped the growth of the comic market because the lockdown in many countries increased the demand for comics. Manga, Japanese comics, has been especially popular on a global scale. Many manga have been translated into different languages. For example, over 510 million copies of One Piece have circulated worldwide; One Piece is recorded in the Guinness Book of Records as the best-selling comic in the history (The Mainichi 2022). The history of the development of manga culture in Japan, its social background, and its global prevalence have been studied (Ito 2005; Brenner 2007; Schodt 2013).

Our social networks are complex but characterized by common properties such as heterogeneity and community structure (Barabási 2016; Newman 2018). Because literally all storylines of manga rely on interactions among characters, social networks among characters, which are often called character networks, may provide a backbone of the story of manga and influence its popularity. More generally, character networks have been analyzed for quantitatively characterizing fiction works (Labatut and Bost 2019; Perc 2020). A number of problems can be addressed through analyses of character networks, such as summarization (Tran et al. 2015; Bost et al. 2019), classification (Ardanuy and Sporleder 2014; Holanda et al. 2019), and role detection (Weng et al. 2007; Jung et al. 2013). Character networks have been studied for a variety of fiction such as novels (Elson et al. 2010; Ardanuy and Sporleder 2014; Bonato et al. 2016; Chaturvedi et al. 2016; Min and Park 2019; Gessey-Jones et al. 2020), plays (Stiller et al. 2003; Mutton 2004; Moretti 2011; Rieck and Leitte 2016), movies (Weng et al. 2007; Park et al. 2012; Agarwal et al. 2014; Yeh and Wu 2014), and TV series (Park et al. 2012; Tan et al. 2014; Nan et al. 2015; Chen and Choi 2016; Bost et al. 2019). However, character networks for comics have been rarely explored except for Marvel comics (Alberich et al. 2002; Gleiser 2007), a graphic novel (Labatut 2022), and a few manga titles (Murakami et al. 2020). One possible reason for this is the difficulty of automating data collection for comics (Labatut and Bost 2019).

In the present study, we analyze character networks of 162 popular Japanese manga that span decades and test the following two hypotheses. First, we expect that popular manga tend to produce realistic social networks. Therefore, we hypothesize that structural and temporal properties of the character networks of manga are similar to those of human social networks. If this hypothesis is supported, it could shed light on how to compose popular manga. Second, our data enables us to investigate how trends of manga have changed over approximately 70 years. Therefore, we hypothesize that character networks of modern manga and old ones are systematically different. We also examine differences in the network structure between manga mainly targeting boys versus girls. Through these analyses, we provide a new understanding of manga culture and its historical development.

Methods

Data collection

We focus on manga of which more than 10 million copies have been published in Japan as of January 2021 (mangazenkan 2021). Since the physical size of the manga may affect the number of panels per page and we adopted the page as the unit of analysis, we excluded the manga that had not been published in the paperback pocket edition (112 mm × 174 mm). Note that the paperback pocket edition is the most common for the boys’ and girls’ manga in Japan. As a result, we included 162 manga (see Additional file 1 for the list of the 162 manga).

We prepared a data table of time-stamped copresence of characters for each manga as follows. For each manga, we manually examined volumes one through three. Examining only three volumes is due to a logistic limitation. All the selected 162 manga had at least three volumes. Then, we first extracted all the characters whose name, blood relation to a named character, or job title that uniquely identifies the character, is known. To ensure the reproducibility of this work, we avoided to use any other information sources (e.g., other volumes of the same manga, the Internet, or anime) to attempt to identify more characters. In practice, such additional information sources would not contribute to identifying many more characters. Second, we recorded the copresence of characters on each page as interaction between the characters. Note that the copresence is the most common definition of interaction in constructing character networks in fictional works (Labatut and Bost 2019). We used copresence on a page rather than in a single panel within a page because it is common that characters appearing in different panels on the same page have some interactions (e.g., two characters talk to each other by alternately occupying successive panels).

Character networks

The original data from which we construct the temporal and static character networks are equivalent to a temporal bipartite graph in which the two types of nodes are characters and pages, and an edge connects a character and a page in which the character appears. We construct a character network by projecting the bipartite graph onto the space of character nodes. In other words, in the character network, we connect two characters by an edge if and only if they appear on the same page at least once (see Fig. 1). The weight of an edge is the number of pages on which the two characters are copresent. Because we are interested in interaction between the characters, we exclude the isolated nodes in the character networks from the analysis.

Fig. 1
figure 1

Schematic illustration of one-mode projection, with which we construct a character network from a temporal bipartite graph of a manga

Bipartite configuration model

To generate a randomized bipartite network, we choose a pair of edges in the original bipartite graph uniformly at random, denoted by (cp) and \((c', p')\), where c and \(c'\) are characters and p and \(p'\) are pages. If c and \(c'\) are the same or p and \(p'\) are the same, we discard the two edges and redraw them. Then, we rewire the two original edges to \((c, p')\) and \((c', p)\). We repeat this procedure 10, 000 times excluding the discarded edge pairs to generate a randomized temporal bipartite network. We construct a static randomized character network from the generated randomized bipartite graph by one-mode projection.

Coefficient of variation

To quantify the heterogeneity of a variable, we measure the coefficient of variation (CV), which is defined as the standard deviation divided by the average. If a variable takes only one value, the CV is 0. If a continuous-valued variable obeys an exponential distribution, the CV is 1. A power-law distribution has a CV value substantially larger than 1.

Degree assortativity coefficient

The degree assortativity coefficient measures the extent to which nodes with similar degrees tend to be adjacent to each other in a network (Newman 2002, 2003). It is defined by

$$\begin{aligned} r=\frac{\sum _{i=1}^{N}\sum _{j=1}^{N}(A_{ij} -k_i k_j/2M)k_i k_j}{\sum _{i=1}^{N}\sum _{j=1}^{N}(k_i \delta _{ij}-k_i k_j/2M)k_i k_j}, \end{aligned}$$
(1)

where N is the number of nodes, M is the number of edges, \(k_i\) is the degree of node i, and \(\delta _{ij}\) is the Kronecker delta. Tha value of r ranges between \(-1\) and 1. A positive value indicates an assortative network, in which nodes with similar degrees tend to be adjacent to each other. A negative value suggests a disassortative network, in which nodes with different degrees tend to be adjacent to each other.

Clustering coefficient

The clustering coefficient quantifies the amount of triangles in a network (Watts and Strogatz 1998). The local clustering coefficient for node i is defined by

$$\begin{aligned} C_i=\frac{\text {(number of triangles including node }i)}{k_i (k_i-1)/2}. \end{aligned}$$
(2)

The denominator gives the normalization such that \(0 \le C_i \le 1\). The clustering coefficient, denoted by C, is defined by the average of \(C_i\) over all the nodes in the network, i.e.,

$$\begin{aligned} C=\frac{1}{N}\sum _{i=1}^{N}C_i. \end{aligned}$$
(3)

Interevent time

An interevent time (IET) refers to the time between two consecutive events. It should be noted that we regard a page as the time unit to simplify the analysis although manga story does not always follow the chronological order. Copresence of two characters on a page defines a time-stamped event on the edge of the character network, where we identify the page number as discrete time for simplicity. Therefore, an IET on edge (ij) is the time between two consecutive copresence events of characters i and j. A time-stamped event for a given node is the presence of the character on a page. Therefore, an IET for node i is the time between two consecutive appearances of character i.

SI model

We use the susceptible-infectious (SI) model (Kermack and McKendrick 1927) to numerically investigate contagion. Because infection does not spread from one connected component to another, we run the SI model on the largest connected component of the character network (see Additional file 1 for the number of nodes in the largest connected component, denoted by \({\overline{N}}_{(L)}\), for the 162 manga). We assume that just one character is initially infectious and that the other \({\overline{N}}_{(L)}-1\) characters are initially susceptible. If a susceptible character co-appears with an infectious character on a page, the susceptible character becomes infectious with probability \(\beta\). Different infectious characters appearing on the same page independently attempt to infect each susceptible character on the page. Once a character contracts infection, it stays infectious forever. We run the SI dynamics with each of the \({\overline{N}}_{(L)}\) characters as the sole character that is initially infectious.

Temporal correlation coefficient

We quantify the persistence of edges over time by the temporal correlation coefficient (Nicosia et al. 2013; Thompson et al. 2017). First, we calculate the topological overlap for node i at time t by

$$\begin{aligned} O_{it} = \frac{\sum _{j=1}^{N} A_{ij}^{t} A_{ij}^{t+1}}{\sqrt{\sum _{j=1}^{N} A_{ij}^{t} \sum _{j=1}^{N} A_{ij}^{t+1}}}, \end{aligned}$$
(4)

where \(A_{ij}^t\) is the adjacency matrix of an unweighted network at time step t. In other words, \(A_{ij}^t = 1\) if characters i and j are copresent on page t, and \(A_{ij}^t = 0\) otherwise. We then define the average topological overlap for node i by

$$\begin{aligned} O_{i} = \frac{1}{T-1} \sum _{t=1}^{T-1} O_{it}, \end{aligned}$$
(5)

where T is the number of pages. The temporal correlation coefficient for the entire temporal network is given by

$$\begin{aligned} O = \frac{1}{N} \sum _{i=1}^{N} O_{i}. \end{aligned}$$
(6)

Partial correlation coefficient

The partial correlation coefficient measures the extent of association between two variables while controlling for the influence of one or more additional variables. The three-way partial correlation coefficient between \(x_i\) and \(x_j\) conditioned on \(x_k\) is given by

$$\begin{aligned} r_{ij|k}=\frac{r_{ij}-r_{ik}r_{jk}}{\sqrt{\smash [b]{1-r_{ik}^2}}\sqrt{\smash [b]{1-r_{jk}^2}}}, \end{aligned}$$
(7)

where \(r_{ij}\) is the Pearson correlation coefficient between \(x_i\) and \(x_j\).

Results

Similarities to empirical social networks

Fig. 2
figure 2

Degree distribution for the A character and B page nodes in the bipartite graph for One Piece. We also show the Poisson distributions with the same mean

Our original data are equivalent to a temporal bipartite graph in which the two types of nodes are characters and pages, and the edges connect characters to the pages in which they appear. The pages are ordered in time. We show the descriptive statistics of the bipartite networks for 162 manga in Additional file 1. We show the degree distribution for the character nodes for One Piece and the Poisson distribution with the same mean in Fig. 2A. The CV of the original degree distribution of the characters in One Piece is 1.66. This value is approximately 11.1 times larger than that of the Poisson distribution. Therefore, similar to human social networks (Barabási 2016; Newman 2018), the characters in manga have heterogeneous numbers of connections, and there are a small number of characters that appear disproportionately frequently on various pages. We obtained similar results for the other manga (see Additional file 1).

In contrast, the CV of the degree distribution for the page nodes is 0.478 for One Piece. The CV for the Poisson distribution with the same mean is 0.592. Therefore, we argue that the characters do not appear uniformly randomly over the pages. As we show in Fig. 2B, there are typically two or three characters on a page in One Piece, and pages containing none or just one character are relatively rare. We obtained similar results for the other manga (see Additional file 1). These results suggest that the high heterogeneity of the characters in terms of the frequency of appearance on pages and the tendency of a page typically containing two or three characters are two common properties of manga. Therefore, in the following analyses, we consider the bipartite configuration model, in which the degrees of all character and page nodes are preserved and the edges are otherwise randomly placed, as a null model. Then, we examine properties of character networks that we can explain by the null model versus those we cannot.

Fig. 3
figure 3

Character network for One Piece. The thickness of an edge is proportional to the weight of the edge. The node with an arrow represents the protagonist, Monkey D. Luffy

We show the weighted character network, which is the projection of the bipartite graph onto the space of character nodes, for One Piece in Fig. 3 (see Additional file 2: Fig. S1 for the networks for all 162 manga). By definition, the edge weight is equal to the number of pages in which the two characters simultaneously appear. We show a summary of descriptive statistics of the character networks over 162 manga in Table 1 (see Additional file 1 for the statistics for each manga). The character networks vary widely in size from manga to manga. Although multiple protagonists may exist in a manga, for simplicity, we define the protagonist as the node with the largest node strength (i.e., weighted degree) in the character network. The protagonist is the character that appears in the largest number of pages for most manga (\(97.5\%\)). In fact, the protagonist for One Piece, indicated by an arrow in Fig. 3, is Monkey D. Luffy, who is generally known as the most central character in One Piece.

Table 1 Structural properties of character networks for 162 manga

We compared various properties of character networks between the empirical character networks and their randomization obtained by the one-mode projection of the bipartite network generated by the bipartite configuration model. We first compare three strength-related indices, i.e., the average strength, \(\langle s \rangle\), the CV of the strength, \(s_{\mathrm{CV}}\), and the strength of the protagonist, \(s_{\text {P}}\), of the character networks for all 162 manga between the original and randomized networks in Fig. 4A–C. These figures suggest that randomized character networks well explain the strength-related indices of the original character networks.

Fig. 4
figure 4

Comparison between the original and randomized character networks for 162 manga. For the randomized networks, we show the mean and standard deviation on the basis of 1,000 realizations by the circle and error bar, respectively. A \(\langle s \rangle\): average node strength. B \(s_{\mathrm{CV}}\): CV of the node strength. C \(s_{\mathrm{p}}\): strength of the protagonist. D \(\langle k \rangle\): average degree. E \(k_{\mathrm{CV}}\): CV of the degree. F \(k_{\mathrm{p}}/(N-1)\): normalized degree of the protagonist. G \(\alpha\): coefficient in the relationships between the degree and strength, i.e., \(s_i \propto k_i^{\alpha }\). H r: degree assortativity coefficient. I C: clustering coefficient. J \(\tau _{\mathrm{CV}}^{\mathrm{node}}\): CV of IETs on nodes. K \(\tau _\mathrm{CV}^{\mathrm{edge}}\): CV of IETs on edges. L \(N_\mathrm{I}/{\overline{N}}_{(L)}\): final epidemic size (i.e., fraction of infected nodes at the last time step)

Figure 5A shows the strength distribution for One Piece. We find that the strength obeys a heavy-tailed distribution over some scale. This result is consistent with Fig. 4B, which shows that the CV of the strength for most manga is substantially larger than 1; the CV for One Piece is 1.48. The heavy-tailed strength distributions shown in Fig. 5A for One Piece and implied in Fig. 4B for a majority of manga are consistent with those for empirical social networks (Barrat et al. 2004; Wang et al. 2006).

In contrast to the strength-related indices, the randomized character networks do not explain the degree-related indices of the original character networks. The randomized networks tend to overestimate the average degree, \(\langle k \rangle\), of the original character networks (see Fig. 4D). This result suggests that characters tend to repeat interacting with relatively few characters. The CV of the degree, \(k_{\mathrm{CV}}\), for the original networks tends to be larger than that for the randomized networks (see Fig. 4E). Let \(k_{\mathrm{p}}\) denote the degree of the protagonist. The fraction of the other characters that the protagonist is adjacent to, \(k_\mathrm{P}/(N-1)\), is \(0.899 \pm 0.103 \, (\mathrm{mean} \pm \mathrm{standard \, deviation}\) based on the 162 manga) and \(0.971 \pm 0.0413\) for the original and randomized character networks, respectively (see Table 1 and Fig. 4F). Therefore, the protagonist is adjacent to most of the other characters in both original and randomized networks. Based on these results, we conclude that the character networks are strongly protagonist-centered, in which the protagonist interacts with most of the other characters, while other characters tend to interact only with fewer characters than expected for the randomized networks, but including the protagonist. This interpretation is consistent with the aforementioned observation that \(\langle k\rangle\) and \(k_{\mathrm{CV}}\) are smaller and larger for the original than randomized networks, respectively.

Fig. 5
figure 5

Strength and degree distributions for the characters in One Piece. A Strength distribution. B Degree distribution. C The relationship between the degree and strength. Each circle represents a character. The coefficient \(\alpha\) is determined by the linear regression \(\log s_i = \alpha \log k_i +b\), where b is an intercept; the solid line shows the linear regression. Variable \(R^2\) represents the coefficient of determination

We show the degree distribution for One Piece in Fig. 5B. The CV of the degree, \(k_{\mathrm{CV}}\), for One Piece is 0.77. Figure 4E shows that the CV of the degree for most of the manga is smaller than 1. In fact, this result is consistent with that for empirical social networks with similar number of nodes (Read 1954; Zachary 1977), while large empirical social networks tend to have heavy-tailed degree distributions implying a large CV (Barabási et al. 2002; Mislove et al. 2007).

We found that the strength is super-linearly scaled with the degree, i.e., \(s_i \propto k_i^{\alpha }\) with \(\alpha >1\), in the character networks, where \(\propto\) indicates “proportional to”. Note that the absence of correlation between the strength and degree would yield \(\alpha =1\) (Barrat et al. 2004). Specifically, in Fig. 5C, we show the relationship between \(k_i\) and \(s_i\) for One Piece. We obtained \(\alpha =1.94\) with the coefficient of determination \(R^2=0.847\). We obtained similar results for the other manga (see Fig. 4G). These results are consistent with power-law relationships between the strength and degree with \(\alpha > 1\), which has been observed in empirical social networks (Barrat et al. 2004; Wang et al. 2005). Note that the randomized character networks also yield \(\alpha > 1\) and that \(\alpha\) for the original and randomized networks are highly correlated with a small number of exceptions.

The character networks are disassortative. In fact, the degree assortativity coefficient, denoted by r, of the character network is \(-0.345\) on average (see Table 1). The values of r for all but two manga are negative (see Fig. 4H). This result is in stark contrast with various observations that social networks are more often than not assortative with positive r (Newman 2002; Newman and Park 2003). We argue that the character networks are disassortative because they are strongly protagonist-centered. In fact, star graphs are disassortative with large negative values of r (Piraveenan et al. 2008; Estrada 2011). The degree disassortativity is also observed for empirical egocentric social networks (Batagelj and Mrvar 2000; Gupta et al. 2015). Figure 4H indicates that r is also negative for the randomized character networks although the correlation between r for the original and randomized networks is low. Therefore, we conclude that the degree disassortativity in our character networks is a consequence of the protagonist-centered nature of the original character-page bipartite network.

The character networks are highly clustered. Specifically, the clustering coefficient, denoted by C, over the different manga is 0.790 with the minimum value of 0.598 (see Table 1). This result is consistent with the observations that empirical social networks have high clustering coefficients (Watts and Strogatz 1998; Saramäki et al. 2007). In fact, randomized character networks also have similarly large C although the spread is large between the empirical and randomized networks. Therefore, we conclude that a high clustering coefficient is a consequence of one-mode projection of the bipartite graph, which is known (Newman 2001; Ramasco et al. 2004).

Fig. 6
figure 6

Survival function, \(P(\tau )\), of IETs on A nodes and B edges for One Piece. Each line in A and B corresponds to a node and an edge, respectively. We only considered nodes that had at least 100 events and edges that had at least 50 events for A and B, respectively

We found that heavy-tailed distributions of IETs are simultaneously present for nodes and edges in the character networks. We show the survival functions of the IETs (i.e., probability that the IET, denoted by \(\tau\), is larger than the specified value) for individual nodes and edges in One Piece in Fig. 6A and B, respectively. The relatively slow decay in Fig. 6 suggests heavy-tailed distributions for both nodes and edges across some scales of \(\tau\). The CV values for IETs on nodes and edges for One Piece are \(1.72 \pm 0.697\) and \(1.92 \pm 0.535\), respectively. We obtained similar results for the other manga, as we show in Fig. 4J and K. These figures also indicate that the randomization of the bipartite network does not preserve this feature, yielding CV values close to 1 regardless of the CV value for the original temporal character networks. It should be noted that a Poisson process produces an exponential IET distribution, which yields CV = 1. These results support that IETs for nodes and edges in the original character networks are non-Poissonian and heterogeneously distributed, which is consistent with properties of empirical social networks (Fonseca dos Reis et al. 2020).

We simulated stochastic contagion processes on the character network, in which one character can infect another if and only if they appear on the same page. The purpose of this analysis is to examine whether contagion on the character network occurs in a manner similar to that on empirical temporal contact networks, which show different contagion patterns from the case of static contact networks. Related to heavy-tailed IET distributions, we found that epidemic spreading occurs more slowly in the temporal character networks than in randomized counterparts. We ran the SI model. By assumption, an infectious character independently infects each susceptible character coappearing on the same page with probability \(\beta =0.2\). For One Piece, we show in Fig. 7 the time course of the fraction of the infectious characters averaged over all the runs. We also show the corresponding averaged time courses for each of the 1,000 randomized temporal networks by the blue lines. Figure 7 indicates that the infection occurs more slowly in the original temporal network than in the randomized temporal networks. We obtained similar results for the other manga (see Fig. 4L). These results are qualitatively the same as those observed for empirical social temporal networks (Karsai et al. 2011; Masuda and Holme 2013).

Fig. 7
figure 7

Time courses of the fraction of infectious characters for the original temporal network and 1,000 randomized temporal networks for One Piece. We used the SI model starting from one initially infectious character. Each curve represents the average time course of the fraction of infectious characters over \({\overline{N}}_{(L)}\) runs. We set the infection probability per contact event to \(\beta =0.2\)

Trends of network structure over decades

The 162 manga span nearly 70 years, from Astro Boy with the first volume being published in 1952 to Jujutsu Kaisen in 2018 (see Additional file 1 for detailed information on the 162 manga). The trend of the structure of manga character networks may have changed over the \(\approx 70\) years, reflecting the transitions of Japanese society including in economics and fashion. In this section, we explore possible existence of such trends.

We first calculated the Pearson correlation coefficient between the year of publication of the first volume, denoted by y, and various indices of network structure on the basis of all 162 manga. We show the correlation coefficients and their 95% confidence intervals (CIs) in Fig. 8A. We find that the number of characters, N, tends to decrease over decades (i.e., negative correlation between N and y). The CV of the node degree, \(k_{\mathrm{CV}}\), and that of the node strength, \(s_{\mathrm{CV}}\), are negatively correlated with y with moderate effect sizes. In general, these and other network indices may be affected by the number of nodes, N (van Wijk et al. 2010). Therefore, we also examined the partial correlation coefficient between y and each index by partialing out the effect of N. The results of the partial correlation coefficient for \(k_{\mathrm{CV}}\) and \(s_{\mathrm{CV}}\), shown by the lines with squares in Fig. 8A, are consistent with those of the Pearson correlation coefficient, although the partial correlation is closer to 0 than the Pearson correlation. We also find that the average degree, \(\langle k \rangle\), is positively correlated with y in terms of the partial correlation. These results suggest that more recent manga tend to be denser and more homogeneous in the node’s connectivity, such that various characters, not just the protagonist, tend to have more connections in more recent manga.

Fig. 8
figure 8

Correlation between the year of publication of the first volume, y, and indices of network structure. For the partial correlation coefficient, we partial out the influence of the number of nodes. According to a standard, the effect size is said to be large, moderate, or small when the correlation coefficient is \(> 0.5\), \(> 0.3\), or \(> 0.1\), respectively (Cohen 1988). The horizontal lines indicate 95% confidence intervals. N: number of nodes, \(\langle k \rangle\): average degree, \(k_{\mathrm{CV}}\): CV of the degree, \(k_{\mathrm{p}}/(N-1)\): normalized degree of the protagonist, \(\langle s \rangle\): average strength, \(s_{\mathrm{CV}}\): CV of strength, \(s_{\mathrm{p}}\): strength of the protagonist, r: degree assortativity coefficient, C: clustering coefficient, and O: temporal correlation coefficient

How the trend of the character network changes over time may depend on genres of manga. A most major categorization of Japanese manga is the one based on the gender of their main readership (Toku 2007; Prough 2010). Therefore, we classify the 162 manga into 124 boys’ and 38 girls’ manga based on the genre section on their Wikipedia pages (see Additional file 1 for the genre of each manga), while their classification is necessarily subjective. We show the Pearson and partial correlation coefficients between each index and y, separately for the boys’ and girls’ manga in Fig. 8B and C, respectively.

We find that there tend to be less characters for more recent manga for both boys’ and girls’ manga. The boy’s and girls’ manga are different in the following aspects in terms of the partial correlation coefficients, i.e., when we control for the number of characters. The trends that we identified for all the 162 manga are even more pronounced in the boys’ manga. The effect sizes of the partial correlation for \(\langle k \rangle\), \(k_{\mathrm{CV}}\), and \(s_{\mathrm{CV}}\) for the boys’ manga are larger than those for the 162 manga. In addition, the degree assortativity coefficient, r, is positively correlated with y. Because r is negative for most manga, this result implies that r tends to be closer to 0 for more recent boys’ manga. On the other hand, the two protagonist-related indices, i.e., \(k_{\mathrm{P}}/(N-1)\) and \(s_{\mathrm{P}}\), are not correlated with y. These results suggest that the character networks in more recent boys’ manga tend to be less protagonist-centered, in which non-protagonist characters have more connections among them.

The trend for the girls’ manga is opposite to that for the boys’ manga except that the downward trend in the number of characters is common. The two indices for which the partial correlation was negative for the boys’ manga, i.e., \(k_{\mathrm{CV}}\) and \(s_{\mathrm{CV}}\), are positively correlated with y for the girls’ manga in terms of the partial correlation. On the other hand, the two indices for which the partial correlation was positive for the boys’ manga, i.e., \(\langle k \rangle\) and r, are negatively correlated with y for the girls’ manga. Furthermore, the normalized degree of the protagonist, \(k_{\mathrm{P}}/(N-1)\), for which we did not confirm a correlation for the boys’ manga, is positively correlated with y. Overall, these results indicate that the character networks in the girls’ manga have shifted toward more protagonist-centered networks.

Discussion

We have examined character networks of 162 Japanese manga that span over 70 years. Our main findings are as follows. First, the structural and temporal properties of the character networks of manga are similar to those of human social networks. Second, the trend of the manga character networks has shifted over decades. Third, this trend shift is substantially different between boys’ and girls’ manga.

Similarities between character networks and human social networks have been investigated for some fictions such as Marvel comics (Alberich et al. 2002; Gleiser 2007), a graphic novel (Labatut 2022), Shakespeare’s plays (Stiller et al. 2003), and myths (Mac Carron and Kenna 2012). Our results of the high clustering coefficient and the heavy-tailed distribution of the strength in the character networks are consistent with these previous studies. However, we ascribe the high clustering coefficient to the effect of one-mode projection rather than to similarity to empirical social networks because the randomized character networks also have high clustering. Our results of the disassortativity is also consistent with the results for Marvel comics (Gleiser 2007), a graphic novel (Labatut 2022), and myths (Mac Carron and Kenna 2012). Therefore, we infer that the heavy-tailed distribution of the strength and disassortativity in the character networks are common properties for various types of fictions. Although the aforementioned previous studies investigated only static properties of the character networks, we further revealed temporal properties of the character networks, such as the long-tailed IET distributions, which are in fact consistent with empirical social networks (Karsai et al. 2011; Masuda and Holme 2013; Fonseca dos Reis et al. 2020).

The bipartite configuration model, which is a standard random bipartite graph, has been shown to be reasonably accurate at explaining some features of the empirical character networks including statistics of the node’s strength, \(\alpha\) values, disassortativity, and the high clustering coefficients. In contrast, there are other features of the empirical character networks that deviate from the expectation from the bipartite configuration graph. Such features include statistics of the degree, the IET distributions, and speeds of epidemic spreading. We suggested that part of these differences originates from the protagonist-centered nature of the character networks. In other words, the protagonist interacts with most characters, while non-protagonist characters tend to interact only with the protagonist and a smaller number of other characters than expected by the configuration model. There are positive support of protagonist-centered social networks in the real world when the network is egocentric (Batagelj and Mrvar 2000; Gupta et al. 2015).

We also found that more modern manga tend to have fewer characters, be denser, and be less protagonist-centered for the boys’ manga, which may reflect a modern change in the society that places more emphasis on diversity and teamwork (Dell’Era and Verganti 2010; Curşeu and Pluut 2013). In contrast, the character networks in the girls’ manga have shifted toward more protagonist-centered, although the downward trend in the number of characters is common. In general, girls’ manga in Japan tend to revolve around issues of love and friendship with a focus on inner feelings of the protagonist (Prough 2010; Takahashi 2014). Our results suggest that more recent girls’ manga may describe the relationships between the protagonist and a few other characters in depth.

There are many future directions of investigation. First, we focused on 162 manga with high circulations. By analyzing manga with low circulation as well, we may be able to discover static and temporal properties of character networks that readers favor, contributing to understanding why some manga are more popular than others. Second, we analyzed only the first three volumes of each manga because of a logistic limitation. Examining all volumes would allow us to understand the narrative structure, which have been analyzed for novels (Gessey-Jones et al. 2020), movies (Weng et al. 2007), and TV series (Park et al. 2012). For this purpose, a wide variety of time series and temporal network analysis tools, such as change-point detection, temporal centrality, and temporal community structure, may be useful (Holme and Saramäki 2012, 2019; Masuda and Lambiotte 2020). Third, character networks are probably signed in most cases, connecting characters by positive or negative ties. It is worth to deploy sentiment analysis based on text (Min and Park 2019) and facial expressions (Soleymani et al. 2015) to construct and analyze signed networks of characters in manga. Temporal network analysis of signed character networks may also reveal complicated dynamics of relationships among characters (e.g., an enemy later becomes an ally). Fourth, manga in different genres such as action, adventure, sports, and comedy, may have different structures of character networks. In addition, we have only analyzed the manga that have been published in the paperback pocket edition, which is the most common for the boys’ and girls’ manga in Japan. Analyzing manga of other sizes may help us understand variation of character networks over a wider range of genres. Fifth, studying character networks in comics in countries other than Japan warrants future work. For example, character networks may reflect societal differences from country to country.

In conclusion, to the best of our knowledge, the present study of character networks for 162 manga titles is unprecedented in scale in quantitative studies of comics and has enabled us to discover their general features. We also introduced new tools and views to the analysis of fiction character network data, such as the bipartite configuration model, the concept of protagonist-centeredness, and temporal network analysis. We hope that this study triggers further quantitative studies of character networks in manga and other types of fictions.