Introduction

Khulusi et al. (2020) have recently surveyed a large amount of the literature that focuses on the unique link between musicology and visualization by classifying 129 related works according to the types of data that were visualized and the visualization techniques that were applied to respond to certain research inquiries. The intersection of musicology and visualization brings a diversity of innovative applications designed for a variety of purposes. As Khulusi et al. (2020) explain, musicologists are served by interactive tools to analyze musicological data, on the one hand, and on the other, applications are tailored to the broad public with the aim of communicating and teaching aspects of music in a more intuitive, playful manner.

According to Magnuson (2008), the Common Practice Period (1650–1900) of Western classical music is a macrocosm in that rules and practices were comparatively unified and composers could be described as belonging to comparatively long periods such as the late Renaissance, Baroque, Classical or Romantic. By contrast, the music of the twentieth century is both pluralist and microcosmic in that, throughout, composers explored more personal and individual approach to music creation, forming their own microcosms or ‘small universes’. This article visualizes on maps both the macrocosm of the Common Practice Period and the microcosms of twentieth century classical music. This article falls into the realm of an experiment in music education, not musicology. The ultimate objective is to explore parts of the classical music heritage and stimulate interest in discovering composers located on ‘maps’ in the vicinity of better-known composers by proposing, first, network graphs tracking their interconnections based on personal (composer-to-composer) musical influences, and second, multidimensional scaling graphs that locate composers’ relative positions while preserving the pairwise distances among them. In this last case, the distances originate from a matrix of composer similarity indices computed on the basis of musical (personal and style) influences and ‘ecological’ data (or composers’ features).Footnote 1 In an age offering either inculcation through lists of prescribed composers and compositions to explore, or music recommendation algorithms that automatically propose works to listen to next, this article shows an alternative path that might promote the active rather than passive discovery of composers and their music in a less restrictive way than inculcation through prescription. All visualization techniques were developed using Python programming and libraries.

The article applies these visualization techniques to The Classical Music Navigator, thereafter referred to as CMN, a website created by Charles H. Smith (2000), and available at http://people.wku.edu/charles.smith/music/.Footnote 2 C. H. Smith created it as a reference work and experiment in music education. According to the CMN, it consists of five compilations of material: (1) An alphabetically-arranged, ‘Composers’ list containing basic data, major works, and influences of 500 individuals (composer-to-composer influences); (2) A ‘Basic Library’ list of works culled from this composer list (and re-arranged by musical genre); (3) A ‘Geographical Roster’ in which the names of the 500 composers are listed under the names of the countries with which they were (/are) associated; (4) An alphabetical ‘Index of Forms and Styles’ listing the names of composers associated with each subject entry; and (5) A ‘Glossary’ of terms used in the CMN. Although the CMN is a ‘short list’ of 500 composers, the initial pool was a ‘long list’ of 750 composers. The long list was then shortened to 500 on the basis of ‘objective’ criteria to avoid mere advocacy or subjective preferences.Footnote 3 Overall, the CMN aimed at reflecting a composer’s status (at the time it was put together). The 500 individuals are those who scored highest on a combination of eleven variables such as, among others, the length of a composer entry in the Grove’s Dictionary of Music and other catalogs, the total number of recordings referring to each composer, the total number of recordings over the past five years (as of c. 2000), the holdings of sheet music and other items in 50,000 libraries in the U.S. and worldwide (through searches in the OCLC WorldCat database). For further details, see the CMN website section on Statistics.Footnote 4

The CMN would be more likely to meet its original educational objective if it incorporated tools making use of the ability of the human visual system to identify patterns and trends, which we explore in this article. In the next section, we discuss the use of social networks for analysing musicological data and describe the development and collection of the CMN personal musical influence database. Section “An application of network graphs” applies standard network graph theory to the CMN influence database. Section “Composer similarity indices and heat maps” illustrates the method underlying the computation of similarity indices and then Section “Multidimentional scaling (MDS) and classification maps” applies multidimensional scaling (MDS), a technique that transforms the composers’ similarity/distance matrix into MDS maps. The section also uses a support-vector machines algorithm to group composers into several classes, while several data-fitting parameters are used to explore further the music styles of the twentieth century. The final section concludes and discusses a series of issues related to natural language processing as applied to music information discovery, collaborative filtering, recommender systems based on convolutional recurrent neural networks, and the ability of these systems to surprise pleasantly a user (serendipity).

Social networks and classical composers

Although composers typically compose music alone, creative work also depends on influences, interaction, and collaboration. Indeed, certain periods and places are considered hotspots of creativity where new musical ideas are shared and movements arise. For McAndrew and Everett (2015), “[t]alent and status attract network connections, but networks may independently foster creative output. Composers acquire tacit as well as formal musical knowledge from networks of teachers and peers; they build on this to create their own styles and music innovations. They use social networks to signal their own and others’ talent to patrons, agents, concert promoters and publishers.”

Influences are not strictly limited to a composition teacher or even social interactions between peers. As argued by Pfitzinger (2017), even if one of the most personal and profound influences is that of individual composition teachers, in general, the character or style of composing is made up of accumulated influences of countless other musicians and composers. As Conte et al. (2014) wrote in a eulogy for Conrad Susa “[a]rtists may or may not have children, but in the words of Plato, they produce ‘eternal progeny’. These progeny are their musical works. They also aid in the creation of the works of others.” Hence, a composer is also a product of all the musical influences they receive throughout their lives, for example, when studying scores or performing music of others. As further argued by Georges (2017), “Western classical music evolved gradually, branching out over time and throwing off many new styles. This overall development is not due to simple creative genius alone, but to the influence of past masters and genres, as constrained or facilitated by the cultural conditions of time and place”. Smith and Georges (2014) even propose studying similarities between composers through their common and distinct composer-to-composer musical influences, assuming that a greater number of common influences is likely to be reflected in more similar musical styles. Their approach permits indices of similarity between pairs of composers to be computed, as reported in Section “Composer similarity indices and heat maps”.

Besides the fact that influences and networks are important to the development of creative works, the analysis of networks helps us focus on questions such as the composers who are highly influential, innovative, pure information sinks, at the periphery of their group of contemporaries or who stand between groups, schools, or periods, giving the network connectivity and cohesion. Furthermore, such analysis may benefit music historians and composers wishing to identify the pedagogical influences of particular composers on their students, whether to describe particular schools of composition, or trace compositional lineages and examine how a composer might fit into a compositional family. As the composer Pfitzinger (2017) puts it, “[h]ow did my teachers’ teachers influence them? And their teachers? And further back? If I am a compositional descendant of Beethoven or Mahler or Widor or Chadwick, has their compositional style affected me? Can I see it in how I write or the types of piece I choose to compose?” Network theory can be used efficiently to trace genealogies and lineages in the form of ‘shortest paths’ from one composer to another, as described in Section “An application of network graphs”. For a network analysis of teacher-student connections, see the study by Jänicke and Focht (2017) based on musicological data from the Bavarian Musicians Encyclopedia Online (Bayerisches Musiker Lexikon Online, BMLO) that now includes about around 28,000 musicians. The data used in this study is an ambitious digital humanities effort more akin to the preservation of musical heritage than an educational one as we try to pursue here.

Here we use the data on composer-to-composer musical influences collected by Smith (2000) (see the Composers page of the CMN) and apply standard graph theory to visualize and study the network of all 500 composers. For articles that discuss social networks in other musical worlds, see McAndrew and Everett (2015) and Crossley et al. (2015), among others.Footnote 5 As the CMN database reflects more than six centuries of classical music, it enables us to include the composers of both the Common Practice Period (1650–1900) and twentieth century music. Unlike Pfitzinger’s (2017) database of composers, which focuses exclusively on actual ‘teacher-student’ relationship among composers, the CMN network includes all reported composer-to-composer musical influences. The CMN website gives some information on how the list of influences was developed. In Version ‘2.1’, the influences noted were based on more than 20,000 opinions derived from biographical and dictionary sources, and dissertations and other databases available online including information retrieved from liner notes for recordings, programme notes and reviews. For the electronic searches, the opinions were typically retrieved by entering pairs of composers’ names in online search engines together with the word “influence”.Footnote 6,Footnote 7 On average about five opinions/sources were used to corroborate a specific composer-to-composer influence, on the assumption that an influence that is mentioned in five different sources is likely to be a real one. However, the five-times-mentioned influence could not be an absolute standard. In particular, for lesser-known/lesser-studied composers, a lower standard had to be accepted.Footnote 8 At the end, there is a trade-off in choosing the relevant number of sources corroborating an influence, between Type I versus Type II errors, that is, errors of commission (including an influence that should not have been included in the list), and errors of omission (excluding an influence that should have been included in the list).Footnote 9

An application of network graphs

A graph or network is a set of nodes (also referred to as vertices), with a finite number of members n. The CMN database is a network of n = 500 nodes/composers, i1, i2, …in (or, i, j, k…). From the CMN, we can extract an n × n (row × column) adjacency matrix. Think of the n rows as ‘influencers’ and the n columns as ‘influencees’. A cell (i,j) in this matrix is equal to 1 if composer i has influenced composer j, otherwise it is equal to 0. When we observe a value of 1, we say that there is an edge (or link) between i and j. The CMN is a ‘directed’ network, that is (i,j) = 1 does not necessarily imply (j,i) = 1 (e.g., that Alkan influenced Debussy does not, of course, imply that Debussy influenced Alkan). The ‘out-degree’ of a node i, \(k_{i}^{{{\text{out}}}}\), is the number of composers that i influenced. The ‘in-degree’ of a node i, \(k_{i}^{{{\text{in}}}}\), is the number of composers that influenced i. The degree of a node is defined as \(k_{i} = k_{i}^{{{\text{out}}}} + k_{i}^{{{\text{in}}}}\). A directed path in a network is a sequence of distinct composers i1, i2, i3,…, iK−1, iK, so that i1 influenced i2, i2 influenced i3, etc. (This is of interest when tracing composers lineage.) The geodesic distance between two nodes i1 and iK in a directed network is the length of the shortest directed path between them.

There are several algorithms that can be used to visualize a network through a pictorial representation of the nodes and edges. There can be very different layouts or representations of the network itself depending on the algorithms used. In Section “An application of network graphs”, we use the ForceAtlas2 algorithm (Jacomy et al., 2014), a forced-based layout whereby the algorithm modifies an initial (random) node placement by continuously moving the nodes according to a system of forces based on a metaphor of springs and electric charged particles. The ‘spring-electric’ layout uses the attraction formula of springs (between nodes connected with an edge) and the repulsion formula of electrically charged particles (between any nodes). It uses the attraction force (or restoring force) formula of springs,\(F_{a} (i_{1} ,i_{2} ) = k_{a} dist(i_{1} ,i_{2} )\), where dist(,) is the initial geometric distance between two nodes. The more one stretches something, the harder it becomes to keep stretching. Or as one stretches something out, there is a restoring force (of opposite sign) that one has to compete with. Thus, given the initial random position, connected nodes with closer geometric distance dist(,) attract less (the restoring force is lower) than for more distant connected nodes. It also uses the repulsion formula of electrically charged particles (electrons), \(F_{r} (i_{1} ,i_{2} ) = {{k_{r} } \mathord{\left/ {\vphantom {{k_{r} } {(dist(i_{1} ,i_{2} )}}} \right. \kern-\nulldelimiterspace} {(dist(i_{1} ,i_{2} )}})^{2}\) so that closer nodes (charges) repulse more. Hence the spring-electric analogy suggests that initially closer nodes attract less but repulse more. These forces create a movement that converges to a balanced state.Footnote 10

Figure 1 shows a balanced state of the CMN network using ForceAtlas2 algorithm.Footnote 11 Nodes in blue represent composers from the Medieval and Renaissance periods; green represents Baroque; red is Classical; cyan is Romantic; and magenta, composers of the twentieth century. The size of the nodes reflects the number of influences (the ‘out-degree’) of a composer. The larger nodes are known as ‘hubs’—composers who have influenced many other composers in the network. For readability, Fig. 1 labels only the ‘top-150’ composers according to a ranking proposed in the CMN. Figure 1 shows that the network of influences structures and reflects music periods, with composers from the same period (e.g., Renaissance, Baroque, Classical, etc.) appearing closer together, in clusters, reflecting a stronger density of ‘intra-period’ influences. As referred before, this appears like a ‘macrocosm’ for the Common Practice Period (1650–1900), with somewhat unified views on music and music rules and practices (the five pillars of the Common Practice Period–Tonality, Vocabulary, Texture, Sonority, and Time Organization) to which composers belonged to for rather long periods of time (i.e., Late Renaissance, Baroque, Classical or Romantic periods).Footnote 12 We will contrast this shortly with the microcosms or small universes of the twentieth century classical music (Figs. 5, 6, 7 and 8), a period unique in its pluralism and its partial or total modification of earlier music rules.

Fig. 1
figure 1

The macrocosm of the CMN influence network. ForceAtlas2 algorithm. Note: Blue = Medieval and Renaissance; Green = Baroque; Red = Classical; Cyan = Romantic; Magenta = twentieth century. Node size represents the out-degree. (Color figure online)

Quantitative metrics remain essential to shed further information that would be quite difficult to extract from the initial CMN database or from Fig. 1.Footnote 13 The density of a network is the ratio of actual edges in the network, to all possible edges (an arrow between any two nodes) given by n*(n − 1) where n is the number of nodes. The network density (a number between 0 and 1) gives a sense of how closely knit the network is. In our case, we have 3724 edges and 500 nodes so that the composer network density is 3724/(500 × 499) = 0.0149. Hence, the CMN network is on the lower edge of the (0, 1) range, but still far from 0. As the CMN is a directed network, we may also be interested in the density of the reciprocal edges of the network, that is the ratio of the number of edges pointing in both directions to the total number of edges in the graph. The reciprocity of the CMN = 0.0417, so that one can infer that there are 155 edges where any composer i both influenced j and was influenced by j.

A second structural measure is the network global (or average) clustering coefficient. A clustering coefficient is a measure of the degree to which nodes in a graph tend to cluster together. The CMN global clustering coefficient is C = 0.2189 and is computed as the average of the local clustering coefficients of all nodes, \(C = \left( {1/N} \right)\mathop \sum \limits_{i = 1}^{n} C_{i}\). The local clustering coefficient of a node quantifies how close its neighbours are to being a clique and is given by \(C_{i} = N_{i} /\left( {k_{i} \left( {k_{i} - 1} \right)} \right)\) where Ni is the number of edges among ‘direct’ neighbors (both influencers and influencees) of i (and therefore the number of triangles through i), \(k_{i} = k_{i}^{{{\text{out}}}} + k_{i}^{{{\text{in}}}}\) is the sum of the in- degree and out-degree of i and \(k_{i} \left( {k_{i} - 1} \right)\) is the maximum (potential) number of edges among the neighbours of i. The local clustering coefficient can be viewed as a type of centrality measure (see below), albeit one that takes small values for ‘important’ individuals. For example, for i = Debussy, Ci = 0.0417. This implies that there are Ni = 430 edges among all neighbours of Debussy. When Ci is close to zero, as is the case for Debussy, i is a ‘star’ − i has a lot of neighbors but his/her neighbors are little connected between themselves, whereas if Ci is close to 1, then the neighborhood of i forms a clique of composers (i is connected to its neighbours, and all the neighbors are connected to each others).Footnote 14 A traditional method to assess real network results is to compare them to those obtained from a purely random network (an Erdos–Renyi graph) with same number of nodes and edges as the tested network. For instance, a well-established result (see Ravasz & Barabási, 2003) is that Ci is independent of ki in random networks while in real networks, Ci decreases with ki (i.e., composers with a large number of neighbours (edges) have a low value for Ci and vice-versa). Figure 2 shows this negative relationship for the CMN as well as a few ‘stars’ with low local clustering coefficients (e.g., Stravinsky, Debussy, J.S. Bach, JS, and Wagner). The intuition for this negative relationship is roughly as follows. On the one hand, composers tend to group in community sharing mostly neighbors within the same community. A composer with a low degree tends to be connected to nodes within his/her single community, leading to a denser community and a relatively high local clustering coefficient. On the other hand, a composer with a high degree tends to be connected to several communities, which increases the maximum (potential) number of edges among his/her neighbours, reducing this composer’s local clustering coefficient. As stars are relatively few in real social networks, the global clustering coefficient computed as the average of the local clustering coefficients tends to be high, and typically higher than the one computed for purely random graphs. This will be shown, below, to lead to the small world characteristic of social networks.

Fig. 2
figure 2

Local clustering coefficient versus composer’s degree. (Color figure online)

Another structural measure of interest starts with the notion of geodesic path (or shortest path) which is the shortest possible series of nodes and edges that stand between any two nodes. In absence of a direct influence of composer i on k, but if i has influenced j and j has influenced k, then the shortest path length (or geodesic distance) between i and k is 2. This may lead to conjectures about whether or not an ‘indirect’ or residual musical influence of i exists on the music of k. As pointed out by Pfitzinger (2017), this might be of interest to musicologists. Shortest paths computed between any two composers may help discovering influence lineages. For example, taking Debussy as the target, we can discover in the CMN network all composers that are one step away (direct influence), two-step away, etc.Footnote 15 The concept of six degrees of separation was made famous by Milgram (1967) in his study of “The Small World Problem”.Footnote 16 Watts and Strogatz (1998) have studied the small world property of networks more formally and Humphries and Gurney (2008) proposed a measure to determine if a network has a small-world characteristic, which is given by \(S = (C/C_{rand} )/\left( {L/L_{rand} } \right)\), where C and L are the clustering coefficient (defined above) and the characteristic path length of the network (i.e., the average shortest path length for the entire network),Footnote 17 while Crand and Lrand are the same concepts for a randomly constructed network. Networks that show both a small average path length and a high clustering coefficient are known as small-world network. For the CMN we find that S = (0.2189/0.0153)/(1.1366/3.29) = 41.43 >  > 1, which confirms that the CMN network has a small-world characteristic.

Beyond some structural measures we can also discuss which nodes are the most ‘important’ in the network through measures of centrality, in particular, degree centrality, betweenness centrality and eigenvector centrality. Figure 3 shows the placement of composers according to their ‘in’ and ‘out’ centrality. Composers with a high in-degree who also have a high out-degree may be viewed as ‘communicators’ or ‘facilitators’ of the network (e.g., Debussy, JS Bach, WA Mozart, Ravel, Liszt, Wagner, Stravinsky, Bartók, Brahms). They are influencers building on the shoulders of others. There are seen as prestigious or knowledgeable (‘authorities’) and other composers seek to be influenced by them. At the other extreme, some composers have low in- and low out-degrees, composing at the periphery. Hildegard, Lara, Milán, and Amram are actually disconnected from the rest of the network (no ‘in’ or ‘out’ influences in the network). Composers with a low out-to-in ratio are information sinks—to some extent, in purely quantitative terms, they do not spread or transfer the knowledge they may have learned from others, at least regarding to the low quantity of direct influences they have had). Some composers are a low source of influence because they are viewed as too indiscriminate in their own sources of influences. For others, this may reflect voluntary isolationism and lack of interest in transmitting knowledge. Perhaps Sullivan is an example of the former. He had little influence on others and might have suffered from his reputation among the musical establishment of writing frivolous music. Britten is perhaps an example of the latter. Of course, chronologically more recent composers are, by definition, sink of influences. Time is playing against them in being influencers, which also reflects a database limitation bias because the CMN has few composers born in the second part of the twentieth century. Finally, composers with a high out-to-in ratio are perhaps best described as outsiders and innovators, defining new directions in music while relying less on previous influences. Composers such as Carissimi, Corelli, D. Scarlatti, Chopin, R. Schumann, and Schoenberg, leading to a transition from a period or style to another, all fit this description to some extent.

Fig. 3
figure 3

In-degree versus out-degree. (Color figure online)

It is common in network analysis to provide a visual check of the node (composer) distribution of ‘in’ and ‘out’ degree centrality. A random network will typically have a binomial degree distribution, or at the limit when n is large, a Poisson degree distribution—distributions that underestimate the presence of nodes with large degrees—while real networks have highly right-skewed distribution—the bulk of the distribution occurring for low degree nodes with just a small number of nodes having a very high degree. For the CMN out-degree distribution, we would expect that the bulk of composers have influenced very few composers while a few composers have influenced many. Similarly, for in-degree distribution we would expect that the bulk of composers have been influenced by very few influencers while a few composers might have been influenced by many. Real networks are often said to follow a Pareto or power law distribution (also known as scale-free distribution).Footnote 18 Figure 4 shows the fraction pk of nodes with degree k, (pk = nk/n), in level (left part of Figure) and in log–log transformation (right). The out-degree average is 7.5, the median is 2, and the standard deviation is 15.9. The corresponding numbers for the in-degree distribution are: average = 7.5, median = 7, sd = 4.3.Footnote 19 Typically, if the in- and out-degree distributions followed a power law then the log–log transformation would appear as a downward sloping straight line (log pk = c − α log k for some constant α, c > 0). It is clear in Fig. 4 that the in and out-degree distributions of the CMN are highly right-skewed as expected from real-world networks, but that the in-degree distribution does not follow a power law for low values of kin (the frequency of composers associated with zero to four influencers is lower).Footnote 20 This reveals a bias in the long list of selected CMN composers; only well-established/known/connected composers were selected at the expense of little established/known/disconnected composers. This bias reveals the stated CMN educational purpose as opposed to preservation of the classical music heritage for musicology purposes.

Fig. 4
figure 4

Distributions of out-degree and in-degree node sizes. Note: out-degree average = 7.51, median = 2, sd = 15.923; in-degree average = 7.51, median = 7, sd = 4.35. (Color figure online)

Table 1 shows two other measures of centrality, betweenness centrality and eigenvector centrality. Betweenness centrality tries to capture nodes that are important not because they have a high in- or out-degree, but because they stand between groups, giving the network connectivity and cohesion. If a composer is often found on shortest paths (as defined above) between any two composers, they will score high on this measure. Table 1 shows that composers with high betweenness-centrality are present in all major periods. Network connectivity can be observed by the presence (in the top-25 list—not shown here) of composers such as Josquin des Prez (transitional stage in Renaissance music), Sweelinck and Schütz (transition from Renaissance to early Baroque), and F. Couperin and Purcell (both composers incorporated in their compositions different Baroque styles, French and Italian for the former, English, Italian and French for the later who is also an important source of influences on English musical renaissance of the early twentieth century). Finally, eigenvector centrality cares not only about the number (quantity) of connections (influences) but also the ‘quality’ of these connections (i.e., whether their connections are also well connected). In a directed network, there are two measures of eigenvector centrality–the left-(or in-) eigenvector which takes into account the quantity and quality of the composers who influenced a subject composer, and the right-(or out-) eigenvector which accounts for the quantity and quality of the composers the subject composer influenced. Given this definition, it is intuitive that the top-10 composers, according to the out-eigenvalue, are likely to be from earlier periods (Renaissance and Baroque), while the top-10 composers with respect to the in-eigenvalue, are from the twentieth century, which reflects the advantage of coming chronologically later. Modern composers had the opportunity to cherry-pick among an increasingly large pool of influencers from earlier periods, an opportunity that, say, a Renaissance composer did not have (partly also because knowledge of medieval and earlier music is limited). For example, Ligeti (1923–2006) was, according to the CMN, influenced by a rather long list of important and influential composers of all periods from the Renaissance to the twentieth century.Footnote 21 Finally, observe that the top-10 lists for degree centrality measures (in and out) and eigenvector centrality (in and out), have no intersection, which illustrates the role of adjusting for influences quality.

Table 1 Composers sorted by several measures of centrality (top-10)

A characteristic of the network of composers studied up to now is that it connects composers from the medieval to the renaissance, all the way to the twentieth century. We now focus on the period since 1862. The choice of 1862, the birth year of Debussy, reflects Griffiths (1978) argument in his ‘Concise History of Modern Music: From Debussy to Boulez’, that Debussy is a transitional figure from late romantic to the modern period. The pool of composers is reduced to 249, so that the adjacency matrix is of dimension 249 × 249. Some modern composers have been influenced by composers from earlier periods, but we only account for musical influences among composers of this period. The number of nodes is 249 and the number of edges is 1325. Hence, the density of the network is 0.02146. The twentieth century network is denser than the previous network, reflecting on average a larger number of influences (among all potential links) per composer.

We color-coded composers (nodes) by style in Figs. 5, 6 citizenship (or regions of affinity) in Fig. 7, and gender and ethnic background in Fig. 8. Figure 6 zooms on Fig. 5 for improved visibility. To ensure the same final ‘equilibrium’ placement of nodes across maps, a pseudorandom number generator was used so that the initial placement of nodes, although statistically random, was created in a deterministic manner. Unlike composers from earlier periods, many twentieth century composers are little-known to the public at large, and Figs. 5, 6, 7 and 8 may help in discovering composers through their connections with better-known composers (whose larger node size (out-degree) reflects their influence).

Fig. 5
figure 5

The microcosms of the 20th Century network. Color by style. ForceAtlas2 algorithm. Note: Late Romantic: deepskyblue; Light Classical: lightblue; Impressionist: blue; Nationalist: gray; Vernacularist: blueviolet; Expressionist: magenta; Serial: pink; Neoclassical: forestgreen; Avant-Garde: saddlebrown; Experimentalist: sandybrown; Mystical: yellow; Eclectic: gold; Neoromantic: springgreen; Minimalist: cyan. (Color figure online)

Fig. 6
figure 6

The microcosms of the 20th Century network. Color by style. ForceAtlas2 algorithm—Zoom level 1. Note: Late Romantic: deepskyblue; Light Classical: lightblue; Impressionist: blue; Nationalist: gray; Vernacularist: blueviolet; Expressionist: magenta; Serial: pink; Neoclassical: forestgreen; Avant-Garde: saddlebrown; Experimentalist: sandybrown; Mystical: yellow; Eclectic: gold; Neoromantic: springgreen; Minimalist: cyan. (Color figure online)

Fig. 7
figure 7

The microcosms of the 20th Century network. Color by citizenship. ForceAtlas2 algorithm. Note: ‘Red’: Americans; ‘Hotpink’: British; ‘Green’: Austrians; ‘Yellowgreen’: Germans; ‘Blue’: French; ‘Deepskyblue’: North-Europeans; ‘Cyan’: Russians; ‘Magenta’: Italians; ‘Gold’: South-Americans; ‘Yellow’: Spanish; ‘Darkorange’: Central-Europeans; ‘Teal’: Other Nationalities. (Color figure online)

Fig. 8
figure 8

The microcosms of the 20th Century network. Color by gender and ethnic background. ForceAtlas2 algorithm. Note 1: Women: in fuchsia. Men and ethnic background: yellow for African-American and African-European composers; green for some Latin-American composers, blue for the rest (essentially, ‘white-males’). Women: L. Boulanger, Beach, Clarke, Larsen, Tailleferre, Tower, Zwilich, Gubaidulina, Musgrave, Crawford, Oliveros, and Monk. African-American/-European composers: Samuel Coleridge-Taylor (English), William Grant Still, Scott Joplin, and George Walker. Latin-American: Leo Brouwer (Afro-Cuban); Agustín Barrios Mangoré (Paraguay); Carlos Chavez, Silvestre Revueltas, Agustín Lara, and Manuel Ponce (Mexico); M. Camargo Guarnieri and Heitor Villa-Lobos (Brazil); Alberto Ginastera and Astor Piazzolla (Argentina); Antonio Lauro (Venezuela). Note 2: Lauro, Lara, and Barrios are too far off the center of the map to be shown here. (Color figure online)

In an interesting thesis Pauls (2014) describes the state of the twentieth century classical music as if there were ‘two centuries in one’. As Pauls puts it: “An outstanding feature of the twentieth-century has been the divergence of European ‘art’ music into two general areas which do not overlap to the same extent that they do in previous centuries. That is, the performing repertoire is at odds, sometimes dramatically so, with a competing canon of works considered to be of greater importance from an evolutionary historical point of view”. This feature can roughly be seen in Fig. 5 where composers located towards the West, South, and East edges of the map make up the bulk of the performing repertoire, pursuing (to some extent) the romantic style of the nineteenth century and, more generally, pursuing the five pillars of the Common Practice Period mentioned earlier. As we move towards the center, the North, and especially the Northeast of the map, however, we find composers who have changed most or all the musical elements and rules of the ‘Common Practice Period’ and are often viewed (loosely) as the ‘avant-garde’ of the music of their time. Magnuson (2008) offers an interesting discussion about which of the five pillars of the Common Practice Period have been basically maintained, generally modified, or completely modified by the different styles of twentieth century (e.g., impressionism, primitivism, neoclassicism, expressionism, serialism, indeterminism, minimalism, neo-romanticism, etc.). He assumes that when a composer either generally modifies or completely changes more than one of these five elements, then, a new music (or Uncommon Practice) is created.

Associating a unique style (node color) to one composer (as in Fig. 5) is a restrictive and misleading assumption as many composers explored different styles over their lifetime. The CMN provides a list of styles for most composers. When more than one style was provided, additional manual searches were done, reviewing the bios of composers in several sources such as Wikipedia and composers websites, to ascertain the dominant style during the composer lifetime. We understand the limitations of this approach and Section “Multidimentional scaling (MDS) and classification maps” deals with this issue in a different way.Footnote 22 However, for the time being, we deliberately take this short-cut in Figs. 5, 6 to explore the main styles of the twentieth century.

According to Magnuson (2008), the music of the twentieth century is unique in its pluralism as “[c]omposers began to explore a more personal and individual approach to music creation, forming their own microcosms or ‘small universes’. No longer bound to the rules formed by one musical approach, they customized sound to suit their own views and preferences.” For Magnuson, there were three important small universes or microcosms near the turn of the twentieth century (see Figs. 5, 6): Impressionism (e.g., Debussy, Ravel, in the Southwest), Primitivism (e.g., Stravinsky, Bartók, in the East and Center), and Expressionism (e.g., Schoenberg, Berg, Webern, in the Northwest).

Impressionism is represented by Debussy, Ravel, and other composers pictured with a blue node and located in the South/Southwest.Footnote 23 Impressionism was a reaction to the state of music at the end of the nineteenth century, that is, late Romantic composers who lived well into the twentieth century, who made some concessions to the new century, but essentially belong to the Common Practice Period. These essentially Romantic composers are located (with deepskyblue nodes) in the East/Southeast and West/Southwest parts of Fig. 5.Footnote 24 Expressionism followed the path of the Common Practice Period but completely mutated its basic pillars of tonality, vocabulary, texture, sonority, and time organization. Besides Schoenberg, Berg and Webern, other expressionists gravitate around them in Figs. 5 and 6.Footnote 25 Expressionism itself led to Serialism and some representative composers, at least during part of their life, are located with a pink node.Footnote 26 Primitivism (Stravinsky, Bartók) positioned itself somewhere between Impressionism and Expressionism, and eventually led to 1) Neo-Classicism (composers pictured with dark-green nodes in Figs. 5 and 6 and essentially grouped in the East part) and to 2) the revival of Nationalism as a source of inspiration (composers with gray nodes), a trend that began with Glinka, Smetana and Dvorák in the nineteenth century.Footnote 27 Note that nationalist composers are spread in the South part of the map closer to late-Romantic composers. This reflects the idea that Nationalism is the continuation of a nineteenth century trend and that the network of influences of nationalists is also driven by their citizenship (see below in Fig. 7).

Other styles, represented by the Avant-Garde (in saddlebrown) and Experimentalists (in sandybrown) in the Nord-East of Figs. 5 and 6, reflect the exploration of the ‘Uncommon Practice’. According to Magnuson (2008), new technology created ElectronicismFootnote 28 while in the second half of the twentieth century there has been an unprecedented attention to new elements of TexturalismFootnote 29—the relationships of timbre, density of pitch and rhythm being given a new primordial status relative to melody and harmony. Reactions to these styles created Indeterminism/Chance/Aleatory music,Footnote 30 a reaction against the total control that is the basis for integral Serialism. Minimalism (Riley, Reich, Glass, Adams–in cyan in Figs. 5 and 6) opposed the ideas of atonality itself and reintroduced the vocabulary and sonority of the Common Practice Period. For Magnuson (2008), neo-RomanticismFootnote 31 (in springgreen in Figs. 5 and 6) opposed these things too, but also represents a complicated relationship between today’s composer (and listener) and the music of the past (as opposed to the late Romantic composers, mentioned above, who belong to the Common Practice Period). Popular music, Jazz, exotic influences and the criss-crossing of styles led to Eclecticism—choosing diverse elements from many different sources. As argued by Magnuson (2008), this is the essence of the twentieth century but certain composers are put in this group as they simply cannot be placed into neat categories due to their originality and individuality.Footnote 32

Figures 7 and 8 focus on citizenship, gender, and ethnic background.Footnote 33 Figure 7 shows that citizenship (and possibly geography) still matters in the influence network.Footnote 34 We see a concentration of North-European composers (deepskyblue)Footnote 35 in the East and towards the center; Russian composers (cyan)Footnote 36 in the South-East and British composers (hotpink)Footnote 37 in the South-East/ South. In the South-West we see a series of Spanish (yellowgreen)Footnote 38 and Italian (magenta)Footnote 39 composers. In the South-West/West, we find a concentration of French composers (darkblue).Footnote 40 In the West we encounter both German (yellowgreen) and Austrian composers (darkgreen).Footnote 41 American composers (red) firmly occupy the center and the Northeast side of the map. We also notice here composers from several nationalities, whose music is of a much more experimental and avant-garde style.Footnote 42 Their presence here tends to suggest that the influences of nationality/geography may have dissipated somewhat in the second part of the twentieth century.

In Fig. 8, color-coding permits to identify gender (fuchsia for Women). All in all, there are 12 women born since 1862 (out of 249 composers in the CMN).Footnote 43 To put things in perspective, there were just four women (out of 251 composers in the CMN) born before 1862.Footnote 44 For men, we also show ethnic background (yellow for African-US/African-European composers, green for Latin-American composers, and blue for the rest, i.e., essentially ‘white males’). Although there are two Asian composers (Toru Takemitsu, Japan and Dun Tan, China) in the database, they are lumped together with the rest (in blue) in Fig. 8.Footnote 45Footnote 46 Fig. 8 therefore shows how much ‘white-male’ dominant the twentieth century composers network (as portrayed by the CMN) remains. As mentioned in the Introduction, the short list of 500 composers included in the CMN was based on objective criteria with the aim of reflecting composers’ status (at the time it was put together), instead of being a mere advocacy instrument or reflecting subjective preferences. This said, any list of 500 (when lists of several thousand composers exist) will always be open to criticism that ‘some other’ composers should have been included into the list. First, there is a well-known discourse within music schools and gender and women studies departments that the narrative and canon formation of Western classical music has privileged white men of the ‘European’ tradition (e.g. Citron, 1990, 1993). In so far as the 500 composers were selected based on items and catalogues that reflect this narrative, then the CMN exposes this bias.Footnote 47 Second, the CMN does not cover more recent developments in classical music (Dun Tan, born in 1957, is the youngest composer in the CMN). The problem here is not the CMN per se (available online in 2000), but that 20 years later new and important composers have emerged. Finally, through the work of musicologists, once-forgotten composers are regularly re-evaluated or re-discovered.Footnote 48 The re-discovery process and the emerging of new composers may of course have an impact on catalogues, length of composer entries in dictionaries, etc., so that an eventual update of the long and short lists of composers as candidates to inclusion in the CMN might be required, at some point, to reflect changes in the relative status of composers.

Network graphs show how musical influences structure and reflect both the macrocosm of the Common Practice Period and its main components, and the late nineteenth and twentieth century musical styles—Impressionism, Expressionism, and Primitivism (and its development as Neo-Classicism)—and their interactions with composers’ nationality, gender, and ethnic background. Yet, the criss-crossing of styles used or developed by twentieth century composers makes it difficult to associate only one style per composer. We therefore propose to extend and enrich the network analysis of composers analysed here by computing similarity indices across composers taking account of composer-to-composer musical influences, style influences, and ecological characteristics (features) that best describe the 500 CMN composers. This permits to develop a multidimensional scaling analysis where composers located closer on the map are more ‘similar’.

Composer similarity indices and heat maps

Beyond establishing lists of musical influences, one objective of the CMN was linked to early efforts in music information retrieval (MIR). For instance, the CMN site explains that many introductions to the classical music world are in the business of inculcation through lists of ‘mandatory’ composers and compositions to explore. Yet, most people explore new subjects by starting with the familiar, and in the case of music, this may mean hearing a composer or a composition that one likes and then searching for more music of the same type. The site gives the following example:

“Suppose you hear the Ravel G major piano concerto on the radio, and take an immediate liking to it. Our database will help you extend this interest to other music by making it possible for you to quickly identify: additional works by Ravel, other piano concerti, other works for piano in general, other concerti in general, composers allied to the same general period and style (Impressionism) as Ravel, other French composers, composers and styles that influenced Ravel, and composers influenced by Ravel.”

Thus, the CMN anticipated the general idea of a recommender system that is now commonly and automatically implemented in Spotify, Apple Music, Pandora, YouTube and other music streaming platforms that have algorithms proposing works to listen to next. These algorithms and their improvement are largely indebted to the field of MIR, which develops innovative content, context, and user-based searching schemes, music recommendation systems, and novel interfaces to make the vast store of music available to all.Footnote 49

This section illustrates a method to compute similarity indices across composers. The general method has been explored and progressively refined in several papers such as Smith and Georges (2014, 2015) and Georges (2017). Here, we assemble all elements together, using two basic sets of information given in the CMN. First, we have used the list of composer-to-composer influences as explained in the earlier sections. Second, we extracted 42 general musical style influences (e.g., African music, Native American music, Spanish music, Indian music, folk music (by specific regions), popular music (by specific regions), world music, jazz, ragtime, blues, electronic, gamelan music, nature sounds, birdsong, etc.), also provided in the ‘musical influences’ list of the CMN ‘Composers’ page.Footnote 50 Finally, we extracted 298 ‘ecological’ categories from the ‘Index of Forms and Styles’ page of the CMN so that each of the 500 composers are associated with a subset of these ecological categories (i.e., characteristics such as time period, geographical location, school association, instrumentation emphases, etc.).Footnote 51 An 8-page list of all ecological categories is available in Smith and Georges (2015). Once this information was gathered for the 500 composers of the CMN database, bilateral similarity indices were constructed by means of pairwise comparison of presence-absence data, using similar concepts to those encountered in bibliographic coupling (e.g., Glänzel & Czerwon, 1996; Sen & Gan, 1983). In essence, we inferred similarities among composers by assuming that if two composers share many of the same musical (personal and style) influences and ecological features, their music will likely have some similarity. On the other hand, if two composers have very distinct sets of musical influences and ecological categories, then their music is likely to have little similarity.

Technically, for any pair of composers (i,j) for i,j \(\in\) C (among the n × n possible pairs of n = 500 composers in the set C of CMN composers), we have a set Ai of all attributes k (personal influences, style influences, and ecological categories) that apply to composer i, and a set Aj of all attributes k that apply to composer j. Given all attributes nk in the database we can therefore count the number a of attributes shared by i and j, the number b of attributes that characterize i but not j, the number c of attributes that characterize j but not i, and the number of attributes d that neither characterize i nor j (d = nk − a − b − c). Among several similarity indices, we have used the centralised cosine similarity measure given by the following formula:

$$CSC_{i,j} = {{(ad - bc)} \mathord{\left/ {\vphantom {{(ad - bc)} {\sqrt[{}]{(a + b)(c + d)(a + c)(b + d)}}}} \right. \kern-\nulldelimiterspace} {\sqrt[{}]{(a + b)(c + d)(a + c)(b + d)}}}$$
(1)

This measure is equivalent to a Pearson correlation coefficient, r, taken between two Boolean vectors of attributes describing a pair of composers (i,j), where each Boolean vector is a series (of length nk) of 1’s and 0’s depending on whether an attribute belongs or not to a composer. It can be shown that values of the centralised cosine measure range from − 1.0 to 1.0. A value of 1.0 indicates that two composers are identical. A value of − 1.0 indicates that two composers are complete opposite. A value of 0 shows that the two composers are independent (unassociated).Footnote 52

This methodology permits to compute a 500 × 500 matrix of similarity indices between pairs of composers (i,j). Let us call the similarity matrix Scomb where the subscript ‘comb’ refers to the fact that we combined all attributes k (musical influences and ecological categories) in the calculation. Besides computing Scomb, we can compute similarity matrices by restricting attributes to one specific category. For example, we can compute a 500 × 500 similarity matrix Secol, with similarity indices between pairs of composers based only on attributes that focus on the 298 ecological categories (as is done in Smith & Georges, 2015). Or we can compute a 500 × 500 similarity matrix Sinfl, based on personal (composer-to-composer) musical influences only (as is done in Smith & Georges, 2014). The Scomb-based indices are now included in the CMN composers webpage through lists of top-15 most similar composers to each of the 500 composers. Table 2 provides lists for 14 major composers together with the similarity score.Footnote 53,Footnote 54 The classical music website soclassiq (https://soclassiq.com/en/) has also recently included our index permitting to explore, for each composer, a list of the most similar composers.Footnote 55

Table 2 Most similar composers to the top 14 composersa (after the CMN site)

Figure 9 visualises Scomb with a heat map.Footnote 56 In this graph, composers have been ordered chronologically, from Hildegard (born in 1098) until Dun Tan (born in 1957). Along the diagonal, composers are compared to themselves, which gives a similarity score of 1, and is translated into a tiny black dot in Fig. 9. Moving off the diagonal implies comparing different composers. Dark blue reflects high similarity, while a light yellowish color suggests that the two composers are unassociated (independent) and any whiter shading implies negative values (opposition between composers). In general, as we move further away from the diagonal composers become more distinct. We can also visually detect ‘intra-period’ similarities and ‘inter-periods’ dissimilarities or independence. For example, Renaissance composers (in the upward left corner) tend to be relatively similar to each other, forming a small dark blue ‘square’ area, but Renaissance seems largely independent of other periods, leading to a long yellow ‘rectangle’ area in the rest of the rows (or columns). Note that the CMN database tends to have a larger number of composers associated with more recent periods. For example, as seen above, half (249) of the database is made of composers born since 1862. This feature is translated as larger, more diffused, blue ‘square’ areas (of intra-period similarity) as we move down the diagonal.

Fig. 9
figure 9

Heat map (500 composers). (Color figure online)

Multidimentional scaling (MDS) and classification maps

Multidimensional Scaling (MDS) is a technique that generates a map displaying the relative position of a number of objects based on a given set of pairwise distances between them. In this section we apply the MDS methodology to the 500 × 500 matrix of similarity/distances between composers, Scomb. In this context, the MDS algorithm seeks to position (i.e., assign coordinates to) the 500 composers in an N-dimensional space according to an optimisation procedure that preserves as well as possible the initially computed bilateral distances (di,j) between composers.Footnote 57. Choosing N = 2 optimizes composers location in a two-dimensional space (x,y), by generating the coordinates (xi, yi) and (xj, yj). The coordinates are computed so as to minimize a loss function called ‘stress’, which is the sum of squared errors of the actual distance between two composers di,j, and the predicted distance di,j * computed by the algorithm, and where the predicted distances depend on the number of dimensions kept and the exact algorithm that is used.Footnote 58 Stress values near zero are the best.Footnote 59

Figure 10 shows the MDS map computed using Python.Footnote 60 A general objective of this section is to gauge whether the MDS map places composers according to our general expectations. Note that the date of birth of composers is never used directly when computing the similarity indices between composers nor is it used through MDS to locate composers on the map.Footnote 61 However, in order to assess the placement of composers, we tagged composers within 10 periods (Medieval, Renaissance, Baroque, Pre-Classical, Classical, Post-Classical, Early Romantic, Middle Romantic, and late Romantic, and Modern), using their birthdate as a criterion of decision, and the color of the dots/composers in Fig. 10 represents this tagging. Interestingly we see that the MDS map unfolds the history of classical music, starting with Medieval and Renaissance composers in the East, and, as we move counter-clockwise, progressing towards Baroque, Classical, Romantic and eventually twentieth century composers.

Fig. 10
figure 10

Multidimensional scaling analysis (500 composers). (Color figure online)

Although the first visual check suggests that the MDS methodology places composers on the map according to general expectations (e.g. Baroque composers together, Classical composers together, etc..) we want to produce a more convenient visual check with a painted contour around composers belonging to the same period while producing a map that is esthetically more pleasing than Fig. 10. In order to realize this, we decided to use a support vector machines classification algorithm (SVM). Typically, a classification algorithm tries to determine the class to which the object of the analysis belongs to. In the case of music composers, we could have a trained data set of composers and their features (characteristics) from which the algorithm would extract classes.Footnote 62 Thereafter, using a test data set (additional composers not included in the trained data set) the algorithm would be used to predict the probability that an additional composer belongs to a specific class or group on the basis on his/her own features (i.e., his/her positioning on the two-dimensional graph). There are several classification algorithms, both using supervised learning (SVM, K-nearest neighbors) and unsupervised learning (K-means clustering). Weiss (2017) and Weiss et al. (2018) apply several of these methods using audio features to characterise and then classify composers.

In our context, SVM should be applied normally to a set of composers described by a series of features. Here, though, we apply SVM directly to the two dimensions of the MDS map, which characterises composers according to the two axes in Fig. 10. The two dimensions do not represent directly measured features of composers but instead coordinates ultimately derived from a distance matrix computed on the basis of musical (personal and style) influences and ecological features of composers. The reason why we chose this strategy is that our interest is not in predicting to which class an additional composer (not included in the trained data set) belongs to. Instead, SVM is used here as an algorithm that draws painted contours around composers of a period in which they have been initially tagged, providing a more convenient visual check of the placement of composers on the MDS map, while also producing a more ‘aesthetic’ map. Finally, note from Fig. 10 that we need to use non-linear SVM because it would be impossible to draw straight contour lines to separate each groups of composers. In other words, we need to bend the lines to separate the classes. For this specific problem we decided to use a non-linear kernel, the radial basis function or Gaussian kernel.Footnote 63

Figure 11 shows the results of applying non-linear SVM to the MDS map of Fig. 10 while forcing overfitting so that we have a perfect matching between the ten classes identified by the SVM algorithm (and represented with painted contours) and the ten sets of dots of a specific color, each representing a music period wherein composers have been pre-identified.Footnote 64,Footnote 65 Besides the overfitting (perfect matching), we also see, more clearly here than in Fig. 10, that the MDS methodology does a good job of positioning composers according to their periods. Relative to a chronologically arranged dictionary of composers, Fig. 11 provides a visual arrangement from which we can also infer the similarity of composers on the basis of their proximity on the map—providing some guidance on which composer to listen to next. If one likes Mozart, why not try to discover Cimarosa, Salieri, Grétry or J.C. Bach?

Fig. 11
figure 11

The macrocosm of the Common Practice Period—Support vector machines on MDS map. (Color figure online)

We now try to apply the method above to shed some light on the classical music of the twentieth century that is notoriously difficult to discover. We apply (and over-fit) the SVM algorithm assuming that each composer of the twentieth century might be tagged with a single main style, that is, mutually exclusive categories such as Impressionist, Expressionist, Neo-classical, etc.Footnote 66 These categories are given in the legend of Fig. 12 which also includes a category ‘Before’ representing all those composers from earlier periods, who belong without much ambiguity, to the Common Practice Period. Unlike results in Fig. 11, Fig. 12 shows that overfitting the data creates a very complex map with many ‘islands’ of seemingly isolated composers. If we believe that the MDS methodology is accurate in positioning pair of composers according to their bilateral distance, so that composers closer on the map are more similar (as Fig. 10 suggests), then a conclusion can be drawn—Categorizing classical composers of the twentieth century is a rather complex task and identifying composers by one unique style and generating the painted contour to which they belong (through an overfitting classification algorithm) provides little pedagogical guidance in terms of communicating music and trends of the twentieth century. Perhaps in this case, the strategy of data overfitting is the problem and should be revised.

Fig. 12
figure 12

The microcosms of twentieth century classical music—Support vector machines on MDS map. Note: Figure 12 is produced using SVM parameters that over-fit the musicological data. (Color figure online)

Figure 13 shows the results of using a non-linear SVM algorithm on a MDS map for our list of 249 composers born since 1862 (to include Debussy, a transitional figure from late romantic to the modern period), while reducing the overfitting of the data.Footnote 67 As shown here, with reduced over-fitting there is no perfect matching between painted contours and the sets of composers (dots) tagged with a same color/style. Still, one color/style tends to dominate within each contour, from which an appropriate classification might be inferred for the whole contour. For example, the green, mauve, yellow, dark grey, orange, and pink contours represent, predominantly, composers related to Late-Romantism, Impressionism, Neo-classicalism, Neo-Romanticism, Expressionism and Serialism, and Experimentalism and Avant-Garde (of their time), respectively. Hence, the map produces a visual framework that helps in communicating classical music trends of the twentieth century, in a move from left to right that is also suggestive of the extent to which composers partially or totally changed the five pillars (music rules) of the Common Practice Period.Footnote 68 Note that choosing a higher degree of overfitting for the SVM algorithm would lead to seemingly more homogeneous contours and the appearance of nationalist and minimalist contours, at the cost of a more complex, less pedagogical, map.

Fig. 13
figure 13

SVM on MDS map for twentieth century composers. (Color figure online)

To rationalize, in musical terms, non-homogenous painted contours, recall that many composers of the twentieth century did not have a unique style that can easily identify them, while their position assigned on the MDS map captures a much richer aspect of the complexity of a composer by considering the distance metric of Section “Composer similarity indices and heat maps”, based on many musical (personal and style) influences and ecological characteristics. A composer tagged (through a coloured dot) with a unique style could be similar (in several different ways) to composers generally associated with another style, and therefore be regrouped with them inside a painted contour. For example, the early style of Szymanowski or Casella (dark blue dots) is typical of the impressionist current, but both composers have been, later, strongly associated with neoclassicism, which ultimately explains their position in the yellow contour. Honegger is a Neo-romantic composer (light green dot) whose music remained strongly influenced by German romanticism, yet, he is located in the Neoclassical (yellow) contour most likely because of his inclusion to Les Six—a group of French neoclassical composers often seen as reacting against both the musical style of Wagner whose music eventually led to Late/Post Romantic current (green contour), and the impressionist music of Debussy (mauve contour). Ginastera is located in the orange contour because he extensively used serial techniques during what he referred to as his Neo-Expressionist period (in the last twenty years of his life). Yet, he was also a nationalistic composer (brown dot) and his works from the Nationalism period often integrate straightforward Argentine folk themes. It is at first surprising to see Schoenberg at the North-West edge of the neoclassical contour. His name is typically associated with Expressionism, atonality, and twelve-tone serialism so that one would expect to see him in the orange contour. Yet, as Schoenberg always claimed, he viewed his music as extending the traditionally opposed Romantic styles of Brahms and Wagner—in his words: “I do not attach so much importance to being a musical bogeyman as to being a natural continuer of properly-understood good old tradition!”Footnote 69 His position close to the border of the Late Romantic (green) contour most likely reflects this fact and shows, among other examples given above, that trying to capture similarities between composers according to a rich set of attributes is, in terms of music education and discovery, a method that complements and improves on a classification of composers based on rigid categories.

Besides visualizing twentieth century musical currents, these maps can be used to discover less well-known composers. Stravinsky is one of the most important and influential composers of the twentieth century. After his ‘Primitivism’ period (e.g., The Firebird and The Rite of Spring) that locates him close to Bartók’s through his own Primitivism period (e.g., the Miraculous Mandarin) on Fig. 13, he eventually turned to Neoclassicism with other well-known composers such as Prokofiev and Hindemith. But Fig. 13 can help discover, say, Alexandre Tansman, at the southern edge of the yellow contour, an important musical figure of Neoclassicism even if somewhat forgotten by the public at large. To back up his position relative to neighbors on the map, we suggest a quick read of Wikipedia’s description. While pursuing his musical career in Paris, Tansman was, according to Wikipedia, influenced by Stravinsky but also by Jacques Ibert and Albert Roussel (close on the map). Darius Milhaud tried to persuade him to join Les Six—the group of French neoclassical composers referred above. Of this group, Auric, Tailleferre, and Milhaud himself can be found in near vicinity on the map (while Poulenc, also of Les Six, is admittedly further away, located South of Prokofiev and close to Ravel). Still according to Wikipedia, Tansman was one of the most respected members of the international music group École de Paris, along with Bohuslav Martinů and Alexander Tcherepnin (also neighbors on the map).Footnote 70

Going over each twentieth century composer to rationalise his or her position on the MDS map goes beyond the objective of this article but musicologists could contribute by exploring further the results, leading to advances in ways one might capture the similarity/distance between composers in Section “Composer similarity indices and heat maps”. For example, the similarity matrix could be computed on the basis of the 298 ecological categories only, while eliminating the musical (personal and style) influences (in terms of Section “Composer similarity indices and heat maps”, using Secol instead of Scomb). This might improve the accuracy of the placement of composers on the MDS map, putting those who have very similar ecological niches even closer on the map. As pursued further in the conclusion, however, this might also reduce the ‘endogenous’ serendipity that the current map offers in terms of composer discovery, when exploring the music of a little-known composer in near vicinity of a better-known composer.

Conclusion

This article falls into the realm of an experiment in music education and exploration, not musicology. By proposing maps tracking composers’ interconnections based on composer-to-composer (i.e. personal) influences and composers’ similarities based on musical (personal and style) influences and ecological data, our ultimate objective is to encourage classical music exploration, suggesting to users ‘new-to-them’ composers they might listen to. In this perspective, we strongly believe in an approach that focuses on a limited number of composers since discovering and listening to important compositions of 500 composers is a lifelong process for most while a more ambitious target might quickly become overwhelming. This rationalizes our choice of visualizing the CMN whose stated purpose is less of a musicological reference than an educational resource. Of course, the results presented here must be understood under this context and limitations. But, in principle, it is possible to extend the analysis presented here to more recent and ambitious musicological database. Of note, musiXplora has rich teacher/student relationships and musical instruments classifications.Footnote 71 The Bavarian Online Encyclopaedia of Musicians mentioned before could also be used, although this would also shift emphasis from music education towards musicology and tools serving musicologists.

In an age offering either inculcation through lists, assembled by music educators, of prescribed composers and compositions, or music recommendation algorithms, built by software engineers and data analysts, that automatically propose works to listen to next, this article offers a path that might promote the active rather than passive discovery of composers and their music (as with automatic recommender systems) in a less restrictive way than inculcation through prescription. Yet, as explained below, the active composer discovery proposed here has a few disadvantages so that it is best positioned as a tool that complements inculcation and the use of a recommender within music streaming systems.

One disadvantage of our approach to music discovery is that a user who decides to listen to a composer nearby a better-known composer still faces the challenge of discovering his or her important compositions. In this case, we argue that the CMN website (among many others) remains a source of information by proposing a list of important compositions for most of the 500 composers in the database. The overall approach would therefore promote active discovering of composers nurtered through a prescribed list of compositions. Second, a weakness of the CMN is that it does not cover the more recent developments in classical music. Extending the CMN so as to include more recent composers is possible, although the process of collecting the musicological data should take advantage of new approaches in digital humanities, including the use of natural language processing (NLP) for music knowledge discovery (e.g. Oramas et al., 2018). With respect to new composers, however, inculcation remains an essential source to music education. For example, Rutherford-Johnson (2017) in his Music after the Fall, offers a retrospective of modern composition and culture since 1989. The book also compiles several lists of composers and their compositions to listen to.

Although we believe that a pre-informed search about who and what to listen to is desirable when using music streaming services, we must also acknowledge the new developments and strong potentials of music recommender systems, from simple (collaborative filtering) to more complex approaches such as audio models using convolutional recurrent neural networks. Collaborative filtering permits to make predictions about the interests of a user, and thus an eventual recommendation, by collecting listening profiles from many users and searching for common and distinct listening patterns among them. As new songs, by definition, are not in the music profile of any users, they cannot be easily recommended, however. More complex audio models typically start with loading segments of audio files, extracting timbral features from the audio signals (e.g., low-level features such as the time domain zero-crossing rate, the spectral centroid, spectral bandwidth, spectral rolloff, the mel-frequency cepstral coefficients, etc.). Once the structured audio features are extracted, a classifier is used, for example an artificial neural network (although in principle it could be a support vector machine algorithm), either for classification purpose (which music genre the different audio segments of the training set belong to), prediction (which genre a new audio file—not part of the initial training set, belongs to), and music recommendation systems (if a new composition is classified in a specific genre, it can be recommended to those who listen to this genre). An early application is Tzanetakis and Cook (2002).

Yet, audio models come with their own problems. As mentioned by Weiss et al. (2018), “[e]xtracting score-like information from audio—referred to as automatic music transcription—is a complex problem where state-of-the art systems do not show satisfactory performance in most scenarios.” In other words, the audio processing algorithms needed to extract meaningful audio features are often error-prone and do not reach a high level of specificity regarding human analytical concepts. For example, notes specified by a musical score are hard to extract from an audio file. These problems eventually have led to the adoption of convolutional neural networks (CNNs), traditionally used for image classifications, by a shift of emphasis away from structured audio features towards images (e.g., the spectrogram of a music audio) to improve the accuracy of music classification and recommendation systems (see Costa et al., 2017).Footnote 72 The spectrogram, being an ‘image’, its use with CNNs makes sense; but its horizontal axis represents the time dimension so that recurrent neural networks (RNNs)—typically used with sequential data, may be useful for temporal summation of the spectrogram features. This idea has led to an architecture combining CNN and RNN for music classification, processing the spectrogram in sequence from CNN to RNN (Choi et al., 2016) or in parallel (Feng et al., 2017). Besides these developments, semantic techniques in natural language processing (that search for the semantic meaning of words and sentences through word ‘embeddings’) are also adapted to music in a way that captures meaningful tonal and harmonic relationships (see Chuan et al., 2020).

We conclude with a well-known issue that the high accuracy (relevance) of some automatic music recommender systems tends to generate the same type of music so that people get bored quickly. The much-discussed concept of serendipity is the idea of a recommender system that can (pleasantly) surprise a listener. Measuring serendipity is not easy or straightforward. One cannot simply import it from a library (e.g., Python sklearn) unlike accuracy metrics such as, say, ‘precision’, ‘recall’, ‘discounted cumulative gain’ or ‘(mean) average precision’, etc. We argue however that the MDS maps presented in this article endogenously includes a degree of serendipity. First, the object of the study is the composer, not the composition (e.g., discovering the music world of Stravinsky is much more than knowing about his ‘Rite of Spring’). Second, composers that are closer on the map are more similar not only because they share the same ecological characteristics but also because they share the same musical (i.e. composer-to-composer, and style) influences. However, as argued in Georges (2017), composers who are similar in their composer-to-composer musical influences may have nevertheless produced music that might sound different in that they belong to different ecological niches (referred in that article as adaptation or music speciation and evolution). Listening to a composer that is in near vicinity to another better-known composer on the MDS maps may, in that sense, lead to new discoveries with sustained serendipity. Further research could possibly compare current MDS maps with maps based on a distance matrix that excludes composer-to-composer musical influences (i.e., excluding the information from the influence network), gaining relevance in terms of similarity accuracy at the possible cost of lower serendipity.