Sentiment analysis of popular-music references to automobiles, 1950s to 2010s

Wu, Chenyang; Le Vine, Scott; Bengel, Elizabeth; Czerwinski, Jason; Polak, John

doi:10.1007/s11116-021-10189-1

Sentiment analysis of popular-music references to automobiles, 1950s to 2010s

Open access
Published: 02 May 2021

Volume 49, pages 641–678, (2022)
Cite this article

Download PDF

You have full access to this open access article

Transportation Aims and scope Submit manuscript

Sentiment analysis of popular-music references to automobiles, 1950s to 2010s

Download PDF

Chenyang Wu^1,2,
Scott Le Vine ORCID: orcid.org/0000-0003-1455-5640^2,3,
Elizabeth Bengel³,
Jason Czerwinski³ &
…
John Polak²

4285 Accesses
1 Citation
4 Altmetric
Explore all metrics

Abstract

In recent years, there has been a scholarly debate regarding the decrease in automobile-related mobility indicators (car ownership, driving license holding, VMT, etc.). Broadly speaking, two theories have been put forward to explain this trend: (1) economic factors whose impacts are well-understood in principle, but whose occurrence among young adults as a demographic sub-group had been overlooked, and (2) less well-understood shifts in cultural mores, values and sentiment towards the automobile. This second theory is devilishly difficult to study, due primarily to limitations in standard data resources such as the National Household Travel Survey and international peer datasets. In this study we first compiled a database of lyrics to popular music songs from 1956 to 2015 (defined by inclusion in the annual “top 40”), and subsequently identified references to automobiles within this corpus. We then evaluated whether there is support for theory #2 above within popular music, by looking at changes from the 1950s to the 2010s. We demonstrate that the frequency of references to automobility tended for many years to increase over time, however there has more recently been a decline after the late 2000s (decade). In terms of the sentiment of popular music lyrics that reference automobiles, our results are mixed as to whether the references are becoming increasingly positive or negative (machine analysis suggests increasing negativity, while human analysis did not find a significant association), however a consistent observation is that sentiment of automobile references have over time become more positive relative to sentiment of song lyrics overall. We also show that sentiment towards automobile references differs systematically by genre, e.g. automobile references within ‘Rock’ lyrics are in general more negative than similar references to cars in other music genres). The data generated on this project have been archived and made available open access for use by future researchers; details are in the full paper.

The Evolution of Public Perceptions of Automated Vehicles in China: A Text Mining Approach Based Dynamic Topic Modeling

Analyzing Parking Sentiment and its Relationship to Parking Supply and the Built Environment Using Online Reviews

Article 05 March 2021

The italian music superdiversity

Article 17 September 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

There are two sharply distinct theories to explain the reversal of the long-term growth trends (a.k.a. ‘Peak Car’) in young people’s car-related mobility indicators (driving license holding, car ownership, driving mileage, etc.) observed in many high-income countries beginning in the 1990s/2000s (Blumenberg et al. 2016; Ciari and Axhausen 2015; Kuhnimhof et al. 2012). One school of thought focuses on changing economic circumstances and external constraints on young people’s mobility. This incorporates candidate explanators such as GDP per capita (Bastian et al. 2016), declining workforce participation and income levels (Bayart et al. 2020; Blumenberg et al. 2016; Delbosc and Currie 2014a), increasing costs of owning and operating a car (Bastian et al. 2016; Chatterjee et al. 2018; Klein and Smart 2017), and the advent of mechanisms that have made acquiring a driving license more onerous, time-consuming, and/or expensive (Thigpen and Handy 2018).

Such exogenous explanators by themselves have not been convincing to the research community as fully and completely accounting for young adults’ declining automobility indicators. Other factors such as possible shifts in cultural mores, values and sentiment among young adults towards the automobile have also been raised. McDonald, for instance, writes that “…there is not agreement among researchers…the second set of work [i.e. theory] acknowledges the importance of economic factors but argues that they do not fully explain observed declines…Identifying factors contributing to declines has been difficult, but [includes]…changing attitudes to travel generally and cars in particular. Some posit that the car is no longer a status symbol having been replaced by smartphones. Evaluating this is difficult” (McDonald 2017, pp. 3–5).

The difficulty in discriminating between these two competing schools of thought is at core due to data limitations: the data resources traditionally employed to both observe and model mobility trends (national and regional-scale household travel surveys) provide little or no information regarding respondents’ attitudes towards the car, and thus whether or not such attitudes may have shifted over time. Such datasets are relatively strong in demonstrating that young adults’ mobility indicators have shifted in unexpected ways since the 1990s/2000s [e.g. in the UK driving-license holding by people under age 30 peaked in the early 1990s, see Le Vine and Polak (2014)], however relatively weak in providing uncontested explanations for such trends. Although researchers have employed survey methods to probe attitude towards cars (Brown and Handy 2015; Thigpen and Handy 2018), the surveys are cross-sectional with limitations in capturing changing attitudes over time.

The motivation for the present research is thus to advance the state-of-knowledge regarding the hypothesis that attitudes towards the car have changed over time. Due to the data limitations noted in the previous paragraph, we devised a novel data-compilation strategy using lyrics from popular music covering a 60-year period from the 1950s through the 2010s. While such data cannot yield unambiguous conclusions about the possibility of social attitudes having shifted away from the car, this research strategy employs the corpus of popular music lyrics as an attempt to proxy for young adults’ social attitudes towards cars during this time period; we note that popular music is consumed most heavily by younger adults (Kalia 2015). Using the song lyrics, we then compiled a structured database of bars within the lyrics that reference automobility, and analyzed this database of ‘tokens’ using both sentiment analysis (a.k.a. “Natural Language Processing”) techniques and manual classification by the research team to quantitatively evaluate the research question.The datasets created for this paper have been archived at https://doi.org/10.17605/OSF.IO/UM5XB, and are available open access for future research use.

The remainder of this paper is organized as follows: “Literature review” section reviews the relevant literature, and Sect. “Data” section describes our data-compilation protocol. “Results” section then presents the results, and “Conclusions” section summarizes and concludes the paper.

Literature review

This section first reviews the observations of declining automobile orientation among young adults and factors (both socio-economical factors and attitudinal factors) that appear relevant to the effect. We next review sentiment analysis approaches and their application in the context of transport, as well as the link between popular music and broader culture evolution.

Young adults’ declining automobile orientation

Many researchers have investigated reasons for this effect over the past two decades, with a marked peak in research productivity on this line of enquiry in the period 2011–2015.

Table 1 summarizes factors that have been investigated and reported in the literature. Overall, socio-demographic characteristics and the built environment of their residence are the most extensively investigated factors. It appears to be broadly agreed that delayed transitions to adulthood (i.e. longer years of getting education and living with parents, delayed marriage), re-urbanization (i.e. living in dense urban areas with good public transport accessibility), and increased financial pressure on young people each play roles in the overall effect.

Table 1 Summary of literature in peak car

Full size table

The wide adoption of Information and Communication Technology (ICT) was expected by some to impact young people’s attitude towards cars. It was expected that the use of smartphones, social media and other electronic devices may decrease young people’s car ownership and car use, but results in the literature are mixed: Thigpen and Handy (2018) found that ICT is linked with delays in young people’s driving license acquisition, whereas Brown and Handy (2015) and Le Vine and Polak (2014) found the opposite. The authors are unaware of contributions to this research question published after the onset of the COVID-19 pandemic, during which ICT use has undergone a step change.

Graduated driving license (GDL) schemes have been implemented in many developed countries, such as Australia, Canada, New Zealand and US. Although implemented differently in different parts of the world, GDL usually comprises of three stages: learner permit, provisional license and full license. To pass through the stages, driving license applicants need to pass a series of tests, and fulfill requirements such as minimum age, minimum driving practicing hours, and/or minimum period of holding a learner permit/provisional license. Whether GDL helps explain reduced automobile orientation has been investigated on a number of studies, but the result again is mixed. Raimond and Milthorpe (2010) and Tefft et al. (2014) did not find evidence that GDL explains this effect, whereas Thigpen and Handy (2018) report that it does. It is worth noting that reduced automobile orientation among young adults is not homogenous across all social groups; for instance Williams (2011) reports that young people in the US that self-identify as not white are more likely to delay driving license acquisition than their white peers.

Symbolic/affective motives for car use

Compared with other travel modes such as public transport, the private car appears to have greater psychological value attached (Jensen 1999). Beyond the car’s instrumental function (speed, flexibility, convenience, privacy, cargo-carrying, etc.), it has been argued that car users may also be motived by the symbolic (prestige, success, etc.) and affective (enjoyment of driving, feeling of control, independence, etc.) factors of cars (Steg 2005). Van and Fujii (2011) and Van et al. (2014) add an additional non-instrumental factor that they term social orderliness of travel modes, which captures environmental friendliness, safety, altruism, and quietness. Van and colleagues compared six Asian countries and report that in countries where intentions for car use are low, the non-instrumental factors are significant predictors of car commuting.

The symbolic/affective function of cars has been found to be associated with car ownership and driving license acquisition. For car ownership, Belgiawan et al. (2016) found that symbolic/affective attributes including ‘independence’ and ‘arrogant prestige’ have significant impact on Indonesian undergraduate students’ car purchasing behavior: those who think owning a car suggests independence are more likely to purchase a car, whereas those who think cars demonstrate arrogance are less likely to own one. Zhu et al. (2012) report similar results in their study among Chinese undergraduate students, in which the psychosocial valuations of cars dominate the aspiration for Chinese students’ car ownership in contrast to the instrumental valuations. For driving license acquisition, the literature contains studies that argue that attitudinal factors such as the intention of ‘being independent’ and ‘feeling driving to school is cool’ significantly stimulate young people to acquire a driving license (Fylan and Caveney 2018; Thigpen and Handy 2018).

It has been argued that the symbolic/affective function of cars may be less prominent in the youth of the Global North than in the lesser-developed Global South (Lyons 2015; McDonald 2015). A consistent pattern has also been reported within China, where the level of development in different regions is large. Zhu et al. (2012) found that students studying at a university in Zhenjiang (a third-tier city in China, less developed) value the non-instrumental values of cars much higher than their peers studying at Shanghai (a first-tier city, one of the most developed Chinese cities).

Given these reports of the possible impacts of attitudinal factors on car use, many of which have been collected in bespoke surveys at relatively small scale, documenting whether there has been a broad shift in attitudes towards cars is an important research question.

Sentiment analysis

Sentiment analysis (SA), also termed opinion mining or emotion AI, aims at systematically identifying people’s opinions, attitudes and emotions towards an entity (Medhat et al. 2014; Pang and Lee 2008). These techniques are widely used in areas where the opinion of the customers/audiences is important. For example, SA has been used to address issues such as predicting election results (Choy et al. 2012; Ramteke et al. 2016), understanding customers’ view of products (Santhosh Kumar et al. 2017; Sari et al. 2018), and supporting investment decisions (Ren et al. 2019; Wu et al. 2014).

There are three main levels of classification in SA: document-level, sentence-level and aspect level (Liu 2012; Medhat et al. 2014). Document-level SA aims to identify whether an entire document presents a positive or negative sentiment. Sentence-level SA is more specific than the document level, but there is no clear and unambiguous threshold between document-level and sentence-level, as a sentence can be regarded as short document. Aspect-level SA is the most specific, which enables classification of the sentiment of aspects of phrases relative to a specific item or concept (termed the ‘entity’). For example, in the phrase “the voice quality of this phone is not good, but the battery life is long”, “This phone” is the entity, and “voice quality” and “battery life” are two aspects of the entity “this phone”.

SA techniques are subdivided into machine learning (ML) and lexicon-based approaches. The former relies on various ML techniques which include Naïve Bayes Classifier, Supportive Vector Machines Classifiers, Neural Networks, etc. The latter relies on a sentiment lexicon, a collection of known and precompiled sentiment terms. Many sentiment analysis algorithms have been proposed, and readers are referred to detailed reviews of sentiment analysis studies and algorithms (Mäntylä et al. 2018; Medhat et al. 2014; Yadav and Vishwakarma 2020).

Table 2 contains a summary of studies that have employed SA in the context of transport studies.

Table 2 Literature review of sentiment analysis in transport

Full size table

Overall, the number of studies that employed SA in traveler attitude analysis has tended to increase. The objectives of these studies are frequently to understand public opinion on a specific mobility service, especially public transport and shared mobility services. Two of the studies analyzed the relationship between sentiments and car sales: Wijnhoven and Plant (2017) test the predictive power on car sales of the ratio of positive to negative tweets, the total number of mentions, the percentage of negative comments, and Google trends. Wijnhoven and Plant report that social media sentiments have relatively very weak salience to improve predictions of car sales. However, Pai and Liu (2018) conversely find that sentiment analysis of social media postings can improve the accuracy of regression models predicting monthly total vehicle sales in the US.

The authors are unaware of published literature that establishes how people’s attitudes towards private vehicles have evolved over the multi-decade period of interest (the latter part of the twentieth century and early part of the twenty-first century). Table 2 shows that the majority of studies have employed contemporary social media postings as the data source for SA, which do not provide information prior to the onset of social media in the 2010s. Hence, other data sources that provide an artefact of attitudes towards cars over a longer timescale are desirable.

Cultural evolution in popular music

In general, popular music is more attractive to young people than other demographic groups (Kalia 2015). It has been argued that popular music helps youth to define their personal identity, serving to shape their behavior (Bogt et al. 2013) and partially reflecting matters that interest, worry, and concern its listeners (Christenson et al. 2019).

Similar to news media, the change of sentiment in popular music can potentially help document social changes. With respect to news media, Beckers et al. (2017) examined changing expectations for consumer price inflation in published news articles, and Cook et al. (2020) studied how references to drinking alcohol during pregnancy within newspapers have evolved over time. We are unaware of earlier literature discussing the association between popular music and young people’s attitude towards cars, but other aspects of cultural changes captured in popular music have been examined. For example, both Madanikia and Bartholomew (2014) and Christenson et al. (2019) find a significant increase (from the 1960s/70 s to the 2010s) in the proportion of songs with themes focusing on sex-related aspects of relationships, which likely reflects a cultural shift toward acceptance of sexuality outside of love relationships. Christenson et al. (2012) found an increase of songs referring to substance use, and in recent decades the use of alcohol and drugs were much likely to be portrayed positively.

In terms of the sentiment of music, popular music lyrics have in general tended to shift towards increasingly negative tone, from the 1950/60 s to the 2000s/10 s (Christenson et al. 2019; DeWall et al. 2011; Napier and Shamir 2018). Also, Pettijohn and Sacco found the sentiment of music to be linked with economic conditions. For music of the “Pop” genre, it is sadder, slower, and more comforting when the economy is experiencing hardship (Pettijohn and Sacco 2009). However, also during difficult economic periods, Country music has more positive lyrics than Pop, as well as being more musically upbeat and exhibiting the use of more happy-sounding major chords (Eastman and Pettijohn 2014).

This study addresses the gap by compiling a dataset that contains popular music lyrics over 60 years, with the objective of identifying whether there are changing patterns of references toward automobiles. We use both sentiment analysis algorithms and human analysts to identify the sentiment of songs and automobile references, and employ both descriptive analysis and regression to identify the association between sentiment towards cars and decades.

Data

Songs

The universe of popular music songs that we included in this analysis are the top 40 songs of each year in the US, as documented in the Billboard Year-End Hot 100 Singles (Billboard 2016), for the 60-year period 1956–2015.^{Footnote 1} Lyrics for the songs in our sample were sourced from www.genius.com, and song genre information was sourced from www.iTunes.com. The datasets created for this paper have been archived at (Le Vine and Wu 2021), and are available open access for future research use.

The distribution of songs by genre is presented in Fig. 1. The combined “Other” category of genres contains the following genres: Country, Easy listening, Electronic, Dance, Disco, Instrumental, Jazz, and Reggae.

It can be seen that Pop^{Footnote 2} is generally the most common genre across time. R&B/Soul also tended to be a consistently common genre over time, but from the 1990s its share has decreased. The trend of the prevalence of Rock genre songs is similar to R&B/Soul. The number of Rock songs in the top 40 peaked around year 1980 and then decreased afterwards. On the other hand, Hip-Hop/Rap first appears in the early 1990s and has since increased rapidly to become the second most prevalent, though there has been a decrease since the late 2000s decade. This shift towards Hip-Hop/Rap has also been observed by others, e.g. Ryan (2018) and Guan (2017).

It has been reported that successful popular music songs in recent years have become more likely to be performed by female artists (Kaplan 2018). We found that this trend is significant for Pop music (p < 0.01) and for all genres combined (p < 0.01), but not for other genres besides Pop (p = 0.14).

Automobile reference tokens

Upon compilation of the database of lyrics from the 2400 songs (60 years * 40 songs/year), we developed a set of uniform guidance for identifying tokens (and where to begin/end a token) within the lyrics that reference automobility (see “Appendix 1”). Two members of the study team then independently read the full set of lyrics to manually identify the tokens, yielding a token-identification match rate of 91%. Following a reconciliation process, the final database contains 535 tokens.

The distribution of tokens by genre is presented in Fig. 2. It shows that, although Pop is the most common genre in most of the years (see Fig. 1), the frequency of automotive references in Pop music is relatively low, especially in the years around 1975 and 2000. Hip-Hop/Rap, on the other hand, shows a very high number of automotive references. Rock also has a high number of automobile references before the 1990s but has since decreased sharply. The frequency of automobile reference in different genres is discussed in more detail in the “Frequency of automobile references” section.

We next classified the tokens by various criteria, with the motivation to analyze how the types of references to cars have evolved over time. The eight criteria are:

Cars (general)
Car brands
Car parts (see listing of observed car-part reference in “Appendix 1”)
Car passenger travel
Driving
Stationary cars (as opposed to driving)
Taxi/hitching a ride
Traffic conditions

The eight criteria are not mutually exclusive. For example, “We go to drive-in movies in a limousine/He takes me deep-sea fishing in a submarine” was classified to belong to both “Cars (general)” and “Driving”. However, “Cars (general)” does not necessary include all other criteria. For example, “Windshield wipers slapping time/I was holding Bobby's hand in mine” was specified to belong to “Car parts” but not “Cars (general)”.

Results

In this section, we first analyze general time trends in popular music (“Popular music trends, from 1950 to 2010s” section), followed by trends relating specifically to automobile references (“Frequency of automobile references” section) and then the sentiment towards cars (“Sentiment towards cars over time, Regression analysis” sections).

The change of popular music and the reference to automobile is analyzed by descriptive analysis, whereas the sentiment towards cars is also investigated by bivariate correlation and linear regression.

Popular music trends, from 1950 to 2010s

The change of average word count of the lyrics is presented in Fig. 3. We drop the category ‘Other’ from this point forward (as the number of songs belonging to this category is very small) and present only the four major genres.

The average word count has tended to increase over time, from under 200 words/song in the 1950s to 400–500 words/song in the 2000s/2010s. The average word count is much higher for Hip-Hop/Rap songs, followed by Pop and R&B, and lowest for Rock music. For Pop, R&B, and Hip-Hop/Rap, the average word count in general increases from the mid-1990s until year 2008 and drops afterwards. The time trend for Rock is different: word count has been more stable over time than for other genres. We note that the overall decreasing trend in words/song coincides with events that occurred in the late 2000s decade including the Global Financial Crisis, the rise of social media, and sustained increases in the price of gasoline; further investigation will be needed to establish the possibility of causality for any of these concurrent phenomena.

Figure 4 depicts the average duration of songs; this statistic peaked around 1990 and has subsequently dropped. Despite the higher average words/song of Hip-Hop/Rap, the duration of this genre is comparable to other genres. Overall, duration varies only weakly from genre to genre. Thus Hip-Hop/Rap songs tend to be characterized by much higher words/minute than other genres (i.e. 147 words/min in 2011–2015, compared to 107 words/min for all other genres combined during this period).

Frequency of automobile references

Figure 5 shows that the number of automobile references per song peaks at around year 2005 and drops afterwards. The change of automobile references over time is similar to the time-trend of word count (see Fig. 3): The number of automobile references per song was low and stable until the 1990s, then tended to increase until the late 2000s decade, and has since experienced a decreasing trend. Even with this post-2008 decreasing trend, the frequency of automobile references in the 2010s is high compared to the 1950s–1990s period. Again, our dataset does not indicate the reason(s) behind these patterns, thus we must leave them as items for future investigation.

It is noteworthy that this ‘peak’ in automobile references in popular music coincides very roughly with the ‘peak’ in car orientation among young US adults, possibly lagging it by several years. On the latter of these points, Kuhnimhof et al. (2012) report that car mileage per US adult age 20–29 decreased approximately 20% between the 2001 and 2008 waves of the National Household Travel Survey (NHTS). NHTS data were not collected for any years between 2001 and 2008, thus the time-trend in this statistic within this period is not knowable. However, Kuhnimhof et al. (2012) document that across five other high-income countries that have historical national travel survey datasets collected at differing frequencies and in different years, the ‘peak’ in this statistic also appears to have occurred “around the turn of the millennium” (p. 772). In terms of license-holding, Delbosc (2017) finds that youth licensing in the US declined from a ‘peak’ in the late 1990s (i.e. near the turn of the millennium, but clearly prior to the 2008 ‘peak’ in car references in popular music), mainly due to subsequent decreases in license-holding by teens, with the license-holding rates of young adults in their mid-20 s remaining more stable.

The increase in car references beginning around 1990 coincides with the increasing prevalence of Hip-Hop/Rap in the top 40, and automobiles are referenced at a much higher frequency in Hip-Hop/Rap songs than other genres, especially in the years leading up to the turn of the millennium (see Fig. 5). In the post-2000 period, car references in Hip-Hop/Rap have decreased sharply, however remain much higher than other genres. Hence, the popularity of Hip-Hop/Rap music is a partial explanation for the higher frequency of automobile references in more recent decades (if the genre mix in 2015 were the same as the year 1990, the number of automobile references would have been only 0.07/song, compared to the actual observation of 0.19/song).

To disentangle between the trend of word count shifting over time simultaneously with the changing frequency of car references, we examined the average number of automobile references per 100 words (see Fig. 6).

We find that the trend of curves in Fig. 5 (car references per song) and Fig. 6 (car references per 100 words) are in general similar; it can therefore be concluded that the changing words/song is not a satisfactory explanation for the change over time in the number of automobile references.

Figure 7 shows the change over time in the proportion of car references meeting each of the eight criteria listed in “Automobile reference tokens” section. We group the eight criteria into four groups:

Fig. 7 panel (a) shows the change of tokens associated with cars (general) and driving. In general, tokens associated with these two criteria are consistently high in all these years, and there are no major time trends.
Fig. 7 panel (b) shows the change of tokens associated with car parts and car brand. There is an increasing trend over time in the frequency of tokens associated with these two criteria.
Fig. 7 panel (c) shows the change of tokens associated with traffic conditions and stationary cars. They are mentioned at a lower frequency compared to Fig. 7 panels (a) and (b), and there are no major time trends for these two criteria.
Fig. 7 panel (d) shows the change of tokens associated with taxi/hitching a ride and car passenger travel. They are mentioned at the lowest frequency among all groups, and no clear trend over time is observed.

We then investigate the relationships between genre and each of these eight criteria; Fig. 8 contains their cross-tabulation.

We can see that Cars (general) and Driving, which are the most frequently mentioned criteria, are mentioned at a similar frequency across the four music genres. In contrast, Car parts and Car brands are mentioned more frequently in Hip-Hop/Rap and R&B/Soul music than in other genres. The number of tokens matching the other four criteria (i.e. at the right hand side of Fig. 8) is very low, hence meaningful comparisons cannot be drawn.

Sentiment towards cars over time

To perform the Sentiment Analysis, we employ two open-source algorithms: IBM “Alchemy Language” (which at the time of writing has been integrated into the Watson line of products)^{Footnote 3} and IBM Watson “Tone Analyzer.^{Footnote 4}

Both algorithms use Machine Learning approaches to identify sentiments, and have been widely applied to text sources including customer reviews (Gao et al. 2015; Shah et al. 2020) and social media posts (Cao et al. 2018; Jussila and Madhala 2019).

Their application in Sentiment Analysis of popular music lyrics is relatively rare. We are aware of two examples: Al Marouf et al. (2019) investigated the use of IBM Watson Tone Analyzer to analyze language and emotional tones in lyrics; and Napier and Shamir (2018) analyzed 6150 Billboard 100 songs from 1951 to 2016, reporting that popular music is tending over time to exhibit increasingly negative sentiment.

The ‘Alchemy Language’ algorithm yields output of ‘positivity/negativity’ of the textual input’s sentiment on a continuous scale of − 1.0 (strongly negative sentiment) to + 1.0 (strongly positive sentiment).

The ‘Tone Analyzer’ algorithm’s output includes scores of ‘emotion’ on a continuous scale of 0.0–1.0, for five emotions: Anger, Disgust, Fear, Joy, and Sadness. For the purposes of this research, we employ only the ‘Joy’ emotion score, and “Joy Watson” is used from here onwards to refer to this algorithm. Joy is defined for use in the algorithm as: Joy or happiness has shades of enjoyment, satisfaction, and pleasure. There is a sense of well-being, inner peace, love, safety, and contentment (Mahmud 2016).

In addition to the objective outputs provided by the two algorithms, two members of the research team also independently manually classified each ‘token’ (reference to automobility) on a binary scale (− 1 for negative, + 1 for positive). Table 3 shows the correlation matrix between the scores from the two algorithms and the two members of the study team. It can be seen that the correlations are much stronger between the outputs of the two algorithms (0.44) and between the outputs of the two human analysts (0.60) than between the algorithms and human analysts (all between 0.09 and 0.16; all are statistically significant at p < 0.05).

Table 3 Correlation matrix comparing coding outputs for car token sentiments

Full size table

We first document the change in sentiment towards cars over time, as shown in Fig. 9.^{Footnote 5} To determine whether this trend is independent of the concurrent trend in overall sentiment of all-lyrics (i.e. a background trend), the latter is also presented in Fig. 9.

Unlike the frequency of automobile references, there is not a clear trend break in sentiment of them (or of all lyrics) in the post-2000 time period.

It can also be seen that the results from the algorithms and human analysts are quite different. The algorithms show a decreasing trend in the sentiment of both automobile references and all popular music lyrics over time. However, the human analysts’ evaluations do not show a clear time trend for all-lyrics, and show an increasing trend in sentiment of references to automobiles (these time trends are confirmed in the correlation analysis presented below in Table 4).

Table 4 Correlation between sentiment and year (bold indicates significant at p < 0.05)

Full size table

A possible reason for the humans-algorithms differences is that the two algorithms are not trained specifically by music lyrics. The algorithm designers do not disclose the types of datasets used to train the two algorithms, however it is known that applications of the two algorithms have included social media postings and hotel reviews (Cao et al. 2018; Gao et al. 2015; IBM 2019). Of the two studies of which we are aware that employ the Watson Tone Analyzer algorithm on music lyrics (Al Marouf et al. 2019; Napier and Shamir 2018), both used only the algorithm’s determinations, without the inclusion of a comparison against human analysts’ judgments.

Comparing song lyrics to hotel reviews and social media, the syntax and content is quite different, for various reasons (choice of words constrained by need to rhyme, lack of sentence structure, use of double-entendres, audio cues such as voice tone that carry meaning but have no analogue in written text, etc.). Such differences may explain part or all of the divergence between the sentiment assigned by the human analysts and by the algorithms.^{Footnote 6}

A bivariate correlation (presented in Table 4) was undertaken to test whether the correlation between sentiment and year is significant. For sentiment scores obtained from the two algorithms, there is negative and significant association between sentiment and year, for both automobile references and all-lyrics, but the former is less negative. For the two human analysts, the association between automobile-reference sentiment and year is positive and significant, there is no significant association between all-lyrics sentiment and year, and the correlation between automobile references and year are also more positive than the same for all-lyrics.

Thus, while the humans’ analyses and sentiment algorithms’ analyses differ in the absolute correlation with time, they concur that automobile references have become more positive relative to all-lyrics.

Figure 10 presents analysis of the sentiment of automobile references, by both the algorithms and human analysts. The two human analysts found references to Car Brands and Car Parts to be the most positive, and references to Taxis/Hitching a ride and Traffic conditions to be the most negative. The two algorithms, by contrast, show little variation in positivity/negativity of sentiment with respect to the eight criteria.

Regression analysis

The cross-tabulation results presented in “Popular music trends, from 1950 to 2010s”–“Sentiment towards cars over time” sections demonstrate that there is systematic variation in references to automobiles, with respect to year, genre, automobile reference criteria, etc. In this section, we present results of regression analysis to estimate the strength/sign of the associations, and establish which are all else equal when considered simultaneously.

The independent variables included in the specification are:

year each song was published,
the gender of the artist (1 = female, or the percentage of members that are female in the case of multi-member group artists),
the classification of the token (as presented in “Automobile reference tokens” section), and
the genre of the songs as independent variables.

The sentiment scores of the tokens and songs that are obtained from the two algorithms (Alchemy and Joy Watson) and the two human analysts serve as dependent variables.

We first present, inTable 5,^{Footnote 7} the regression results of sentiment of tokens vs independent variables. Overall, all four models are statistically significant, but the goodness-of-fit of the two algorithm models are particularly low. Similar to results presented in Table 3, results from the two algorithms show similar patterns, as do the results from the two human analysts. However, results from algorithms and human analysts differ more sharply.

Table 5 Regression analysis for sentiment of automobile references

Full size table

As shown in Table 5, even when the influence of the independent variables is taken into account (genre, gender of artists, etc.), the negative association between sentiment of tokens and year remains significant for the two algorithm models. For the two human analysts’ models, the association becomes insignificant.

We hypothesized that artist gender may be associated with sentiment of automobile references, as discussed in “Songs” section. Results on this point are mixed: the all else equal effect of a song being performed by a female artist(s) was negative and significant in one human-analyst model, and not significant in the other three models.

For both algorithm and human analysts’ models, the genre Rock is significantly and negatively associated with sentiment of automobile references. The effect of Hip-Hop/Rap genre is found to be negatively associated with the sentiment for the “Joy Watson” model (p = 0.04). However, the Human Analyst #2 model finds a significant and positive association between the sentiment of automobile references and Hip-Hop/Rap genre (p < 0.01).

In terms of the eight criteria describes in “Automobile reference tokens” section, the majority of effects are insignificant. Two noteworthy findings are:

1.
Both algorithms find an all else equal negative link between the Taxi/Hitching a ride category and sentiment, and
2.
Both human analysts find positive all-else-equal relationships between sentiment and Car Brand (p < 0.01 for both), and negative association between sentiment and Traffic (p < 0.01 for both). A positive but weaker relationships is found between sentiment and Driving (p = 0.07; p = 0.13)

In summary, the clearest and most consistent finding from the regression analysis, which holds across both humans and both algorithms, is the negative all-else-equal effect of a song belonging to the Rock genre. Interestingly, in regression analysis with sentiment of all-lyrics as the dependent variables (and otherwise analogous to the regression analysis presented in this section, see “Appendix 2”), we also found across all humans and algorithms that Rock genre is negatively associated with all-song sentiment.

Finally, beyond this consistent observation with respect to Rock genre, we also found several other relationships that held across either humans or algorithms, but not across both of them.

Conclusions

In this study we first developed a novel database of references to automobility in popular music in the period 1956 to 2015, and subsequently interrogated this database to determine whether there have been systematic shifts in frequency of references and/or sentiment to automobiles over time. The lyrics of popular music songs is an ideal corpus for this analysis as it is continuously available over many decades, freely available to researchers, and is a historical artefact of data on attitudes that could not readily be compiled in the present day by survey methods. Several background trends in popular music (time trends in word count per song, song duration, all-lyrics sentiment, etc.) are observable; we undertook efforts to disentangle between these background trends and effects related specifically to automobile references within the lyrics.

On the motivating research question—whether there is empirical support for the “changing attitudes towards cars” hypothesis to explain the decline in young adults’ car-borne mobility—our conclusions are mixed; they diverge in terms of frequency-of-car-references and their sentiment. Specifically, our main findings are:

1.
A general upwards trend over time in the frequency of references to cars until the late 2000s, and a downward trend since (but remaining historically high). This inflection point coincides very roughly with findings by others that car mileage (Kuhnimhof et al. 2012) and driving license-acquisition (Delbosc 2017) in high-income countries began to decline “around the turn of the millennium” (Kuhnimhof et al. 2012, p. 772).
2.
Mixed results as to whether sentiment of these references has become more positive or more negative over time. Human-classification suggests increasingly positive sentiment of references to automobiles, however the sentiment analysis algorithms indicate the opposite. Unlike point #1 above, we did not find a clear trend break in sentiment to automobiles (within popular music lyrics) in the post-2000 period.
3.
Although the trends found by humans and algorithms are different, a consistent observation is that sentiment of automobile references have over time become more positive relative to sentiment of song lyrics overall.

We also report a minor finding: for both automobile references and all lyrics, the genre Rock is negatively associated with sentiment (across both human and algorithm analysis).

The datasets created for this paper have been archived at (Le Vine and Wu 2021), and are available open access for future research use.

We now conclude with a brief discussion of future research needs to advance this line of enquiry. First, the divergence of results between the sentiment analysis algorithms and human analysts needs more investigation; it may relate in part to the types of datasets used to train the algorithms, which are likely to be quite different from the non-traditional syntax and content that characterizes song lyrics. Second, other historical artefacts of late-20th/early-twenty-first century culture (e.g. newspaper/magazine archives, movie/television scripts, etc.) would be very useful, to enrich the findings we present and identify the extent to which they support the results from popular music lyrics. A promising direction would be to examine corpuses of text targeted at different demographic segments (as people belonging to different demographic group prefers different media), in recognition that the ‘Peak Car’ effects vary across demographic groups. An important direction for future research would be to establish whether the findings we present could be applied to influence attitudes as form of transport policy intervention.

Third, the fact that several indicators within popular-music lyrics appear to have trend breaks around late 2000s suggests that researchers should focus attention on this period. Fourth, international comparison across different societies, beyond the US, would also be potentially powerful, including both highly motorized societies (e.g. Germany, Japan) and those in the earlier stages of motorization (China, India, Brazil, etc.)

In closing, it is hoped that this line of enquiry will help the research community to distinguish between the ‘economic’ and ‘attitudinal’ theories to explain the decline in young adults’ automobility.

Notes

Data are presented as 5-year moving averages (i.e. the number of R&B songs in year 1960 in Fig. 1 is the average number of all R&B music from 1956 to 1960) for ease of distinguishing trends from year-on-year random variation.
Throughout this paper, we use the term “Popular” music to refer to music in the Top 40, and the term “Pop” to refer to one specific genre of music.
https://www.ibm.com/watson/services/alchemy-language-migration/
https://www.ibm.com/cloud/watson-tone-analyzer
In the interest of time efficiency, we sampled 10 songs from each year (a 25% sample) for purposes of analyzing whole-song sentiment, rather than perform this analysis for all 2400 songs.
Examples of a song whose lyrics were rated differently by the human analysts and the algorithms include:
1. 1.
  Ed Sheeran’s Photograph (2014). Both human analysts judged the lyrics to be negative, as it describes a stressful long-distance relationship. However, the algorithms determined its lyrics to be positive, which may be due to the frequency of relatively positive words/phrases in the song (e.g. “love”, “heal”, “never broken”). Similarly, The Ray’s Silhouettes (1957), Andy Williams’ Butterfly (1957) were judged to be negative by human analysts but positive by algorithms.
2. 2.
  The Door’s Light my fire (1967) was judged to be positive by both human analysts and negative by the algorithms. It describes a male’s wish to accelerate the relationship between himself and a female. There are many negative words in the lyrics (e.g. untrue, liar, lose), which may be the reason that the algorithms judge the song’s overall sentiment to be negative. Similar, Neil Diamond’s Sweet Caroline (1969) and James Taylor’s You’ve got a friend (1971) were judged to be positive by human analysts but negative by algorithms.
Table 5 used year as an integer-denominated independent variable. We also tested two other models that treated year as 10-year bands or before-after peak car (see “Appendix 3”). In general, the three sets of models provide similar outputs.

References

Al Marouf, A., Hossain, R., Kabir Rasel Sarker, M.R., Pandey, B., Tanvir Siddiquee, S.M.: Recognizing language and emotional tone from music lyrics using IBM Watson Tone Analyzer. In: Proceedings of 2019 3rd IEEE International Conference on Electrical, Computer and Communication Technologies, ICECCT 2019. Institute of Electrical and Electronics Engineers Inc. (2019). https://doi.org/10.1109/ICECCT.2019.8869008
Ali, F., Kwak, D., Khan, P., Islam, S.M.R., Kim, K.H., Kwak, K.S.: Fuzzy ontology-based sentiment analysis of transportation and city feature reviews for safe traveling. Transp. Res. Part C Emerg. Technol. 77, 33–48 (2017). https://doi.org/10.1016/j.trc.2017.01.014
Article Google Scholar
Baj-Rogowska, A.: Sentiment analysis of Facebook posts: the Uber case. In: 2017 IEEE 8th International Conference on Intelligent Computing and Information Systems, ICICIS 2017, pp. 391–395. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/INTELCIS.2017.8260068
Baradaran, S., Persson, C., Hogosson, M., Karlstrom, A.: A time-dynamic duration model for driver’s license holding in discrete-time (2016) KTH Royal Institute of Technology, SE-100 44 Stockholm, Sweden. https://www.trafikverket.se/contentassets/773857bcf506430a880a79f76195a080/forskningsresultat/bilaga_1_kth_skattning_av_korkortsmodell.pdf. Accessed 9 Nov 2020
Bastian, A., Börjesson, M., Eliasson, J.: Explaining “peak car” with economic variables. Transp. Res. Part A Policy Pract. 88, 236–250 (2016). https://doi.org/10.1016/j.tra.2016.04.005
Article Google Scholar
Bayart, C., Havet, N., Bonnel, P., Bouzouina, L.: Young people and the private car: a love-hate relationship. Transp. Res. Part D Transp. Environ. 80, 1–15 (2020). https://doi.org/10.1016/j.trd.2020.102235
Article Google Scholar
Beckers, B., Kholodilin, K.A., Ulbricht, D.: Reading between the Lines: Using Media to Improve German Inflation Forecasts (April 28, 2017). DIW Berlin Discussion Paper No. 1665, Available at SSRN: https://ssrn.com/abstract=2970466 or https://doi.org/10.2139/ssrn.2970466. Accessed 9 Nov 2020
Belgiawan, P.F., Schmöcker, J.D., Fujii, S.: Understanding car ownership motivations among Indonesian students. Int. J. Sustain. Transp. 10, 295–307 (2016). https://doi.org/10.1080/15568318.2014.921846
Article Google Scholar
Billboard: Hot 100 songs—year-end | Billboard [WWW Document] (2016). https://www.billboard.com/charts/year-end/2016/hot-100-songs. Accessed 9 Nov. 20
Blumenberg, E., Ralph, K., Smart, M., Taylor, B.D.: Who knows about kids these days? Analyzing the determinants of youth and adult mobility in the US between 1990 and 2009. Transp. Res. Part A Policy Pract. 93, 39–54 (2016). https://doi.org/10.1016/j.tra.2016.08.010
Article Google Scholar
Bogt, T.F.M.T., Keijsers, L., Meeus, W.H.J.: Early adolescent music preferences and minor delinquency. Pediatrics 131, 380–389 (2013). https://doi.org/10.1542/peds.2012-0708
Article Google Scholar
Bose, S., Saha, U., Kar, D., Goswami, S., Nayak, A.K., Chakrabarti, S.: Rsentiment: a tool to extract meaningful insights from textual reviews. In: Advances in Intelligent Systems and Computing. Springer, pp. 259–268 (2017). https://doi.org/10.1007/978-981-10-3156-4_26
Brown, R.E., Handy, S.L.: Factors associated with high school students’ delayed acquisition of a driver’s license insights from three northern California schools. Transp. Res. Rec. J. Transp. Res. Board 2495, 1–13 (2015). https://doi.org/10.3141/2495-01
Article Google Scholar
Cao, X., MacNaughton, P., Deng, Z., Yin, J., Zhang, X., Allen, J.: Using Twitter to better understand the spatiotemporal patterns of public sentiment: a case study in Massachusetts, USA. Int. J. Environ. Res. Public Health 15, 250 (2018). https://doi.org/10.3390/ijerph15020250
Article Google Scholar
Chatterjee, K., Goodwin, P., Schwanen, T., Clark, B., Jain, J., Melia, S., Middleton, J., Plyushteva, A., Ricci, M., Santos, G. Stokes, G.: Young People’s Travel – What’s Changed and Why? Review and Analysis. Report to Department for Transport. UWE Bristol, UK. https://www.gov.uk/government/publications/young-peoples-travel-whats-changed-and-why. Accessed 9 Nov 20
Choy, M.J., Cheong, M.L.F., Ma, N.L., Koo, P.S.: A Sentiment Analysis of Singapore Presidential Election 2011 using Twitter Data with Census Correction. (2012). Research Collection School of Information Systems. https://ink.library.smu.edu.sg/sis_research/1436. Accessed 9 Nov 2020
Christenson, P., Roberts, D.F., Bjork, N.: Booze, drugs, and pop music: trends in substance portrayals in the Billboard top 100–1968–2008. Subst. Use Misuse 47, 121–129 (2012). https://doi.org/10.3109/10826084.2012.637433
Article Google Scholar
Christenson, P.G., de Haan-Rietdijk, S., Roberts, D.F., ter Bogt, T.F.M.: What has America been singing about? Trends in themes in the US top-40 songs: 1960–2010. Psychol. Music 47, 194–212 (2019). https://doi.org/10.1177/0305735617748205
Article Google Scholar
Ciari, F., Axhausen, K.: Insights on the Swiss way to Peak Car. In: 14th International Conference on Travel Research Behaviour (IATBR), pp. 1–21, Windsor, UK (2015)
Collins, C., Hasan, S., Ukkusuri, S.V.: A novel transit rider satisfaction metric: rider sentiments measured from online social media data. J. Public Transp. 16, 21–45 (2013)
Article Google Scholar
Cook, M., Leggat, G., Pennay, A.: Change over time in australian newspaper reporting of drinking during pregnancy: a content analysis (2000–2017). Alcohol Alcohol. (2020). https://doi.org/10.1093/alcalc/agaa072
Article Google Scholar
Curry, A.E., Pfeiffer, M.R., Durbin, D.R., Elliott, M.R.: Young driver crash rates by licensing age, driving experience, and license phase. Accid. Anal. Prev. 80, 243–250 (2015). https://doi.org/10.1016/j.aap.2015.04.019
Article Google Scholar
Das, S., Sun, X., Dutta, A.: Investigating user ridership sentiments for bike sharing programs. J. Transp. Technol. 05, 69–75 (2015). https://doi.org/10.4236/jtts.2015.52007
Article Google Scholar
Delbosc, A.: Delay or forgo? A closer look at youth driver licensing trends in the United States and Australia. Transportation (Amsterdam) 44, 919–926 (2017). https://doi.org/10.1007/s11116-016-9685-7
Article Google Scholar
Delbosc, A., Currie, G.: Causes of youth licensing decline: a synthesis of evidence. Transp. Rev. 33, 271–290 (2013). https://doi.org/10.1080/01441647.2013.801929
Article Google Scholar
Delbosc, A., Currie, G.: Changing demographics and young adult driver license decline in Melbourne, Australia (1994–2009). Transportation (Amsterdam) 41, 529–542 (2014a). https://doi.org/10.1007/s11116-013-9496-z
Article Google Scholar
Delbosc, A., Currie, G.: Impact of attitudes and life stage on decline in rates of Driver’s License acquisition by young people in Melbourne, Australia. Transp. Res. Rec. 2452, 62–70 (2014b). https://doi.org/10.3141/2452-08
Article Google Scholar
Delbosc, A., Currie, G.: Using discussion forums to explore attitudes toward cars and licensing among young Australians. Transp. Policy 31, 27–34 (2014c). https://doi.org/10.1016/j.tranpol.2013.11.005
Article Google Scholar
Delbosc, A., Nakanishi, H.: A life course perspective on the travel of Australian millennials. Transp. Res. Part A Policy Pract. 104, 319–336 (2017). https://doi.org/10.1016/j.tra.2017.03.014
Article Google Scholar
DeWall, C.N., Pond, R.S., Campbell, W.K., Twenge, J.M.: Tuning in to psychological change: linguistic markers of psychological traits and emotions over time in popular US song lyrics. Psychol. Aesthetics Creat. Arts 5, 200–207 (2011). https://doi.org/10.1037/a0023195
Article Google Scholar
Eastman, J.T., Pettijohn, T.F.: Gone country: an investigation of billboard country songs of the year across social and economic conditions in the United States. Psychol. Pop. Media Cult. (2014). https://doi.org/10.1037/ppm0000019
Article Google Scholar
Effendy, V., Novantirani, A., Sabarah, M.: Sentiment analysis on Twitter about the use of city public transportation using support vector machine method. Int. J. Inf. Commun. Technol. 2, 57–66 (2016). https://doi.org/10.21108/ijoict.2016.21.85
Article Google Scholar
El-Diraby, T., Shalaby, A., Hosseini, M.: Linking social, semantic and sentiment analyses to support modeling transit customers’ satisfaction: towards formal study of opinion dynamics. Sustain. Cities Soc. 49, 101578 (2019). https://doi.org/10.1016/j.scs.2019.101578
Article Google Scholar
Fraedrich, E., Lenz, B.: Societal and individual acceptance of autonomous driving. In: Autonomous Driving: Technical, Legal and Social Aspects, pp. 621–640. Springer, Berlin (2016). https://doi.org/10.1007/978-3-662-48847-8_29
Fylan, F., Caveney, L.: Young people’s motivations to drive: expectations and realities. Transp. Res. Part F Traffic Psychol. Behav. 52, 32–39 (2018). https://doi.org/10.1016/j.trf.2017.11.011
Article Google Scholar
Gao, S., Hao, J., Fu, Y.: The application and comparison of web services for sentiment analysis in tourism. In: 2015 12th International Conference on Service Systems and Service Management, ICSSSM 2015. Institute of Electrical and Electronics Engineers Inc. (2015). https://doi.org/10.1109/ICSSSM.2015.7170341
Giancristofaro, G.T., Panangadan, A.: Predicting sentiment toward transportation in social media using visual and textual features. In: IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC, pp. 2113–2118. Institute of Electrical and Electronics Engineers Inc. (2016). https://doi.org/10.1109/ITSC.2016.7795898
Guan, F.: Rap dominated pop in 2017, and it’s not going anywhere anytime soon (WWW Document), Vulture (2017). https://www.vulture.com/2017/12/the-year-rap-overtook-pop.html. Accessed 19 Oct. 2020
Haghighi, N.N., Liu, X.C., Wei, R., Li, W., Shao, H.: Using Twitter data for transit performance assessment: a framework for evaluating transit riders’ opinions about quality of service. Public Transp. 10, 363–377 (2018). https://doi.org/10.1007/s12469-018-0184-4
Article Google Scholar
Hao, L., Panangadan, A., Abellera, L.: Understanding public sentiment toward i-710 corridor project from social media based on natural language processing. In: IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), pp. 2107–2112 (2016)
Hjorthol, R.: Decreasing popularity of the car? Changes in driving licence and access to a car among young adults over a 25-year period in Norway. J. Transp. Geogr. 51, 140–146 (2016). https://doi.org/10.1016/j.jtrangeo.2015.12.006
Article Google Scholar
IBM: IBM cloud docs/tone analyzer: case studies (WWW Document) (2019). https://cloud.ibm.com/docs/tone-analyzer?topic=tone-analyzer-caseStudies. Accessed 10 Sept. 2020
Jensen, M.: Passion and heart in transport—a sociological analysis on transport behaviour. Transp. Policy 6, 19–33 (1999). https://doi.org/10.1016/S0967-070X(98)00029-8
Article Google Scholar
Jussila, J., Madhala, P.: Cognitive computing approaches for human activity recognition from tweets—a case study of twitter marketing campaign. In: Springer Proceedings in Complexity, pp. 153–170. Springer (2019). https://doi.org/10.1007/978-3-030-30809-4_15
Kalia, A. (2015) Why do we listen to less pop music as we get older? (WWW Document). https://www.newsweek.com/why-do-we-listen-less-pop-music-we-get-older-329805. Accessed 8 Nov. 2020
Kaplan, K.: Computers crack the code of pop-song success: it helps to be “happy” and “female”—Los Angeles Times (WWW Document). Los Angeles Times (2018). https://www.latimes.com/science/sciencenow/la-sci-sn-pop-song-success-20180516-story.html. Accessed 20 Oct. 2020
Kinra, A., Beheshti-Kashi, S., Buch, R., Nielsen, T.A.S., Pereira, F.: Examining the potential of textual big data analytics for public policy decision-making: a case study with driverless cars in Denmark. Transp. Policy (2020). https://doi.org/10.1016/j.tranpol.2020.05.026
Article Google Scholar
Klein, N.J., Smart, M.J.: Millennials and car ownership: less money, fewer cars. Transp. Policy 53, 20–29 (2017). https://doi.org/10.1016/j.tranpol.2016.08.010
Article Google Scholar
Kuhnimhof, T., Armoogum, J., Buehler, R., Dargay, J., Denstadli, J.M., Yamamoto, T.: Men shape a downward trend in car use among young adults-evidence from six industrialized countries. Transp. Rev. 32, 761–779 (2012). https://doi.org/10.1080/01441647.2012.736426
Article Google Scholar
Kulkarni, G., Abellera, L., Panangadan, A.: Unsupervised classification of online community input to advance transportation services. In: 2018 IEEE 8th Annual Computing and Communication Workshop and Conference, CCWC 2018, pp. 261–267. Institute of Electrical and Electronics Engineers Inc. (2018). https://doi.org/10.1109/CCWC.2018.8301704
Le Vine, S., & Wu, C.: Sentiment Analysis of Popular-Music References to Automobiles, 1950s-2010s. Datafiles archived at Open Science Framework. (2021). https://doi.org/10.17605/OSF.IO/UM5XB
Le Vine, S., Polak, J.: Factors associated with young adults delaying and forgoing driving licenses: results from Britain. Traffic Inj. Prev. 15, 794–800 (2014). https://doi.org/10.1080/15389588.2014.880838
Article Google Scholar
Le Vine, S., Jones, P., Lee-Gosselin, M., Polak, J.: Is heightened environmental sensitivity responsible for drop in young adults’ rates of driver’s license acquisition? Transp. Res. Rec. (2014a). https://doi.org/10.3141/2465-10
Article Google Scholar
Le Vine, S., Latinopoulos, C., Polak, J.: What is the relationship between online activity and driving-licence-holding amongst young adults? Transportation (Amsterdam) 41, 1071–1098 (2014b). https://doi.org/10.1007/s11116-014-9528-3
Article Google Scholar
Licaj, I., Haddak, M., Pochet, P., Chiron, M.: Individual and contextual socioeconomic disadvantages and car driving between 16 and 24years of age: a multilevel study in the Rhône Département (France). J. Transp. Geogr. 22, 19–27 (2012). https://doi.org/10.1016/j.jtrangeo.2011.11.018
Article Google Scholar
Liu, B.: Sentiment Analysis and Opinion Mining. Morgan & Claypool Publishers (2012)
Book Google Scholar
Luong, T.T.B., Houston, D.: Public opinions of light rail service in Los Angeles, an analysis using Twitter data. In: IConference, pp. 1–4 (2015)
Lyons, G.: Transport’s digital age transition. J. Transp. Land Use 8, 1–19 (2015). https://doi.org/10.5198/jtlu.2014.751
Article Google Scholar
Madanikia, Y., Bartholomew, K.: Themes of lust and love in popular music lyrics from 1971 to 2011. SAGE Open 4, 215824401454717 (2014). https://doi.org/10.1177/2158244014547179
Article Google Scholar
Mahmud, J.: Another step closer to building empathetic systems (WWW Document) (2016). http://blog.alchemyapi.com/a-step-closer-to-building-empathetic-systems
Mäntylä, M.V., Graziotin, D., Kuutila, M.: The evolution of sentiment analysis—a review of research topics, venues, and top cited papers. Comput. Sci. Rev. (2018). https://doi.org/10.1016/j.cosrev.2017.10.002
Article Google Scholar
McDonald, N.C.: Are millennials really the “go-nowhere” generation? J. Am. Plan. Assoc. 81, 90–103 (2015). https://doi.org/10.1080/01944363.2015.1057196
Article Google Scholar
McDonald, N.C.: Changing automobility: Is it real? (2017). http://www.demand.ac.uk/wp-content/uploads/2017/03/26-EC1-McDonald.pdf. Accessed 9 Nov 2020
McDonald, N., Trowbridge, M.: Does the built environment affect when American teens become drivers? Evidence from the 2001 National Household Travel Survey. J. Safety Res. 40, 177–183 (2009). https://doi.org/10.1016/j.jsr.2009.03.001
Article Google Scholar
Medhat, W., Hassan, A., Korashy, H.: Sentiment analysis algorithms and applications: a survey. Ain Shams Eng. J. 5, 1093–1113 (2014). https://doi.org/10.1016/j.asej.2014.04.011
Article Google Scholar
Mendez, J.T., Lobel, H., Parra, D., Herrera, J.C.: Using Twitter to infer user satisfaction with public transport: the case of Santiago, Chile. IEEE Access 7, 60255–60263 (2019). https://doi.org/10.1109/ACCESS.2019.2915107
Article Google Scholar
Mondschein, A., King, D.A., Hoehne, C., Jiang, Z., Chester, M.: Using social media to evaluate associations between parking supply and parking sentiment. Transp. Res. Interdiscip. Perspect. 4, 100085 (2020). https://doi.org/10.1016/j.trip.2019.100085
Article Google Scholar
Napier, K., Shamir, L.: Quantitative sentiment analysis of lyrics in popular music. J. Pop. Music Stud. (2018). https://doi.org/10.1525/jpms.2018.300411
Article Google Scholar
Pai, P.F., Liu, C.H.: Predicting vehicle sales by sentiment analysis of twitter data and stock market values. IEEE Access 6, 57655–57662 (2018). https://doi.org/10.1109/ACCESS.2018.2873730
Article Google Scholar
Pang, B., Lee, L.: Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval (2008)
Pettijohn, T.F., Sacco, D.F.: Tough times, meaningful music, mature performers: popular Billboard songs and performer preferences across social and economic conditions in the USA. Psychol. Music 37, 155–179 (2009). https://doi.org/10.1177/0305735608094512
Article Google Scholar
Pratt, A.N., Morris, E.A., Zhou, Y., Khan, S., Chowdhury, M.: What do riders tweet about the people that they meet? Analyzing online commentary about UberPool and Lyft Shared/Lyft Line. Transp. Res. Part F Traffic Psychol. Behav. 62, 459–472 (2019). https://doi.org/10.1016/j.trf.2019.01.015
Article Google Scholar
Qi, B., Costin, A., Jia, M.: A framework with efficient extraction and analysis of Twitter data for evaluating public opinions on transportation services. Travel Behav. Soc. 21, 10–23 (2020). https://doi.org/10.1016/j.tbs.2020.05.005
Article Google Scholar
Rahim Taleqani, A., Hough, J., Nygard, K.E.: Public opinion on dockless bike sharing: a machine learning approach. Transp. Res. Rec. J. Transp. Res. Board 2673, 195–204 (2019). https://doi.org/10.1177/0361198119838982
Article Google Scholar
Raimond, T., Milthorpe, F.: Why are young people driving less? Trends in licence-holding and travel behaviour. In: ATRF 2010: 33rd Australasian Transport Research Forum, pp. 1–11 (2010)
Ramteke, J., Shah, S., Godhia, D., Shaikh, A.: Election result prediction using Twitter sentiment analysis. In: Proceedings of the International Conference on Inventive Computation Technologies, ICICT 2016. Institute of Electrical and Electronics Engineers Inc. (2016). https://doi.org/10.1109/INVENTIVE.2016.7823280
Ren, R., Wu, D.D., Wu, D.D.: Forecasting stock market movement direction using sentiment analysis and support vector machine. IEEE Syst. J. 13, 760–770 (2019). https://doi.org/10.1109/JSYST.2018.2794462
Article Google Scholar
Rérat, P.: A decline in youth licensing: a simple delay or the decreasing popularity of automobility? Appl Mobilities 00, 1–21 (2018). https://doi.org/10.1080/23800127.2018.1545737
Article Google Scholar
Ryan, P.: Rap overtakes rock as the most popular genre among music fans. Here’s why. USA Today (2018)
Santhosh Kumar, K.L., Desai, J., Majumdar, J.: Opinion mining and sentiment analysis on online customer review. In: 2016 IEEE International Conference on Computational Intelligence and Computing Research, ICCIC 2016. Institute of Electrical and Electronics Engineers Inc. (2017). https://doi.org/10.1109/ICCIC.2016.7919584
Sari, P.K., Alamsyah, A., Wibowo, S.: Measuring e-Commerce service quality from online customer review using sentiment analysis. J. Phys. Conf. Ser. (2018). https://doi.org/10.1088/1742-6596/971/1/012053
Article Google Scholar
Schoettle, B., Sivak, M.: The reasons for the recent decline in young driver licensing in the United States. Traffic Inj. Prev. 15, 6–9 (2014). https://doi.org/10.1080/15389588.2013.839993
Article Google Scholar
Schweitzer, L.: Planning and social media: a case study of public transit and stigma on Twitter. J. Am. Plan. Assoc. 80, 218–238 (2014). https://doi.org/10.1080/01944363.2014.980439
Article Google Scholar
Shah, J., Sagathiya, M., Redij, K., Hole, V.: Natural language processing based abstractive text summarization of reviews. In: Proceedings of the International Conference on Electronics and Sustainable Communication Systems, ICESC 2020, pp. 461–466. Institute of Electrical and Electronics Engineers Inc. (2020). https://doi.org/10.1109/ICESC48915.2020.9155759
Sivak, M., Schoettle, B.: Recent changes in the age composition of US drivers: implications for the extent, safety, and environmental consequences of personal transportation. Traffic Inj. Prev. 12, 588–592 (2011). https://doi.org/10.1080/15389588.2011.605817
Article Google Scholar
Sivak, M., Schoettle, B.: Recent changes in the age composition of drivers in 15 countries. Traffic Inj. Prev. 13, 126–132 (2012). https://doi.org/10.1080/15389588.2011.638016
Article Google Scholar
Steg, L.: Car use: lust and must. Instrumental, symbolic and affective motives for car use. Transp. Res. Part A Policy Pract. 39, 147–162 (2005). https://doi.org/10.1016/j.tra.2004.07.001
Article Google Scholar
Tefft, B.C., Williams, A.F., Grabowski, J.G.: Driver licensing and reasons for delaying licensure among young adults ages 18–20, United States, 2012. Inj. Epidemiol. 1, 1–8 (2014). https://doi.org/10.1186/2197-1714-1-4
Article Google Scholar
Thigpen, C., Handy, S.: Driver’s licensing delay: a retrospective case study of the impact of attitudes, parental and social influences, and intergenerational differences. Transp. Res. Part A Policy Pract. 111, 24–40 (2018). https://doi.org/10.1016/j.tra.2018.03.002
Article Google Scholar
Vaca, F.E., Li, K., Tewahade, S., Fell, J.C., Haynie, D.L., Simons-Morton, B.G., Romano, E.: Factors contributing to delay in driving licensure among US high school students and young adults. J. Adolesc. Heal. (2020). https://doi.org/10.1016/j.jadohealth.2020.05.003
Article Google Scholar
Van, H.T., Fujii, S.: A cross Asian country analysis in attitudes toward car and public transport. In: Proceedings of the Eastern Asia Society for Transportation Studies (2011)
Van, H.T., Choocharukul, K., Fujii, S.: The effect of attitudes toward cars and public transportation on behavioral intention in commuting mode choice—a comparison across six Asian countries. Transp. Res. Part A Policy Pract. 69, 36–44 (2014). https://doi.org/10.1016/j.tra.2014.08.008
Article Google Scholar
Wijnhoven, F., Plant, O.: Sentiment analysis and Google trends data for predicting car sales. In: 38th International Conference on Information Systems (2017), p. 1
Williams, A.F.: Teenagers’ licensing decisions and their views of licensing policies: a national survey. Traffic Inj. Prev. 12, 312–319 (2011). https://doi.org/10.1080/15389588.2011.572100
Article Google Scholar
Wu, D.D., Zheng, L., Olson, D.L.: A decision support approach for online stock forum sentiment analysis. IEEE Trans. Syst. Man Cybern. Syst. 44, 1077–1087 (2014). https://doi.org/10.1109/TSMC.2013.2295353
Article Google Scholar
Yadav, A., Vishwakarma, D.K.: Sentiment analysis using deep learning architectures: a review. Artif. Intell. Rev. 53, 4335–4385 (2020). https://doi.org/10.1007/s10462-019-09794-5
Article Google Scholar
Zhu, C., Zhu, Y., Lu, R., He, R., Xia, Z.: Perceptions and aspirations for car ownership among Chinese students attending two universities in the Yangtze Delta, China. J. Transp. Geogr. 24, 315–323 (2012). https://doi.org/10.1016/j.jtrangeo.2012.03.011
Article Google Scholar

Download references

Acknowledgements

An earlier version of this research was presented at the 97th Annual Meeting of the Transportation Research Board (Washington, DC, January 2018). The authors thank the anonymous peer reviewers and editor for helpful feedback on earlier versions of this manuscript, however the authors are solely responsible for any remaining errors. Author Wu acknowledges partial support via Southeast University start-up funding (#3221002109A1).

Author information

Authors and Affiliations

School of Transportation, Southeast University, Nanjing, China
Chenyang Wu
Urban Systems Lab, Department of Civil and Environmental Engineering, Imperial College London, London, UK
Chenyang Wu, Scott Le Vine & John Polak
Transpo Group and Department of Geography, SUNY New Paltz, New Paltz, USA
Scott Le Vine, Elizabeth Bengel & Jason Czerwinski

Authors

Chenyang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Scott Le Vine
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth Bengel
View author publications
You can also search for this author in PubMed Google Scholar
Jason Czerwinski
View author publications
You can also search for this author in PubMed Google Scholar
John Polak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The authors confirm contributions to the paper as follows: Study conception and design: Le Vine, Polak. Data collection: Bengel, Czerwinski. Human sentiment analysis: Bengel, Le Vine. Mathematical modelling: Wu. Manuscript preparation: Wu, Le Vine. All authors confirm their approval of the final version of the manuscript.

Corresponding author

Correspondence to Chenyang Wu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

John Polak: Deceased.

Appendices

Appendix 1: Rules employed during token-classification, listing of excluded ambiguous references, and listing of car parts

General rules of token-identification

1.
Reference must be to cars, traffic (flow of multiple vehicles), driving/passenger travel, car parts, or taxi/hitching a ride. Not roads. Not motorcycles. Yes trucks/RVs/ambulances/police-cars. Not when car terms are used purely allegorically (e.g. ‘junk in the trunk’ to refer to body shape)
2.
To identify each token: Line referencing “cars”, plus the previous line and/or following line if each rhymes. If two tokens defined this way are directly adjacent to one another, we define it as one token.
3.
We remove non-word noises (oop, oop, oop; dum-dee-dum, oh, etc. from the middle of tokens), to avoid confusing the algorithms
4.
We change spellings in tokens to standard spellings, and clean up syntax (e.g. “I’ma” to “I’m going to”)
5.
We remove racial/ethnic slurs from lyrics before processing them through the algorithms.

Specific borderline (ambiguous) references that were explicitly excluded:

Lover’s lane; Anything to do with a mechanic or garage or chop shop or gas station or car wash or drive-in unless token mentions cars as vehicles or driving; Hit the road; Roll with me; Buses; Motor City; Dead Man’s Curve; Took a wrong turn; Ride up; Go-karts; Bumper cars; Toot-toot, beep-beep; Call me for a ride; Let’s cruise, away from here; Motoring; Let me start you up; Pull up; Living in the fast lane (but yes to “speeding in the fast lane”); Buckle up.

List of specific cars parts that are referenced at least once

Radio, keys, brake, engine, turbo (charger), motor, engine noise (purr), window, glass, motor, white wall tires, sirens, roof, bumper, door, stereo, top, buckle, seat belt, honk, steering wheel, spare tire, trunk, amps, (gas) tank, hydraulics, front, back, passenger side, rear view, 18′s (rims), gas, fuel, wood panel, chrome, speaker/subwoofer, sun roof, title, ashtray, lift gate, throttle, license plate, ignition, paint color, paint type (e.g. candy gloss), TVs, car phone (car celly), rag top, chrome pipes, chrome hydraulics, low pro (file tires), car seats, stainless wheels, chrome wheels, white leather inside, alarm, bucket seat, butterfly doors, pedal, gears, dash, AC, button.

Appendix 2: Regression analysis of all-lyrics sentiment (see discussion in “Regression analysis” section)

See Table

Table 6 Regression analysis for sentiment of songs (i.e. all-lyrics)

Full size table

6.

Appendix 3: Regression analysis of token sentiment (10-year band model and before-after peak car model)

See Tables 7 and 8.

Table 7 Regression analysis for sentiment of automobile references (10-year band model)

Full size table

Table 8 Regression analysis for sentiment of automobile references (before-after peak car model)

Full size table

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wu, C., Le Vine, S., Bengel, E. et al. Sentiment analysis of popular-music references to automobiles, 1950s to 2010s. Transportation 49, 641–678 (2022). https://doi.org/10.1007/s11116-021-10189-1

Download citation

Accepted: 17 March 2021
Published: 02 May 2021
Issue Date: April 2022
DOI: https://doi.org/10.1007/s11116-021-10189-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Sentiment analysis of popular-music references to automobiles, 1950s to 2010s

Abstract

Similar content being viewed by others

The Evolution of Public Perceptions of Automated Vehicles in China: A Text Mining Approach Based Dynamic Topic Modeling

Analyzing Parking Sentiment and its Relationship to Parking Supply and the Built Environment Using Online Reviews

The italian music superdiversity

Introduction

Literature review

Young adults’ declining automobile orientation

Symbolic/affective motives for car use

Sentiment analysis

Cultural evolution in popular music

Data

Songs

Automobile reference tokens

Results

Popular music trends, from 1950 to 2010s

Frequency of automobile references

Sentiment towards cars over time

Regression analysis

Conclusions

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Additional information

Publisher's Note

Appendices

Appendix 1: Rules employed during token-classification, listing of excluded ambiguous references, and listing of car parts

General rules of token-identification

Specific borderline (ambiguous) references that were explicitly excluded:

List of specific cars parts that are referenced at least once

Appendix 2: Regression analysis of all-lyrics sentiment (see discussion in “Regression analysis” section)

Appendix 3: Regression analysis of token sentiment (10-year band model and before-after peak car model)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation