From alternative conceptions of honesty to alternative facts in communications by US politicians

Lasser, Jana; Aroyehun, Segun T.; Carrella, Fabio; Simchon, Almog; Garcia, David; Lewandowsky, Stephan

doi:10.1038/s41562-023-01691-w

From alternative conceptions of honesty to alternative facts in communications by US politicians

Article
Open access
Published: 25 September 2023

Volume 7, pages 2140–2151, (2023)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue Submit your manuscript

From alternative conceptions of honesty to alternative facts in communications by US politicians

Download PDF

10k Accesses
4 Citations
251 Altmetric
23 Mentions
Explore all metrics

Abstract

The spread of online misinformation on social media is increasingly perceived as a problem for societal cohesion and democracy. The role of political leaders in this process has attracted less research attention, even though politicians who ‘speak their mind’ are perceived by segments of the public as authentic and honest even if their statements are unsupported by evidence. By analysing communications by members of the US Congress on Twitter between 2011 and 2022, we show that politicians’ conception of honesty has undergone a distinct shift, with authentic belief speaking that may be decoupled from evidence becoming more prominent and more differentiated from explicitly evidence-based fact speaking. We show that for Republicans—but not Democrats—an increase in belief speaking of 10% is associated with a decrease of 12.8 points of quality (NewsGuard scoring system) in the sources shared in a tweet. In contrast, an increase in fact-speaking language is associated with an increase in quality of sources for both parties. Our study is observational and cannot support causal inferences. However, our results are consistent with the hypothesis that the current dissemination of misinformation in political discourse is linked to an alternative understanding of truth and honesty that emphasizes invocation of subjective belief at the expense of reliance on evidence.

Combating Misinformation by Sharing the Truth: a Study on the Spread of Fact-Checks on Social Media

Article 11 June 2022

Not just the facts: an index for measuring the information density of political communication

Article Open access 05 December 2020

Vindication, virtue, and vitriol

Article Open access 17 November 2020

Find the latest articles, discoveries, and news in related topics.

Artificial Intelligence

Main

Numerous indicators suggest that democracy is in retreat worldwide (for example, refs. ^1,2). Although symptoms and causes of this democratic backsliding are difficult to tease apart, the widespread dissemination of misinformation—on social media, in hyper-partisan news sites and in political discourse—is undoubtedly a challenge to democracies³. There is increasing evidence that exposure to misinformation can cause people to change their behaviour (for example, ref. ⁴). Exposure to misinformation has been identified as a contributing cause of voting for populist parties in Italy⁵ and has been causally linked to ethnic hate crimes in Germany⁶ (for a review of causal effects, see ref. ⁷). Note that we use ‘misinformation’ as an umbrella term to refer to any information that people consume that later turns out to be false. Misinformation can be spread unintentionally, when communicators mistakenly believe some item of information to be true, or it can be spread intentionally, for example, in pursuit of a political agenda. Intentionally disseminated misinformation is often referred to as ‘disinformation’. The psychological and cognitive consequences of disinformation are indistinguishable from those of unintentional misinformation, and we therefore use the latter term throughout.

Misinformation has several troubling psychological attributes. First, misinformation lingers in memory even if people acknowledge, believe and try to adhere to a correction⁸. Although people may adjust their factual beliefs in response to corrections⁹, their political behaviours and attitudes may be largely unaffected^10,11. Second, perhaps most concerningly, in some circumstances people may even come to value overt dishonesty as a signal of ‘authenticity’¹². A politician who routinely and blatantly misinforms the public is overtly violating the established societal norm of being accurate and truthful. This norm violation can identify the politician as an enemy of the ‘establishment’ and, by implication, an authentic champion of ‘the people’—dishonesty and misinformation thus become a sign of distinction¹². For example, polls have shown that around 75% of Republicans considered President Trump to be ‘honest’ at various points throughout his presidency (for example, NBC poll, April 2018). This perception of honesty is at odds with the records of fact checkers and the media, which have identified more than 30,000 false or misleading statements by Trump during his presidency (Washington Post fact checker¹³).

This discrepancy between factual accuracy and perceived honesty is, however, understandable if ‘speaking one’s mind’ on behalf of a constituency is considered a better marker of honesty than veracity. The idea that untrue statements can be honest, provided they arise from authentic belief speaking, points to a distinct ontology of honesty that does not rely on the notion of evidence but on a radically constructivist appeal to an intuitive shared experience as ‘truth’³. There have been several attempts to characterize this ontology of truth and honesty and the stream of misinformation to which it gives rise (for example, refs. ^3,14,15). A recent analysis³ of ontologies of political truth (see also ref. ¹⁶) proposed two distinct conceptions of truth: ‘belief speaking’ and ‘fact speaking’. Belief speaking relates only to the speaker’s beliefs, thoughts and feelings, without regard to factual accuracy. Fact speaking, in contrast, relates to the search for accurate information and an updating of one’s beliefs based on that information.

The first of these two ontologies echoes the radical constructivist truth, based on intuition and feelings, that also characterized 1930s fascism (for example, ref. ¹⁷). This conception of truth sometimes rejects the role of evidence outright. For example, Nazi ideology postulated the existence of an ‘organic truth’ based on personal experience and intuition that can only be revealed through inner reflection but not external evidence (for example, refs. ^17,18). Contemporary variants of this conception of truth can be found in critical postmodern theory¹⁹ and both right-wing and left-wing populism^20,21. The second ontology, based on fact speaking, aims to establish a shared evidence-based reality that is essential for the well-being of democracy²². This conception of truth aims to be dispassionate and does not admit appeals to emotion as a valid tool to adjudicate evidence, although it also does not preclude truth-finding from being highly contested and messy (ref. ²³ versus ref. ²⁴).

For democratic societies, a conception of truth that is based on belief speaking alone can have painful consequences because democracy requires a body of common political knowledge to enable societal coordination²². For example, people in a democracy must share the knowledge that the electoral system is fair and that a defeat in one election does not prevent future wins. Without that common knowledge, democracy is at risk. The attempts by Donald Trump and his supporters to overturn the 2020 election results with baseless claims of electoral fraud have brought that risk into sharp focus²⁵. To achieve a common body of knowledge, democratic discourse must go beyond belief speaking. In particular, democratic politics requires fact speaking by leaders—otherwise, they may choose to remain wilfully ignorant of embarrassing information, for example, by refusing briefings from experts that are critical of their favoured public-health policy. A corollary of this requirement is that the public considers fact speaking by politicians as an indicator of honesty rather than (only) belief speaking.

Although truth and honesty are closely linked concepts, with honesty and truthfulness being nearly synonymous²⁶, in the present context they need to be disentangled for clarity. Here we focus primarily on conceptions of honesty, which refers to a virtuous human quality and a socially recognized norm, rather than truth, which refers to the quality of information about the world. Thus, the two ontologies of truth just introduced describe how the world can be known—namely, either through applying intuition or seeking evidence, irrespective of the virtuous qualities (or lack thereof) of the beholder. This ontological dichotomy maps nearly seamlessly into the different conceptions of honesty that we characterize as belief speaking and fact speaking.

So far, there has been much concern but limited evidence about the increasing prevalence of belief speaking at the expense of fact speaking in American public and political life. We aim to explore this presumed shift in conceptions of truth and honesty by focusing on Twitter activity by members of both houses of the US Congress. The United States is not only one of the world’s largest democracies but it is also a crucible of the contemporary conflict between populism and liberal democracy and the intense partisan polarization this conflict has entailed²⁷. The choice of Twitter is driven by the fact that public outreach on Twitter has become one of the most important avenues of public-facing discourse by US politicians in the last decade²⁸ and is frequently used by politicians for agenda-setting purposes²⁹.

Our analysis addressed several research questions: can we identify aspects of belief speaking and fact speaking in public-facing statements by members of Congress? If so, how do these conceptions evolve over time? What partisan differences, if any, are there? Is the quality of shared information linked to the different conceptions of honesty? To answer these questions, we performed a computational analysis of an exhaustive dataset of tweets posted by US politicians, detecting links to misinformation sources and analysing text from tweets and news sources.

Identifying conceptions of honesty in political speech

We first sought to identify the two components of truth and honesty—belief speaking and fact speaking—in public-facing political speech by elected US officials. For our analyses, we collected a corpus of tweets from members of the US Congress between 1 January 2011 and 31 December 2022. After removing retweets and duplicates, our corpus contained a total of 4,527,814 tweets (see Methods for details). Twitter accounts were categorized by party affiliation.

To measure the conceptions of honesty in text, we created two dictionaries of words associated with each of the concepts. We followed a computational grounded theory approach³⁰ to incorporate both expert knowledge and computational pattern recognition. We started with a list of seed words for each conception, followed by computational expansion and iterative pruning and refinement through human input (see Methods for details).

We validated the dictionaries in three steps. First, to validate the candidate keywords (selected by the authors), we created a survey on Prolific and asked participants (N = 51) to rate each keyword’s representativeness of the two honesty components on two separate Likert scales. We then ran paired t-tests between each word’s representativeness ratings for belief speaking and fact speaking, respectively. Keywords that were rated as significantly more representative for belief speaking (fact speaking) were included in the belief-speaking (fact speaking) dictionaries. The final dictionaries include a total of 37 keywords for each component and are provided in Extended Data Table 1 (see Methods and Supplementary Notes 1 and 2 for details). Following the distributed dictionary representation approach³¹, we converted the keywords into vector embeddings using a pre-trained algorithm (GloVe). These representations capture nuanced contextual information and are amenable to a vector-similarity approach to establish overlap between each dictionary and the text or document of interest (see Methods for details).

In the second validation step, we applied the dictionaries to our tweet corpus and calculated the semantic similarity D_b and D_f between the article and the belief-speaking and fact-speaking dictionaries, respectively (see Methods for details). A positive semantic similarity means that a piece of text is more similar to the words contained in a dictionary, whereas a negative similarity means that it is more dissimilar. We then sampled tweets that had a high belief-speaking or fact-speaking similarity or were dissimilar to both honesty components. We again created a survey on Prolific with the same set-up as described for the keyword validation. Using tweets that a majority of human raters agreed were representative of belief speaking or fact speaking as ground truth, we found satisfactory agreement between the computed belief-speaking and fact-speaking similarity scores and human ratings, with AUC = 0.824 for belief speaking and AUC = 0.772 for fact speaking (see Methods and Supplementary Note 3 for details).

In the third validation step, we applied the dictionaries to historical articles from the New York Times (NYT) for three text categories: ‘opinion’, ‘politics’ and ‘science’ (see Methods for details). We found that articles in the science category are more similar to fact speaking than all articles on average (${ < {D}_{{{{\rm{f}}}}} > }_{{{{\rm{sci}}}}}- < {D}_{{{{\rm{f}}}}} > =0.033$), followed by articles in the opinion (${ < {D}_{{{{\rm{f}}}}} > }_{{{{\rm{op}}}}}- < {D}_{{{{\rm{f}}}}} > =0.006$) and politics (${ < {D}_{{{{\rm{f}}}}} > }_{{{{\rm{pol}}}}}- < {D}_{{{{\rm{f}}}}} > =-0.006$) categories. Articles in the opinion category show the highest similarity to the belief-speaking dictionary (${ < {D}_{{{{\rm{b}}}}} > }_{{{{\rm{op}}}}}- < {D}_{{{{\rm{b}}}}} > =0.013$), followed by articles in the science (${ < {D}_{{{{\rm{b}}}}} > }_{{{{\rm{sci}}}}}- < {D}_{{{{\rm{b}}}}} > =0.009$) and politics (${ < {D}_{{{{\rm{b}}}}} > }_{{{{\rm{pol}}}}}- < {D}_{{{{\rm{b}}}}} > =-0.007$) categories. The analysis of NYT content confirmed our expectation of articles in the science category being most similar to fact speaking, whereas articles in the opinion category being most similar to belief speaking. It did not confirm our expectation of politics being more similar to fact speaking than opinion articles and more similar to belief speaking than science articles.

Finally, to establish the uniqueness of our dictionaries and to differentiate the honesty conceptions from existing similar measures, we investigated the relationship between our two components to text features such as authenticity³², analytic language³³ and a moral component reflecting judgemental language³⁴, each measured using Linguistic Inquiry and Word Count 2022 (LIWC-22) (ref. ³⁵), and positive and negative sentiment measured using VADER³⁶. We calculated scores for each of these components for every tweet in the corpus. Both belief speaking and fact speaking are negatively correlated with ‘analytic’, although the correlation with belief speaking (r = −0.27) is about twice as high as with fact speaking (r = −0.16). Both honesty components are positively correlated with ‘authentic’, ‘moral’ and negative sentiment, whereas the correlation with positive sentiment is positive for belief speaking (r = 0.06) and sightly negative for fact speaking (r = −0.01). All correlations are highly significant (P < 0.001) but small—the correlation with the largest magnitude (r = −0.27) is observed between belief-speaking similarity and ‘analytic’. Details of the comparison with LIWC and VADER scores are summarized in Supplementary Note 4. In summary, these analyses show that belief speaking and fact speaking do not overlap greatly with existing related measures of text features.

Partisan and temporal dynamics of conceptions of honesty

After validating our dictionaries, we produced textual scatterplots³⁷ (see Methods for details) to illustrate individual terms that are characteristic of the two honesty components.

Figure 1 shows diagnostic words in a two-dimensional plot, with the x and y axes representing party and honesty conception, respectively. Each dot is a unigram from the Twitter corpus, and its colour is associated with party keyness (a word with positive party keyness occurs more often for texts from members of a given party than expected by chance). The closer to a corner a word is, the more it characterizes that particular conception of honesty and party dimension. See Methods for details on how words in the figure are represented. We see that Republican belief-speaking keywords, situated in the top-left corner, often refer to political opponents or ideologies (‘Biden’, ‘democrats’ and ‘conservatives’) or conservative values (‘freedom’ and ‘liberty’). In contrast, fact-speaking keywords by the same party are linked to economic (‘energy’, ‘taxpayer’ and ‘trade’) or foreign-policy aspects (‘China’ and ‘Chinese’) and the military. On the right-hand side of the figure, we find that Democrat belief-speaking tweets also regard politicians and political ideology (‘Trump’, ‘Democrats’ and ‘Republicans’) and social justice (‘colour’, ‘discrimination’ and ‘justice’), whereas fact-speaking texts particularly concern the climate crisis (‘climate’) and social welfare and healthcare (‘worker’, ‘care’ and ‘pre-existing condition’).

**Fig. 1: The figure depicts the distribution of keywords on a textual scatterplot.**

Supplementary Note 6 explores the topics of politicians’ communications further. Analysis of some controversial topics showed that these topics invoked more belief speaking or fact speaking than the average tweet, with only a few exceptions. For example, vaccine-related discourse involved far less belief speaking than other controversially discussed topics such as climate change or the opioid crisis for both parties.

We next examined the temporal trends of the two honesty components. For the following analyses, we use the centred and length-corrected belief-speaking and fact-speaking similarity scores ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ (see Methods for details). To arrive at a finer-grained picture of the variability of these components between individual politicians, we calculated the average belief-speaking similarity ${ < {D}_{{{{\rm{b}}}}}^{{\prime} } > }_{{{{\rm{acc}}}}}$ and fact-speaking similarity ${ < {D}_{{{{\rm{f}}}}}^{{\prime} } > }_{{{{\rm{acc}}}}}$ of tweets for each individual politician. Note that ${ < > }_{{{{\rm{acc}}}}}$ denotes an average over all posts from a given Twitter account. Figure 2a–d shows how the distribution of ${ < {D}_{{{{\rm{b}}}}}^{{\prime} } > }_{{{{\rm{acc}}}}}$ and ${ < {D}_{{{{\rm{f}}}}}^{{\prime} } > }_{{{{\rm{acc}}}}}$ shifted between the first (2011–2013, 331 Democrats and 514 Republicans) and last (2019–2022, 295 Democrats and 494 Republicans) 4 years of tweets contained in the corpus.

**Fig. 2: Belief-speaking and fact-speaking similarity in tweets by members of the US Congress for the period 2011–2022 shown separately for members of each party.**

For both parties, the mean belief-speaking similarity ${ < {D}_{{{{\rm{b}}}}}^{{\prime} } > }_{{{{\rm{party}}}}}$ increased considerably from −0.031 to 0.017 for Democrats (unpaired t-test t(558) = −11.317, P < 0.001, Cohen’s d = 0.850, 95% confidence interval (CI) of difference in means [−0.06, −0.04]) and from −0.040 to 0.012 for Republicans (t(493) = −10.819, P < 0.001, d = 0.854, 95% CI = [−0.06, −0.04]). Similarly, we see an increase in the similarity to truth-seeking ${ < {D}_{{{{\rm{t}}}}}^{{\prime} } > }_{{{{\rm{party}}}}}$ from −0.027 to 0.009 for Democrats (t(516) = −9.753, P < 0.001, d = 0.748, 95% CI = [−0.04, −0.03]) and from −0.038 to −0.003 for Republicans (t(483) = −8.442, P < 0.001, d = 0.671, 95% CI = [−0.04, −0.03]). This overall increase in both belief-speaking and truth-seeking similarity also becomes apparent in Fig. 2e,f, and is especially pronounced after the presidential election in late 2016.

This parallel increase for both belief speaking and fact speaking could reflect the fact that, in recent years, topics concerning fake news have become increasingly central to political discourse³⁸, resulting in opposing claims and counterclaims (for example, Donald Trump routinely accused mainstream media such as the NYT of spreading ‘fake news’²⁹). Whereas those claims mainly represented belief speaking, they were accompanied by increasing attempts by the media, and other actors, to correct misinformation through fact-speaking discourse.

Relating honesty components to information trustworthiness

To test our hypothesis that belief speaking is preferentially associated with dissemination of misinformation, we analysed the association between belief speaking and fact speaking to the quality of the information that is being relayed. To assess information quality, we examined links to websites external to Twitter that were shared by the accounts. We followed an approach used in similar research in this domain^39,40 and used a trustworthiness assessment by professional fact checkers of the domain to which a link points. We used the NewsGuard information nutrition database⁴¹ and an independently compiled database of domain trustworthiness labels⁴² (see Methods and Supplementary Notes 7 and 8 for details).

As of the beginning of March 2022, the NewsGuard database indexed 6,860 English language domains. Each domain is scored on a total of 9 criteria, ranging from ‘does not label advertising’ to ‘repeatedly publishes false information’. Each category awards a varying number of points for a total of 100. Domains with less than 60 points are considered ‘not trustworthy’. The majority of indexed domains (63%) are considered trustworthy. After excluding links to other social media platforms (for example, Twitter, Facebook, YouTube.com and Instagram) and links to search services (Google and Yahoo), the database covered between 20% and 60% of the links posted by members of the US Congress, with a steadily increasing share of links covered over time and no difference in coverage between the parties (Extended Data Fig. 1).

For each tweet, we calculated the belief-speaking and fact-speaking similarity ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$. Figure 3a,b shows the NewsGuard score, ${S}_{{{{\rm{NG}}}}}^{{\prime} }$, rescaled to [0; 1] over the belief-speaking and fact-speaking similarity, respectively, for each tweet posted by a member of Congress. Note that in Fig. 3 the y axis appears truncated to improve legibility. The full data are shown in Extended Data Fig. 2.

**Fig. 3: Relation of information quality with belief speaking and fact speaking.**

To investigate the relationship between ${D}_{{{{\rm{b}}}}}^{{\prime} },{D}_{{{{\rm{f}}}}}^{{\prime} }$ and ${S}_{{{{\rm{NG}}}}}^{{\prime} }$, we fitted a linear mixed-effects model with random slopes and intercepts for every Congress member following equation (1). The lines shown in Fig. 3a,b show ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ predicted by the model depending on ${D}_{{{{\rm{b}}}}}^{{\prime} },{D}_{{{{\rm{f}}}}}^{{\prime} }$, respectively, party P and their interaction terms (see Methods for details).

The analysis conducted with P = Democrat as baseline yielded a significant fixed effect for ${D}_{{{{\rm{f}}}}}^{{\prime} }$ (t(504, 809) = 3.6, P < 0.001, coefficient 0.022, 95% CI = [0.010, 0.033]), P = Republican (t(504, 809) = −29.9, P < 0.001, coefficient −0.069, 95% CI = [−0.074, −0.065]), the interaction between Republican and ${D}_{{{{\rm{b}}}}}^{{\prime} }$ (t(504, 809) = −14.4, P < 0.001, coefficient −0.128, 95% CI = [−0.146, −0.111]), the interaction between Republican and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ (t(504, 809) = 9.6, P < 0.001, coefficient 0.085, 95% CI = [0.068, 0.103]), and the three-way interaction between ${D}_{{{{\rm{b}}}}}^{{\prime} },{D}_{{{{\rm{f}}}}}^{{\prime} }$ and Republican (t(504, 809) = −5.6, P < 0.001, coefficient −0.085, 95% CI = [−0.115, −0.056]). See Extended Data Table 2 for the full regression statistics and Extended Data Fig. 3 for a visualization of the fixed effect of the three-way interaction.

Therefore, an increase in ${D}_{{{{\rm{b}}}}}^{{\prime} }$ of 10% predicted a decrease in NewsGuard score of 12.8, but only for members of the Republican party. An increase in ${D}_{{{{\rm{f}}}}}^{{\prime} }$ of 10% predicted an increase in NewsGuard score of 2.1 for Democrats and of 10.6 for Republicans. For Democrats, we find no significant relationship between S_NG and belief-speaking similarity. Predictions of the NewsGuard score depending on belief-speaking and fact-speaking similarity based on the two-way interactions between honesty components and party are shown as lines in Fig. 3a and Fig. 3b, respectively

In Supplementary Note 9, we explore this pattern further by considering NewsGuard scores and honesty components broken down by state and party. We find that the quality of information being shared by Republicans tends to be lower in southern states (for example, AL, TN, TX, OK and KY) than in the north (for example, NH, AK and ME), although there are also striking exceptions (for example, NY). For Democrats, no clearly discernible pattern across states emerges. We also find that the voting patterns during the 2020 presidential election in their home state did not affect the quality of news being shared by members of Congress.

To exclude a dependence of these results on use of the NewsGuard database, we validated this analysis with an independently collected list of news-outlet reliability from academic and fact-checking sources. The results are reported in Supplementary Note 7 and are consistent with the results reported here. In addition, using the different outlet-reliability database, we also find a significant effect of belief-speaking similarity on the quality of shared information for Democrats that goes in the same direction as the effect for Republicans.

We wanted to know whether the content of belief-speaking and fact-speaking words in the texts found in the websites linked to be the tweets was also indicative of low information quality. To this end, we attempted to scrape the text of all linked websites (Methods). We successfully collected text from about 65% of links. We excluded texts with less than 100 words and only retained one copy of the text in the case when multiple tweets contained links to the same website. In addition, we excluded all articles collected from links that were posted by members of both parties (2,462 texts; 0.91% of articles), such that each link had a unique party designation. This resulted in a total of 261,765 unique news texts.

We investigated the dependence of the NewsGuard score associated with the domain from which the text was scraped on the belief-speaking similarity and the fact-speaking similarity of the article text (rather than in the original tweet). We fitted a linear-regression model to predict the rescaled NewsGuard score ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ depending on party, the belief-speaking and fact-speaking similarities ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ and the two-way interaction terms (see equation (2) and Methods for details).

We show both the data for individual links and the model predictions for ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ in Fig. 3c and Fig. 3d, respectively. Again, we found a significant inverse relationship between P = Republican and ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ (t(261, 765) = −184.8, P < 0.001, coefficient −0.099, 95% CI = [−0.100, −0.098]) and the interaction term between Republican and ${D}_{{{{\rm{b}}}}}^{{\prime} }$ (t(261, 765) = −33.8, P < 0.001, coefficient −0.540, 95% CI = [−0.571, −0.509]). We also confirmed the positive relationship between ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ (t(261, 765) = 2.3, P = 0.003, coefficient 0.026, 95% CI = [0.004, 0.048]), and the interaction term between Republican and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ (t(261, 765) = 6.0, P < 0.001, coefficient 0.110, 95% CI = [0.074, 0.147]), and the three-way interaction term ${{{\rm{party}}}}\times {D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }$ (t(261, 765) = −14.8, P < 0.001, coefficient −0.594, 95% CI = [−0.673, −0.516]).

In contrast to the analysis using tweet texts, we also find a significant negative relationship for ${D}_{{{{\rm{b}}}}}^{{\prime} }$ for Democrats (t(261, 765) = −6.6, P < 0.001, coefficient −0.065, 95% CI = [−0.084, −0.046]) and a significant interaction term ${D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }$ (t(261, 765) = 2.8, P = 0.006, coefficient 0.067, 95% CI = [0.019, 0.113]). See Extended Data Table 3 for the full regression statistics. Our analysis of article texts therefore reproduces the main results from our analysis of tweet texts.

Discussion

We curated two dictionaries that captured the distinction between an evidence-based conception of honesty (fact speaking) and a conception based on intuition, subjective impressions and feelings (belief speaking). We confirmed the validity and diagnosticity of the dictionaries by soliciting ratings from human participants both for individual keywords and for documents, and by showing that belief speaking prevailed in opinion pieces in the NYT but not in their science section, whereas the reverse occurred for fact speaking.

Applying those dictionaries to public political discourse by members of the US Congress, represented by their tweets, we find a bipartisan increase of the use of both fact-speaking and belief-speaking language over time, in particular from late 2016 onward. The use of fact-speaking and belief-speaking language is particularly intense for controversial topics, and this is also a bipartisan phenomenon.

The parties differ considerably, however, when the quality of information being shared is considered. Overall Republicans tend to share information of lower quality than Democrats (see also ref. ⁴²), and this difference is associated to belief speaking: the more Republicans engage in belief speaking, the more likely they are to share low-quality information. There is no evidence (or little evidence; Supplementary Note 7) for this relationship for Democrats.

Our results have several theoretical and practical implications that deserve to be explored. First, our data cast a new light on several recent analyses of the US public’s information diet that have shown that conservatives are more likely to encounter and share untrustworthy information than their counterparts on the political left^39,42,43,44. Several reasons have been put forward for this apparent asymmetry, for example, that partisans are motivated to share derogatory content towards the political outgroup⁴⁵. As greater negativity towards Democrats is mostly found in lower-quality outlets, conservatives may disproportionately share untrustworthy information because it is satisfying a need for outgroup derogation⁴⁶.

Our analysis offers another potential explanation, namely that the public may be sensitive to cues provided by the political elites that, as we have shown here, also differ considerably in the accuracy of content that they share on social media. Specifically, Republican politicians frequently, although not always, share low-quality information and are thus providing a cue to their partisan followers of the legitimacy of those outlets. Evidence for the sensitivity of the public to leadership cues has been observed in the climate-change arena, where the growing polarization of the public along party lines mainly resulted from the Republican leadership gradually assuming a more hostile stance towards the science of climate change⁴⁷.

Furthermore, our analysis provides evidence that belief speaking could be a ‘gateway’ rhetorical technique for the sharing of low-quality information. The more Republican politicians appeal to beliefs and intuitions, rather than evidence, the more likely they are to share low-quality information. For Democrats, this association was absent in the main analysis using NewsGuard scores, and it was attenuated if an independent source of domain quality was used (Supplementary Note 7). This pattern gives rise to the question why, if belief speaking gives licence to the sharing of misinformation, is it only Republicans (or mainly Republicans) who avail themselves of that option?

A possible answer can be found in the finding that belief speaking is associated with greater negative emotion (Supplementary Note 4). Therefore, belief speaking may result from Republican politicians’ desire to derogate Democrats, as suggested in ref. ⁴⁶. With this view, negative emotional content should be a mediator of the association between belief speaking and low quality of shared content. In contrast, if belief speaking were instrumental in the sharing of low-quality content for other reasons, then it should mediate the association involving negative emotionality. We report two competing mediation models in Supplementary Note 10. Although the models cannot definitively adjudicate between the two possibilities, the analyses suggest the former hypothesis is in a better position to explain the mediating effect on the spread of low-quality news among Republicans. Within this framework, and concordant with ref. ⁴⁶, negative emotion associated with derogation of the opponent is the driving force behind the association between belief speaking and the spread of low-quality content among Republicans. Further indirect support for this possibility is provided by the fact that Republican members of Congress do not exclusively share misinformation. When they engage in fact speaking, Republicans’ accuracy of shared information rises to nearly the same level as that of Democrats.

Finally, we return to the argument advanced at the outset, namely that belief speaking can be a marker of authenticity, which allows partisan followers to consider a politician to be honest despite them promulgating low-quality or false information. We cannot directly test this argument based on the present data because we have no way of ascertaining the perceived honesty of the politicians in our sample. However, we do have state-level electoral data from the 2020 presidential election, which show that belief speaking and the associated sharing of low-quality information is not associated with an electoral penalty (Supplementary Note 9). There is no association between the accuracy of Republicans’ shared information and the vote share for Trump, suggesting that voters were not deterred by belief-speaking-based dissemination of misinformation.

A limitation of our study is that it does not provide causal evidence because the reported observations are purely correlational. In addition, our analysis was limited to communications by the ‘political class’ in the United States and, although the United States is one of the world’s largest democracies, the trends shown here should not be considered in isolation but deserve to be contrasted with observations in other countries and cultures. A recent comparison of the overall accuracy of information shared by members of the US Congress found that their accuracy was lower—even among Democrats—than the information shared by parliamentarians from mainstream parties in the United Kingdom and Germany⁴². Although there were also differences between parties in those two countries, they were small in magnitude and European conservatives were more accurate than US Republicans, underscoring that conservatism is not, per se, necessarily associated with reliance on low-quality information. Another international comparison of populist leaders (Trump in the United States, Modi in India, Farage in the United Kingdom and Wilder in the Netherlands) found some commonalities among those politicians, such as the use of insults against political opponents, but also identified Trump as an outlier in the use of critical language⁴⁸. Therefore, further examinations of belief speaking and fact speaking outside the US context are urgently needed to explore the generality of our findings and to redress the existing global imbalance in research activity⁷.

Future research is also needed to examine the temporal stability of the patterns we observed here. Although our analysis extended to the end of 2022, thus covering 2 months of Twitter activity after it was taken over by Elon Musk, there is no guarantee that the platform will remain stable in the future. Likewise, in the same way that sharing of misinformation mushroomed after 2016 (ref. ⁴²), the long-term trend towards populism may reverse and the sharing of misinformation may become less frequent in the future. Therefore, our analysis is best understood as a historical and contemporary picture of political discourse rather than as a pointer to the future.

Finally, future research should also address the particular role played by social media in our analysis. We de-emphasized this angle because when our analysis was extended to mainstream news articles shared by the members of Congress, we found very similar results compared with the tweets. However, there may be other situations in which social media play a uniquely different role from conventional mainstream media, and those situations remain to be identified and examined.

Methods

US Congress member tweet corpus

A corpus of contemporary political communication in English was created by scraping tweets by members of both houses of the US Congress on 10 February 2023. To build the corpus, lists of Twitter handles of members of Congress were collected for the 114th (from www.socialseer.com), 115th (from www.socialseer.com), 116th (from ref. ⁴⁹), and 117th and 118th (from https://triagecancer.org/congressional-social-media) Congresses. For the 114th and 115th Congresses, only the Twitter handles of senators were available. For the 116th, 117th and 118th Congresses, Twitter handles were available for both houses of Congress. This resulted in a total of 1,278 unique Twitter handles, which included Congress member staff and Congress member campaign accounts. If a politician had multiple accounts, all were included in the dataset. No sampling was involved in collecting the data and the collected dataset is exhaustive.

For each of the Twitter handles, metadata were collected on 10 February 2023 via the Twitter application programming interface (API) v.2 using the Python package twarc (v.2.13.0) (ref. ⁵⁰). Metadata included the account’s handle, username, creation date, location, user description, number of followers, number of accounts followed and tweet count. Of the 1,278 accounts, 220 were not accessible because they had been deleted, suspended or set to ‘private’.

To build the text corpus, all tweets posted by the collected Twitter accounts starting from 6 November 2010 and up to 31 December 2022 were collected, using academic access to the Twitter API. Note that by following this approach, we include all tweets posted by a given account in the given time span, not only tweets that were posted while a politician was in office. Earlier tweets all the way back to 2006 could be retrieved, but we chose 2010 as the earliest date due to changes in the design of retweeting in the Twitter platform at that time. The retweet button was introduced in November 2009 (previously, retweeting was done manually), and it took approximately a year for users to start using it consistently. Furthermore, the prominence of Twitter in US politics emerged later, especially since 2012. The resulting corpus consisted of a total of 5,914,107 tweets, of which 3,463,409 were original tweets, 531,289 were quote tweets, 575,044 were replies and 1,351,346 were retweets. Note that quoting, replying and retweeting are not exclusive categories. We removed retweets from the corpus because they do not constitute original content. The number of tweets consistently increased from around 100,000 in 2011 to over 600,000 in 2020, and then declined to around 500,000 in 2022. We removed exact matches (that is, duplicates) and included only tweets with more than ten words. The final corpus contained 3,897,032 tweets. Next to the tweet text, the corpus contained the tweet creation date and a unique identifier of the account that posted the tweet. The identifier permitted linkage to the metadata collected about the user accounts, such as party affiliation.

We find a large variance in the number of tweets posted by individual accounts, ranging from only 1 tweet in the observed time period to 52,055 tweets, with a median number of 2,876 tweets per account. To exclude a dependence of our results on highly prolific accounts, we also conducted the main analysis reported in Fig. 3 and Extended Data Table 2 using only the latest 3,200 tweets per account. Results from this analysis are highly consistent with the analysis using all available tweets (see Supplementary Note 11 for details). In addition, we show which accounts contribute most to the overall increase of belief speaking and fact speaking (Supplementary Note 12).

In addition to the perspective from individual tweets taken in the analysis presented in ‘Partisan and temporal dynamics of conceptions of honesty’, we also considered the perspective of individual links taken in the analysis presented in ‘Relating honesty components to information trustworthiness’. For this analysis, we only considered tweets that contained at least one link (2,700,539 tweets). As a single tweet can contain more than one link, we expanded the dataset such that each entry referred to a single link, transferring the tweet-level honesty-component labels to the individual links. This resulted in a total of 2,844,901 links. From each link, we extracted the domain to which the link pointed. If the link was shortened using a link-shortening service, such as bit.ly, we followed the link to retrieve the full domain name. The domains were then matched against the NewsGuard domain trustworthiness database and the independently compiled list of trustworthiness labels (described in ‘NewsGuard nutrition labels’ and Supplementary Note 7).

Honesty-component keywords and validation

We relied on keywords to identify the relevant subsets of tweets that involved the presumed distinct conceptions of honesty. Initially, two lists of keywords, one for each honesty component, were generated by the researchers involved in this article. The aim was to capture linguistic cues whose presence might signal that one of the components has been enacted by the speaker. To illustrate, initial keywords for fact speaking included terms such as ‘reality’, ‘assess’ ‘examine’, ‘evidence’, ‘fact’, ‘truth’, ‘proof’ and so on. For belief speaking, initial keywords were terms such as ‘believe’, ‘opinion’, ‘consider’, ‘feel’, ‘intuition’ or ‘common sense’.

The lists were expanded computationally using a combination of the fastText library⁵¹ and colexification networks^52,53. Using the fastText embeddings, we expanded the seed words to include words that have a cosine similarity score above 0.75. Colexification networks connect words in a language based on their common translations to other languages, thus signalling words that can be used to express multiple concepts. For example, the words ‘air’ and ‘breath’ are considered to be colexifications because they both translate into the same word in multiple languages (‘sukdun’ in Manchu, ‘vu:jnas’ in Kildin Sami and ‘jind’in Nenets⁵⁴). Colexification networks have been used recently to study emotion structures in language⁵⁵ and are predictors of word-meaning ratings⁵². Including colexification networks in lexicon expansion gives word lists with a better trade off between precision and recall⁵³ than previous approaches using WordNet or word embeddings, such as empath. We subsequently filtered the expanded lists to remove duplicates, overlapping terms appearing in more than one list and lemma inflections (that is, ‘convey’, ‘conveys’ and ‘conveyed’). The keywords were then used to identify texts relevant to the presumed conceptions of honesty.

To validate the keyword lists, we asked participants in an online survey to score each term on two scales reflecting the honesty components. Data were acquired on 20 September 2022 from 50 individuals (15 men, 34 women and 1 unlisted; mean age 39.5 years, s.d. 15.8 years) using the Prolific survey platform⁵⁶. Participants were asked to score each term on two distinct Likert scales ranging from one to five, which indicated low and high representativeness, respectively, of the word for that honesty component. The instructions provided to participants can be found in Supplementary Note 1. The distributions of ratings collected for each keyword are shown in Supplementary Figs. 1 and 2.

We next performed paired t-tests to see how participants sorted the terms into the two conceptions. The results of the t-tests are shown in Supplementary Table 1. Of 98 keywords, 61 were judged to belong in the category to which we previously assigned them, 24 did not reach the significance threshold (P < 0.05) and were therefore removed and 13 were classified by participants as belonging to the opposite category. We followed the raters’ indications and moved the keywords that were classified as belonging to the opposite category from their original dictionary to the other dictionary. The final list of keywords for both dictionaries is given in Extended Data Table 1.

Identification of honesty components in text

As a first preparatory step, we removed URLs and replaced user handles on Twitter with the word ‘user’. We then split the tweet texts into individual tokens (words). We then created embeddings of each word contained in the honesty-component dictionaries (Extended Data Table 1) with GloVe⁵⁷ trained on 840B tokens from the Common Crawl corpus, following the distributed dictionary representation approach³¹. We note that the word ‘seem’ from the belief-speaking dictionary is included in the list of stopwords of GloVe. Therefore, we removed seem from the stopword list to include it into the dictionary embedding that was calculated using GloVe.

We then averaged the single-word embeddings within every honesty component to create an embedded representation of the entire dictionary. Similarly, we embedded every token contained in a given tweet and calculated an average of all token embeddings to create an embedded representation of the tweet. For every tweet and both components, we then calculated the cosine similarity between the embedded tweet representation and the embedded dictionary representations to arrive at a belief-speaking similarity score D_b and a fact-speaking similarity score D_f for the given tweet. Similarity scores range from −1 (not similar at all) to 1 (perfectly similar).

We find that similarity scores correlate with the length of tweets (number of characters), with Pearson’s r = 0.37 (P < 0.001) for belief speaking and r = 0.42 (P < 0.001) for fact speaking. In addition, the length of tweets systematically increases over the years, particularly after the increase in the tweet character limit from 140 characters to 280 characters in 2017. To remove the trend in similarity scores due to increasing tweet length, we fit two linear models D_b ~ tweet length and D_f ~ tweet length. We then used these linear models to predict D_b and D_f for every tweet based on its length and subtracted this prediction from the measured belief-speaking and fact-speaking similarity, resulting in the centred and length-corrected similarity scores ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$, which we report throughout this article.

To measure belief-speaking and fact-speaking similarity in the text of the articles collected from links posted by Congress members on Twitter (see ‘News article collection’), we followed the same approach as described for the text of the tweets above but measure the length of an article as the number of words it contains instead of the number of characters.

To test the robustness of our results to perturbations of the dictionaries, we recalculated belief-speaking and fact-speaking similarities using versions of the dictionaries where 7 words (20%) were removed from the dictionary at random before embedding the words and calculating dictionary representations. We then re-ran the regression of ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ on ${D}_{{{{\rm{b}}}}}^{{\prime} },{D}_{{{{\rm{f}}}}}^{{\prime} }$, party and the interaction terms (equation (1)), where ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ are the belief-speaking and fact-speaking similarities calculated using the representations of the perturbed dictionaries. The distribution of estimates for the fixed effects of the two-way interaction between party and ${D}_{{{{\rm{b}}}}}^{{\prime} }$, and party and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ over 100 perturbations are shown in Extended Data Fig. 4. Whereas the estimates for the effect of ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ on NewsGuard score vary by about 20% between different perturbed dictionary versions, the effects never change direction and always remain significant (P < 0.001) for Republicans, as reported in the main text.

In addition to GloVe⁵⁷ embeddings, we also calculated ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ using word2vec⁵⁸ and fastText⁵¹ embeddings of both the dictionary keywords and the tweets to exclude a dependence of our results on the choice of embedding. We note that, similar to GloVe, the word seem is included in the stopword list of word2vec and was removed from the stopword list before computing the embeddings. The results of fitting the linear mixed-effects model following equation (1) using the alternative embeddings for the dictionaries and tweet texts are shown in Supplementary Note 13. The results are similar to the results obtained using GloVe embeddings (Extended Data Table 2). This shows that our results do not depend on the algorithm or the corpus (common crawl for GloVe and word2vec versus Google news for fastText) that was used to train the embedding.

Lastly, we also investigated which individual keyword probably contributed the most to the overall increases of belief speaking and fact speaking reported in Fig. 2. We report the results in Supplementary Note 14.

Honesty-component document-level validation

To validate our measures of the belief-speaking and fact-speaking honesty components on the document level, we asked human raters to rate individual tweets with respect to their similarity to the two honesty components. To this end, we sampled 20 tweets from the top belief-speaking and bottom fact-speaking quartile, and 20 tweets from the top fact-speaking and bottom belief-speaking quartile. In addition, we sampled 20 tweets that simultaneously belonged to the bottom belief-speaking and fact-speaking quartiles. Each sample of 20 tweets included 10 tweets from Democrats and 10 from Republicans.

We then created a survey on Prolific⁵⁶ and asked participants (N = 51) to rate each tweet’s representativeness of the two honesty components on two separate Likert scales. We followed exactly the same set-up as described in ‘Honesty component keywords and validation’ above, but presented full tweets instead of singular keywords. The instructions provided to participants can be found in Supplementary Note 1. In addition, we included an attention check in the survey, with the aim of excluding all participants that failed the check. To this end, we asked all participants to select ‘5’ for both categories halfway through the survey. Only one person failed the check. The responses of this person were excluded from the survey, resulting in N = 50 total responses (25 men, 24 women and 1 non-binary person; mean age 37.6 years, s.d. 12.88 years). Data were acquired on 10 February 2023. The distributions of ratings collected for each tweet are shown in Supplementary Note 2.

We then wanted to quantify the performance of our computed similarity scores when used as a classifier. To this end, for each honesty component we coded the 20 tweets that were selected from the top belief-speaking (fact speaking) similarity quartile as belief speaking (fact speaking) and the 40 tweets that were selected from the bottom similarity quartile of that component as ‘not belief speaking’ (‘not fact speaking’). We then classified every tweet for which a majority of human raters selected either a four or a five for how characteristic a tweet was for belief speaking (fact speaking) as belief speaking (fact speaking) to create a ground-truth dataset to compare our classifier against. We obtained receiver operating characteristic curves for belief speaking and fact speaking by varying the threshold for belief-speaking (fact speaking) similarity to categorize a tweet as belief speaking (fact speaking) (akin to varying-response criteria in a behavioural study). The receiver operating characteristic curves are shown in Supplementary Note 3. The area under the curve is high in both cases, with AUC = 0.824 for belief speaking and AUC = 0.772 for fact speaking.

NYT corpus

We retrieved data from the NYT through their archive API (https://developer.nytimes.com/docs/archive-product/1/overview). By iterating over the months since the founding of the newspaper in the 19th century, we retrieved information on every article in the archive. The information returned by the API included the article title, an abstract that summarizes the article content, and additional metadata such as publication date and section of the paper. This approach is different to earlier research that used the NYT API to obtain a number of articles over time that contain certain terms, which does not yield any further text or ways to filter the data⁵⁹. As we needed text to identify honesty components in articles, the archive end point was more suitable than the term search function of the NYT API, despite not giving us the full text of all articles but only returning a summary.

We extracted three distinct categories of content from the NYT corpus based on the sections identified in the metadata: (1) an ‘opinion’ category that comprises opinion pieces such as ‘op-eds’; (2) a ‘politics’ category consisting of articles in the sections United States, Washington and world; and (3) a ‘science’ category that includes health, science, education and climate articles. We chose these three clusters because we expected opinion articles to contain more belief speaking, whereas we expected science articles to contain more fact speaking. We expected articles in the politics cluster to fall in between. We retrieved a total of 809,271 articles consisting of 240,567 opinion articles, 518,123 politics articles and 50,581 science articles.

Word and topic keyness analysis

The scatterplot in the top panel of Fig. 1 was produced using the approach described in Scattertext³⁷, a Python package designed to illustrate words and phrases that are more characteristic of a category, such as political party, than others. To derive how characteristic a word is of a category, we start from raw word frequencies: for each word w_i ∈ W and category c_j ∈ C, we define the precision of the word w_i with respect to the category as

$${{{\rm{prec}}}}(i,j)=\frac{\#({w}_{i},{c}_{j})}{{\sum }_{c\in C}\#({w}_{i},c)}.$$

Here the function #(w_i, c_j) represents the number of times w_i occurs in a document labelled with the category c_j. Therefore, prec(i, j) represents the discriminative power of a given word across categories regardless of its frequency in the given category.

Similarly, we define the frequency a word occurs in a category c_j as

$${{{\rm{freq}}}}(i,j)=\frac{\#({w}_{i},{c}_{j})}{{\sum }_{w\in W}\#(w,{c}_{j})}.$$

To combine prec(i, j) and freq(i, j) into a single score, we scale and standardize both values using a normal cumulative density function Φ(z) and then calculate the harmonic mean (H) between the two contributions (see ref. ³⁷ for details). This yields the scaled F-score (SFS) for every word w_i and category c_j that is defined as

$${{{\rm{SFS}}}}(i,j)={{{\mathcal{H}}}}\left({{\varPhi }}({{{\rm{prec}}}}(i,j)),{{\varPhi }}({{{\rm{freq}}}}(i,j))\right).$$

For our application case, we want to show how representative a word is not only for a single category (such as ‘Republican’) but rather on a spectrum of representativeness that ranges from ‘more Democratic’ to ‘more Republican’. To this end, we need to map the two distinct scores SFS^D for the category Democratic and SFS^R for the category Republican to a single score that ranges from −1 to +1. For two arbitrary categories x and y we therefore define

$${{{\rm{SFS}}}}=2\times \left(-0.5+\left\{\begin{array}{ll}{{{{\rm{SFS}}}}}^{x}\quad &{{{\rm{if}}}}\,{{{{\rm{SFS}}}}}^{x} > {{{{\rm{SFS}}}}}^{y},\\ 1-{{{{\rm{SFS}}}}}^{y}\quad &{{{\rm{if}}}}\,{{{{\rm{SFS}}}}}^{x} < {{{{\rm{SFS}}}}}^{y},\\ 0\quad &{{{\rm{otherwise}}}}\end{array}\right.\right).$$

This maps two SFSs (one for category x and one for category y) that are both defined in the range [0, 1] to a single score in the range [−1, 1]. To this end, SFS^y is mapped to [−1, 0], the SFS with the larger magnitude is selected and is then rescaled to the new range. In our application case, this then yields a single SFS^party that is −1 for more Republican tweets and +1 for more Democratic tweets.

To calculate representativeness along the belief-speaking–fact-speaking dimension, we follow a similar approach. Before we can calculate the SFS for belief speaking and fact speaking, we first need to transform the continuous honesty similarity scores ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ into a binary honesty-component label for each tweet. To this end, we divided the tweets into quantiles according to their belief-speaking (fact speaking) similarity. We then categorized the tweets with a belief-speaking (fact speaking) similarity in the top 20% as belief speaking (fact speaking). If a tweet was part of the upper quantile for both components, then the higher of the two similarity values was used to assign a category to the tweet. We then followed the approach described above to calculate a single SFS^honesty from SFS^b (for belief speaking) and SFS^f (for fact speaking).

As a result, each word had two SFS scores: SFS^party and SFS^honesty. These two scores were used as x and y coordinates for the scatterplot shown in the top panel of Fig. 1. The X-shaped structure of the words in the scatterplot indicates that words that are characteristic for one dimension (for example, party) are probably also characteristic for the other dimension (for example, honesty component). Words that are not characteristic of any category (such as stopwords) cluster in the middle.

NewsGuard nutrition labels

Following precedent^39,40, we used source trustworthiness as an estimator for the trustworthiness of an individual piece of shared information. We used nutrition labels provided by NewsGuard, a company that offers professional fact checking as a service and curates a large database of domains. The trustworthiness of a domain is assessed in nine categories, each of which awards a number of points: does not repeatedly publish false content (up to 22 points); gathers and presents information responsibly (18); regularly corrects or clarifies errors (12.5); handles the difference between news and opinion responsibly (12.5); avoids deceptive headlines (10); website discloses ownership and financing (7.5); clearly labels advertising (7.5); reveals who is in charge, including any possible conflicts of interest (5); and the site provides names of content creators, along with either contact or biographical information (5).

NewsGuard categorizes domains with a score of 60 or higher as ‘generally adheres to basic standards of credibility and transparency’⁴¹. Similar to ref. ⁶⁰, we used this value as a threshold below which we categorized a domain and the link pointing to it as not trustworthy.

After excluding links to other social media platforms (for example, Twitter, Facebook, YouTube and Instagram) and links to search services (Google and Yahoo), the NewsGuard database covers between 20% and 60% of the links posted by members of the US Congress, with a steadily increasing share of links covered over time (Extended Data Figure 1a).

Regression

We performed a range of regression analyses to quantify the relationship between various manifestations of honesty components and information quality. For the predictions shown in Fig. 3a,b, we fitted the following linear mixed-effects model for tweets from members of the US Congress:

$${S}_{{{{\rm{NG}}}}}^{{\prime} } \sim 1+{D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }+{D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }\times P+\Big(1+{D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }| {{{\rm{userID}}}}\Big)$$

(1)

Here, ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ is the NewsGuard nutrition score of a domain to which a Congress member linked in a post on Twitter, rescaled to [0; 1]. ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ are the centred and length-corrected belief-speaking and fact-speaking similarity of the text in the tweet with the link, respectively (see section ‘Identification of honesty components in text’ above). P is the party designation of the account that posted the tweet, which can be Republican or Democrat. We include random slopes and intercepts for every account (userID). We fitted the model using the lmer function from the R library lme4 (ref. ⁶¹). Regression results are reported in Extended Data Table 2. Data distribution was assumed to be normal but this was not formally tested.

For the predictions shown in Fig. 3c,d, we fitted the following model for articles that were linked to by the US Congress members:

$${S}_{{{{\rm{NG}}}}}^{{\prime} } \sim 1+{D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }+{D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }\times P.$$

(2)

Here, ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ are the centred and length-corrected belief-speaking and fact-speaking similarity scores of the article text retrieved from the link. We fitted the model using an ordinary least-squares fitting approach from the Python package statsmodels⁶². Regression results are reported in Extended Data Table 3. Data distribution was assumed to be normal but this was not formally tested. Note that we did not fit a linear mixed-effects model for the statistical analysis of the articles because there is no clear nesting of articles within individual Twitter accounts as a single article can be linked to from multiple accounts.

News article collection

Excluding links to other social media platforms (for example, Twitter, Facebook, YouTube and Instagram) and links to search services (Google and Yahoo), our corpus of tweets contained 1,027,050 unique links to news articles that were shared by members of Congress. Of these links, 462,853 pointed to sites that were indexed by the NewsGuard database (see ‘NewsGuard nutrition labels’). We scraped the text of these sites using Newspaper3k (ref. ⁶³), a Python package for scraping and curating news articles. Some links were broken, restricted or could not be scraped by the package. In addition, we removed all articles that contained less than 100 words or were shared only by independent politicians (that is, not Republican or Democrat). This resulted in 65% of total scraping coverage. When broken down by trustworthiness, the coverage for trustworthy links (N = 291,143) was 65%, and 82% for untrustworthy links with a NewsGuard score < 60 (N = 7,776). We retained only one copy of each news article in case it was shared multiple times, and removed from the main analysis articles that were shared by members of more than one political party (that is, a link was shared either by Republicans or Democrats, but not both). This was done to ensure each article had only a single party designation such that our statistical analysis of articles was comparable to our statistical analysis of tweets. This resulted in the removal of 2,462 articles (0.91% of all remaining articles), which were analysed separately. To provide a marker for apparent bipartisan agreement, we plot the mean and s.d. of honesty-component similarity and ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ for the articles shared by both parties (grey ellipses in Extended Data Fig. 2). Removing these articles left us with a corpus of 261,765 article texts.

The distribution of NewsGuard scores and the belief-speaking and fact-speaking similarity in each article is shown in Extended Data Fig. 2c,d.

Inclusion and ethics

This study is based on publicly available archival Twitter data on US Members of Congress and their official staff and campaign accounts. Only public figures are analysed and only content that was not deleted by the time of data retrieval was considered. All US Members of Congress in curated Twitter account lists are included as long as their Twitter accounts were public by the retrieval data. We focused on the two major parties to have sufficient evidence for statistical analysis, and our results cannot be extended to independent members of congress or members of other parties besides the Democratic and the Republican parties.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The lists of Twitter handles of members of congress used to build the tweet corpus are available from www.socialseer.com (114th and 115th Congress), ref. ⁴⁹ (116th Congress) and https://triagecancer.org/congressional-social-media(117th and 118th Congress). The tweet identifiers of the tweet texts and URLs of the articles analysed in this study are deposited in the Open Science Framework (OSF)⁶⁴. Dictionaries of keywords associated with the different conceptions of honesty are deposited in the OSF⁶⁴. The independently compiled list of domain accuracy and transparency scores is deposited on GitHub⁴². The NewsGuard database used to asses domain trustworthiness is commercially available from NewsGuard and cannot be shared publicly. Aggregated values for information trustworthiness and honesty components for tweets and articles used to produce all figures in this article are deposited in the OSF⁶⁴.

Code availability

Python 3.9.1 and R 4.2 were used to collect the data and perform the data analysis presented in this study. Data collection and analysis code is available under MIT License in a GitHub repository⁶⁵.

References

Freedom in the World 2020. A Leaderless Struggle for Democracy (Freedom House, 2020).
Freedom in the World 2021. Democracy under Siege (Freedom House, 2021).
Hertwig, R. & Engel, C. (eds.) Deliberate Ignorance: Choosing Not to Know (MIT Press, 2020).
Loomba, S., de Figueiredo, A., Piatek, S. J., de Graaf, K. & Larson, H. J. Measuring the impact of COVID-19 vaccine misinformation on vaccination intent in the UK and USA. Nat. Hum. Behav. https://doi.org/10.1038/s41562-021-01056-1 (2021).
Cantarella, M., Fraccaroli, N. & Volpe, R. G. Does fake news affect voting behaviour? Preprint at SSRN https://doi.org/10.2139/ssrn.3629666 (2020).
Müller, K. & Schwarz, C. Fanning the flames of hate: social media and hate crime. J. Euro. Econ. Assoc. 19, 2131–2167 (2021).
Article Google Scholar
Lorenz-Spreen, P., Oswald, L., Lewandowsky, S. & Hertwig, R. A systematic review of worldwide causal and correlational evidence on digital media and democracy. Nat. Hum. Behav. https://doi.org/10.1038/s41562-022-01460-1 (2022).
Lewandowsky, S., Ecker, U. K. H. & Cook, J. Beyond misinformation: understanding and coping with the post-truth era. J. Appl. Res. Mem. Cogn. 6, 353–369 (2017).
Article Google Scholar
Wood, T. & Porter, E. The elusive backfire effect: mass attitudes’ steadfast factual adherence. Political Behav. 41, 135–163 (2018).
Article Google Scholar
Swire, B., Berinsky, A. J., Lewandowsky, S. & Ecker, U. K. H. Processing political misinformation: comprehending the Trump phenomenon. R. Soc. Open Sci. 4, 160802 (2017).
Article PubMed PubMed Central Google Scholar
Swire-Thompson, B., Ecker, U. K. H., Lewandowsky, S. & Berinsky, A. J. They might be a liar but they’re my liar: source evaluation and the prevalence of misinformation. Political Psychol. 41, 21–34 (2020).
Article Google Scholar
Hahl, O., Kim, M. & Sivan, E. W. Z. The authentic appeal of the lying demagogue: proclaiming the deeper truth about political illegitimacy. Am. Sociol. Rev. 83, 1–33 (2018).
Article Google Scholar
Kessler, G., Rizzo, S. & Kelly, M. Trump’s false or misleading claims total 30,573 over 4 years. The Washington Post https://www.washingtonpost.com/politics/2021/01/24/trumps-false-or-misleading-claims-total-30573-over-four-years/ (2022).
Lewandowsky, S., Lloyd, E. A. & Brophy, S. When THUNCing Trumps thinking: what distant alternative worlds can tell us about the real world. Argumenta 3, 217–231 (2018).
Google Scholar
McCright, A. M. & Dunlap, R. E. Combatting misinformation requires recognizing its types and the factors that facilitate its spread and resonance. J. Appl. Res. Mem. Cogn. 6, 389–396 (2017).
Article Google Scholar
Cooper, B., Cohen, T. R., Huppert, E., Levine, E. & Fleeson, W. Honest behavior: truth-seeking, belief-speaking, and fostering understanding of the truth in others. Acad. Manag. Ann. https://doi.org/10.17605/OSF.IO/PCG7M (2023).
Varshizky, A. Alfred Rosenberg: The Nazi weltanschauung as modern gnosis. Politics Relig. Ideol. 13, 311–331 (2012).
Article Google Scholar
Voegelin, E. in The Collected Works of Eric Voegelin Vol. 5 (Wiser, J. L. ed.) 21–73 (Univ. of Missouri Press, 2000).
van Zoonen, L. l-Pistemology: changing truth claims in popular and political culture. Eur. J. Commun. 27, 56–67 (2012).
Article Google Scholar
Edis, T. A revolt against expertise: pseudoscience, right-wing populism, and post-truth politics. Disputatio https://doi.org/10.5281/zenodo.3567166 (2020).
Waisbord, S. The elective affinity between post-truth communication and populist politics. Commun. Res. Pract. 4, 17–34 (2018).
Article Google Scholar
Farrell, H. & Schneier, B. Common-Knowledge Attacks on Democracy (Berkman Klein Center for Internet & Society, 2018).
Uscinski, J. E. & Butler, R. W. The epistemology of fact checking. Crit. Rev. 25, 162–180 (2013).
Article Google Scholar
Amazeen, M. A. Revisiting the epistemology of fact-checking. Crit. Rev. 27, 1–22 (2015).
Article Google Scholar
Jacobson, G. C. Driven to extremes: Donald Trump’s extraordinary impact on the 2020 elections. Pres. Stud. Q. 51, 492–521 (2021).
Article Google Scholar
Williams, B. Truth and Truthfulness: An Essay in Genealogy (Princeton Univ. Press, 2002); https://doi.org/10.1515/9781400825141
Graham, M. H. & Svolik, M. W. Democracy in America? Partisanship, polarization and the robustness of support for democracy in the United States. Am. Political Sci. Rev. 114, 392–409 (2020).
Article Google Scholar
Barberá, P. et al. Who leads? Who follows? Measuring issue attention and agenda setting by legislators and the mass public using social media data. Am. Political Sci. Rev. 113, 883–901 (2019).
Article Google Scholar
Lewandowsky, S., Jetter, M. & Ecker, U. K. Using the president’s tweets to understand political diversion in the age of social media. Nat. Commun. 11, 5764 (2020).
Article CAS PubMed PubMed Central Google Scholar
Nelson, L. K. Computational grounded theory: a methodological framework. Sociol. Methods Res. 49, 3–42 (2020).
Article Google Scholar
Garten, J. et al. Dictionaries and distributions: combining expert knowledge and large scale textual data content analysis. Behav. Res. Methods 50, 344–361 (2018).
Article PubMed Google Scholar
Newman, M. L., Pennebaker, J. W., Berry, D. S. & Richards, J. M. Lying words: predicting deception from linguistic styles. Pers. Soc. Psychol. Bull. 29, 665–675 (2003).
Article PubMed Google Scholar
Pennebaker, J. W., Chung, C. K., Frazee, J., Lavergne, G. M. & Beaver, D. I. When small words foretell academic success: the case of college admissions essays. PLoS ONE 9, e115844 (2014).
Article PubMed PubMed Central Google Scholar
Brady, W. J., Crockett, M. J. & Bavel, J. J. V. The MAD model of moral contagion: the role of motivation, attention and design in the spread of moralized content online. Perspect. Psychol. Sci. 15, 978–1010 (2020).
Article PubMed Google Scholar
Boyd, R. L., Ashokkumar, A., Seraj, S. & Pennebaker, J. W. The Development and Psychometric Properties of LIWC-22 (Univ. of Texas at Austin, 2022); https://www.liwc.app
Hutto, C. & Gilbert, E. VADER: a parsimonious rule-based model for sentiment analysis of social media text. Proc. Int. AAAI Conf. Weblogs Soc. Media 8, 216–225 (2014).
Article Google Scholar
Kessler, J. Scattertext: a browser-based tool for visualizing how corpora differ. In Proc. Association for Computational Linguistics 2017, System Demonstrations, 85–90 (Association for Computational Linguistics, 2017).
Kozyreva, A., Lewandowsky, S. & Hertwig, R. Citizens versus the internet: confronting digital challenges with cognitive tools. Psychol. Sci. Public Interest 21, 103–156 (2020).
Article PubMed PubMed Central Google Scholar
Grinberg, N., Joseph, K., Friedland, L., Swire-Thompson, B. & Lazer, D. Fake news on Twitter during the 2016 US presidential election. Science 363, 374–378 (2019).
Article CAS PubMed Google Scholar
Pennycook, G. et al. Shifting attention to accuracy can reduce misinformation online. Nature 592, 590–595 (2021).
Article CAS PubMed Google Scholar
Rating process and criteria. NewsGuard https://www.newsguardtech.com/ratings/rating-process-criteria/ (2020).
Lasser, J. List of domain accuracy and transparency scores v1.1. Zenodo https://doi.org/10.5281/ZENODO.6536692 (2022).
Guess, A. M., Nyhan, B. & Reifler, J. Exposure to untrustworthy websites in the 2016 US election. Nat. Hum. Behav. 4, 472–480 (2020).
Article PubMed PubMed Central Google Scholar
Guess, A. M., Nagler, J. & Tucker, J. Less than you think: prevalence and predictors of fake news dissemination on Facebook. Sci. Adv. 5, eaau4586 (2019).
Article PubMed PubMed Central Google Scholar
Rathje, S., Van Bavel, J. J. & van der Linden, S. Out-group animosity drives engagement on social media. Proc. Natl Acad. Sci. USA https://doi.org/10.1073/pnas.202429211 (2021).
Osmundsen, M., Bor, A., Vahlstrup, P. B., Bechmann, A. & Petersen, M. B. Partisan polarization is the primary psychological motivation behind political fake news sharing on Twitter. Am. Political Sci. Rev. 115, 999–1015 (2021).
Article Google Scholar
Brulle, R. J., Carmichael, J. & Jenkins, J. C. Shifting public opinion on climate change: an empirical assessment of factors influencing concern over climate change in the U.S., 2002–2010. Clim. Change 114, 169–188 (2012).
Article Google Scholar
Gonawela, A. et al. Speaking their mind: populist style and antagonistic messaging in the tweets of Donald Trump, Narendra Modi, Nigel Farage, and Geert Wilders. Comput. Support. Coop. Work 27, 293–326 (2018).
Article Google Scholar
Wrubel, L. & Kerchner, D. 116th U.S. Congress tweet IDs. Harvard Dataverse https://doi.org/10.7910/DVN/MBOJNS (2020).
Summers, E. et al. Shawn DocNow/twarc: v2.10.4. Zenodo https://doi.org/10.5281/ZENODO.6503180 (2022).
Bojanowski, P., Grave, E., Joulin, A. & Mikolov, T. Enriching word vectors with subword information. Trans. Assoc. Comput. Linguist. 5, 135–146 (2017).
Article Google Scholar
Di Natale, A., Pellert, M. & Garcia, D. Colexification networks encode affective meaning. Affect. Sci. 2, 99–111 (2021).
Article PubMed PubMed Central Google Scholar
Di Natale, A. & Garcia, D. LEXpander: applying colexification networks to automated lexicon expansion. Behav. Res. Methods https://doi.org/10.3758/s13428-023-02063-y (2023).
François, A. in From Polysemy to Semantic Change: Towards a Typology of Lexical Semantic Associations (Vanhove, M. ed) 163 (John Benjamins, 2008); https://doi.org/10.1075/slcs.106.09fra
Jackson, J. C. et al. Emotion semantics show both cultural variation and universal structure. Science 366, 1517–1522 (2019).
Article CAS PubMed Google Scholar
Palan, S. & Schitter, C. Prolific.ac—a subject pool for online experiments. J. Behav. Exp. Finance 17, 22–27 (2018).
Article Google Scholar
Pennington, J., Socher, R. & Manning, C. D. GloVe: global vectors for word representation. In Proc. 2014 Conference on Empirical Methods in Natural Language Processing 1532–1543 (Association for Computational Linguistics, 2014); https://doi.org/10.3115/v1/D14-1162
Mikolov, T., Chen, K., Corrado, G. & Dean, J. Efficient estimation of word representations in vector space. Preprint at https://arxiv.org/abs/1301.3781 (2013).
Scheffer, M., van de Leemput, I., Weinans, E. & Bollen, J. The rise and fall of rationality in language. Proc. Natl Acad. Sci. USA https://doi.org/10.1073/pnas.2107848118 (2021).
Bhadani, S. et al. Political audience diversity and news reliability in algorithmic ranking. Nat. Hum. Behav. 6, 495–505 (2022).
Article PubMed Google Scholar
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2015).
Article Google Scholar
Seabold, S. & Perktold, J. Statsmodels: econometric and modeling with Python. 9th Python in Science Conference, Austin, 28 June–3 July, 2010 57–61 (SciPy, 2010); https://doi.org/10.25080/Majora-92bf1922-011
Ou-Yang, L. Newspaper3k: article scraping & curation. Github https://github.com/codelucas/newspaper (2020).
Lasser, J. et al. From alternative conceptions of honesty to alternative facts in communications by US politicians. OSF https://doi.org/10.17605/OSF.IO/VNY8K (2022).
Lasser, J. et al. New-ontology-of-truth: code freeze for revision round 2. https://doi.org/10.5281/zenodo.7723109 (2022).

Download references

Acknowledgements

This report was partly funded by the Templeton Foundation through a grant awarded to Wake Forest University for the Honesty Project. S.L. was also supported by funding from the Humboldt Foundation in Germany, and S.L. and D.G. are beneficiaries of the ERC Advanced Grant PRODEMINFO (101020961). J.L. was supported by the Marie Skłodowska-Curie grant number 101026507. The funders had no role in study design, data collection and analysis, decision to publish or preparation of the article. We acknowledge Travis Coan for helpful feedback on the manuscript.

Author information

Authors and Affiliations

Graz University of Technology, Graz, Austria
Jana Lasser, Segun T. Aroyehun & David Garcia
Complexity Science Hub Vienna, Vienna, Austria
Jana Lasser & David Garcia
University of Konstanz, Konstanz, Germany
Segun T. Aroyehun & David Garcia
University of Bristol, Bristol, UK
Fabio Carrella, Almog Simchon & Stephan Lewandowsky
University of Western Australia, Crawley, Western Australia, Australia
Stephan Lewandowsky
University of Potsdam, Potsdam, Germany
Stephan Lewandowsky

Authors

Jana Lasser
View author publications
You can also search for this author in PubMed Google Scholar
Segun T. Aroyehun
View author publications
You can also search for this author in PubMed Google Scholar
Fabio Carrella
View author publications
You can also search for this author in PubMed Google Scholar
Almog Simchon
View author publications
You can also search for this author in PubMed Google Scholar
David Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Stephan Lewandowsky
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.L., D.G. and J.L. conceptualized the research. S.T.A., F.C., J.L. and A.S. developed the methodology and statistical models. F.C. performed the validation. J.L., S.T.A., A.S. and F.C. performed computational and statistical analyses. J.L. and S.T.A. collected and curated the data. J.L. prepared the visualizations. J.L. administrated the project. S.L. and D.G. acquired funding and supervised the project. S.L. and J.L. wrote the original draft of the article. All authors contributed to editing the original draft of the article.

Corresponding author

Correspondence to Stephan Lewandowsky.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Human Behaviour thanks Aparup Khatua, Niels Mede and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Share of links posted by accounts belonging to members of the U.S. Congress.

Share of links pointing to domains indexed in a the NewsGuard data base and b the independently compiled list.

Extended Data Fig. 2 Relation of information quality with belief-speaking and fact-speaking (full data).

a and b rescaled NewsGuard score ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ of links shared in tweets by members of the U.S. congress over belief-speaking similarity ${D}_{{{{\rm{b}}}}}^{{\prime} }$. Red and blue dots denote tweets by Democrats and Republicans, respectively. b shows ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ over fact-speaking similarity ${D}_{{{{\rm{f}}}}}^{{\prime} }$ in tweets. c and d show the same information but with ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${{{{\rm{D}}}}}_{{{{\rm{f}}}}}^{{\prime} }$ calculated using the text of the articles that were linked instead of the tweet texts. The grey ellipses indicate the mean and standard deviation of the honesty component similarity and ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ for articles shared by members of both parties. These articles were excluded in the regression analysis.

Extended Data Fig. 3 Three-way interaction between party, belief-speaking and fact-speaking.

Prediction of rescaled NewsGuard score ${S}_{{{{\rm{NG}}}}}^{{\prime} }$ for different values of belief-speaking similarity ${{{{\rm{D}}}}}_{{{{\rm{b}}}}}^{{\prime} }$ and different levels (-1 SD, mean, +1 SD) of fact-speaking similarity ${{{{\rm{D}}}}}_{{{{\rm{f}}}}}^{{\prime} }$ based on the fixed effect estimate of the three-way interaction $P\times {D}_{{{{\rm{b}}}}}^{{\prime} }\times {D}_{{{{\rm{f}}}}}^{{\prime} }$ (see linear mixed effects model in Eq. (1)) for tweets from a Democrat and b Republican members of the U.S. Congress.

Extended Data Fig. 4 Dictionary robustness analysis.

a and b show the distribution of estimates of the effect of belief-speaking and fact-speaking similarity ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and fact-speaking similarity ${D}_{{{{\rm{f}}}}}^{{\prime} }$ for Republicans from the linear mixed model (see Eq. (1)), where ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ were calculated with a perturbed dictionary for every tweet, respectively. c and d show the distribution of estimates of the effect of ${D}_{{{{\rm{b}}}}}^{{\prime} }$ and ${D}_{{{{\rm{f}}}}}^{{\prime} }$ for Democrats, respectively. Distributions were calculated from 100 dictionary perturbation iterations.

Extended Data Table 1 Keyword lists

Full size table

Extended Data Table 2 Dependence of NewsGuard score on belief-speaking and fact-speaking measured in tweets

Full size table

Extended Data Table 3 Dependence of NewsGuard score on belief-speaking and fact-speaking measured in articles

Full size table

Supplementary information

Supplementary Information

Supplementary Notes 1–14, Figs. 1–11 and Tables 1–13.

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lasser, J., Aroyehun, S.T., Carrella, F. et al. From alternative conceptions of honesty to alternative facts in communications by US politicians. Nat Hum Behav 7, 2140–2151 (2023). https://doi.org/10.1038/s41562-023-01691-w

Download citation

Received: 22 July 2022
Accepted: 01 August 2023
Published: 25 September 2023
Issue Date: December 2023
DOI: https://doi.org/10.1038/s41562-023-01691-w
Springer Nature Limited

From alternative conceptions of honesty to alternative facts in communications by US politicians

Abstract

Similar content being viewed by others

Explore related subjects

Main

Identifying conceptions of honesty in political speech

Partisan and temporal dynamics of conceptions of honesty

Relating honesty components to information trustworthiness

Discussion

Methods

US Congress member tweet corpus

Honesty-component keywords and validation

Identification of honesty components in text

Honesty-component document-level validation

NYT corpus

Word and topic keyness analysis

NewsGuard nutrition labels

Regression

News article collection

Inclusion and ethics

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation