Several authors have observed that mathematics is typically written using the present tense (Ganesalingam 2013; Pimm 2005). For instance, Solomon and O’Neill (1998) studied an extract from the personal notebook of mathematician William Rowan Hamilton. They noticed that Hamilton’s personal remarks tended to be written in the past tense, whereas the mathematical content was relayed “in the timeless present” (p. 216). Ganesalingam (2013) attributed this to mathematicians’ beliefs about the philosophy of mathematics, and suggested that it is almost universal:

Working mathematicians treat mathematical objects as if they were Platonic ideals, timeless objects existing independently of the physical world. The interactions and properties of these objects are also seen as frozen, timeless quantities. Mathematical assertions are therefore almost always written as generic (gnomic) sentences in the present simple tense. (p. 21).

Some mathematics educators have noted that the tendency to discuss mathematics in the present tense could have implications for how students see mathematics. In particular, Pimm (2005) argued this style may cause students to view mathematics as being a-temporal rather than historical and contingent. Obscuring the role of human agency in mathematics in this manner may, it is suggested, make it “harder for students to construe a present or future role for themselves” within the discipline (Morgan 2016, p. 138).

While the relative infrequency of the past tense in written mathematics has often been commented upon (Burton and Morgan 2000; Ganesalingam 2013; Morgan 2016; Solomon and O’Neill 1998), to our knowledge no one has, to date, produced an estimate of the size of this effect or investigated if it is genuinely universal across mathematics. Our goal in this short note is to provide relevant evidence on these points.

To this end, we used Python (version 3.8.12) and the Natural Language Toolkit (NLTK, version 3.6.7) to perform part-of-speech (POS) tagging (Bird et al. 2009). POS tagging is a process by which the words in corpora are categorised as being particular parts of speech, depending on the definition of the word and its context. For instance, consider the following two sentences, both taken from the Lancaster-Oslo/Bergen (LOB) corpus (Johansson et al. 1978): (i) “Most stories of Miss Nightingale begin and end with her work in the Crimea”; (ii) “if you are 65 (60 for a woman) you work for an employer and are paid more than £9 in any week you will (unless you are contracted out) pay graduated contributions”. In the first sentence “work” can be tagged as a noun, whereas the surrounding context in the second sentence allows the same word to be tagged as a verb.

We used Punkt, a pre-trained model, to tokenize (separate into sentences) our corpora. We then used the default NLTK tagger (Averaged Perceptron Tagger) to tag the tokenized texts according to the Penn Treebank tag set (Taylor et al. 2003). The code used for this analysis can be found at https://github.com/jda5/tense-analysis. Importantly, the Averaged Perceptron Tagger does not achieve perfect accuracy in its tagging: Banga and Mehndiratta (2017) assessed its accuracy to be 88.7%.

The two corpora we used were:

  • A corpus created from academic mathematics papers uploaded to the ArXiv. The ArXiv is a preprint server that is widely used by mathematicians to self-archive their academic output. The corpus was created by downloading raw TeX sources and stripping out TeX commands and inline mathematics (so that formulae, diagrams and so on were removed). We including all 5087 papers from the first four months of 2009 that could be successfully processed. This left a corpus of approximately 27 million words (for a full description of the process by which this corpus was created, see Mejía-Ramos et al.'s (2019) description).

  • The Brown and LOB corpora (Kucera and Francis 1967; Johansson et al. 1978) of general English. The Brown corpus contains 500 text samples, each of 2000 words, of representative American English. The LOB corpus is a British English version of the Brown corpus, created using identical sampling principles. We combined the two corpora to create a single 2 million word corpus.

The frequencies of different verb forms (base, past tense, gerund or present participle, past participle, non-third person singular present, third person singular present, modal verb) were calculated, and then normalised, yielding a frequency per 1000 words for each form. These are shown in Table 1.

Table 1 Verb form frequencies per 1000 words in the ArXiv and Brown/LOB corpora

Next we calculated the total number of past-tense and present-tense verbs (with these terms operationalised as shown in Table 1). To be conservative we omitted verb base forms from this calculation as, although these forms are usually in the present tense (“I like philosophy”), they can be part of phrases in other tenses (“I used to like philosophy”). We found that the ratio of present to past tense differed significantly between the corpora, \(\chi ^2(1)=146820.25\), \(p<.001\). The present:past ratio was approximately 3:1 in the ArXiv corpus and 5:7 in the Brown/LOB corpus. An even more conservative analysis would be to ignore participles and gerunds, and simply compare just the frequency of past tense verbs between the corpora. As shown in Table 1, the Brown/LOB corpus had nearly ten times as many past tense verbs (VBD tags) per 1000 words than the ArXiv corpus.

Next we explored the level of agreement/disagreement between mathematical authors concerning the use of tense. For each of the 5087 papers in our corpus we calculated the frequency with which verbs were expressed in the present and past tense. For 5004 articles (98.4%), the frequency of present tense verbs was higher. This is perhaps surprising given the large differences in style that can be seen across different mathematical fields. In just 26 articles (0.5%) was the frequency of past tense verbs higher. An inspection of these 26 articles revealed that 2 focused on the history of mathematics, 3 were written in a language other than English (but with English abstracts), and 15 had been replaced by a withdrawal notice (which explained that an error had been found in the original manuscript). The remaining 57 papers were made up of 53 empty articles (i.e. the papers had been withdrawn by the authors without being replaced by a withdrawal notice) and 4 articles where the withdrawal notice contained the same number of past- and present-tense verb forms.

In sum, in line with (Ganesalingam 2013) assertion, we found that mathematicians share an almost universal preference for writing mathematics using the present tense.