How diverse is your reading list? Exploring issues of representation and decolonisation in the UK

There is a growing impetus, from university students and administrations, to decolonise the curriculum and develop diverse reading lists. Yet, there is limited theoretical or empirical analysis of the authorship of current reading lists to justify this imperative. The present study developed and applied a method for auditing the authorship on reading lists of two modules, one from science and one from social science, in a research-intensive British university. The paper explores whether these reading lists can be considered descriptively representative of the student body or scholarly community. The analysis of reading lists found empirical basis for concerns that university curricula are dominated by white, male and Eurocentric authors, with some exceptions. The reading lists did not represent the diverse local student body but came closer to representing the demographic profile of academic staff. To interpret these findings, the paper argues that reading lists have a role to play in decolonising our universities, and offer opportunities to critique and deconstruct disciplinary boundaries. Further research is required to explore student and staff views of reading lists and the meaning of diversity, to evaluate existing tools, and address barriers to decolonising our curricula on a global scale.

Africa, reading lists have been reviewed as part of a wider effort to decolonise the university (Heleta 2016;Luckett and Shay 2017;Le Grange 2016). In this context, the university curriculum constitutes a fruitful site, but not the only one (Pillay 2018), for challenging entrenched colonial structures. Calls for decolonisation highlight the dominance of Eurocentric curricula which 'reinforce white and Western dominance and privilege while at the same time being full of stereotypes, prejudices and patronising views about Africa and its people' (Heleta 2016: 2). Whilst these demands have a long history, it is recent student protests, such as #RhodesMustFall, that have reignited these debates (Le Grange 2016; Mngomezulu and Hadebe 2018). In this setting, the university curriculum represents a necessary opportunity to deliver change in higher education and beyond, as 'a microcosm of and impetus for broader societal transformation' (Le Grange 2016: 3. See also Heleta 2016;Luckett and Shay 2017).
In the UK, calls for an inclusive curriculum were initially driven by the growing internationalisation of the university body (Hubble and Bolton 2018) and the widening participation agenda. Initiatives seeking to increase the numbers of UK domiciled students from underrepresented groups have been expansive (Donnelly and Evans 2018) and have had some, albeit uneven, success (Whitty et al. 2015;Equality Challenge Unit 2017b). Within this context, an inclusive curriculum was considered central to delivering equality for, and representation of, a diverse and international student body (HEA/ECU 2008;Hockings 2010). A diverse curriculum has also been aligned with attempts to address the attainment gap between Black, Asian and Minority Ethnic (BAME) students and their white peers (Stevenson 2012) (BAME students are less likely to attain a 'good', first or upper second-class, degree compared to their white peers. See Equality Challenge Unit 2017b). It is only more recently that the interrogation of curricula in the UK has been linked to the broader movement to decolonise the university and transform the knowledge base, rather than adding token diversity (Andrews 2019). Student protests, inspired by campaigns in other parts of the world, have questioned the whiteness of the curriculum and dominance of Eurocentric, Western thought, e.g. Rhodes must fall Oxford; 'Why is my curriculum white?', #LiberateMyDegree (Bhambra et al. 2018). Based on such movements, students have developed their own tools to broaden the range of perspectives on their reading lists (see the Alternative Reading List Project https://thealternativereadinglistproject.wordpress.com/). Situated within these debates, UK universities are beginning to recognise that 'the content of university knowledge remains principally governed by the West for the West' (Bhambra et al. 2018: 6) with reading materials constituting a pertinent focus for reform (Stevenson 2012: 18). Yet, the sector does not have data about the authors included on programme/module reading lists as such empirical analysis is extremely rare. This article aims to ascertain whether there is empirical evidence to support claims that UK universities need to diversify their reading lists. In doing so, the article considers the meaning of diversification by reflecting on who and/or what requires better representation in our reading lists. Two core subject areas, one in the sciences and one in the social sciences, in a leading UK university, will be interrogated in an attempt to establish whether white, male and Eurocentric authors dominate university reading lists. This type of enquiry represents a crucial step towards developing and transforming our curricula in response to the global decolonisation agenda.

Reading lists and/as representation
Reading lists in higher education are representation devices, serving to reflect particular perspectives and knowledge. Pitkin's (1967) concept of descriptive representation is helpful for unpacking this representative function and the normative justifications for an inclusive and diverse reading list. Descriptive representation refers to the premise that a representational artefact should resemble the community from which it is drawn. The representational function 'must be intentional' (Pitkin 1967: 68), although not necessarily accurate, where the representative 'stands for' the wider community.

Descriptive representation of the student body
Within both Global South and North settings, critiques of the curriculum have highlighted the failure of reading lists to reflect or resemble the student body. The colonial curriculum in Africa, for example, is considered 'disconnected from African realities' (Heleta 2016: 4) and BAME students in the UK feel 'under-represented' as 'their histories and ancestral narrative [are] omitted from mainstream discourse' (Abou El Magd 2016). Seen through a lens of descriptive representation, the reading list becomes a tool to represent student perspectives, histories and interests. This would respond to concerns identified above (Abou El Magd 2016; Heleta 2016; Wolff 2016) and facilitate student belonging and engagement (Burke 2018;Hoffman et al. 2002). More broadly, a reading list that directly connects with student histories and experiences could be aligned with pedagogical shifts towards student-centred education where 'learning dwells within the student' (Neumann 2013: 164). As learning contexts become centred in students, reading lists could arguably serve to instil and represent the perspectives of the student body.
Defining which student body should be reflected in our reading lists, however, is challenging. Representation of specific local student profiles would require each course to resemble the student cohort. The growing body of female students in STEM subjects in the US/UK, for example, would require collections that 'adequately reflect the needs and populations of their campuses' (Blackburn 2017). A reading list that responds to local students could challenge the colonial curriculum and enable indigenous voices to be represented (Phillips and Archer-Lean 2018;Le Grange 2016). However, student intake of 'home' students at elite universities (such as the 'Russell Group' in the UK) and particular courses (such as Physical sciences) are relatively homogenous (Equality Challenge Unit 2017b) and so the reading lists in these contexts would remain unchallenged. This could exacerbate differences between elite and nonelite universities at a national level and mark the knowledge divide between centre and peripheral academic systems globally (Altbach 2016;Tickner 2013). An alternative form of descriptive representation could use indicators of the national or international student body of a given discipline to inform reading lists (e.g. in UK see Equality Challenge Unit 2017b; HESA 2019). However, comparing reading lists to international or national student cohorts would reiterate concerns about recognising regional and local perspectives in a global education system (Cheung 2012). The case for descriptively representing the student body in the reading list is complex and justifiable primarily within student-centred education. It is in such settings that students have autonomy and choice in their curricula (O'Neill and McMahon 2005). However, disciplinary differences remain as student-centred learning is more common in 'soft' disciplines (such as social science) compared to those described as 'hard' (such as physical science) (Lueddeke 2003;Lindblom-Ylänne et al. 2006), and there are debates about the transferability of the concept to diverse international contexts (Jordan et al. 2013). There is a dissonance between the extensive rhetoric of student-centred learning and its implementation in higher education, where there are various barriers to student-staff co-creation of the curriculum (Bovill et al. 2016;O'Neill and McMahon 2005).

Descriptive representation of the scholarly community
Research on reading lists, although scarce (Stokes and Martin 2008;Piscioneri and Hlavac 2013), suggests that staff identify the list as 'a device, a tool and an unproblematic given' (Stokes and Martin 2008: 124) that offers 'students a "sense of direction" in relation to writing on a given field' (Stokes and Martin 2008: 116). Students view the reading list as an important instrument for learning, providing them with the 'key' or 'main' texts in the subject area (Siddall 2016;Siddall and Rose 2014;Brewerton 2014;Stokes and Martin 2008). The reading list is intended to represent the subject content, discipline and scholarly community from which it is drawn. There are standard collections of works, i.e. the canon, which have historically shaped our disciplines and populated reading lists. Moreover, there are regulations that direct the nature of study and academic standards expected in our disciplines (e.g. Subject Benchmark Statements as part of the UK Quality Code for Higher Education). Yet, existing research highlights that published or highly cited works do not necessarily reflect the discipline as a whole (Schucan Bird 2011;Dickersin 2015) and curricula can present a distorted view of a field with a more pronounced tendency to list particular publications, for example, by male, white scholars (Colgan 2017;Hagmann and Biersteker 2014;Romero 2017;Sumner 2018). On a global scale, curricula templates are often exported from North to South (Altbach 2016: 94;Shahjahan et al. 2015) and the pool of research from which reading lists are drawn favours the Global North. The global economy of knowledge marginalizes peripheral voices/ perspectives often situated in the Global South (Altbach 2016;Connell 2018;Tickner 2013).
Shifts towards inclusive curricula therefore offer transformative potential (Luckett and Shay 2017) that 're-conceptualizes the field in light of new knowledge, scholarship, and new ways of knowing' (Kitano cited in Cohn and Mullenix 2007: 16). The development of a diverse reading list would necessarily build on and critique the canon (Wolff 2016). In post-colonial contexts, this requires greater representation of perspectives, histories and approaches of scholars of the country (Le Grange 2016; Mngomezulu and Hadebe 2018) to 'build a knowledge base that takes learners and teachers beyond colonialist habit' (Phillips and Archer-Lean 2018). In other settings, calls for an inclusive reading list seek to recognise the contributions of historically marginalised groups by granting, for example, 'access to historic and contemporary narratives about women in STEM' (Blackburn 2017).
Seeking to develop a reading list that is descriptively representative of the scholarly community, however, is challenging. On the one hand, curricula could aspire to resemble the scholarly profile of each discipline as represented through professional disciplinary associations e.g. American Sociological Association (Bonilla-Silva 2017) or teaching departments in universities (e.g. Colgan 2017). Yet, disciplines vary according to how homogenous they have become (Altbach 2016) so descriptive representation may not be desirable in subfields with low diversity or female representation (Sumner 2018). On the other hand, drawing the reading list from the global scholarly community may provide a more holistic picture of the discipline but lack relevance to local scholars and their specialisms.

Methodology
Two small projects were undertaken to examine whether there is empirical evidence that white, male and Eurocentric/Western authors dominate reading lists in the UK context. This provides a lens through which to reflect on the meaning of a diverse curriculum and examine whether reading lists are descriptively representative of staff or students. The projects aimed to develop a method for auditing the authors on reading lists and apply this to two case studies.

The case studies
Each of the two projects undertook a case study so that, altogether, two module reading lists were analysed. Whilst work to decolonise reading lists has focused predominantly on the humanities and social sciences, the readings in science have remained almost entirely unexamined, and their perceived neutrality is less likely to be challenged (Last 2018). Therefore, an undergraduate science module on genetics methodology and a postgraduate social science module on research methods were selected for analysis. The setting of the work was a research intensive university in the UK, with the projects constituting part of, and funded by, the local 'Liberating the Curriculum' initiative (a staff and student group working to advance the inclusive curriculum). The two case studies were carried out separately, each by project teams comprising one British member of staff and one international student. As white women, the staff members were mindful of their own and others' racialized, gendered and ranked positionality in order to anticipate dangers that may emerge in the research process. In practice, this meant engaging in open reflection with colleagues and students to explore different interpretations of the data during its collection and analysis (Milner 2007).
The projects were undertaken in two inter-related stages. The first stage involved searching for literature to identify existing approaches for analysing reading lists. The second stage drew upon this literature, where possible, to develop a method of data collection and analysis.

Stage 1
Literature reviews were undertaken to identify existing studies, in any discipline, that have analysed authorship of academic research publications and/or university reading lists. The separate studies were identified and integrated across the projects. The search strategy included Google Scholar, pearl growing techniques (using key words drawn from relevant publications to search for further references), and citation checking (identifying references that cite, of have been cited by, key publications).

Stage 2
Informed by the literature identified in stage 1, methods were developed to examine the reading lists. The process of data collection for each project began with the creation of a spreadsheet that contained all of the references from each reading list. Each author was given an individual ID and for each multi-authored item, their place in the order was noted. The gender, ethnicity and geographical/institutional affiliation of each author was identified using information available about the author on institutional Web sites, profiles on other online platforms (such as LinkedIn) and biographical details included on the publication/journal article itself. Authors were grouped according to the categories listed in Table 1. In line with previous studies (Schucan Bird 2011;Dickersin et al. 1998;Eigenberg and Whalley 2015), the gender of the author was categorised as female, male or unclear and deduced from the first name, pronouns or photos. To identify the ethnicity/race of the author, the case studies experimented with different methods. The social science project used broad categories, based on the UK Institute of Race Relations definition of Black or Minority Ethnic (BME): 'terminology normally used in the UK to describe people of non-white descent'. The science case recorded the information as used in the data source and then grouped according to the categories afterwards. To assign a geographical area to each author, the institutional affiliation was identified by examining the details reported on the publication or the institutional Web site of the author. Although all data was from publicly accessible sources, the completed spreadsheets included assumptions about named individuals and were therefore held securely according to data protection principles.
Data analysis was undertaken in multiple stages. First, a descriptive overview of the gender, ethnicity/ race and geographical affiliation of all authors was created for each reading list. Second, single and multiple authored publications were analysed separately. The position of authors, and their demographic characteristics, in multiple authored papers was examined. Whilst there are different approaches to attributing credit to authors on publications, typically the most significant contributor is the author listed first in social science (Endersby 1996) and last in science (Tscharntke et al. 2007). Third, the findings were compared to local, national and international data on the student and staff bodies of the disciplines. This served to consider whether the reading lists were descriptively representative.

Stage 1
The review found that most research on authorship of published works focused on disciplines/set of journals, in a range of fields, rather than reading lists in higher education. The vast majority of the studies examined the gender of the authors (Schucan Bird 2011; Boschini and Sjögren 2007;Crow and Smykla 2015;Dickersin et al. 1998;Eigenberg and Whalley 2015;Karimi et al. 2016;Mauleón and Bordons 2006;West et al. 2013), with few studies focusing on ethnicity (Freeman and Huang 2015) and/or institutional affiliation (Crow and Smykla 2015;Hinnant et al. 2012). The methods used in these studies were diverse, ranging from manually reviewing publications to inferring an author's sex from their name/photograph (Schucan Bird 2011; Dickersin et al. 1998;Eigenberg and Whalley 2015) to using software to assign demographic categories on the basis of authors' names (Freeman and Huang 2015;West et al. 2013). One study evaluated the precision of different gender detection approaches reporting that mixed methods are optimal, with name-based approaches performing particularly weakly in emerging countries (Karimi et al. 2016). On this basis, the projects used image-based gender detection methods to supplement name-focused approaches although, as considered in the discussion below, this is still problematic.

Social science
The reading list comprised 57 journal articles, book chapters or online reports specified for required and further reading. There were 144 different authors (see Table 2) with the number of authors ranging from one to 15 per publication. Teams of two or more authors wrote the majority (83%) of the publications on the reading list.

Gender
There were an equal proportion of female and male authors. The majority of co-authored papers (75%) included teams of men and women but the lead author was more likely to be male (55% led by men compared to 43% led by women). Further, teams of male authors were three times as common as female collaborations. Only one of the single author publications (10%) was written by a woman.
The reading list provides descriptive representation of female academic staff at levels greater than their proportion in the field of social science. The percentage of female authors listed on the reading list (50%) is higher than local (38%, University Staff statistics 2018) and national levels of female academic staff (46%, Equality Challenge Unit 2017a). The postgraduate student population, however, is not as well represented in terms of the national level population of 'social studies' at 66% (HESA 2019) or the local department of social sciences, at 73%.

Ethnicity
The majority of authors (90%) were identified as non-BME with four authors coded as unclear.
None of the single author publications were written by a BME author. Of the co-authored publications, only 2% (n = 1) had a BAME lead author. A small proportion of the co-authored reports, 13%, included a BME author so the majority were written by teams of non-BME authors. Three percent of the total authors were BME and female, writing four publications (2%) on the reading list.
Compared to the proportion of BME academic staff in the social sciences, the reading list is descriptively representative. With 7% of authors coded as BME, the list mirrors levels of staff in these disciplines both locally (9% University Staff Statistics 2018) and nationally (10%, Equality Challenge Unit 2017a). Based on university wide statistics (2018/19), rather than specific social sciences data (which is not available), the proportion of BAME students ranges from 39% (UK domiciled) to 75% (non-UK domiciled) suggesting that the reading list fails to represent this student body.

Geography
The vast majority of the authors, 99%, were affiliated to universities in Europe, North American or Australasia. There were only two authors affiliated to institutions outside of these geographical areas, representing Brazil and Lebanon. These were listed in co-authored articles, as fourth author on a list of six (Brazil) and 13th on a publication with 14 authors (Lebanon). This meant that only 4% of all publications on the reading list included an author from the Global South.
There is no known data on the geographical affiliations of social science scholars around the world. Although limited in scope, there is some data on the nationalities of academics where 14% of staff in the UK report as non-EU nationals (UUK report). With 16% of authors affiliated to non-European universities, the reading list would arguably be descriptively representative of academic staff. The postgraduate student body in social science, however, is composed of 67% of 'overseas' (i.e. non-UK), of which 69% are from non-European countries (University Student Statistics 2018/19).

Science
The reading list comprised 68 individual articles, book chapters and Web-based data sets, with 146 authors. The number of authors per publication ranged from one to 23. The majority of the publications, 65% (n = 44), were written by teams of two or more authors. This meant that just over a third of the reading list (35%, n = 24) was written by a single author.

Gender
The majority of the authors on the reading list, 70%, were male. The gender of the authors of single-author publications (n = 24) broadly reflected this ratio (62% male and 21% female). However, of the co-authored publications (n = 44), 89% were led by male authors (n = 39). This included 12 publications written by the same team of male biologists.
In terms of descriptive representation, the proportion of female authors on the reading list (30%) fell short of the female student population at institutional level which is 67% for the biosciences. Nationally the proportion of female students is almost as high (Universities UK 2018). The reading list more accurately reflects the gender balance of staff in the local department, where women constitute 40%. There is a lack of data on the gender of academic staff in the biosciences nationally and internationally, but there is evidence of a steep decline in the number of women between researcher level and principal investigator in the life sciences (Lerchenmueller and Sorenson 2018).

Ethnicity
The majority of authors, 65%, were classified as white. Of the remaining authors, 18% were identified as Asian and 17% were unclear. Similarly, white authors (75%) wrote the majority of single author publications. Of the co-authored papers, 55% of the teams included at least one Asian author and 39% were led by an Asian author.
The high proportion of unclear information about ethnicity means that it is difficult to examine whether the list is descriptively representative.

Geography
The majority of authors, 90%, were from European, Australian or North American universities. Many of these authors were affiliated to universities in North America (n = 91) or Europe (n = 35), with the latter group representing ten different European countries. Three countries in Asia (China, Singapore and The Philippines) were represented by ten authors who constituted 7% of authors. Of the publications on the reading list, 24% included an author from the Global South.
As with social science, there is no known data about the geographical affiliations of scholars. 32% of students at the local university are from outside the European Union.

Science and social science compared
The number of authors in both Social Science and Science reading lists was similar (N = 144; N = 146). Science had a lower proportion of female authors than social science with mixed findings regarding women's role in authorship. There were a higher proportion of female authors of single publications in science yet there were fewer female led co-authored papers compared to social science. Across both case studies, the majority of authors were identified as non-BAME/white and few authors were from Global South contexts (ranging from 4 to 7%). The number of individual authors on the science list for whom it was not possible to find gender and/or ethnicity data was significantly higher than for the social science list. Both lists were more representative of the local academic community than they were of the student body, which is much more diverse ethnically and more predominantly female.

Discussion
The findings suggest that there is empirical basis for claims that university reading lists in the UK context are dominated by white, male and Eurocentric authors. These conclusions are qualified by a couple of findings: the equal proportion of male/female authors on the social science reading list, and almost a third of authors on the science list identified as Asian.
Analysis indicates that whilst these authors are descriptively representative of the local/ national scholarly community, as far as we can know it, the demographic characteristics of the authors diverge from those of the student body. This discussion will consider how to interpret these findings. Firstly, reflections will be made on why descriptively representative reading lists matter. Secondly, the discussion will consider the conceptual and methodological challenges that hinder and potentially undermine the interrogation of reading lists. Thirdly, a set of recommendations for future research will be identified.
Descriptively representative reading lists: why do they matter?
Should we applaud the reading lists from the case studies for descriptively representing the scholarly community, at a local/national level? These lists arguably reflect the changing composition of the disciplines and provide recognition of some of the historically underrepresented groups, such as women. Such authors/lists can also provide a symbolic function to inspire under-represented groups to engage with the field (Bettinger and Long 2005). The transformative potential of descriptive representation, however, arguably rests on the view that scholars' identities influence their approach to research (Mügge et al. 2018). This would suggest that particular groups of authors, e.g. women, bring distinctive skills, values and knowledge to the field of study. Situated within Pitkin's (1967) framework, this links descriptive representation to substantive representation and the idea that representatives' actions reflect their demographic profile. The historical development and critical analyses of our disciplines provide some support for this claim. Women/feminist authors within social science (and to some extent science) have sought to challenge the male and masculinist agendas/ perspectives of traditional research, and question definitions and practices of knowledge itself (Pearse et al. 2018;Roberts 1981;Schiebinger 2000). Such efforts continue today, as scholars recognise the contribution of women to the fields of science/social science (Dickersin 2015;Harding 2008). Similar efforts have been undertaken to recognise the contributions of scholars of colour and from the Global South (Bhambra 2014;Connell 2018;Harding 2008). The geographical affiliation of the author is seen to determine the nature and relevance of research, presenting 'local knowledge' tied to the communities and their needs (Connell 2018). Such knowledge is useful for generating 'actionable' inferences whether in health research (Goyet et al. 2015) or climate science (Borland et al. 2018). Framed in these arguments, there is some evidence to support a descriptively representative reading list, locally, on the basis of claims for substantive representation.
Descriptive representation of local scholars, however, arguably remains problematic. The homogeneity of particular disciplines remains unchallenged and may not be compatible with local efforts to diversify the curriculum. Further, within the wider decolonisation agenda, descriptively representative lists need to acknowledge the global reach of the discipline which, due to lack of data, currently remains unknowable. Going forward, reading lists, as representational artefacts, can serve to highlight the constructed and partial nature of our disciplines. The boundaries of disciplines are, after all, contested and/ or fluid at national and international levels (e.g. see Leftwich 2004 for a discussion of political science). The creation of a reading list therefore enables lecturers to 'imprint and construct their values and identity and beliefs into a given programme' (Stokes and Martin 2008: 121). In this sense, aspiring to be descriptively representative will inevitably entail a device that 'presents a part of the world as being or looking a certain way' (Pitkin 1967: 70). This should serve to acknowledge inequalities in global knowledge production (Altbach 2016) and offer opportunities for staff and students to critique and deconstruct disciplinary boundaries (Luckett and Shay 2017). This would serve to counter the distinctively unrepresentative nature of the reading lists in relation to the student body. Whilst reading lists do not necessarily need to mirror student experiences to be legitimate (Wolff 2016), representation and recognition of student perspectives are essential for genuine parity of participation in higher education (Burke 2018;Luckett and Shay 2017).

Conceptual and methodological challenges
In the process of developing and applying a method for interrogating reading lists, this project unearthed a range of potential conceptual and methodological pitfalls for such an endeavour. These are considered below.

Categories of diversity
This project principally used three dimensions, gender, ethnicity, and geographical affiliation, to assess diversity within the readings lists for two subjects. These dimensions of difference seem to be most highly considered in the literature, i.e. 'decolonizing' or 'gendering' the curriculum. Whilst this represents a useful starting point, there remains a range of other categories that could be potentially interrogated. The Equality Act (2010) in England and Wales identified eight characteristics that warrant protection from discrimination and harassment in higher education (age, disability, gender reassignment, pregnancy and maternity, race, religion or belief, sex, sexual orientation). Should the diversity of a curriculum/ reading list be assessed against all eight characteristics? Would it be possible to access such data? There are examples within the literature that suggest that perspectives of these groups require particular attention too. Critical disability studies (Goodley 2013) and queer theorists argue for a more inclusive framing of our disciplines (e.g. Smith and Lee 2015). There could be further characteristics, such as social class, that also warrant exploration. A robust theoretical framework, drawing on social and educational theory, is needed to justify the selection of particular characteristics. Such considerations seem lacking in the current moves to decolonize the curriculum.
Even with a strong rationale for selecting particular categories of difference, there are difficulties relating to their operationalization within the research process. The classification of ethnicity, for example, was distinguished in two different ways: common groupings (such as 'white' or 'Asian' in science) and broad categories ('non-BME' and 'BME' in social science). Both approaches suffer drawbacks, as there is no consensus on the appropriate terms to use for the scientific study of ethnicity and race (see Bhopal 2004). On the one hand, the use of 'BME' is seen as a somewhat 'blunt' instrument that attempts to be inclusive without taking into consideration the nuanced, locally defined populations (Bhopal 2004;Song 2018). On the other, large categories such as 'white' conceal heterogeneity, shifting demographic patterns and growing numbers of multiracial people (Song 2018). Further, there is virulent debate about whether a terminology of 'race' should also be used to meaningfully measure difference (Roth 2016;Song 2018). Similar debates about identity and classification continue to take place with regard to other variables such as gender and sexuality (e.g. Hines 2006). Again, deeper reflection is encouraged to create a framework for analysing reading lists that can be meaningful and theoretically justified.
In focusing on author demographics as static characteristics, projects to decolonize the curriculum arguably re-inscribe categories of difference. Concurrently, sociological theories recognize that, for example, 'ethnicity is imprecise and fluid' (Bhopal 2004;Roth 2016) that changes over time and context. Whilst seeking to incorporate fluid understandings of identity categories, these projects inadvertently re-affirm rigid and static categories of difference.

Data collection
This project used information available in the publications together with internet sources to collect data about authors on the reading lists. This presented a number of challenges including an absence of information and implicit/hidden forms of data. Researchers were required to extrapolate and draw inferences about author demographics, often based on photographic images. Whilst extrapolating demographics from photographs is relatively unproblematized in other studies (e.g. Eigenberg and Whalley 2015), this approach meant that the findings were mainly based on demographic proxies, with limited possibilities for ensuring accuracy (Sloan 2017;Mügge et al. 2018). The intersection of such proxies can generate further uncertainty unless, for example, gender detection methods are tailored to geographic and ethnic contexts (Karimi et al. 2016). Triangulating proxies with other methods may serve to validate the data by asking the individuals themselves (Sloan 2017) or asking scholars in the same field (Dickersin et al. 1998). Some familiarity with the scholars in the fields of study was helpful but validation of data was not systematically carried out and the accuracy of the data remains problematic.

Data analysis
The nature of the data collection presented a number of opportunities and challenges for the data analysis. First, data was collected at two different levels: for each author and per publication. The analysis needed to interrogate and draw inferences from both metrics. Whilst there are examples of studies that have integrated both levels of data (e.g. Crow and Smykla 2015), greater consideration is required to recognize the relative contribution of each level of analysis to the overall conclusions. Second, the analysis aimed to acknowledge and explore demographic characteristics and how they related to sole/multiple authorship. This meant, for example, examining whether women were evenly represented across the author positions (as sole or co-authors) (West et al. 2013). The practice of authoring papers and attributing authorship, however, varies across disciplines and so comparison remains difficult (Helgesson and Eriksson 2018). Further, in multiple author publications, authorship and author order remains a 'vexed issue' (McCann and Polacsek 2018) with no consensus on how scientific merit is attributed to individuals (Helgesson and Eriksson 2018). Therefore, without clear guidance on how to interpret authorship order, it remains difficult to draw inferences on the intersection of author contribution and demographics. Third, data was collected on a number of demographic factors which provided an opportunity to analyse intersectionality and authorship. The intersectional analysis, however, was limited by the small numbers of authors for each category of difference and an inability to address the normative underpinnings of such analysis (Hancock 2007). Fourth, attempts to compare the demographic characteristics of the authors on the reading lists with wider student and staff bodies, i.e. analyse descriptive representation, were hampered by difficulties in identifying and accessing data on these populations.

Ethical issues and data protection
Collecting personal data from online sources generates a number of ethical and legal concerns pertaining to informed consent, privacy and anonymity (Wilkinson and Thelwall 2011). Informed consent for research participants is a fundamental element of science/social science research but is this necessary when interrogating author demographics from module reading lists? On the one hand, the researcher is collecting personal data and making judgments about individuals. On the other hand, there are conditions in which informed consent may not be necessary for documents, text or observations taking place in public online space (Hudson and Bruckman 2004;Wilkinson and Thelwall 2011). As Convery and Cox (2012) propose, a form of 'negotiated ethics' may be necessary which means that particular types of data collection, such as author's geographical affiliation, may not require informed consent because such information is intentionally public (Willis 2017). Categories of gender and ethnicity, however, are not explicitly presented as public data and so may arguably require consent from the participants for research use and analysis.
The issues of privacy and anonymity are complex in this project given the central role played by identity in the data collection and analysis. Nonintrusive research may not raise privacy concerns but the collection and analysis of implicit data requires further consideration. As Wilkinson and Thelwall (2011) suggest, revealing the identity of an individual may simply involve the transference of identity from one public space (online) to another (academic publication). This may, or may not, be potentially sensitive so good practice procedures for data protection should be followed to ensure that data about individuals remained confidential. The legal framework in the UK changed after the research was carried out, and the GDPR needs to apply to all future work (UKRI 2018). This may influence requirements for consent and anonymity.

Recommendations for future research
The paucity of research examining authorship and reading lists (Piscioneri and Hlavac 2013) suggest that there are key strands of work that need to be progressed in order to understand and develop inclusive reading lists in universities. First, there is a lack of research examining student views of reading lists. Qualitative research is required to uncover student expectations of reading lists and their reactions to diversity in the curriculum (Wolff 2016). This should necessarily consider students from different disciplines, where expectations and views are likely to differ. Second, more research is needed to identify staff views of reading lists and to re-examine the role of reading lists in higher education (Stokes and Martin 2008). Third, normative and empirical work is required to fully unpack the conceptual and methodological issues involved in reviewing the diversity of reading lists. With the university sector prompting their staff to develop more 'inclusive' reading lists (Stevenson 2012), further work is needed to reflect on the challenges of doing so and then provide ethical and rigorous tools to perform such a task. Fourth, further research needs to explore reading lists as artefacts of substantive representation: do author identities influence the type and content of their research? Finally, more extensive and global data is required to detail the number and characteristics of staff and students in our disciplines. Current knowledge is incomplete.

Conclusions
This article provides empirical support for diversification of reading lists in higher education but also highlights a number of challenges that need careful consideration so as not to undermine the endeavour. There are a range of implications.
First, universities need to engage in a transparent and informed discussion about what a 'diverse/inclusive/decolonised' reading list actually looks like. This process needs to engage with a range of stakeholders, including staff and students, to ensure clarity and commitment to the agenda (Whitsed and Green 2016). These discussions should be context-sensitive (Wolff 2016) and be founded on an understanding of the relevant theoretical, methodological and ethical debates (such as those discussions raised in this article).
Second, higher education institutions and agencies (such as Advance HE in the UK) need to develop coherent and consistent strategies that can move the sector towards more inclusive and diverse programmes of study. Current approaches focus on individual responses in a single classroom or module (Sleeter 2016) but a strategic, sector-wide response is required. To meet the aims of the global decolonisation agenda and strive for genuine inclusivity of students, these approaches need to be 'transformative' (Luckett and Shay 2017;Kitano cited in Cohn and Mullenix 2007;Sleeter 2016) where 'universities must completely rethink, reframe and reconstruct the curriculum' (Heleta 2016: 5).
Third, academics and lecturers should not wait for institutional responses but engage with the debates raised here and begin to critically reflect on their own reading lists (Heleta 2016). However, further analytical and methodological work needs to be undertaken to develop a toolkit for teachers to review their own curriculums. Universities need to invest in research to develop and trial methods for interrogating diversity in module reading lists. Some tools are beginning to emerge (e.g. UCL Inclusive Curriculum Healthcheck 2018; Sumner 2018), but there is a lack of research evaluating their relevance, implementation or outcome. More detailed, step-by-step instruction, based on evidence, would help to support teachers in higher education to start the process of analysing their modules through a critical, 'transformative' lens (see Cohn and Mullenix 2007).
Fourth, sector-wide strategies should recognise the structural challenges that face the global agenda to decolonise/diversify reading lists. The nature of academic publishing, for example, means that the dominance of the Global North necessarily shapes the landscape for research and knowledge production (Collyer 2016). The development of more diverse reading lists will therefore need to be accompanied by a range of initiatives to challenge inequalities in knowledge production and bring marginalised perspectives into the centre (Altbach 2016;Tickner 2013).