Did You Find It on the Internet? Ethical Complexities of Search Engine Rankings

Search engines play a crucial role in our access to information. Their search ranking can amplify certain information while making others virtually invis-ible. Ethical issues arise regarding the criteria that the ranking is based on, the structure of the resulting ranking, and its implications. Critics often put forth a collection of commonly held values and principles, arguing that these provide the needed guidance for ethical search engines. However, these values and principles are often in tension with one another and lead us to incompatible criteria and results, as I show in this short chapter. We need a more rigorous public debate that goes beyond principles and engages with necessary value trade-offs


Introduction
In our digitalized life, search engines function as the gatekeepers and the main interface for information.Ethical aspects of search engines are discussed at length in the academic discourse.One such aspect is the search engine bias, an issue that encompasses ethical concerns about search engine rankings being value-laden, favoring certain results, and using non-objective criteria (Tavani 2020).The academic ethics debate has not (yet) converged to a widely accepted resolution of this complex issue.Meanwhile, the mainstream public debate mostly ignored the hard ethical trade-offs, opting instead for a collection of commonly held principles and values such as accuracy, equality, efficiency, and democracy.In this short chapter, I explain why this approach leads to unresolvable conflicts and thus ultimately a dead end.
This chapter builds on the Mapping workshop (https://aiethicslab.com/the-mapping/) I designed and developed together with Laura Haaber Ihle at AI Ethics Lab.I thank Laura also for her insights and valuable feedback on this chapter.

Value of and Value Within Search Engines
Search engines are invaluable.Without them, it is impossible to navigate the massive amounts of information available in the digital world.They are our main mediator of information (CoE 2018).Every day over 7 billion searches are conducted on Google alone,1 accounting for over 85% of all searches worldwide.2In addition to that, various searches are conducted on specialized search engines such as Amazon and YouTube.Even academic and scientific research relies on Google Scholar, PubMed, JSTOR, and similar specialized search engines-meaning not only we rely on search engines to access information but we also rely on them while creating further knowledge.
We defer to search engines' ranking of information so much that most people do not even check the second page of the search results. 3On Google, 28.5% of users click the first result, while 2.5% click on the tenth result, and fewer than 1% click results on the second page. 4This means that the ranking has a great effect on what people see and learn.
Search engine ranking is never value-neutral or free from human interference.It is optimized to reach a goal.Even if we can agree that this goal is relevance to the user, defining what is relevant involves guesswork and value judgments.By definition, in most search queries, users do not know what result they are looking for.On top of that, the search engine has to guess from the search keywords what kind of information the user is interested in.For example, a user searching for "corona vaccination" could be looking for vaccine options, vaccine safety information, anti-vaccine opinions, vaccination rates, or celebrities who are vaccinated, and they might be looking for these on a global or local scale.More importantly, they might be equally satisfied with well-explained vaccine safety information or anti-vaccine opinions since they might not have prior reasons to differentiate these two opposing results.Here, the value judgment comes into play in designing the system.Should the system first show vaccine safety information to ensure that the user is well-informed or the anti-vaccine opinions since they are often more intriguing and engaging?Should the system make results depending on user profiles (e.g., being scientifically or conspiracy oriented)?Should it sort the results by click rates globally or locally, by personal preferences of the user, by accuracy of the information, by balancing opinions, or by public safety?Deciding which ranking to present embeds a value judgment into the search engine.And this decision cannot fully rely on evaluating user satisfaction about a search query, because the user does not know the full range of information they could have been shown.Moreover, user satisfaction might still lead to unethical outcomes.

Ethical Importance of Search Engine Rankings
Imagine basing your decision whether to get vaccinated on false information because that is what came up on your search (Ghezzi et al. 2020; Johnson et al. 2020).Imagine deciding whom to vote for based on conspiracy theories (Epstein and  Robertson 2015).Imagine having your perception of other races and genders pushed to negative extremes because that is the stereotype you are presented with online (Noble 2018).Imagine searching for a job but not seeing any high pay and high power positions because of the search engine's knowledge or estimate of your race, gender, or socio-economic background.5 Imagine having your CV never appear to the employers for those jobs that you easily qualify for because of search engine profiling (Deshpande et al. 2020).These are all ethical issues.They stem from value judgments embedded in search engine processing and, as a result, impacting individual autonomy, well-being, and social justice.
We base our decisions on what we know.By selecting which information to present in what order, search engines affect and shape our perception, knowledge, decision, and behavior both in the digital and physical sphere.As a result, they can manipulate or nudge individuals' decisions, interfere with their autonomy, and affect their well-being.
By sorting and ranking what is out there in the digital world, search engines also impact how benefits and burdens are distributed within the society.In a world where we search online for jobs, employees, bank credits, schools, houses, plane tickets, and products, search engines play a significant role in the distribution of opportunities and resources.When certain goods or opportunities are systematically concealed from some of us, there is an obvious problem of equality, equal opportunity and resources, and, more generally, fairness.More to the point, once certain information is not accessible to some, they often do not even know that it exists.If we cannot realize the injustice, we also cannot demand a fair treatment.

Do You See Female Professors?
A running example for search engine bias has been the image search results for the term "professor." 6When searching for "professor," search engines present a set of overwhelmingly white male images.In the USA, for instance, women make up 42% of all tenured or tenure-track faculty. 7In search engine results, the ratio of female images has been about 10-15% and only recently went up to 20-22%. 8When searching specifically for "female professors," the image results are accompanied by unflattering suggestions: Google's first suggested term is "clipart" (Fig. 1), whereas Bing's top four suggestions include "crazy," "cartoon," and "clipart" female professors (Fig. 2). 9hy is this an ethical problem?Studies show that girls are more likely to choose a field of study if they have female role models (Lockwood 2006; Porter and Serra  2020).Studies also show that gender stereotypes have a negative effect on hiring women for positions that are traditionally held by men (Isaac et al. 2009; Rice and  Barth 2016).By amplifying existing stereotypes, search engine results contribute to the existing gender imbalance in high-powered positions.It is reasonable to think that this gender imbalance in real life has its roots in unjustified discrimination against women in the workplace as well as discriminatory social structures, both of which do not allow female talent to climb the career ladder.The search engine bias can contribute to the perpetuation of this gender imbalance.
In fact, this is not special to image search results for "professor."Women are underrepresented across job images and especially in high-powered positions (Kay  et al. 2015; Lam et al. 2018).Take one step further in this problem, and we end up with issues such as LinkedIn-a platform for professional networkingautocorrecting female names to male ones in its search function10 and Google showing much fewer prestigious job ads to women than to men. 11ost mainstream reactions criticize search engine rankings from commonly held values and principles: search engines should reflect the truth and be fair; they should promote equality and be useful; they should allow users to exercise agency and prevent harm; and so on.On close inspection, however, these commonly held values and principles fail to provide guidance and may even conflict with one another.In the next paragraphs, I briefly go over three values-accuracy, equality, and agency-to show how such simple guidance is inadequate for responding to this complex problem.
Accuracy One could argue that search engines, being a platform for information, should accurately reflect the world as it is.This would imply that the image results

Fig. 1 Female professors and Google
Did You Find It on the Internet?Ethical Complexities of Search Engine. . .

Fig. 2
Female professors and Bing should be revised and continuously updated to reflect the real-life gender ratio of professors.In contrast, the current search results portray the social perception about gender roles and prestigious jobs.Note that implementing accuracy in search results would require determining the scope of information: Should the results accurately reflect local or global ratio?And what other variables-such as race, ability, age, and socio-economic background-and their ratio should the results reflect accurately?
Equality Contesting prioritization of accuracy, one could argue that search engines should promote equality because they shape perception while presenting information, and simply showing the status quo despite the embedded inequalities would be unfair.If we interpret equality as equal representation, this would imply showing equal number of male and female professors.Implementing equal representation in search results would also require taking into account other abovementioned variables-such as race, ability, age, and socio-economic background-and equally representing all their combinations.A crucial question would then be, what would the search results look like if all possible identities are represented and would these results still be relevant, useful, or informative for the user?
Agency Contesting both accuracy and equality, one could argue that the system should prioritize user agency and choice by ranking the most clicked results at the top.This is not a strong argument.When conducting a search, users do not convey an informed and intentional choice through their various clicks.One could, however, incorporate user settings to the search engine interface to encourage user agency and provide them with a catalogue of setting options for ranking.Yet, since most people have psychological inertia and status quo bias, most users would still end up using the default (or easiest) setting-which brings us back to the initial question: What should the default ranking be?
An additional consideration must be the content of web pages that these images come from.It is not sufficient to have an ethically justifiable combination of images in the search results.It is also important that the web pages that link to these images follow a similar ethical framework.If, for example, search results show equal number of female and male images but the pages with female images dispute gender equality, this would not be a triumph of the principle of equality.
We could continue with other values such as well-being, efficiency, and democracy.They would yield even more different and conflicting ranking outcomes.Therein lies the problem.These important and commonly held values do not provide a straightforward answer.They are often in tension with each other.We all want to promote our commonly cherished and widely agreed-upon values.This is apparent from the Universal Declaration of Human Rights, from the principlism framework, and from the common threads within various sets of AI principles published around the world. 12But simply putting forth some or all of these values and principles and demanding them to be fulfilled is an unreasonable and impossible request, which do not get us very far in most cases (Canca 2019, 2020).

The Process and the End Product
Thus far I focused on the end product.What is the ethically justified composition for search engine results?But we also need to focus on the process: how did we end up with the current results, and what changes can or should be implemented?
Search engines use a combination of proxies for "relevance."In addition to the keyword match, this might include, for example, click rates, tags and indexing, page date, placing of the term on the website, page links, expertise of the source, and user location and settings. 13Search is a complex task, and the way that these proxies fit together changes all the time.Google reports that in 2020 they conducted over 600,000 experiments to improve their search engine and made more than 4500 changes.14Since search engine companies compete in providing the most satisfactory user experience, they do not disclose their algorithms.15Returning to our example, when we compare image search results for "professor" in Google, Bing, and DuckDuckGo, we see that while Google prioritizes Wikipedia image as the top result, Bing and DuckDuckGo refer to news outlet and blog images for their first ten results, excluding the image from the Wikipedia page. 16alue judgments occur in deciding which proxies to use and how to weigh them.Ensuring that the algorithm does not fall into ethical pitfalls requires navigating existing and expected ethical issues.Going back to our example, content uploaded and tagged by users or developers is likely to carry their implicit gender biases.Therefore, it is reasonable to assume that the pool of images tagged as "professor" would be highly white-male oriented to start with.Without any intervention, this imbalance is likely to get worse when users click on the results that match their implicit bias and/or when an algorithm tries predicting user choice and, thereby, user bias.

Conclusions
A comprehensive ethical approach to search engine rankings must take into account search engines' impact on individuals and society.The question is how to mitigate existing ethical issues in search engine processing and prevent amplifying them or creating new ones through user behavior and/or system structure.In doing so, values and principles can help us formulate and clarify the ethical problems at hand, but they cannot guide us to a solution.For that, we need to engage in deeper ethics analyses which provide insights to the value trade-offs and competing demands that we must navigate to implement any solutions to these problems.These ethics analyses should then feed into the public debate so that the discussion can go beyond initial arguments, reveal society's value trade-offs, and be informative for decisionmakers.