Skip to main content
Log in

Epistemology and anomaly detection in astrobiology

  • Published:
Biology & Philosophy Aims and scope Submit manuscript

Abstract

We examine the epistemological foundations of a leading technique in the search for evidence of life on exosolar planets. Specifically, we consider the “transit method” for spectroscopic analysis of exoplanet atmospheres, and the practice of treating anomalous chemical compositions of the atmospheres of exosolar planets as indicators of the potential presence of life. We propose a methodology for ranking the anomalousness of atmospheres that uses the mathematical apparatus of support vector machines, and which aims to be agnostic with respect to the particular chemical biosignatures of life. We argue that our approach is justified by an appeal to the “hinge” model of epistemic justification first proposed by as reported by Wittgenstein (On certainty, Blackwell, Oxford, 1969). We then compare our approach to previous work due to Walker et al. (Astrobiology 18(6):779–824, 2018) and Cleland (Astrobiology 19(6):722–729, 2019a; Does ‘life’ have a definition?, Cambridge University Press, New York, 2019b).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1

Similar content being viewed by others

Notes

  1. There is a rich recent tradition of philosophical commentary on astrobiology, not all of which we are able to comment on here; see for instance Fry (2000), Chela-Flores (2011), and Dick (2012).

  2. In taking this aspect of Seager and Bains’ approach as a scientific basis for a metabolism-based epistemology of life detection, we follow Knuuttila and Loettgers (2017).

  3. Having said this, some of our assumptions could, with more data on the chemical compositions of exosolar atmospheres, undergo a “sanity check” by ensuring that Earth is classified as having an anomalous chemical composition in its atmosphere, thereby identifying it as a potential location for life.

  4. That said, we do take the following quote from On Certainty to indicate that the hinge epistemological method is meant to be relevant to the assumptions used in science: “All testing, all confirmation and disconfirmation of a hypothesis takes place already within a system. And this system is not a more or less arbitrary and doubtful point of departure for all our arguments: no, it belongs to the essence of what we call an argument. The system is not so much the point of departure, as the element in which arguments have their life” (p. 107).

References

  • Achinstein P (2018) Speculation: within and about science. Oxford University Press, Oxford

    Book  Google Scholar 

  • Anet FA (2004) The place of metabolism in the origin of life. Curr opini chem Biol 8(6):654–659

    Article  Google Scholar 

  • Banerjee A, Burlina P, Diehl C (2006) A support vector method for anomaly detection in hyperspectral imagery. IEEE Trans Geosci Remote Sens 44(8):2282–2291

    Article  Google Scholar 

  • Bich L, Green S (2018) Is defining life pointless? operational definitions at the frontiers of biology. Synthese 195(9):3919–3946

    Article  Google Scholar 

  • Burrows AS (2014) Spectra as windows into exoplanet atmospheres. Proc Natl Acad Sci 111(35):12601–12609

    Article  Google Scholar 

  • Chela-Flores J (2011) The science of astrobiology: a personal view on learning to read the book of life, vol 20. Springer Science & Business Media, New York

    Book  Google Scholar 

  • Cleland CE (2019) Moving beyond definitions in the search for extraterrestrial life. Astrobiology 19(6):722–729

    Article  Google Scholar 

  • Cleland CE (2019) The quest for a universal theory of life: searching for life as we don’t know it, vol 11. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Cleland CE, Chyba C (2002) Defining ‘life’. Orig Life Evol Biosph 32(4):387–393

    Article  Google Scholar 

  • Cleland CE, Chyba C (2007) Does ‘life’ have a definition? In The nature of life: classical and contemporary perspectives from philosophy and science. Cambridge University Press, New York

    Google Scholar 

  • Currie A (2018) Rock, bone, and ruin: An optimist’s guide to the historical sciences. MIT Press, Cambridge

    Book  Google Scholar 

  • Currie A (2019) Epistemic optimism, speculation, and the historical sciences. Philos Theor Pract Biol 11:7

    Google Scholar 

  • Currie A (2021) Science & speculation. Erkenntnis. https://doi.org/10.1007/s10670-020-00370-w

    Article  Google Scholar 

  • Currie A, Sterelny K (2017) In defence of story-telling. Stud Hist Philos Sci Part A 62:14–21

    Article  Google Scholar 

  • Dick SJ (2012) Critical issues in the history, philosophy, and sociology of astrobiology. Astrobiology 12(10):906–927

    Article  Google Scholar 

  • Dyson F (1999) Origins of life. Cambridge University Press, Cambridge

    Book  Google Scholar 

  • Fry I (2000) The emergence of life on earth: a historical and scientific overview. Rutgers University Press, New Brunswick

    Google Scholar 

  • Ghafoori Z, Erfani SM, Rajasegarar S, Bezdek JC, Karunasekera S, Leckie C (2018) Efficient unsupervised parameter estimation for one-class support vector machines. IEEE Trans Neural Netw Learn Syst 29(10):5057–5070

    Article  Google Scholar 

  • Hörst SM (2017) Titan’s atmosphere and climate. J Geophys Res: Planets 122(3):432–482

    Article  Google Scholar 

  • Hörst SM, Vuitton V, Yelle RV (2008) Origin of oxygen species in titan’s atmosphere. J Geophys Res: Planets 113:E10

    Google Scholar 

  • Kauffmann S (1993) The origins of order. Oxford University Press, Oxford

    Google Scholar 

  • Kempes CP, Krakauer DC (2021) The multiple paths to multiple life. J Mol Evol 89(7):415–426

    Article  Google Scholar 

  • Kishimoto A, Buesser B, Botea A (2018) Ai meets chemistry. In: Thirty-Second AAAI Conference on Artificial Intelligence

  • Knuuttila T, Loettgers A (2017) What are definitions of life good for? transdisciplinary and other definitions in astrobiology. Biol Philos 32(6):1185–1203

    Article  Google Scholar 

  • Kolmogorov AN (1933) Foundations of the theory of probability: second, English. Courier Dover Publications, New York

    Google Scholar 

  • Laudan L (1981) A confutation of scientific realism. Philos Sci 48:19–49

    Article  Google Scholar 

  • Léger A, Pirre M, Marceau F (1993) Search for primitive life on a distant planet: relevance of 02 and 03 detections. Astron Astrophys 277:309

    Google Scholar 

  • Lv K-P, Norman L, Li Y-L (2017) Oxygen-free biochemistry: the putative CHN foundation for exotic life in a hydrocarbon world? Astrobiology 17(11):1173–1181

    Article  Google Scholar 

  • Machery E (2012) Why i stopped worrying about the definition of life... and why you should as well. Synthese 185(1):145–164

    Article  Google Scholar 

  • Narita N, Enomoto T, Masaoka S, Kusakabe N (2015) Titania may produce abiotic oxygen atmospheres on habitable exoplanets. Sci Rep 5:13977

    Article  Google Scholar 

  • Patel BH, Percivalle C, Ritson DJ, Duffy CD, Sutherland JD (2015) Common origins of RNA, protein and lipid precursors in a cyanosulfidic protometabolism. Nat Chem 7(4):301

    Article  Google Scholar 

  • Pritchard D (2017) Wittgenstein on hinge commitments and radical scepticism in on certainty. A companion to Wittgenstein. Blackwell, Oxford, pp 563–575

    Google Scholar 

  • Ralser M (2018) An appeal to magic? the discovery of a non-enzymatic metabolism and its role in the origins of life. Biochem J 475(16):2577–2592

    Article  Google Scholar 

  • Reichenbach H (1938) Experience and prediction: an analysis of the foundations and the structure of knowledge. University of Chicago Press, Chicago

    Google Scholar 

  • Schölkopf B, Platt JC, Shawe-Taylor J, Smola AJ, Williamson RC (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13(7):1443–1471

    Article  Google Scholar 

  • Schrödinger E (1944) What is life? Cambridge University Press, Cambridge

    Google Scholar 

  • Seager S (2014) The future of spectroscopic life detection on exoplanets. Proc Natl Acad Sci 111(35):12634–12640

    Article  Google Scholar 

  • Seager S, Bains W (2015) The search for signs of life on exoplanets at the interface of chemistry and planetary science. Sci Adv 1(2):e1500047

    Article  Google Scholar 

  • Seager S, Bains W, Petkowski J (2016) Toward a list of molecules as potential biosignature gases for the search for life on exoplanets and applications to terrestrial biochemistry. Astrobiology 16(6):465–485

    Article  Google Scholar 

  • Spiegel DS, Turner EL (2012) Bayesian analysis of the astrobiological implications of life’s early emergence on earth. Proc Natl Acad Sci 109(2):395–400

    Article  Google Scholar 

  • Swain M (2010a) Exoplanet spectroscopy: a bright present, a brilliant future. In: EGU General Assembly Conference Abstracts, vol 12, p 7631

  • Swain MR (2010b) Finesse-a new mission concept for exoplanet spectroscopy. In: Bulletin of the American Astronomical Society, vol 42, p 1064

  • Vance DB, Jacobs JA (2005) Water, bacteria, life on mars, and microbial diversity. Water Encycl 4:746–748

    Google Scholar 

  • Walker SI, Bains W, Cronin L, DasSarma S, Danielache S, Domagal-Goldman S, Kacar B, Kiang NY, Lenardic A, Reinhard CT et al (2018) Exoplanet biosignatures: future directions. Astrobiology 18(6):779–824

    Article  Google Scholar 

  • Wittgenstein L (1969) On certainty, vol 174. Blackwell, Oxford

    Google Scholar 

  • Wright C (2004) Warrant for nothing (and foundations for free)? Aristotelian Society Supplementary Volume, vol 78. Wiley Online Library, New York, pp 167–212

    Google Scholar 

Download references

Acknowledgements

We are very grateful to Artemy Kolchinsky for detailed feedback on an earlier draft of this paper, to David Wolpert and Natalie Grefenstette for helpful discussions, and to audiences at Philosophy of Biology at the Mountains hosted by the University of Utah, an online audience at the meeting of the European Philosophy of Science Association, and the Uncovering the Laws of Life Workshop in Grindavik, Iceland. Christopher Kempes’ work on this project was supported by the National Aeronautics and Space Administration (Grant No. 80NSSC18K1140) and by Toby Shannan and Charities Aid Foundation of Canada (CAF).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Kinney.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A

Our measure of anomalousness is defined precisely as follows. Let \(\vec {w}_{i}\) be a vector with n entries, where each entry \(w_{ij}\) is the abundance of gas j from some subset of Seager et al.’s list in planet i’s atmosphere (namely, some subset of the set of gases that can be measured in a given context). Let the set of observed composition vectors be \(\mathbf {W}\). Our proposed anomalousness measure draws on support vector machine (SVM) based approaches to anomaly detection in high-dimensional space, as proposed by Banerjee et al. (2006). Specifically, we propose to measure the anomalousness \(\mathcal {A}(\vec {w}_{i},\mathbf {W})\) of the atmospheric composition of a given planet \(W_{i}\), in the context of a set of planets \(\mathbf {W}\) with cardinality m, using the following equation:

$$\begin{aligned} \mathcal {A}(\vec {w}_{i},\mathbf {W})=1-2\sum _{j|j\ne i}^{m}\alpha _{j}\exp \bigg (\frac{-||\vec {w}_{i}-\vec {w}_{j}||^{2}}{\sigma ^{2}}\bigg ) + \sum _{j|j\ne i}^{m}\sum _{k|k\ne i}^{m}\alpha _{j}\alpha _{k}\exp \bigg (\frac{-||\vec {w}_{j}-\vec {w}_{k}||^{2}}{\sigma ^{2}}\bigg ) \end{aligned}$$
(6)

See Banerjee et al. (2006) for a detailed derivation of this particular measure of anomalousness. Each \(\alpha _{j}\) and \(\alpha _{k}\) is a positive, real-number entry in a vector \(\vec {\alpha }\) in which each entry is a specific weighting parameter for each planet. The norm \(||\vec {w}_{i}-\vec {w}_{j}||\) denotes the Euclidean distance between the vectors \(\vec {w}_{i}\) and \(\vec {w}_{j}\). The scale parameter \(\sigma\) sets the shape of the distribution of the anomalousness of the planets in the dataset. Thus, if \(\vec {w}_{i}\) is an abnormally large distance from all of the other data vectors in the set, then the first term of the equation will be larger and the second term will be smaller, and so \(\mathcal {A}(\vec {w}_{i},\mathbf {W})\) will be larger, all else being equal. This justifies the claim that higher the value of \(\mathcal {A}(\vec {w}_{i},\mathbf {W})\), the greater the anomalousness of the atmospheric composition of the atmosphere of the planet \(W_{i}\).

The scale parameter \(\sigma\) and the weight vector \(\vec {\alpha }\) can be estimated from the available data. To estimate \(\sigma\), we use a technique from Ghafoori et al. (2018). Let \(\delta _{min}\) be the minimal Euclidean distance of any vector \(\vec {w_{i}}\) from its nearest neighbor in the data set \(\mathbf {W}\). Let \(\delta _{avg}\) be the average Euclidean distance of all vectors in \(\mathbf {W}\) from their nearest neighbor, excluding the vector that is \(\delta _{min}\) from its nearest neighbor. The scale parameter \(\sigma\) is estimated as follows:

$$\begin{aligned} \sigma = \frac{-\ln (\delta _{min}/\delta _{avg})}{\delta _{avg}^{2}-\delta _{min}^{2}} \end{aligned}$$
(7)

This method for estimating \(\sigma\) has proven successful at detecting anomalies in a series of empirical tests, using standard machine-learning data sets (Ghafoori et al., 2018, pp. 5064-5069). Following Schölkopf et al. (2001), we can set the weight vector \(\vec {\alpha }\) by solving the following optimization problem:

$$\begin{aligned} \min _{\vec {\alpha }}\sum _{j=1}^{m}\sum _{k=1}^{m}\alpha _{j}\alpha _{k}\exp \bigg (\frac{-||\vec {w}_{j}-\vec {w}_{k}||^{2}}{\sigma ^{2}}\bigg )\text {, s.t.} \ \sum _{i=1}^{m}\alpha _{i}=1 \end{aligned}$$
(8)

This ensures that \(\mathcal {A}(\vec {w}_{i},\mathbf {W})\) assigns as high anomalousness as possible to each planet \(W_{i}\).

Appendix B

To demonstrate the potential applicability of our proposed anomaly-detection method, we tested our method on simulated data. The simulation proceeded in the following steps:

  1. 1.

    Generate 90 gas abundance vectors by repeatedly sampling from a Dirichlet distribution \(\text {Dir}(1000,\vec {u})\), where \(\vec {u}=[u_{1},\dots ,u_{1000}]\) is a concentration parameter such that \(u_{i}=10\) for \(i\le 100\) and \(u_{i}=1\) for \(i>100\). This ensures that abundance vectors sampled from this distribution are expected to have abundances concentrated near the first 100 entries. This is meant to simulate observations from a typical, lifeless planet, on which certain gases are much more likely to be common than others. Call this sample of abundance vectors \(\mathbf {W_{A}}\).

  2. 2.

    Generate 10 gas abundance vectors by repeatedly sampling from a Dirichlet distribution \(\text {Dir}(1000,\vec {v})\), where \(\vec {v}=[v_{1},\dots ,v_{1000}]\) is a concentration parameter such that \(v_{i}=x\) for \(i\le 100\) and \(v_{i}=1\) for \(i>100\). For higher values of x, abundance vectors sampled from this distribution are expected to have abundances concentrated near the first 100 entries. This is meant to simulate observations from a typical, lifeless planet, on which certain gases are much more likely to be common than others. For lower values of x, all possible abundance vectors become increasingly likely. This is meant to simulate observations from planets that are possible hosts of life, where many more combinations of gas abundances are assumed to be possible. Call this sample of abundance vectors \(\mathbf {W_{B}}\).

  3. 3.

    Form the full data set \(\mathbf {W}=\mathbf {W_{A}}\cup \mathbf {W_{B}}\).

  4. 4.

    Compute the parameters \(\sigma\) and \(\vec {\alpha }\) of the anomalousness measure for the data set \(\mathbf {W}\).

  5. 5.

    Calculate the anomalousness measure \(\mathcal {A}(\vec {w},\mathbf {W})\) for each element of \(\mathbf {W}\).

  6. 6.

    Identify the ten most anomalous abundance vectors in \(\mathbf {W}\).

  7. 7.

    Record as y the proportion of the ten most anomalous vectors that are elements of \(\mathbf {W_{B}}\).

  8. 8.

    Repeat steps 1-7 for all possible values of x in the interval (1, 10), in increments of .2.

  9. 9.

    Repeat steps 1-8 fifty times, for a total of 460 data points.

Recall that lower values of the parameter x, which determines the concentration parameter for the distribution from which elements of \(\mathbf {W_{B}}\) are sampled, are meant to simulate cases in which the elements of \(\mathbf {W_{B}}\) represent planets that are possible hosts of life. Thus, we expect that as x increases, the proportion of abundance vectors with top-ten anomalousness in \(\mathbf {W}\) that are also elements of \(\mathbf {W_{B}}\) should decrease.

Fig. 2
figure 2

Plot showing the relationship between the value of x, which determines the concentration parameter for the distribution from which elements of \(\mathbf {W_{B}}\) are sampled, and the proportion of abundance vectors with top-ten anomalousness in \(\mathbf {W}\) that are also elements of \(\mathbf {W_{B}}\)

Indeed, this is what we observe in the results of our simulation. Figure 2 shows the proportion of top-ten anomalies from the set \(\mathbf {W}\) that are also elements of the set \(\mathbf {W_{B}}\), for each value of x in each simulation. For values of x lower than or equal to 6, the proportion of top-ten anomalous vectors that are elements of \(\mathbf {W_{B}}\) is tightly clustered around 1. For values of x greater than 6, this same proportion is declining linearly in x. We take this to be positive evidence for the in-principle utility of our measure for detecting the kinds of anomalies that we are interested in. It should be noted that while there is a fairly sharp transition in correctly identifying \(W_B\), knowing the value of x (or the analogous parameter(s) in a given setting) at which this transition occurs in an applied setting will require careful thought. As pointed out above, correct identification depends on the degree of anomalousness and thus, such methods are expected to fail when life is not very anomalous. Code for this simulation is available at https://github.com/davidbkinney/astrobiologyappendixB.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kinney, D., Kempes, C. Epistemology and anomaly detection in astrobiology. Biol Philos 37, 22 (2022). https://doi.org/10.1007/s10539-022-09859-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10539-022-09859-w

Navigation