1 Introduction

The discovery of a diffuse flux of astrophysical neutrinos, using High-Energy Starting Events (HESE) observed by IceCube [1] opened the possibility to study the Universe’s most powerful cosmic accelerators [2, 3]. HESE is an all-flavor, all-sky selection of events of predominantly astrophysical origin, with an analysis region above 60 TeV in deposited electromagnetic-equivalent energy in the detector. Tau neutrinos are expected to be produced only in tiny fractions at neutrino sources, but emerge due to neutrino oscillations over cosmic baselines [4]. For neutrinos from distant sources, the probability of a neutrino created with flavor \(\nu _{\alpha }\) to reach the detector as \(\nu _{\beta }\) is \(\langle P_{\nu _{\alpha } \rightarrow \nu _{\beta }} \rangle = \sum _i \vert U_{\alpha i}\vert ^2 \vert U_{\beta i} \vert ^2 \) [5, 6]. Thus, the neutrino flavor composition at Earth depends on the neutrino mixing matrix elements, \(U_{\alpha i}\), and the source flavor composition. For neutrinos from the decay of charged pions produced in hadronic interactions, with a source flavor composition of \(\nu _{e}{:}\nu _{\mu }{:}\nu _{\tau }=1/3{:}2/3{:}0\), we expect \(\nu _{e}{:}\nu _{\mu }{:}\nu _{\tau }=0.30{:} 0.36{:} 0.34\) at Earth (using the oscillation parameters from [7]), i.e., very close to equipartition (1/3:1/3:1/3). However, the environment at the neutrino production sites may influence the flavor composition, due to cooling or interactions of the charged particles produced in the hadronic interactions [8,9,10,11]. Therefore, the flavor composition of astrophysical neutrinos is a powerful probe of the environments of cosmic accelerators and can help constrain the source populations contributing to the observed neutrino flux. The neutrino flavor composition on Earth is also a sensitive probe of physics beyond the Standard Model (BSM) affecting neutrino propagation and modifying the flavor composition [12,13,14,15,16]; see [17] for BSM-constraints derived using the HESE selection.

Atmospheric neutrinos are a background to astrophysical neutrino searches. As atmospheric neutrinos are accompanied by muons born in the same cosmic-ray-induced shower, their contribution to a sample can be suppressed by muon-rejecting event selection criteria, e.g. by using the outer parts of the detector as a vetoing region. This effect, called atmospheric neutrino self-veto [18], is used in HESE [19]. Conventional atmospheric neutrinos are \(\nu _{e}\) and \(\nu _{\mu }\) from the decay of \(\pi ^{\pm }\) and \(K^{0,\pm }\) produced in the atmosphere by cosmic-ray interactions. At energies above \(\sim 100\) TeV, the atmospheric flux is expected to be increasingly dominated by the prompt component, originating from the decays of charmed hadrons (e.g. [20]). Tau neutrinos, produced from rare decays of \(D_s\) and \(D^{0,\pm }\), contribute only up to 5% to the yet unobserved prompt atmospheric neutrino component [21, 22]. This makes the observation of high-energy tau neutrinos a smoking-gun signature of cosmic neutrinos, but so far, none have been identified [23,24,25]. Previous flavor studies only separated the charged-current \(\nu _\mu \) contribution from other flavors, leading to a significant degeneracy between the \(\nu _e\) and \(\nu _{\tau }\) flavors [26,27,28]. Here, we present a new flavor composition measurement of astrophysical neutrinos with direct sensitivity to each of the neutrino flavors, performed on the HESE sample. A detailed description of the characteristics of the HESE sample and spectral fits to a diffuse astrophysical neutrino spectrum assuming flavor equipartition, as well as a detailed description of systematic uncertainties and their treatment are provided in [19]. There, the astrophysical neutrino spectrum was fit as a single power law,

$$\begin{aligned} \frac{\textrm{d}\varPhi _{\nu }}{\textrm{d}E} = \phi _{\nu } \cdot \left( \frac{E}{E_0} \right) ^{\gamma _{\text {astro}}}, \end{aligned}$$
(1)

where \(\phi _{\nu }\) is the all-flavor \(\nu +\overline{\nu }\) flux at \(E_0=100\) TeV and \(\gamma _{\text {astro}}\) is the spectral index. Their best fit values are \(\phi _{\nu }=6.4^{+1.5}_{-1.6} \, \times \, 10^{-18} \mathrm {\, GeV^{-1} \, cm}^{-2} \mathrm {\, s}^{-1} \mathrm {\, sr}^{-1}\), and \(\gamma _{\text {astro}}=-2.87^{+0.21}_{-0.19}\). The sample and associated results have been made available publicly through a dedicated data release [29].

This manuscript is structured as follows: Sect. 2 describes the signatures of neutrino interactions detected in IceCube and how they map to neutrino flavors; Sect. 3 illustrates the selection and classification of the detected events according to these various signatures; Sect. 4 summarizes the outcome of the classification and the characteristics of the found \(\nu _{\tau }\) candidates; in Sect. 5 the flavor composition constraints from this analysis are derived.

2 Neutrino signatures in the detector

In IceCube, neutrinos are detected by collecting the Cherenkov light emitted by charged secondary particles created in neutrino interactions. All neutral-current (NC) interactions produce showers of hadrons and are indistinguishable between flavors. In a charged-current (CC) interaction, the neutrino flavor can be inferred from the distinct Cherenkov light pattern produced by each flavor of charged lepton. Light depositions from a muon traversing the detector are called tracks and stem from \(\nu _{\mu }\) CC interactions, atmospheric muons, and \(\nu _{\tau }\) CC interactions where the tau decays to a muon (17% branching ratio). Single cascades consist of energy depositions at a single vertex and are produced by \(\nu _{e}\) CC and NC interactions of all flavors. At PeV energies, both tracks and single cascades can also emerge from the decays of W-bosons produced in resonant neutrino-electron scattering [30]. Double cascades are two energy depositions connected by a track of comparatively low light emission. They are produced by \(\nu _{\tau }\) CC interactions where the first cascade originates from the hadronic interaction of the \(\nu _{\tau }\) producing a tau, and the second cascade stems from the tau decaying to a hadronic or electromagnetic cascade (83% branching ratio) [4]. Due to their short livetime, taus have a short decay length of \(\langle L_{\tau } \rangle \sim 50\) m \(\cdot \, E_{\tau }\) / PeV, where \( E_{\tau }\) is the tau energy. This makes the distinction between single and double cascades challenging in IceCube, where the mean horizontal distance between light sensors, called Digital Optical Modules (DOMs), is 125 m. The HESE analysis defines a lower threshold on the deposited electromagnetic-equivalent energy in the detector of events, \(E_{\text {tot}}\), of 60 TeV (see below). Above this threshold it is possible to identify some of the \(\nu _{\tau }\) events as double cascades, if \(L_{\tau } > rsim 10\) m, breaking the degeneracy between \(\nu _e\) and \(\nu _{\tau }\) flavors present at lower energies.Footnote 1

3 Event selection and classification

Using the HESE selection, we have performed a new analysis of the IceCube data that incorporates major improvements with respect to previous publications [2, 3] in our understanding of the detector and the modelling of atmospheric backgrounds. The HESE selection is described in [2]. To pass, an event has to (1) start inside of the outermost layer of DOMs making up the “veto” layer, and (2) deposit more than 6000 photoelectrons in the detector. Muons radiate away energy throughout their passage through the ice, with the amount of light deposited increasing with increasing muon energy. It is thus extremely unlikely for atmospheric muons to pass the HESE selection criteria. Due to the atmospheric self-veto ([18], see also Sect. 1), accompanying muons also greatly reduce the number of downgoing atmospheric neutrinos present in the sample. To further enhance the fraction of astrophysical neutrinos in the sample, the analysis is restricted to events with a reconstructed total deposited energy \(E_{\text {tot}}\) above 60 TeV. Data collected between 2010 to 2017 using the original HESE selection [2], with a total livetime of 2635 days, have been reprocessed using a new and improved detector calibration. An improved model of the optical properties of the South Pole ice sheet [32], critical to the reconstruction of event properties, has been incorporated into the simulation and reconstruction, and an updated calculation of the atmospheric neutrino self-veto [33] is used. This new HESE sample has 60 events in the analysis region, i.e. with \(E_{\text {tot}}> 60\) TeV, and is described in detail in [19].

Fig. 1
figure 1

Length resolution for simulated tau neutrinos classified as double cascades for the best-fit spectrum [19]. With increasing length, the resolution improves while the event expectation drops. The inlay shows a schematic of a \(\nu _{\tau }\) CC interaction producing a double cascade with associated reconstruction parameters

Table 1 Steps for the ternary topological classification in order of precedence. For events failing the “preselection”, the likelihoods of the track and single-cascade fits are compared, and the topology with the higher likelihood fit is chosen

We use a classification algorithm developed on Monte Carlo (MC) simulated events and first applied to the 6-year HESE sample [25, 34] to classify the 60 events as single cascades, double cascades, or tracks (ternary event classification). It was developed with the goal of achieving a high \(\nu _{\tau }\) purity for the events assigned a double-cascade topology, while keeping misclassification fractions low for all topologies [35]. All events are reconstructed using maximum likelihood fits with different hypotheses: single cascade [36], track [37], and double cascade [36, 38]. For the fits, the timing and spatial information of the light collected in an event is used. The parameters of the double-cascade fit are (see also inset of Fig. 1): the energies of the interaction and decay cascade, \(E_1\) and \(E_2\) respectively; the spatial separation between them (called double-cascade length \(L_{\text {dc}}\) hereafter); the direction and vertex of the first cascade. The total energy \(E_{\text {tot}}\) of the event is the sum of all energy depositions obtained from a track energy unfolding; for double cascades this equals \(E_1+E_2\). The two cascades are assumed to be co-linear due to the large Lorentz boost.

A preselection removes events with a failed double-cascade fit from being further considered as double cascades. After preselection, three event properties are used to classify each event: double-cascade length, energy asymmetry, and energy confinement. The double-cascade length is a proxy for the tau lepton’s decay length with an average resolution of \(\sim 2\) m over the entire analysis range at the best-fit spectrum with flavor equipartition [19]. Figure 1 shows the reconstructed double cascade length as a function of the true double cascade length; the length resolution improves with increasing length as the cascades get better separated. The energy asymmetry is defined as \(A_E=(E_1-E_2)/(E_1+E_2)\). It can take values \(-1 \le A_E \le 1\), with the boundary values corresponding to single cascades. The energy confinement is defined as \(E_C=(E_{C1}+E_{C2})/E_{\textrm{tot}}\), where \(E_{Ci}\) are the energy depositions within 40 m of the i-th cascade vertex position. For the purpose of this calculation the vertices of the two cascades are taken directly from the double-cascade reconstruction, while the energy depositions are obtained through a track energy unfolding algorithm [36], and thus the confinement can take values \(0< E_{C} < 1\). Thus, for double cascades separated by \(\lesssim 80\) m the relation \(E_{\text {tot}}= E_1+E_2 = E_{C1}+E_{C2}\) holds. Events passing the requirements shown in the second column of Table 1 are classified as double cascades.

True single cascades typically have a small reconstructed double-cascade length and a large, positive energy asymmetry. True tracks typically have energy depositions all along their tracks, leading to low energy confinement values. True double cascades have \(E_C\) values very close to 1 even for separation lengths in excess of 80 m, due to the low relative brightness of the tau. By choosing a conservative requirement of \(E_C > 0.99\) for double cascades, the performed analysis does not lose sensitivity towards higher-energy \(\nu _{\tau }\) producing longer-lived \(\tau \) leptons. True double cascades show a flat distribution in \(A_E\) with a resolution of \(\sim 0.1\) at negative values of \(A_E\) and worsening towards positive values. Their double-cascade length is correlated to their total deposited energy and follows the exponentially falling distribution seen in the energy spectrum. Events failing the double-cascade requirements are classified according to the procedure shown in the last column of Table 1.

Note that the requirement of \(L_{\text {dc}}\ge 10\) m for double cascades leads to the majority of \(\nu _{\tau }\) induced events to be classified as single cascades. At the best-fit spectrum with flavor equipartition [19], we expect \(\sim 15\) \(\nu _{\tau }\) events, of which \(\sim 12\) interact via the double cascade channel. But only \(\sim 3\, (22.7\%)\) of those are expected to produce a tau that travels at least 10 m before decaying. \(42.3 \%\) of simulated double cascades with tau decay lengths above 10 m pass the double cascade requirements in Table 1. The total efficiency of the ternary topological classification chain for double cascades is \(12.2\%\). \(1.9\%\) of all \(\nu _{e}\) and \(\nu _{\mu }\) induced events are expected to be misclassified as double cascades.

Glacial ice at South pole flows at a rate of \(\sim 10\) m per year. It has recently been observed [39] that the optical properties of glacial ice at South Pole vary as a function of the direction with respect to the flow of the glacial ice. This ice anisotropy is one of the limiting factors on the selection of double cascades: directional distortions of Cherenkov light patterns can lead to a misclassification of single cascades as double cascades. See Appendix C for details on the ice anisotropy treatment.

4 Results of the topological classification

The 60 events are classified into 41 single cascades, 17 tracks, and 2 double cascades. These are the first double cascades in the signal region and the first astrophysical tau neutrino candidate events. The reconstructed properties of the double cascades are shown in Table 2. As the average tau decay length scales with the tau energy \(\langle L_{\tau } \rangle \sim 50\) m \(\cdot \, E_{\tau }\) / PeV, which depends on the energy of the incoming \(\nu _{\tau }\) as \(\langle E_{\tau } \rangle \sim 0.7\cdot \, E_{\nu _{\tau }}\), the double cascades length \(L_{\text {dc}}\) scales with the total deposited energy \(E_{\text {tot}}\). Two-dimensional MC probability distribution functions (PDFs) of reconstructed total deposited energy \(E_{\text {tot}}\) versus reconstructed double cascade length \(L_{\text {dc}}\) for signal and background contributions to events classified as double cascades are shown in Fig. 2 with the data events overlaid as white circles. For \(\nu _{\tau }\)-induced double cascades (top panel), a clear correlation between \(E_{\text {tot}}\) and \(L_{\text {dc}}\) can be seen. Background events (bottom panel) cluster at the thresholds in \(E_{\text {tot}}\) due to the falling spectrum and in \(L_{\text {dc}}\) since single cascades typically have very small reconstructed \(L_{\text {dc}}\). The regions containing 68%, 90%, and 95% of true single cascades misclassified as double cascades are marked by vertical white lines, i.e. 68% of the true single cascades misclassified as double cascades have \(L_{\text {dc}}<14.4\) m, while 90% have \(L_{\text {dc}}<20.4\) m. The tilted white lines show the region within which 95% of the signal are contained. Few events are expected in the parameter space of event #1, while there are contributions expected from both signal and background in the parameter space of event #2. For single cascades and tracks, the properties total deposited energy, \(E_{\text {tot}}\), and cosine of the zenith angle in detector coordinates, \( \cos (\theta _z)\), are used to distinguish atmospheric and astrophysical contributions. The PDFs shown in Fig. 2 and the corresponding PDFs for single cascades and tracks described above are used in the all-flavor analyses presented in [19].

Table 2 Reconstructed properties of the two double cascades. Uncertainties are \(\sim \) 10% for the deposited energy and \(\sim \) 2 m for the double-cascade length
Fig. 2
figure 2

Two-dimensional MC PDFs showing total reconstructed energy versus reconstructed double-cascade length for the double-cascade subsample with data points, using the best fit to the atmospheric and astrophysical components with the flavor composition of astrophysical neutrinos fixed to 1:1:1 [19]. In the signal (\(\nu _{\tau }\)-induced double-cascade events) histogram (top), the region containing \(95\%\) of the expected signal is indicated with white dotted lines. In the background (all remaining events) histogram (bottom), the white vertical dotted lines mark the regions containing \(68\%,\,90\%,\) and \(95\%\) of the single-cascade induced background. In both histograms the two tau neutrino candidates are overlaid as white circles

Fig. 3
figure 3

Double-cascade event #1 (2012). The reconstructed double-cascade vertex positions are indicated as grey circles, the direction indicated with a grey arrow. The size of the circles illustrates the relative deposited energy, the color encodes relative time (from red to blue). Bright and saturated DOMs are excluded from this analysis

4.1 Double-cascade event characteristics

An event view of event #1, observed in 2012 and nicknamed “Big Bird” [3], is shown in Fig. 3. For several DOMs, the photon counts as a function of time are displayed alongside the predicted photon count distributions for single- and double-cascade hypotheses. The double-cascade hypothesis fits the observed data better than the single-cascade hypothesis. However, this event has several saturated and bright DOMs that were excluded from the analysis, a standard procedure for high-energy IceCube analyses [40, 41]. A DOM is called saturated if the signal in the PMT exceeds the dynamic range of the readout electronics. A DOM is called bright if it has collected ten times more light than the average DOM for an event. Only statistical uncertainties on photon count rates are included in the likelihoods of the reconstruction algorithms [36,37,38]. At the highest observed energies, bright DOM signals have very small statistic uncertainties and can therefore lead to misreconstructions due to the lack of proper systematic uncertainty terms in the likelihood. For comparison of predicted photon counts for each hypothesis, the bright DOMs are displayed in Fig. 3.

An event view of event #2, observed in 2014 and nicknamed “Double Double,” is shown in Fig. 4. The two vertices of the cascades cannot be spatially resolved by eye, highlighting the need for the algorithmic topological classification employed in this work. Analogous to Fig. 3, collected photon counts as a function of time are displayed together with the predicted photon count distributions for single- and double-cascade hypotheses. The predicted photon count PDFs differ remarkably between the single- and double-cascade hypothesis, with the single-cascade hypothesis disfavored.

Fig. 4
figure 4

Double-cascade event #2 (2014). The reconstructed double-cascade vertex positions are indicated as grey circles, the direction indicated with a grey arrow. The size of the circles illustrates the relative deposited energy, the color encodes relative time (from red to blue). Bright DOMs are excluded from this analysis

Data from DOMs labeled as bright were excluded from the analysis , but are used for the comparison of predicted photon count PDFs in Fig. 4.

Fig. 5
figure 5

Distribution of the ratio of double-cascade length to reconstructed decay-cascade energy (top), and of the reconstructed energy asymmetry (bottom) in the double-cascade subsample split by flavor content for the best-fit astrophysical and atmospheric spectra assuming flavor equipartition [19]. The values of the two double cascades are shown. Regions outside of the energy asymmetry values required for double cascades are marked in grey

Figure 5 shows the distribution of the ratio of the double-cascade length \(L_{\text {dc}}\) to reconstructed decay-cascade energy \(E_2\) (top panel) and the energy asymmetry \(A_E\) (bottom panel) of simulated events and data for the best-fit spectrum given in [19]. The distributions were not part of the topological classification chain. While the correlation between \(L_{\text {dc}}\) and \(E_{\text {tot}}\) is clear on average, there are large fluctuations in energy transfer from parent to daughter particle. Therefore, on the per-event basis, the more direct correlation between the double-cascade length \(L_{\text {dc}}\) and the decay-cascade energy \(E_2\) proves more informative. Event #1 has a length-to-energy ratio in a region where the \(\nu _{\tau }\) contribution is larger than the background contribution, but outside of 90% of the simulated \(\nu _{\tau }\)-induced double cascades. Its high energy asymmetry is in a region with a background expectation which is on the order of the signal expectation. Event #2 has a length-to-energy ratio at the peak of the distribution for \(\nu _{\tau }\)-induced double cascades and an energy asymmetry value in a highly signal-dominated region. None of the classified double cascades are in a phase space greatly affected by the ice anisotropy.

Table 3 Parameter space for resimulated events. The upper value of the primary energy depends on the interaction type, reflecting the spread of visible energy losses typical of that interaction. The visible energy is the energy transformed into light, it equals the total energy deposited in the detector for electromagnetic showers and is lower for hadronic showers and events with final-state muons or neutrinos. \(r-r_{\textrm{evt}}\) is the two-dimensional distance in the xy-plane. The values in parentheses are for \(\nu _{\mu }\) CC events

4.2 Tau neutrino probability assessment

To quantify the compatibility with a background hypothesis (i.e., not \(\nu _{\tau }\)-induced) for the actual \(\nu _{\tau }\) candidate events observed, a targeted MC simulation for each event was performed, consisting of simulation of \(\nu _{e}\), \(\nu _{\mu }\), and \(\nu _{\tau }\) interactions. In addition, for “Double Double,” also atmospheric muons were simulated. However, none of the \(1.2 \cdot 10^{10}\) generated muons passed the HESE veto undetected. See Table 3 for details on the restricted parameter space and Appendix A for a description of how this parameter space was chosen. Using targeted MC simulation for the analysis of exceptional events is a method often employed in IceCube [3, 30, 42, 43]. These new MC events were filtered and reconstructed in the same way as the initial MC and data events. In total, \(\sim 2 \cdot 10^7\) “Double-Double”-like events and \(\sim 1 \cdot 10^6\) “Big-Bird”-like events from the targeted simulation pass the HESE selection criteria. A breakdown of simulated event types and their fractions passing the HESE double cascade selection criteria can be found in Appendix A.

We define the tauness, \(P_{\tau }\), as the posterior probability for each event to have originated from a \(\nu _{\tau }\) interaction, which can be obtained with Bayes’ theorem:

(2)

In the first line we have simply split the total probability of an event at the observed parameter space \(\mathbf {\eta }_{\textrm{evt}}\) into its \(\nu _{\tau }\) and non- \(\nu _{\tau }\) (written ) components in Bayes’ theorem. In the second line we identify \(P(\mathbf {\eta }_{\textrm{evt}}| \nu _{\tau })\) with the PDFs for \(\nu _{\tau }\), and express the prior probability \()\) as the fraction of expected \(\nu _{\tau }\) events evaluated at the observed parameter space of each event, \(\mathbf {\eta }_{\textrm{evt}}\), obtaining the differential number of expected events \(N_{\nu _{\tau }} P_{\nu _{\tau }}(\mathbf {\eta }_{\textrm{evt}})\) (and analogous for the non- \(\nu _{\tau }\) components indicated as ).

For each tau neutrino candidate, the differential expected number of events at the point \(\mathbf {\eta }_\mathrm{{evt}}\), \(N_{\nu _{\tau }} P_{\nu _{\tau }}(\mathbf {\eta }_\mathrm{{evt}})\) and is approximated from the targeted simulation sets using a multidimensional kernel density estimator (KDE) with a gaussian kernel and the Regularization Of Derivative Expectation Operator (rodeo) algorithm [44]. The rodeo algorithm provides an unbiased and computationally efficient way to find the optimal bandwidth in d dimensions for a d-dimensional set of n events. In the rodeo the bandwidth is reduced as long a the derivative of the kernel density estimate with respect to its bandwidth is large compared to its variance. The obtained optimal bandwidth for each considered dimension balances the relevance of the variable with the sparsity of the dataset at the evaluated point. The eight dimensions used in evaluating the tauness include the six dimensions (\(d=6\)) of the restricted parameter space that the resimulation was carried out in: total deposited energy \(E_{\text {tot}}\), vertex position (x, y, z) and direction (\(\theta , \phi \)). Further, a region of interest is defined in the parameters not restricted during resimulation but used in the double-cascade classification: double-cascade length \(L_{\text {dc}}\) and energy asymmetry \(A_E\) [45]. The region of interest is obtained by slowly decreasing a two-dimensional box around the observed parameters as long as the statistical errors from the limited targeted MC stay below 10%. This procedure was established using the produced MC in a sideband region.

Having defined \(\mathbf {\eta }_{\textrm{evt}}= (E_{\text {tot}}, x, y, z, \theta , \phi , L_{\text {dc}}, A_E)\), and approximating

$$\begin{aligned} N_{\nu _{\tau }} P_{\nu _{\tau }}(\mathbf {\eta }_{\textrm{evt}}) \approx \hat{f}_{\nu _{\tau }}(\mathbf {\eta }_{\textrm{evt}}, \hat{h}_{\nu _{\tau }}) \end{aligned}$$
(3)

and

(4)

one obtains the tauness

$$\begin{aligned} P_{\tau }=\frac{\hat{f}_{\nu _{\tau }}(\mathbf {\eta }_{\textrm{evt}}, \hat{h}_{\nu _{\tau }})}{\sum _{\alpha = e, \mu , \tau } \hat{f}_{\nu _{\alpha }}(\mathbf {\eta }_{\textrm{evt}}, \hat{h}_{\nu _{\alpha }})}. \end{aligned}$$
(5)

Here, \(\hat{f}_{\nu _{\alpha }}(\mathbf {\eta }_{\textrm{evt}}, \hat{h}_{\nu _{\alpha }})\) is the density of \(\nu _{\alpha }\) for the optimal bandwidth \(\hat{h}_{\nu _{\alpha }}\) determined by the rodeo algorithm in the region of interest. Originally developed for unweighted events, we extend the rodeo formalism to weighted events according to the procedure in [46]: Each of the n simulated events has a weight \(w_i\), with \(i = {1,\ldots ,n}\). We use the effective number of events \(n_{\textrm{Eff}}=(\sum _i w_i)^2 / \sum _i(w_i^2) \), and their effective weight \(w_{\textrm{Eff}}=\sum _i w_i^2 / \sum _i w_i\).

Note that the tauness is always evaluated under certain assumptions for the flux parameters. Computing the tauness for each of the events to originate from a \(\nu _{\tau }\) interaction for the best-fit spectrum given in [19] with a 1/3:1/3:1/3 flavor composition yields \(P_{\tau \, \mathrm {best\,fit}}^{\textrm{BB}} \approx 75\%\) for “Big Bird,” and \(P_{\tau \, \mathrm {best\,fit}}^{\textrm{DD}} > rsim 97\%\) for “Double Double.” For “Double Double,” the statistics of the generated MC are not sufficient to evaluate the tauness to a higher precision. The tauness weakly depends on the astrophysical spectral index and decreases by \(\sim 1\%\) for a softening of \(\gamma _{\text {astro}}\) by one unit.

We sample the posterior probability in the flavor composition, obtained by leaving the source flavor composition unconstrained and taking the uncertainties in the neutrino mixing parameters into account. When using the best-fit spectra given in [19] but varying the source flavor composition over the entire parameter space (i.e. \(\nu _{e}{:}\nu _{\mu }{:}\nu _{\tau }=a{:}b{:}1-a-b\) with \(0\le a,b \le 1\) and \(a+b\le 1\) at source), and the mixing parameters in the global fit NuFit4.1 [7] \(3\sigma \) allowed range, the tauness is \((97.5_{-0.6}^{+0.3})\%\) for “Double Double” and \((76_{-7}^{+5})\%\) for “Big Bird.”

“Double Double” is also identified as a candidate tau neutrino event in two complementary analyses using the double pulse method to search for tau neutrinos that have been performed while this analysis was ongoing [47, 48].

5 Flavor composition analysis

A multi-component maximum likelihood fit is performed on the three topological subsamples using PDFs obtained from MC simulations. We account for the uncertainty due to limited MC statistics by using a variant of the effective likelihood \(\mathcal {L}_\textrm{Eff}\), a generalized Poisson likelihood, presented in [46] and employed in [19]. This joint likelihood is composed of the contributions from the independent subsamples single cascades, double cascades, and tracks (\(\text {SC, DC, and T}\), respectively):

$$\begin{aligned} \mathcal {L}_\textrm{Eff}(\mathbf {\theta }) = \prod _{t} \prod _{j} \mathcal {L}_\textrm{Eff}^t \left( \mu _j(\mathbf {\theta });\sigma _j(\mathbf {\theta });d_j\right) , \end{aligned}$$
(6)

where \(\mathbf {\theta }\) are the model parameters, j are the analysis bins, \(\mu _j\) is the expected number of events and the variance in the j-th bin with statistical uncertainty \(\sigma _j\), \(d_j\) is the observed number of events in the j-th bin, and \(t=({\textrm{SC, DC, T}})\) are the event topologies. Each simulated event i has a weight \(w_i\) which depends on the model parameters \(\mathbf {\theta }\). The expected number of events is a product of the effective number of simulated events \(n_{\textrm{Eff}}\) and the effective weight, \(w_{\textrm{Eff}}\) introduced in Sect. 4.2: \(\mu =w_{\textrm{Eff}}n_{\textrm{Eff}}\).

For all topologies, the contributions from atmospheric and astrophysical neutrinos as well as atmospheric muons are taken into account in the likelihood analysis. The conventional atmospheric neutrino component is modeled according to the HKKMS calculation [49, 50], the prompt atmospheric neutrino component is modeled following the BERSS [20] (for \(\nu _{e}, \nu _{\mu }\)) and MCEq [22] (for \(\nu _{\tau }\)) calculations. MCEq is using the SIBYLL-2.3c [51] model. The muon component is simulated using MUONGUN [52] which samples single muons from templates generated by CORSIKA [53] weighted to the Hillas–Gaisser-H4a cosmic-ray model [54] and employing the SIBYLL-2.1 hadronic interaction model [55] in the shower development. For the spectrum of the astrophysical neutrino flux \(\varPhi _{\text {astro}}\), a single power law with a common spectral index \(\gamma _{\text {astro}}\) for all flavors is used,

$$\begin{aligned} \frac{\textrm{d}\varPhi _{\text {astro}}}{\textrm{d}E} = \sum _{\alpha } \, \phi _{\nu _{\alpha }} \cdot \left( \frac{E}{E_0} \right) ^{\gamma _{\text {astro}}}, \end{aligned}$$
(7)

where \(\phi _{\nu _{\alpha }}\) is the astrophysical normalization of the \(\nu +\overline{\nu }\) flux of flavor \(\alpha \) at \(E_0=100\) TeV.

While for single cascades and tracks, atmospheric contributions pose the main background to the astrophysical signal, the main background to \(\nu _{\tau }\)-induced double cascades arises from misclassified astrophysical \(\nu _e\) and \(\nu _{\mu }\). The background contributions from atmospheric neutrinos are small (0.2 events in 7.5 years expected), while those from penetrating atmospheric muons and prompt atmospheric \(\nu _{\tau }\) are negligible (0 and 0.04 events in 7.5 years expected, respectively).

The systematic uncertainties are given in Table 5 found in Appendix C (reproduced from Table V of [19]), and are included in this analysis in the same way as in [19]. The main systematic uncertainty affecting the double-cascade reconstruction is the anisotropy of the light propagation in the ice [32, 56].

While in [19, 57,58,59], the total likelihood is maximized assuming flavor equipartition, here we fit the three flavors’ fractions \(f_{\alpha }\) of the overall astrophysical normalization \(\varPhi _{\text {astro}}\), \(f_{\alpha }=\varPhi _{\nu _\alpha }/ \varPhi _{\text {astro}}\), with the constraint \(f_e +f_{\mu } +f_{\tau } =1\). To perform the flavor composition measurement using the multidimensional KDE, the likelihood is modified compared to the analyses in [19]. In the joint likelihood for the three topologies, \(\mathcal {L}_\textrm{Eff}= \mathcal {L}_\textrm{Eff}^\textrm{SC} \mathcal {L}_\textrm{Eff}^\textrm{T} \mathcal {L}_\textrm{Eff}^\textrm{DC}\) [19], \(\mathcal {L}_\textrm{Eff}^\textrm{DC}\) is replaced by the extended unbinned likelihood for the double-cascade events,

$$\begin{aligned} \mathcal {L_\textrm{Rodeo}^\textrm{DC}} = e^{-\sum _c N_c} \prod _{\textrm{evt}}\left( \sum _c N_c P_c(\mathbf {\eta }_{\textrm{evt}})\right) , \end{aligned}$$
(8)

where c are the flux components used in the fit, \(c = \nu _{\text {astro},\alpha }, \nu _{\text {conv}, \alpha }, \nu _{\text {prompt}, \alpha }, \mu _{\text {atm}}\) for the flavors \(\alpha =e, \mu , \tau \). \(N_c P_c(\mathbf {\eta }_{\textrm{evt}})\) is computed using the rodeo algorithm introduced in Sect. 4.2. The aforementioned slight dependence on \(\gamma _{\text {astro}}\) is parametrized in the extended double-cascade likelihood \(\mathcal {L}^\textrm{DC}_\textrm{Rodeo}\) by evaluating \(N_c P_c(\mathbf {\eta }_{\textrm{evt}})\) as a function of \(\gamma _{\text {astro}}\).

Fig. 6
figure 6

Measured flavor composition of IceCube HESE events with ternary topology ID and extended multi-dimensional analysis of the double cascades (black). Contours show the \(1\sigma \) and \(2\sigma \) confidence intervals assuming Wilks’ theorem [60] holds. The shaded regions show previously published results [27, 61] without direct sensitivity to the tau neutrino component. Flavor compositions at source and after propagation expected from various astrophysical neutrino production mechanisms (see, e.g., [9]) are marked, and the entire accessible range of flavor compositions assuming standard 3-flavor mixing is shown

The result of the flavor composition measurement is shown in Fig. 6. The fit yields

$$\begin{aligned} \frac{\textrm{d}\varPhi _{\text {astro}}}{\textrm{d}E}= & {} 7.4_{-2.1}^{+2.4} \cdot \left( \frac{E}{100\text {~TeV}} \right) ^{-2.87[-0.20,+0.21]}\nonumber \\ {}{} & {} \cdot 10^{-18} \cdot \mathrm {\, GeV^{-1} \, cm}^{-2} \mathrm {\, s}^{-1} \mathrm {\, sr}^{-1}, \end{aligned}$$
(9)

with a best-fit flavor composition of

$$\begin{aligned} \nu _{e}{:}\nu _{\mu }{:}\nu _{\tau }=0.20{:}0.39{:}0.42. \end{aligned}$$
(10)

Comparing this result with previously published results of the flavor composition also shown in Fig. 6 clearly shows the advantages of the ternary topological classification. The best-fit point is non-zero in all flavor components for the first time, and the degeneracy between the \(\nu _{e}\) and \(\nu _{\tau }\) fraction is broken. The small sample size of 60 events in this analysis and the lower sensitivity of the HESE sample to \(\nu _{\mu }\) than to \(\nu _{e}\) and \(\nu _{\tau }\) flavors both lead to an increased uncertainty on the \(\nu _{\mu }\) fraction as compared to [27, 61].

The test statistic \(\text {TS}= -2 \left( \ln \mathcal {L}(\phi _{\nu _{\tau }}^0) - \ln \mathcal {L}(\phi _{\nu _{\tau }}^{\text {b.f.}}) \right) \) compares the likelihood of a fit with a \(\nu _{\tau }\) flux normalization fixed at a value \(\phi _{\nu _{\tau }}^0\) to the free fit where \(\phi _{\nu _{\tau }}\) assumes its best-fit value, \(\phi _{\nu _{\tau }}^{\text {b.f.}}\). Evaluated at \(\phi _{\nu _{\tau }}^0=0\) and using Wilks’ theorem, it gives the significance at which a vanishing astrophysical tau neutrino flux can be disfavored. The test statistic is expected to follow a half-\(\chi ^2_{k}\) distribution with \(k=1\) degree of freedom [62]. The validity of Wilks’ theorem was tested with pseudo-MC trials as described in Appendix B. The observed test statistic is TS \(=6.5\), which translates to a significance of \(2.8\sigma \), or a p-value of 0.005. A one-dimensional scan of the astrophysical \(\nu _{\tau }\) flux normalization is performed with all other components of the fit profiled over. The \(1 \sigma \) confidence intervals are defined by TS \(\le 1\), and the astrophysical tau neutrino flux normalization is measured to

$$\begin{aligned} \phi _{\nu _{\tau }}=3.0_{-1.8}^{+2.2} \cdot 10^{-18} \mathrm {\, GeV^{-1} \, cm}^{-2} \mathrm {\, s}^{-1} \mathrm {\, sr}^{-1}. \end{aligned}$$
(11)

This constitutes the first indication for tau neutrinos in the astrophysical neutrino flux.

6 Summary and outlook

Seven and a half years of HESE events were analyzed with new analysis tools. The previously shown data set was reprocessed with improved detector calibration. A flavor composition measurement was performed using a ternary topological classification directly sensitive to tau neutrinos, which breaks the degeneracy between \(\nu _{e}\) and \(\nu _{\tau }\) events that is present in a binary classification scheme (into tracks and cascades). This analysis found the first two double cascades, indicative of \(\nu _{\tau }\) interactions, with an expectation of 1.5 \(\nu _{\tau }\)-induced signal events and 0.8 \(\nu _{e,\mu }\)-induced background events for the best-fit single-power-law spectrum with flavor equipartition [19]. The first event, “Big Bird,” has an energy asymmetry at the boundary of the selected interval for double cascades. For the second event, “Double Double,” the photon arrival pattern is well described with a double-cascade hypothesis, but not with a single-cascade hypothesis. A dedicated a posteriori analysis was performed to determine the compatibility of each of the events with a background hypothesis, based on targeted MC. The analysis confirms the compatibility of “Big Bird” with a single cascade, induced by a \(\nu _e\) interaction, at the 25% level. A “Big Bird”-like event is \(\sim 3\) (15) times more likely to be induced by a \(\nu _{\tau }\) than a \(\nu _e\) (\(\nu _{\mu }\)), the result being only weakly dependent on the astrophysical spectral index. “Double Double” is \(\sim 80\) times more likely to be induced by a \(\nu _{\tau }\) than either a \(\nu _e\) or a \(\nu _{\mu }\). All background interactions have a combined probability of \(\sim 2\%\), almost independent of the spectral index of the astrophysical neutrino flux.

Using a novel extended likelihood for double cascades, which allows for the incorporation of a multi-dimensional PDF as evaluated by a kernel density estimator, the flavor composition was measured. The best fit is \(\nu _e {:} \nu _{\mu } {:} \nu _{\tau } = 0.20{:}0.39{:}0.42\), consistent with all previously published results by IceCube [27, 61], as well as with the expectation for astrophysical neutrinos assuming standard 3-flavor mixing. The astrophysical tau neutrino flux is measured to:

$$\begin{aligned} \frac{\textrm{d}\varPhi _{\nu _{\tau }}}{\textrm{d}E}= & {} 3.0_{-1.8}^{+2.2} \left( \frac{E}{100\text {~TeV}} \right) ^{-2.87[-0.20,+0.21]}\nonumber \\ {}{} & {} \cdot 10^{-18} \cdot \mathrm {\, GeV^{-1} \, cm}^{-2} \mathrm {\, s}^{-1} \mathrm {\, sr}^{-1}. \end{aligned}$$
(12)

A zero \(\nu _{\tau }\) flux is disfavored with a significance of \(2.8 \sigma \), or, \(p = 0.005\).

A limitation of the analysis presented here is the small sample size of 60 events. Merging the HESE selection with the contained cascades event selection [40] is expected to enhance the number of identifiable \(\nu _{\tau }\) events by \(\sim 40 \%\) [63]. Due to the small effective volume for \(\nu _{\mu }\)-CC interactions of HESE, the \(\nu _{\mu }\) fraction of the astrophysical neutrinos has large uncertainties. Work on updating the joint analysis of multiple event selections [27] is ongoing, where the strongest contribution to constraining the \(\nu _{\mu }\) fraction is expected from through-going muons [41, 64]. A few years from now, the IceCube Upgrade [65] will greatly improve our knowledge and modeling of the optical properties of the South Pole ice sheet, which the \(\nu _{\tau }\)-identification, via the double-cascade method, is sensitive to. The better modeling is expected to lead to a better distinction between single and double cascades around and below the length threshold of 10 m applied in this analysis. The planned IceCube-Gen2 facility [66] will provide an order-of-magnitude larger sample of astrophysical neutrinos and enable a precise measurement of their flavor composition, allowing to distinguish between neutrino production mechanisms [8,9,10,11] with high confidence.