Introduction

Applications of single-photon detectors, which focus on the timing of a very weak optical signal, mostly use single-photon avalanche diodes (SPADs) operated above their breakdown voltage in Geiger mode. Such as in photonics, quantum processing tasks, laser ranging, fluorescence microscopy, neural imaging with blood flow tomography, contrast-enhanced MRI, two-photon luminescence imaging, astronomical telescopes, etc.1,2,3,4,5,6,7. They are widely manufactured and sold by many different companies and as such exhibit different properties/behavior. To efficiently and accurately perform these experiments we must account for all non-ideal behavior of the detectors used8,9. This is especially true for sensitive applications like quantum communication, because the security of any real world implementation (i.e. with a high transmission loss) depends on the devices used. Hence the precise modeling/characterization of the non-ideal behavior of single photon detectors (and all other components of the quantum communication device) are critical for practical security proofs.

An ideal single-photon detector generates one and only one electric pulse for every incident photon. However, in a real detector, it is possible that a single incident photon results in more than one electrical pulse per incident photon. This is known as afterpulsing8. In this work we, approach this problem from the perspective of an end user. Consequently, we use an ipso-facto definition of an “afterpulse” as any pulse in addition to and following an isolated detection event and its subsequent dead time, regardless of its etiology. This behavior has been extensively studied due to its importance in semiconductor physics in general. It has been suggested that afterpulsing can be linked to charges trapped in the deep levels of the semiconductor’s band structure and released at a later time10 as well as to an other causes (such as those described in refs11,12,13,14.

Afterpulsing has different implications, depending on the application of the detector used. It can result in an overestimation of the total count rate by up to 10% as well as a reduction of the duty cycle and detection efficiency due to the increased dead times. In fluorescence microscopy it could lead to an overestimation of the concentration of fluorophores. In quantum communication the overestimation of coincidence events leads to a larger Quantum Bit Error Rate (QBER). Afterpulsing can also adversely affect security15. Afterpulsing also poses a significant problem to the measurement of photon arrival times both in quantum communication protocols as well as for laser ranging. Hence a proper characterization of the afterpulse behaviour could improve the accuracy of several measurements in metrology in general.

Several previous works have attempted to characterize the afterpulse behavior and fit the results to various models10,16,17,18,19,20,21. Worryingly, these papers do not agree on the most suitable model describing the statistics of the arrival times. Most of these studies have focused on one make/manufacture of detector at a time. In this work we take a more comprehensive approach and perform a comparative study on 3 different makes of detectors (for each make we compare up to 12 individual detectors with almost consecutive serial numbers). We test the universality of the various theoretical models and attempt to simply resolve the conflicts raised by previous works. To do so we eschew the common practice of studying the timing auto-correlation of the detector signals in favor of the more comprehensive timing cross-correlation between the emission of a photon and all the following detector signals. We describe this procedure in Section 3. We then fit to the various standard models described in Section 2 and compare these fits in Section 4.1. Due to our use of cross-correlations we were also able to detect higher orders of afterpulses which we present in Section 4.2 and finally in Section 4.3 we discuss the corrections that can be applied.

Canonical Models of Afterpulsing

Much of the previous work on afterpulsing in SPADs focused on modeling their behavior. In this section we present the three most common models. We go on to show the inadequacy of all of these models in Section 4.1. The characteristic decay of the afterpulse probability was sometimes thought to depend on the deep level in which the charge is trapped. Initial models10,16,18,22,23 considered the decay from several distinct deep levels and proposed the “multiple exponential model” where the afterpulsing probability (P exp (t)) at a time t is given by:

$${P}_{exp}(t)=\sum _{k}{A}_{k}{e}^{-\frac{t}{{\tau }_{k}}}+d,$$
(1)

where τ k and A k is the de-trapping lifetime and amplitude factor for the kth deep level, and d is the offset due to noise counts.

Furthermore17,24 considered a continuum of deep levels, in InGaAs/InP detectors, that could trap a charge and found that the power law model was a good empirical fit. Here the afterpulse probability (P pow (t)) is given by:

$${{P}}_{{pow}}({t})={A}.{{t}}^{-{\lambda }}+{d},$$
(2)

where λ is a effective decay constant and A is the initial afterpulse probability.

In an another attempt to create a more physically meaningful model ref.19 derived the “hyperbolic Sinc model” from the Arrhenius law (once more assuming a continuum of levels), where the afterpulse probability (P sinc (t)) is given by:

$${P}_{sinc}(t\mathrm{)=2}\cdot A\cdot \frac{\sinh ({\rm{\Delta }}\cdot t)}{t}\cdot {e}^{-\gamma \cdot t}+d,$$
(3)

where Δ and γ are both functions of the minimum and maximum energies of the deep levels in which charges may be trapped.

In this paper, we compare the above three standard mathematical models (see Eqs 1, 2 and 3) with the measured behavior of several different single photon counting detectors (SPCM-AQ4C from PerkinElmer, SPCM-NIR from Excelitas and τ-SPAD-fast from Laser Components as depicted in Section 4.1. We show that none of these canonical models are universal and vary between individual detectors as well as makes of detectors. We also report on the presence of several higher order afterpulses. An investigation of the region of reduced detection efficiency called the “dead-time” found between the signal pulse and the afterpulse shows yet another characteristic variation between makes of detectors, which is not part of the models used so far.

Experimental Setup

The general scheme of the setup used for the experiments is shown in Fig. 1. A function-generator triggers a 798 nm laser with a repetition rate of 0.25 to 1 MHz. The emitted light pulse of ≈1 ns length gets attenuated in a neutral attenuator to <0.1 photons per pulse on average and arrives at the active area of the respective detector in use. The arrival time at the detector and the trigger from the function generator are individually sent to a time-tagging-module (TTM). The TTM has a dead time of 5.5 ns for consecutive pulses to the same channel and ≪300 ps for signals arriving at different channels. We then calculated the temporal cross-correlation histogram (g(2)) between those two signals arriving in channels 1 and 2, using a bin width of 1 ns. We normalize to the number of detected laser pulses to obtain the pulse probability per bin. To ensure that our results are not skewed by an individual “faulty” detector we tested at least 2 detectors of each type (same manufacturer’s model number with nearly consecutive serial numbers) all of similar ages.

Figure 1
figure 1

A 798 nm laser is used to generate pulses with a 1 ns pulse width and various repetition rates between 0.25 to 1 MHz. We use Neutral Density (ND) filters to attenuate the laser pulse, incident on the active area of the SPADs, until we have ≪1 detected photon per laser pulse (approximately corresponding to a mean photon number of 0.1 per pulse). We use a time tagging module (TTM8000) with a resolution of 82.3 ps to record the trigger for the laser pulse (Channel 1) and the photon detection event (Channel 2). This data is stored on a computer and a software computes the temporal cross-correlation histogram (g(2)) with 1 ns bin width. The LED shown was to control the background noise level and was only used to obtain the results shown in Section 4.3.

We consider the first event arriving at the TTM, after a given laser pulse to be the “detection pulse” which stems from the real photon impinging on the active area of the detector. Any detection event after that, but before the next laser pulse, should be due to either afterpulsing or noise. From an end user perspective it is impossible to distinguish a true afterpulse from another count that just happened to occur at a similar time delay. Consequently, we do not differentiate between an afterpulse due to processes in the semiconductor diode from those due to secondary avalanches caused by any other means (such as faults in the quenching circuit). This is typical of most applications and previous studies of these detectors ranging from (quantum) optics experiments to fluorescence microscopy.

We fit each of the standard models (see Eqs 1, 2 and 3) to the tail of the afterpulse as seen in the g(2) obtained for each different SPAD detectors. Figure 2 shows the resultant fits as well as the residuals.

Figure 2
figure 2

Afterpulse peak’s tail fitted with Power law model, Hyperbolic-sinc model and multiple exponential model for SPCM-AQ4C (PerkinElmer), SPCM-NIR (Excelitas) and τ-SPAD-fast (Laser Components) - Comparison of residuals for all detectors and models (Power law model (blue), Hyperbolic sinc model (cyan) and Multiple exponential model (green)) and the red dashed lines are ±2 standard deviation limits for statistical fluctuations (2σ). We analysed > 0.14 × 106, 2.7 × 106 and 3 × 106 afterpulse events for each SPCM-AQ4C, SPCM-NIR, and τ-SPAD-fast detector.

All current models of afterpulsing discussed, concern the timing structure of the afterpulse tail. An ideal model of a detector should include its entire behaviour including the duration of the dead time, the jitter and the rising edge of the afterpulse peak. As in most previous previous studies, we then isolate the peak (due to afterpulses) and its tail in time. The tail of the afterpulse of all detectors are shown in Fig. 2 after normalization and accidentals correction have been applied. The accidental rate is estimated using a region well after the end of the afterpulse tail and before the next incident photon (this method of correction has been empirically verified in Section 4.3). We define the total afterpulse probability to be the sum total of the afterpulse probability in each bin of the corrected g(2) histogram for a duration of 900 ns (We chose a value of 900 ns such that we include a significant amount of the afterpulse tail while avoiding any potential effects due to the earliest possible subsequent pulse 1 μs later. In silicon based SPADs the afterpulse tail dies off some hundred nanoseconds, whereas in InGaAs detectors it can last for several μs)18,19. Regardless of the model, in order to obtain a good fit we had to ignore the first two points of the afterpulse tail.

Results

Comparison of afterpulsing models

We observed a wide variation between different tested detectors. We tested at least 2 and up to 12 identical detectors (in terms of make, model number and year of purchase) from each of three manufacturers. We saw a drastic change in the total afterpulse probability between otherwise identical detectors (see Table 1).

Table 1 Table comparing various makes of commercially available single photon detectors we tested. The total after pulse probility shows a piece to piece variability as expected. More importantly, the piece to piece variation extends to the best fitting theoretical model. This should not happen for theories based on fundamental properties of semiconductors. Clearly the applicability of all 3 mathematical models discussed here is very limited and ill defined

On the whole, the power law model had the most consistent good fit and the hyperbolic Sinc model the worst. Unlike the hyperbolic Sinc model, the power law model approximates the experimental data more accurately independent of the detector used AQ4C (see Fig. 2a), SPCM-NIR (see Fig. 2c), τ-SPAD-fast (see Fig. 2e). Ref.19 showed that the hyperbolic sinc model was better than the power law model using id100-MMF50 SPAD module from idQuantique, we were able to duplicate those results only with 6 of the 16 detectors we tested.

Figure 2 shows sample fits of the three models for one detector from each manufacturer. In this case the power law model is fitting the entire range of the tail better than the Hyperbolic sinc model. The fits can be quantitatively compared using the residuals seen in Fig. 2 which are well within the ±2 standard deviation range (shown in red) for the power law model but not for the hyperbolic sinc model.

Power law model as a simple analytic model function leads to some good results and proved to be the best model for 4 of the 16 detectors tested. The power law model was primarily shown to be effective for InGaAS/InP detectors17,24,25 and we now show that it is a reasonably good model for some Si detectors. However, it’s estimated parameters gives a rather loose description of the physical origin of afterpulses. A truly good model should be able to predict the behavior of the entire tail region. The power law model only provides a suitable fit if the beginning two points of the afterpulse tail are ignored.

The multiple exponential model provides a good fit when k ≫ 5 and is slightly better than the power law model for the SPCM-AQ4C detector. It proved to be the best fit for the behaviour of 6 of the 16 tested detectors. It is nearly commensurate with the power law model for the τ-SPAD. For the SPCM-NIR, the exponential model gives a good fit only when we ignore the higher order afterpulse peaks (which are discussed in Section 4.2). We observed that it was possible to get different fits of almost the same quality with different characteristic times (τ k also called the de-trapping times) depending on the number of exponentials used. Thus, upon examining the fitting procedure we concur with17 who state that “It is evident from this whole fitting procedure that the extracted values for the de-trapping times depend entirely on number of exponentials in the model function and the range of hold-off times used in the data set.”

In case of the hyperbolic sinc model, we consistently obtain unsuitable fits to all but 6 detectors. In Fig. 2 we can see that the residuals for several fits are much larger than those for other models. However, for 6 detectors, the hyperbolic sinc model was found to be the most suitable.

The total probability of obtaining an afterpulse (PAP) for each detector is shown in Table 1. Clearly, afterpulse behavior and probabilities vary drastically between brands and between individual specimens of the same brand.Various detectors of the same make from the same company (even if manufactured 1 week appart) provide evidence in support of different and contradictory mathematical models. As do detectors from different brands. Clearly a universal model to describe afterpulsing behaviour does not exist and every detector needs to be characterized individually.

Higher order afterpulses

Any detection event may cause an afterpulse which, being a detection event, may induce secondary and further afterpulses called “higher order afterpulses”. Usually afterpulsing is a small effect and the afterpulsing probability distribution function is smeared such that the higher order afterpulses are improbable. However, in the presence of strong enough twilighting12, photon detections and afterpulses accumulate in a narrow peak that appear just after the dead time. We note this behavior in SPCM-NIR and to a lesser extent in SPCM-AQ4C, as can be seen in Fig. 3. The higher order afterpulses are clearly visible as a series of peaks after the main peak with a period exactly equal to the dead time of the detector. This holds for each of the 14 detectors where higher order afterpulsing was observed despite differences in their dead time. The presence of such higher order afterpulsing was first speculated in ref.17 but, to the best of our knowledge, never reported.

Figure 3
figure 3

g(2) Histograms for various detectors exhibit distinct afterpulse behavior (each afterpulse peak is marked by an arrow). For example, the τ-SPAD-fast displays an unusually gradual decay while the SPCM-NIR is the only detector make to exhibit higher order afterpulses. The inset shows the higher order afterpulses occurring at intervals equal to the dead time.

For the SPCM-NIR, the signal pulse and the first order of afterpulse are 22 ns apart, which exactly corresponds to the duration of the dead time of this particular detector module. The measured time intervals between all following higher orders of afterpulses (as seen in the inset of the Fig. 3) have the same time delay of 22 ns. We have obtained a similar plot for three detectors with the same model number: in each case, peaks appear separated by the dead time of the particular detector. Such a behavior is clearly undesirable, notably in time-resolved spectroscopy where higher order peaks could be mistaken for, or mask the true signal.

We verified that the higher order afterpulses seen are not the result of stray light, the shape of the laser pulse, electronic noise, impedance mismatch or optical reflections. We also processed the data with different bin widths and laser pulse frequencies to ensure that the observed higher order afterpulse peaks are not due to digitization noise.

Further, we note that it is possible to explain the area of the nth higher order peak based on the probability of the first afterpulse (PAP) as \({{\rm{P}}}_{{\rm{AP}}}^{(n+\mathrm{1)}}+\) the probability of an afterpulse in the bin just before the nth higher order peak × the number of bins in the peak. This geometric progression agrees to within 4 to 6% for the 2nd to 5th order afterpulses.

Background and accidentals corrections during the dead time

In this subsection we discuss the corrections that we can apply to the cross correlated histograms we used in the previous subsections. In typical quantum optics experiments, there is a probability that a coincidence is detected between two different detectors erroneously, we call these coincidences “accidentals”. Typically, they can be estimated from Poissonian statistics as: r acc  = r1r2t c , where r acc is the rate of the accidental counts, r1 and r2 are the count rates of the individual detectors and t c is the coincidence time window used. By introducing a controlled amount of continuous background illumination we experimentally verify this standard practice. In the duration well after a detection event, this provides a very good estimate (with a maximum variation of ≪3%) of the behavior (see Fig. 4a and b).

Figure 4
figure 4

g(2) Histograms for different background count levels, clearly showing the probability of detection events during dead time region during strong background illumination (between the large detection peak and time Δt = 0). We observe a linear scaling of the accidental coincidences as we vary the background count rate. Insets: The g(2) histograms corrected for background counts.

An interesting consequence of using the cross-correlation technique described above is the non-zero probability of a count during the “dead-time”. We attribute these detection events to the probability that the detector did not click because of the laser pulse but did click within the dead-time region.

When used for some tasks (like quantum communication), the end user may not be able to correct for the accidentals obtained while computing the cross correlation histogram. In these cases it is important, for a complete model of the detector’s behavior, that these accidentals during the dead-time be correctly accounted for.

We use a continuous wave battery powered LED as a steady and controllable source of background illumination in addition to the attenuated laser pulses (see Fig. 1). We then measured the g(2) histograms for different background count rates. Figure 4a shows these histograms for the SPCM-NIR while Fig. 4b shows the same for the τ-SPAD-fast modules (We could not illuminate the SPCM-AQ4C fiber coupled detector in the same manner as the other free space coupled detectors, so we have left out these results for consistency). We show that the increase in click probability during the dead-time scales linearly with the amount of additional background illumination (see Fig. 4a and b).

In the regions before the detection event, well after the detection event and even during the afterpulse, the measured accidental corrections agreed with the value calculated assuming Poissonian statistics.

Conclusion

We have clearly demonstrated that different standard models are required to appropriately describe the distribution of electrical signals generated by different detectors. This explains the conflicting nature of several previous studies; for example refs17,24,25 show strong evidence for the power law model while refs10,16 show equally compelling evidence for the multiple exponential model and ref.19 provides evidence for the hyperbolic sinc model. Recently26 showed that the afterpulse probability is dependent on past events–a property not considered in the exponential, power, or hyperbolic sinc models. By comparing previously reported results to our own, we realize that there is a large variation between the different commonly used manufactures/makes of detectors and between individual detectors. It is natural to expect that the modulation/attenuation of trapped carriers and hence the resulting mathematical models behind afterpulsing behaviour are a property fundamental to the semiconductor junction. However, we have demonstrated that the mathematical models of afterpulsing (and not just its amplitude or time constants) differ from detector to detector. Further, studies have shown that afterpulsing can be suppressed by different quenching methods1. This leads us to believe that the afterpulsing behavior is more dependent on the electronic quenching circuit used rather than the properties of the semiconductor (such as the presence and distribution of discrete/continuous/quasi-continuous deep levels). Our data clearly proves that none of the current theoretical models are universal which makes it hard to draw conclusions about the underlying mechanism based on fundamental semiconductor physics.

As seen in Table 1 supposedly identical detectors manufactured within a week of each other, provide evidence to support one model over the other. Several previous studies have shown that one or the other mathematical model fits the afterpulsing behaviour of a few individual detectors. Further many of these studies use such evidence to confirm or disprove hypotheses/assumptions about semiconductor physics. Clearly, with such extreme individual variation no conclusion about the physics behind these mathematical models can be drawn from any similar test. Especially tests with a small number of individual detectors or with detectors of only one kind.

We also report on the presence of higher order afterpulses in one of the tested detector models. To make this possible, unlike in several previous measurements18,19,21 who used an auto-correlation signal, we use a cross-correlation histogram between the detector and the trigger. This allows us to look for both higher order afterpulses as well as the behavior during the dead time. These higher order afterpulses can cause large errors in measurements of the arrival times of photons and must be carefully accounted for. Under certain conditions, such as a short dead time, the quenching circuit may be prone to oscillation, thus adding fake detections that have no origin in the physics of the SPAD. We were able to exclude frequency dependence in our detectors since we repeated all the experiments described above for several different repetition rates (all far from detector saturation) of the laser pulses ranging from 10 kHz to 1.2 MHz and found no significant variation.

Further, the total afterpulsing probability for several individual detectors all of the same manufacturer and part number varied drastically (in some cases by a factor of 460!) implies that every individual detector needs to be calibrated for all applications that need to accurately measure count rates or arrival times of photons.

For many years, afterpulsing has been extensively studied from a semiconductor physics based perspective, where it is important to understanding how trapped charges/energy levels decay. However, to correctly study this, one must separate the effects due to the behaviour of the diode or seimiconductor junction from the effects due to the electronics and quenching circuits. For example, based on modeling the semiconductor junction, a longer dead time is thought to lead to a lower afterpulsing probability18. While this conclusion has been drawn in certain studies of particular SPADs and quenching circuits11,27,28,29, it does not hold in general. Namely, as seen in Table 1 and Fig. 3 both the SPCM-AQ4C and the τ-SPAD-fast have a significantly longer dead time than the SPCM-NIR but the former has a lower afterpulse probability while the latter has a larger one. Further, even among the 12 SPCM-NIR detectors, those with a larger dead time can have increased afterpulsing. In these cases it is impossible to draw a conclusion about the relationship between dead time and afterpulse probability without carefully considering the quenching circuits used. This type of analysis requires proprietary and confidential information about the diode and circuits used by the manufacturer; which is inaccessible to a typical end-user. Instead of drawing potentially erroneous conclusions about semiconductor behavior based on which model had a better fit, we focus on an application oriented perspective. For most practical purposes, it is sufficient to understand the statistical nature of the afterpulses rather than their causal mechanisms.

We also would like to mention, that we did not consider a possible aging effect of the detectors. All measurements were performed in the time span of only a few months. Commercial products, may have to be characterized repeatedly during their long operational lifetimes. The study of any aging effects on the statistical behaviour of SPAD’s in general is an interesting avenue for further exploration.

Most applications of SPADs are hindered by afterpulsing, in many cases these ill effects can be corrected for if each individual detector is properly characterized. The individual characterization is necessary due to the large variation in both the total afterpulse probability and the mathematical form of the probability distribution between detectors of the same model number, age and manufacturer under nearly identical laboratory test conditions. It is possible, although inadvisable, to ignore all clicks for several hundred ns after any detection event. This effectively increases the dead time and avoids the bulk of afterpulses but this severely limits the maximum count rates and detection efficiency due to saturation-like effects. To correctly account for the ill effects of afterpulses it is sufficient to characterise each individual detector prior to/during use, using the method described in this work.

From a quantum communication perspective, our method of characterization of every individual detector can be included into the overall device dependent security analysis30. These characterizations can also be used to improve the accuracy of results in quantum meterology.

OCIS codes

(270.0270) Quantum optics; (270.5568) Quantum cryptography; (060.5565) Quantum communications; (250.1345) Avalanche photodiodes; (040.6070) Solid state detectors; (120.4800) Optical standards and testing; (120.4640) Optical instruments; (120.6200) Spectrometers and spectroscopic; instrumentation; (180.2520) Fluorescence microscopy; (170.0170) Medical optics and biotechnology.