Pattern Recognition: General Considerations
The goal of pattern recognition is to identify groups of detector hits to form tracks. Track trajectories are parameterized using the following five parameters: \(d_0\), \(z_0\), \(\phi _0\), \(\cot {\theta },\) and \(q/p_T\).Footnote 1 The transverse impact parameter, \(d_0\), is the distance of closest approach of the helix to the chosen reference point (e.g., the primary vertex) in the x-y plane. The longitudinal impact parameter, \(z_0\), is the z coordinate of the track at the point of closest approach. The azimuthal angle, \(\phi _0\), is the angle of the track in the x-y at the point of closest approach. The polar angle, \(\cot {\theta }\) is the inverse slope of the track in the r-z plane. The curvature, \(q/p_T\), is the inverse of the transverse momentum with the sign determined by the charge of the particle.
Neglecting noise and multiple scattering, most particle tracks of physics interest, particularly those with high \(p_T\), exhibit the following properties:
-
The hits follow an arc of a helix in the x-y plane with a large radius of curvature or small \(q/p_T\);
-
The hits follow a straight line in the r-z plane;
-
Most hits lie on consecutive layers: there are few to no missing hits (holes).
Track candidates with fewer than five hits are predominantly fake tracks, which do not correspond to a true particle trajectory. While tracks can share hits, we impose the constraint from Ref. [10] that any one hit can belong to at most one track.
Algorithm Goals
The algorithm presented in this paper encodes a classification problem. Following Ref. [5], tracks are constructed from n consecutive hits, leading to \(n-1\) doublets. Given the large set of potential doublets from hits in the detector, the goal of the algorithm is to determine which subset belongs to the trajectories of charged particles. The algorithm aims to preserve the efficiency, but improve the purity of the input doublet set.
Triplets and Quadruplets
We follow a similar approach to Ref. [5], but use triplets instead of doublets. In addition to improving the performance at high multiplicity, this allows us to calculate and use track properties.
A triplet, denoted \(T^{abc}\), is a set of three hits (a, b, c) or a pair of consecutive doublets (a, b and b, c), ordered by increasing transverse radius (R). Two triplets \(T^{abc}\) (of hits a, b, c) and \(T^{def}\) (of hits d, e, f), can be combined to form a quadruplet if \(b=d \wedge c=e\) or a quintet if \(c=d\). If they share any other hit, the triplets are marked as being in conflict. A set of n consecutive hits will result in \(n-2\) triplets and \(n-3\) quadruplets.
Key triplet \(T_{i}^{abc}\) properties are the number of holes \(H_i\); the curvature, \(q/p_T\); and \(\delta \theta\) the difference in polar angle between the doublets.
The strength S quantifies the compatibility of the track parameters between the two triplets in a quadruplet \((T_i, T_j)\):
$$\begin{aligned} S(T_i, T_j)&= {z_1} \frac{ {z_2} \big (1-|\delta (q/p_{Ti}, q/p_{Tj})|\big )^{z_3} }{ (1+H_i+H_j)^{z_5}} \end{aligned}$$
(1)
$$\begin{aligned}&\quad + \frac{(1-z_2) \big (1-\mathrm {max}(\delta \theta _i, \delta \theta _j)\big )^{z_4} }{ (1+H_i+H_j)^{z_5} } \end{aligned}$$
(2)
where \(z_2\) encodes the relative importance of the curvature with respect to \(\delta \theta\). The other parameters (\(z_1, z_3, z_4, z_5\)) are unbounded constants that require problem-specific tuning. The parameters are set to favor high \(p_T\) tracks. In its simplest form, we have \(z_2 = 0.5\) (equal weights), \(z_5 = 2\), and all other constants set to 1:
$$\begin{aligned} S(T_i, T_j)&= \frac{1 - \frac{1}{2}(|\delta (q/p_{Ti}, q/p_{Tj})| + \mathrm {max}(\delta \theta _i, \delta \theta _j))}{(1+H_i+H_j)^2} \end{aligned}$$
(3)
Definition of the Quadratic Unconstrained Binary Optimization
The QUBO is configured to identify the best pairs of triplets. It has two components: a linear term that weighs the quality of individual triplets and a quadratic term used to express relationships between pairs of triplets. In our case, the objective function to minimize becomes:
$$\begin{aligned} O(a,b,T) = \sum _{i=1}^N{a_i T_i} + \sum _{i}^N\sum _{j < i}^N{b_{ij} T_i T_j} \quad T_i, T_j \in \{0,1\} \end{aligned}$$
(4)
where T are all potential triplets, \(a_i\) are the bias weights, and \(b_{ij}\) the coupling strengths computed from the relation between the triplets \(T_i\) and \(T_j\). The bias weights and the coupling strengths define the Hamiltonian. Minimizing the QUBO is equivalent to finding the ground state of the Hamiltonian.
All bias weights are set to be identical \(a_i=\alpha\) , which means all triplets have equal a priori probability to belong to a particle track. Our objective function therefore depends solelyFootnote 2 on the triplet–triplet coupling strength \(b_{ij}\). If the triplets form a valid quadruplet, the coupling strength is negative and equal to the quadruplet quality \(S(T_i, T_j)\) (Eq. 3). If the two triplets are in conflict, the coupling is a positive constant \(b_{ij}=\zeta\) that disfavors a solution with \(T_i=T_j=1\). Finally, if the triplets have no relationship (meaning, no shared hits), the coupling is set to zero. This is illustrated in Fig. 1 and represented in Eq. 5.
$$\begin{aligned} b_{ij} = {\left\{ \begin{array}{ll} -S(Ti, Tj), &{} {\text{ if } } (T_i, T_j) { \text{ form } \text{ a } \text{ quadruplet }}, \\ \zeta &{} {\text{ if } } (T_i, T_j) { \text{ are } \text{ in } \text{ conflict }}, \\ 0 &{} {\text{ otherwise. }} \end{array}\right. } \end{aligned}$$
(5)
As is clear from Eq. 5, the choice of constants in Eq. 1 determines the functional behavior of \(b_{ij}\). The larger the conflict strength \(\zeta\) the lower the number of conflicts, but too large values risk discontinuities in the energy landscape, increasing time to convergence. Furthermore, the D-Wave machines limit the value of \(b_{ij}\), and thus \(\zeta\), to between \(-2\) and 2 (with a restricted precision, so scaling is not a fix either).
Dataset Selection
By design, the algorithm does not favor any particular momentum range. However, to limit the size of the QUBO, we focus on high \(p_T\) tracks (\(p_T \ge 1\) GeV), which are the most relevant for physics analysis at the HL-LHC.
A triplet \(T_i\) is created if and only if:
$$\begin{aligned} H_i\le & {} 1 \\ |(q/p_T)_i |\le & {} 8 \times 10^{-4} \ {\text {GeV}}^{-1} , \\ \delta \theta _i\le & {} 0.1 \ {\text {rad}} \end{aligned}$$
And a quadruplet \((T_i, T_j)\) is created if and only if:
$$\begin{aligned} |\delta ((q/p_T)_i, (q/p_T)_j)|\le & {} 1\times 10^{-4} \ {\text {GeV}}^{-1} , \\ S(T_i, T_j)> & {} 0.2 \end{aligned}$$
Triplets that are not part of any quadruplet or whose longest potential track has less than five hits are not considered.