On the emerging potential of quantum annealing hardware for combinatorial optimization

Tasseff, Byron; Albash, Tameem; Morrell, Zachary; Vuffray, Marc; Lokhov, Andrey Y.; Misra, Sidhant; Coffrin, Carleton

doi:10.1007/s10732-024-09530-5

On the emerging potential of quantum annealing hardware for combinatorial optimization

Open access
Published: 02 August 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Journal of Heuristics Aims and scope Submit manuscript

On the emerging potential of quantum annealing hardware for combinatorial optimization

Download PDF

Byron Tasseff¹,
Tameem Albash²,
Zachary Morrell¹,
Marc Vuffray¹,
Andrey Y. Lokhov¹,
Sidhant Misra¹ &
…
Carleton Coffrin ORCID: orcid.org/0000-0003-3238-1699¹

391 Accesses
1 Altmetric
Explore all metrics

Abstract

Over the past decade, the usefulness of quantum annealing hardware for combinatorial optimization has been the subject of much debate. Thus far, experimental benchmarking studies have indicated that quantum annealing hardware does not provide an irrefutable performance gain over state-of-the-art optimization methods. However, as this hardware continues to evolve, each new iteration brings improved performance and warrants further benchmarking. To that end, this work conducts an optimization performance assessment of D-Wave Systems’ Advantage Performance Update computer, which can natively solve sparse unconstrained quadratic optimization problems with over 5,000 binary decision variables and 40,000 quadratic terms. We demonstrate that classes of contrived problems exist where this quantum annealer can provide run time benefits over a collection of established classical solution methods that represent the current state-of-the-art for benchmarking quantum annealing hardware. Although this work does not present strong evidence of an irrefutable performance benefit for this emerging optimization technology, it does exhibit encouraging progress, signaling the potential impacts on practical optimization tasks in the future.

Benchmarking D-Wave Quantum Annealers: Spectral Gap Scaling of Maximum Cardinality Matching Problems

Unraveling Quantum Annealers using Classical Hardness

Article Open access 20 October 2015

Boosting Quantum Annealing Performance Using Evolution Strategies for Annealing Offsets Tuning

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the 1990s, an optimization algorithm called quantum annealing (QA) was proposed with the aim of providing a fast heuristic for solving combinatorial optimization problems (Kadowaki and Nishimori 1998; Ray et al. 1989; Finnila et al. 1994; Farhi et al. 2001; Santoro et al. 2002). At a high level, QA is an analog quantum algorithm that leverages the non-classical properties of quantum systems and continuous time evolution to minimize a discrete function. Annealing is the process that steers the dynamics of the quantum system into an a priori unknown minimizing variable assignment of that function. Under suitable conditions, theoretical results have shown that QA can arrive at a global optimum of the desired function (Born and Fock 1928; Kato 1950; Jansen et al. 2007). These results have motivated the study of using this algorithm for combinatorial optimization over the past thirty years.

Due to the computational difficulty of simulating quantum systems (Feynman 1982), the study of QA remained a theoretical pursuit until 2011, when D-Wave Systems produced a quantum hardware implementation of the QA algorithm (Berkley et al. 2010; Johnson et al. 2010; Harris et al. 2010; Johnson et al. 2011). This represented the first time that QA could be studied on optimization problems with more than a few dozen decision variables and spurred significant interest in developing a better understanding of the QA computing model (Job and Lidar 2018; Hauke et al. 2020; Crosson and Lidar 2021).

The release of D-Wave Systems’ QA hardware platform also generated expectations that this new technology would quickly outperform state-of-the-art classical methods for solving well-suited combinatorial optimization problems (Farhi et al. 2001, 2002; Santoro et al. 2002). The initial interest from the operations research community was significant. However, through careful comparison with both complete search solvers (McGeoch and Wang 2013; Puget 2013; Dash 2013) and specialized heuristics (Selby 2014; Boixo et al. 2014; Selby 2013; Mandrà et al. 2016; Mandrà and Katzgraber 2018; Rønnow et al. 2014; Hen et al. 2015; Albash and Lidar 2018), it was determined that the available QA hardware was a far cry from state-of-the-art optimization methods. These results tempered the excitement around the QA computing model and reduced interest from the operations research community. Since the waning of the initial excitement around QA, QA hardware has steadily improved and now features better noise characteristics (Vuffray et al. 2022; Zaborniak and de Sousa 2021; King et al. 2022) and quantum computers that can solve optimization problems more than fifty times larger than what was possible in 2013 (McGeoch and Farre 2020).

Since 2017, we have been using the benchmarking practices of operations research to track the performance of QA hardware platforms and compare the results with established optimization algorithms (Coffrin et al. 2019; Pang et al. 2021). In previous studies of this type, this benchmarking approach ruled out any potential performance benefit for using available QA hardware platforms in hybrid optimization algorithms and practical applications, as established algorithms outperformed or were competitive with the QA hardware in both solution quality and computation time. However, in this work, we report that with the release of D-Wave Systems’ Advantage Performance Update computer in 2021, our benchmarking approach can no longer rule out a potential run time performance benefit for this hardware. In particular, we show that there exist classes of combinatorial optimization problems where this QA hardware finds high-quality solutions around fifty times faster than a wide range of heuristic algorithms under best-case QA communication overheads and around fifteen times faster under real-world QA communication overheads. This work thus provides compelling evidence that quantum computing technology has the potential for accelerating certain combinatorial optimization tasks. This represents an important and necessary first condition for demonstrating that QA hardware can have an impact on solving practical optimization problems.

Although this work demonstrates encouraging results for the QA computing model, we also emphasize that it does not provide evidence of a fundamental or irrefutable performance benefit for this technology. Indeed, it is quite possible that dramatically different heuristic algorithms (Dunning et al. 2018; Mohseni et al. 2021) or alternative hardware technologies (McMahon et al. 2016; Goto et al. 2019; Matsubara et al. 2020; Kowalsky et al. 2022) can reduce the run time performance benefits observed in this work. We look forward to and encourage ongoing research into benchmarking the QA computing model, as closing the performance gap presented in this work would provide significant algorithmic insights into heuristic optimization methods, benefiting a variety of practical optimization tasks.

This work begins with a brief introduction to the types of combinatorial optimization problems that can be solved with QA hardware and the established benchmarking methodology in Sect. 2. It then presents a summary of the key outcomes from a large-scale benchmarking study in Sect. 3, which required hundreds of hours of compute time. In Sect. 4, the paper concludes with some discussion of the limitations of our results and future opportunities for QA hardware in combinatoral optimization. Additional details regarding the experimental design, as well as further analyses of computational results, are provided in the appendices.

2 Quantum annealing for combinatorial optimization

Available QA hardware is designed to perform optimization of a class of problems known as Ising models, which have historically been used as fundamental modeling tools in statistical mechanics (Gallavotti 2013). Ising models are characterized by the following quadratic energy (or objective) function of $\mathcal {N} = \{1, 2, \dots , n\}$ discrete spin variables, $\sigma _{i} \in \{-1, 1\}, \; \forall i \in \mathcal {N}$:

$$\begin{aligned} E(\sigma ) = \sum _{{(i, j) \in \mathcal {E}}} {J}_{ij} \sigma _{i} \sigma _{j} + \sum _{{i \in \mathcal {N}}} {h}_{i} \sigma _{i} , \end{aligned}$$

(1)

where the parameters, ${J}_{ij}$ and ${h}_{i}$, define the quadratic and linear coefficients of this function, respectively. The edge set, $\mathcal {E} \subseteq \mathcal {N} \times \mathcal {N}$, is used to encode a specific sparsity pattern in the Ising model, which is determined by the physical system being considered. The optimization task of interest is to find the lowest energy configuration(s) of the Ising model, i.e.,

$$\begin{aligned} \begin{aligned}&\underset{\sigma }{\text {minimize}}{} & {} E(\sigma ) \\&\text {subject to}{} & {} \sigma _{i} \in \{-1, 1\}, \, \forall i \in \mathcal {N}. \end{aligned} \end{aligned}$$

(2)

At first glance, the lack of constraints and limited types of variables make this optimization task appear distant from real-world applications. However, the optimization literature on quadratic unconstrained binary optimization (QUBO), which is equivalent to minimization of an Ising model’s energy function, indicates how this model can encode a wide range of practical optimization problems (Kochenberger et al. 2014; Lucas 2014).

2.1 Foundations of quantum annealing

The central idea of QA is to leverage the properties of quantum systems to minimize discrete-valued functions, e.g., finding optimal solutions to Problem (2). The mathematics of QA is comprised of two key elements: (i) leveraging quantum states to lift the minimization problem into an exponentially larger space and (ii) slowly interpolating (i.e., annealing) between an initial easy problem and the target problem to find high-quality solutions to the target problem. The quantum lifting begins by introducing, for each spin, $\sigma _i \in \{-1,1\}$, a $2^{|\mathcal N |} \times 2^{|\mathcal N |}$ dimensional matrix, $\widehat{\sigma }_i$, expressible as a Kronecker product of ${|\mathcal N |}$ $2 \times 2$ matrices,

$$\begin{aligned} \widehat{\sigma }_i = \underbrace{\begin{pmatrix} 1 &{} 0 \\ 0 &{} 1 \end{pmatrix} \mathop {\otimes } \cdots \mathop {\otimes } \begin{pmatrix} 1 &{} 0 \\ 0 &{} 1 \end{pmatrix}}_{1\,\textrm{to}\, i-1} \mathop {\otimes } \underbrace{\begin{pmatrix} 1 &{} 0 \\ 0 &{} -1 \end{pmatrix}}_{i} \mathop {\otimes } \underbrace{\begin{pmatrix} 1 &{} 0 \\ 0 &{} 1 \end{pmatrix} \mathop {\otimes } \cdots \mathop {\otimes } \begin{pmatrix} 1 &{} 0 \\ 0 &{} 1 \end{pmatrix}}_{i+1\,\textrm{to}\,{|\mathcal N |}} . \end{aligned}$$

(3)

In this lifted representation, the value of a spin, $\sigma _i$, is identified with the two possible eigenvalues, 1 and $-1$, of the matrix $\widehat{\sigma }_i$. The quantum counterpart of the energy function defined in Eq. (1) is the $2^{|\mathcal N |} \times 2^{|\mathcal N |}$ matrix obtained by substituting spins, $\sigma _{i}$, with the $\widehat{\sigma }_{i}$ matrices, defined in Eq. (3), within the algebraic expression for the energy. That is,

$$\begin{aligned} \widehat{E} = \sum _{{(i,j) \in \mathcal{E}}} J_{ij} \widehat{\sigma }_i \widehat{\sigma }_j + \sum _{i \in \mathcal{N}} h_i \widehat{\sigma }_i . \end{aligned}$$

(4)

Notice that the eigenvalues of the matrix $\widehat{E}$ are the $2^{|\mathcal N |}$ possible energies obtained by evaluating $E(\sigma )$ from Eq. (1) for all possible configurations of spins. This implies that finding the minimum eigenvalue of $\widehat{E}$ is equivalent to solving Problem (2). This lifting is clearly impractical in the classical computing context, as it transforms a minimization problem over $2^{|\mathcal N |}$ configurations into computing the minimum eigenvalue of a $2^{|\mathcal N |} \times 2^{|\mathcal N |}$ matrix. The key motivation for the QA computational approach is that it is possible to model $\widehat{E}$ with only $|\mathcal{N} |$ quantum bits (qubits), so it is feasible to compute over this exponentially large matrix.

The annealing process in QA provides a method for steering a quantum system into the a priori unknown eigenvector that minimizes Eq. (4) (Kadowaki and Nishimori 1998; Farhi et al. 2000). First, the system is initialized at an a priori known minimizing eigenvector of a simple (“easy”) energy matrix, $\widehat{E}_0$. After the system has been initialized, the energy matrix is interpolated from the easy problem to the target problem slowly over time. Specifically, the energy matrix at a point during the anneal is $\widehat{E}_a(\Gamma ) = (1-\Gamma )\widehat{E}_0 + \Gamma \widehat{E}$, with $\Gamma $ varying from zero to one. The annealing time is the physical time taken by the system to evolve from $\Gamma =0$ to $\Gamma =1$. When the anneal is complete ($\Gamma =1$), the interactions in the quantum system are described by the target energy matrix. For suitable starting energy matrices, $\widehat{E}_0$, and a sufficiently slow annealing time, the adiabatic theorem demonstrates that a quantum system remains at the minimal eigenvector of the interpolating matrix, $\widehat{E}_a(\Gamma )$ (Born and Fock 1928; Kato 1950; Jansen et al. 2007), and therefore achieves the minimum energy of the target problem.

2.2 Quantum annealing hardware

The computers developed by D-Wave Systems realize the QA computational model in hardware with more than 5000 qubits. However, the engineering challenges of building real-world quantum computers are significant and have an impact on the previously discussed theoretical model of QA. In particular, QA hardware is an open quantum system, meaning that it is affected by environmental noise and decoherence. The coefficients in Eq. (1) are constrained to the ranges, $-4 \le {h_{i}} \le 4$, $-1 \le {J_{ij}} \le 1$, and nonzero ${J_{ij}}$ values are restricted to a specific sparse lattice structure (i.e., $\mathcal {E}^H \subseteq \mathcal {E}$), which is determined by the hardware’s implementation. (See “Appendix A” for details.) The D-Wave hardware documentation also highlights five sources of deviation from ideal system operations called integrated control errors, which include background susceptibility, flux noise, digital-to-analog conversion quantization, input/output system effects, and variable scale across qubits (D-Wave Systems 2020). These implementation details impact the performance of QA hardware (Nelson et al. 2021). Consequently, QA hardware often does not find globally optimal solutions but instead finds near-optimal solutions, e.g., within 1% of the best-known solutions (Coffrin et al. 2019; Pang et al. 2021). All of these deviations from the ideal QA setting present notable challenges for encoding and benchmarking combinatorial optimization problems with available QA hardware platforms.

2.3 Benchmarking quantum annealing hardware

Due to the challenges associated with mapping established optimization test cases to specific QA hardware (Coffrin et al. 2019), the QA benchmarking community has adopted the practice of building instance generation algorithms that are tailored to specific quantum processing units (QPUs) (King et al. 2015; Hen et al. 2015; King et al. 2017; Denchev et al. 2016; Albash and Lidar 2018; Pang et al. 2021). The majority of the proposed problem generation algorithms build Ising model instances that are defined over a specific QPU’s hardware graph, i.e, $(\mathcal {N}, \mathcal {E}^H)$, or subsets of this graph, which are typically referred to as hardware-native problems.

In this work, we build upon an earlier class of hardware-native instances termed corrupted biased ferromagnets, or CBFMs, as proposed by Pang et al. (2021). Given the QPU graph, $(\mathcal {N}, \mathcal {E}^H)$, the CBFM model adopts the following distributions for hardware-native instances:

$$\begin{aligned} \begin{aligned} P({J}_{ij} = 0) = 0, \, P({J}_{ij} = -1) = 0.625, \, P({J}_{ij} = 0.2) = 0.375, \, \forall (i, j) \in \mathcal {E}^H \\ P({h}_i = 0) = 0.97, \, P({h}_i = -1) = 0.02, \, P({h}_i = 1) = 0.01, \, \forall i \in \mathcal {N}. \end{aligned} \end{aligned}$$

(CBFM)

This instance model is characterized by ten parameters, which define the probabilities that the h and J terms in the Ising model take on zero, positive or negatives values and the magnitude of those values. Benchmarking these instances on the previous generation of D-Wave’s QPU architecture (i.e., the 2000Q platform using the Chimera hardware graph) showed promising performance against state-of-the-art classical alternatives, although a clear wall-clock run time benefit was not achieved (Pang et al. 2021).

In this work, we design a variant of the CBFM problem class called CBFM-P, which is tailored to D-Wave’s first Advantage QPU platform. The model parameters for this problem class are

$$\begin{aligned} \begin{aligned} P({J}_{ij} = 0) = 0.35, \, P({J}_{ij} = -1) = 0.10, \, P({J}_{ij} = 1) = 0.55, \, \forall (i, j) \in \mathcal {E}^H \\ P({h}_i = 0) = 0.15, \, P({h}_i = -1) = 0.85, \, P({h}_i = 1) = 0, \, \forall i \in \mathcal {N}. \end{aligned} \end{aligned}$$

(CBFM-P)

The CBFM-P parameters differ from CBFM, as the Advantage QPU architecture features a different and denser hardware graph called Pegasus, whose topology is detailed in “Appendix A”. These new parameters were discovered using a metaheuristic approach that explored different combinations of the ten parameters in this model and sought to maximize the problem’s difficulty. In each evaluation of the metaheuristic, a combination of parameters was selected, one random instance was generated following this parameterization, and a variety of classical solution methods were executed on the instance. The instance difficulty was determined by comparing the lower and upper bounds of solutions found by these classical solution methods. Although this approach is naive, we found that it was sufficient for the objectives of this study. We expect that there exist classes of more challenging hardware-native instances on the Pegasus graph, but identifying these classes is left for future work.

3 Optimization performance analysis

In this section, we compare the performance of the D-Wave Advantage QPU and a variety of classical algorithms for optimization of CBFM-P Ising models. Specifically, we consider the following established classical algorithms:

A greedy algorithm based on steepest coordinate descent (SCD) (Pang et al. 2021);
An integer quadratic programming (IQP) model formulation solved using the commercial mathematical programming solver Gurobi (Billionnet and Elloumi 2007);
Simulated annealing (SA) (van Laarhoven and Aarts 1987; D-Wave Systems 2022);
A spin-vector Monte Carlo (SVMC) algorithm, which was proposed to approximate the behavior of QA (Shin et al. 2014);
Parallel tempering with iso-energetic clustering moves (PT-ICM) (Zhu et al. 2015).

SCD and IQP are general optimization approaches, intended to serve as strawman comparisons to understand solution quality, while SA, SVMC, and PT-ICM reflect high-performance classical competitors, which provide different tradeoffs in run time and solution quality. Details of these methods and others that were considered are discussed in “Appendix B”. All of these classical optimization algorithms were executed on a system with two Intel Xeon E5-2695 v4 processors, each with 18 cores at 2.10 GHz, and 125 GB of memory. The parameterizations used by each algorithm in this work are also detailed in “Appendix B”.

For the QA hardware comparison, we use the Advantage_system4.1 QPU accessed through D-Wave Systems’ LEAP cloud platform. The largest system we consider features $|\mathcal {N} |= 5{,}387$ discrete variables and $|\mathcal {E}^{H} |= 25{,}324$ quadratic coefficients in the Pegasus topology. Solving a hardware-native optimization problem on this platform consists of (i) programming an Ising model, (ii) repeating the annealing and read-out process a number of times, and (iii) returning the highest quality solution found over all replicates. In this analysis, we hold the annealing time constant at $62.5 \upmu $s, which is justified in “Appendix C”. The number of anneal-read cycles are varied between 10 and $5{,}\!120$ to produce different total run times. We also leverage the spin reversal transforms feature, provided by the LEAP platform, after every 100 anneal-read cycles to mitigate the undesirable impacts of the aforementioned integrated control errors. For each Ising instance, this protocol typically requires less than two seconds of QPU compute time and less than 10s of total wall-clock time.

3.1 A characteristic example

Here, we present an evaluation of the above optimization techniques on a characteristic problem instance of the largest CBFM-P Ising models that we considered on the Advantage_system4.1 QPU, with 5, 387 variables.^{Footnote 1} For each solution technique, parameters that control the execution time of the algorithm (e.g., the number of sweeps in SA or the wall-clock time limit of the IQP method) were varied to understand their effects on solution quality. These parameters are detailed in “Appendix B”. All other parameters remained fixed.

Benchmarking results for the CBFM-P instance “16” are shown in Fig. 1. Here, the horizontal axis measures the execution time of each algorithm, where each point indicates the best solution at the end of an independent algorithm execution with some set termination criterion. The vertical axis measures the solution quality as the relative difference from the best-known solution. Specifically, each solution’s relative difference is computed as

$$\begin{aligned} \% \text {Relative Difference} = 100\% \left( \frac{|\bar{E} - E^{*} |}{|E^{*} |}\right) , \end{aligned}$$

(5)

where $E^{*}$ is the best-known objective value, i.e., the energy of Eq. (1) for the best-known solution, and $\bar{E}$ is the objective value obtained for a specific solver and execution time.

In Fig. 1, we first observe that the QPU (i.e., the solid black line) is shown to find high-quality, but not optimal, solutions at very fast timescales (between 0.01 and 2.40 seconds), with relative quality differences between $0.2\%$ and $0.5\%$ of the best-known solution. Note that each execution time comprising the solid black line reflects a setting where the classical computer is colocated with and has exclusive access to the QPU. In practice, QPU access is managed by D-Wave’s remote cloud service, LEAP, which has overheads in both communication and job scheduling. These solve times are reflected by the open points in Fig. 1, which add between one and five seconds of overhead to the total idealized solve times. Impressively, even accounting for these significant overheads, the QPU is still able to obtain high-quality solutions well before all other classical methods that are considered.

Although the Advantage_system4.1 QPU is capable of quickly obtaining high-quality solutions in short amounts of time, it appears to reach a solution quality limit around $0.2\%$. This relative difference is over an order of magnitude larger than the standard termination criterion used by mathematical programming solvers, i.e., an optimality gap of $0.01\%$ or less, delineated by the dotted line in Fig. 1. To facilitate a comparison of the run time performance gained by the Advantage_system4.1 QPU, we thus propose a measurement that evaluates the ability of classical algorithms to match the solution quality found by the QPU after 2, 560 anneal-read cycles. The measurement we use is similar to determining the intersection with the dashed line in Fig. 1, albeit on a linear instead of logarithmic scale.

In this instance example, the best solution obtained by the QPU after 2, 560 anneal-read cycles is discovered after around 1.2 seconds when neglecting overheads and 4.3 seconds when including overheads. Most solution techniques (i.e., SVMC, IQP, and SCD) do not reach this solution quality after one hour of computation. Simulated annealing matches this quality after around 132 seconds and, linearly interpolating between the two nearest points before and after the intersection, PT-ICM is estimated to match this quality after around 77 seconds. Thus, the best-case performance of the QPU in this experiment, which assumes colocation with and direct access to the QPU, provides a 64 times improvement in run time, from 77 seconds with PT-ICM to 1.2 seconds. A similar comparison using the wall-clock run time yields an improvement of around 18 times. That is, even when including the overhead of communicating with D-Wave’s LEAP cloud service, the QPU is capable of providing a high-quality solution over an order of magnitude faster than all tested classical methods.

3.2 Problem scaling run time trends

In this subsection, we investigate how the run time performance of the QPU is impacted by the size of the problem that is considered. Unlike the previous section, here, we consider solution statistics that are aggregated over 50 distinct CBFM-P instances per problem size. Similar to the run time ratios discussed in Sect. 3.1, we estimate the amount of time required for the classical algorithms to match, on average, the solution quality reported by the QPU using an annealing time of $62.5 \upmu $s and 2, 560 anneal-read cycles. This experiment is performed for Pegasus lattice sizes ranging from two to sixteen, yielding problems with 40 to 5, 387 decision variables. For each instance, if a classical algorithm exactly matches the best solution (objective) found by the QPU, this time-to-match measurement is the earliest solve time at which that solution is obtained. If a classical algorithm finds a solution that does not strictly match but is better than the solution found using the QPU, the time-to-match measurement is estimated via a linear interpolation between the time at which the better solution is obtained and the time at which the worse solution, preceding it, is obtained.

The results of conducting these scaling experiments and analyses are summarized in Fig. 2. Figure 2a illustrates the idealized run time improvements (i.e., without including communication overheads) as a function of the number of variables in the Ising model. It is clear that the problem scale has a large impact on the usefulness of QA hardware. For small problem sizes (e.g., less than 500 variables), the QPU run time is greater than most of the classical algorithms, as indicated by a run time ratio less than $10^{0}$. For instances containing roughly 1, 000 variables or more, the QA hardware begins to have run time performance benefits, and only the best classical heuristics, i.e., PT-ICM and SA, are capable of matching the QA hardware’s solution quality, albeit sometimes after a significant amount of time. Note that points are excluded from Fig. 2a if a solver did not match the QA solution quality for all 50 instances. For example, SA matched the solution quality for only 49 of 50 instances for Pegasus lattice sizes of 9 and 10, and hence these points are excluded from this plot.

For the two most competitive classical methods, PT-ICM and SA, Fig. 2b shows that the run time benefits of the QA hardware increase steadily with problem size. This trend holds when considering both the idealized computation setting (solid lines) and the real-world setting that includes communication and scheduling overheads (dashed lines). In particular, the estimated $15 \times $ run time ratio for the largest problem size is encouraging, as this suggests that the solutions identified by the QPU, even when accessed via a cloud computing service, can be obtained quickly enough to accelerate the performance of classical heuristic methods.

4 Limitations and opportunities

Sections 3.1 and 3.2 provide evidence that there exist classes of Ising models where available QA hardware can provide run time performance improvements over classical alternatives. This is an encouraging result, but it is also important to recognize some limitations of this study and available QA hardware.

Limitations of This Study: The foremost limitation of this work is that it considers Ising models that are hardware-native. Such models provide best-case scenarios for QA hardware and, thus far, have not reflected sparsity patterns of realistic combinatorial optimization tasks. Although this work demonstrates an important necessary condition for having a performance benefit on practical problems, it is not a sufficient condition. Benchmarking real-world problems is required to show that these benefits can be also realized in that context.

We also note that most of the classical algorithms employed in this work did not effectively exploit parallelism, and all except SVMC and PT-ICM used their single-threaded variants. Parallelism of classical algorithms may reduce or eliminate the performance benefits presented in this work. Further, the benchmarks considered in this study did not evaluate other novel computing technologies or special-purpose hardware (e.g., McMahon et al. 2016; Goto et al. 2019; Matsubara et al. 2020; Honjo et al. 2021; Kowalsky et al. 2022), which could provide improved performance on CBFM-P instances. Both of these avenues should be explored in future work to improve heuristic algorithms and better exploit computational resources.

Finally, we also recognize that this study does not attempt to demonstrate nor assert the much sought-after scaling advantage from quantum annealing (Rønnow et al. 2014), even for the contrived class of CBFM-P instances that are considered. This work has provided encouraging initial evidence of a class of Ising models where QA hardware can provide a practical, constant factor performance improvement over available classical algorithms.

Limitations of Current QA Hardware: The primary limitation of the QA hardware identified in this study is that it appears to approach a limit on solution quality for the largest CBFM-P instance class, i.e., around $0.20\%$ from the best-known solution. More evidence for this behavior is provided in “Appendix C”. As such, this work adopted a time-to-match measurement of performance, which is atypical for an optimization benchmarking study. Additional research is required to develop extensions of the simple QA optimization protocols used in this work to understand if the hardware can achieve solutions that are within 0.01% of global optimality, which would make this hardware’s performance consistent with standard optimality tolerances used by commercial optimization tools. QA hardware improvements to reduce noise and integrated control errors would also serve to further close this gap.

Future Opportunities: Despite the limitations of this work and current QA hardware, our results provide encouraging evidence that QA hardware is reaching a point where existing classical optimization algorithms can be practically outperformed, especially over very short timescales. If QA hardware continues to increase in the number of qubits and hardware graph connectivity while also reducing noise properties, it is reasonable to expect that the performance gap on hardware-native problem instances will continue to increase, as suggested by the results of Sect. 3.2. Recently, D-Wave Systems announced their plans to develop an Advantage 2 QPU, which will support over 7, 000 decision variables and a denser hardware graph (D-Wave Systems 2021). If the trends observed in this work continue on this new platform, identifying even more dramatic performance gains should be possible. Acknowledging these anticipated hardware improvements, as well as the empirical findings of this study, revisiting the topic of demonstrating a QA scaling advantage (as in Denchev et al. 2016; Albash and Lidar 2018) is a natural next step to establish a stronger case for a fundamental performance benefit of the QA computing model for combinatorial optimization.

5 Conclusion

After roughly twenty years of research and development and ten years of focused commercial development, we believe quantum annealing technology has reached a level of technical maturity and performance that warrants serious consideration by the operations research community. This work has shown, for the first time, an order-of-magnitude run time performance benefit for quantum annealing over a wide range of classical alternatives, even when accounting for the substantial overheads involved in the practical usage of commercial quantum annealing services. Nonetheless, significant open challenges remain in translating these performance results into benefits for practical optimization tasks. There may be significant unrealized opportunities to hybridize this new computing technology into existing mathematical programming algorithms and impact real-world optimization challenges. We sincerely hope that this work will inspire the operations research community to increase its consideration of the quantum annealing computing model and continue exploring how it can potentially benefit mathematical optimization algorithms and practical applications. To support follow-on works along these lines, all of the test cases and runtime results used in the production of this article are made available at https://github.com/lanl-ansi/arXiv-2210.04291.

Notes

An evaluation over 50 CBFM-P instances in “Appendix D” indicates this example is not an outlier.

References

Aaronson, S. (2017). Insert D-Wave post here. https://web.archive.org/web/20220305233421, Accessed 08 April 2022
Albash, T., Lidar, D.A.: Demonstration of a scaling advantage for a quantum annealer over simulated annealing. Phys. Rev. X 8, 031016 (2018). https://doi.org/10.1103/PhysRevX.8.031016
Article Google Scholar
Albash, T., Vinci, W., Mishra, A., et al.: Consistency tests of classical and quantum models for a quantum annealer. Phys. Rev. A 91, 042314 (2015). https://doi.org/10.1103/PhysRevA.91.042314
Article Google Scholar
Baccari, F., Gogolin, C., Wittek, P., et al.: Verifying the output of quantum optimizers with ground-state energy lower bounds. Rev. Res. Phys. (2020). https://doi.org/10.1103/physrevresearch.2.043163
Article Google Scholar
Beasley, J.E.: Heuristic algorithms for the unconstrained binary quadratic programming problem. Tech. rep, Management School, Imperial College (1998)
Berkley, A.J., Johnson, M.W., Bunyk, P., et al.: A scalable readout system for a superconducting adiabatic quantum optimization system. Superconduct. Sci. Technol. 23(10), 105014 (2010). https://doi.org/10.1088/0953-2048/23/10/105014
Article Google Scholar
Billionnet, A., Elloumi, S.: Using a mixed integer quadratic programming solver for the unconstrained quadratic 0–1 problem. Math. Programm. 109(1), 55–68 (2007). https://doi.org/10.1007/s10107-005-0637-9
Article MathSciNet Google Scholar
Boixo, S., Rønnow, T.F., Isakov, S.V., et al.: Evidence for quantum annealing with more than one hundred qubits. Nat. Phys. 10(3), 218–224 (2014). https://doi.org/10.1038/nphys2900
Article Google Scholar
Boothby, K., Bunyk, P., Raymond, J., et al.: Next-generation topology of D-Wave quantum processors. (2020). https://doi.org/10.48550/arXiv.2003.00133
Born, M., Fock, V.: Beweis des adiabatensatzes. Z. Phys. 51(3), 165–180 (1928). https://doi.org/10.1007/BF01343193
Article Google Scholar
Choi, V.: Minor-embedding in adiabatic quantum computation: I. The parameter setting problem. Quantum Inf. Process. 7(5), 193–209 (2008). https://doi.org/10.1007/s11128-008-0082-9
Article MathSciNet Google Scholar
Choi, V.: Minor-embedding in adiabatic quantum computation: II. minor-universal graph design. Quantum Inf. Process. 10(3), 343–353 (2011). https://doi.org/10.1007/s11128-010-0200-3
Article MathSciNet Google Scholar
Coffrin, C., Nagarajan, H., Bent, R.: Evaluating Ising processing units with integer programming. In: Rousseau, L.M., Stergiou, K. (eds.) Integration of Constraint Programming, Artificial Intelligence, and Operations Research, pp. 163–181. Springer International Publishing, Cham (2019)
Chapter Google Scholar
Crosson, E.J., Lidar, D.A.: Prospects for quantum enhancement with diabatic quantum annealing. Nat. Rev. Phys. 3(7), 466–489 (2021). https://doi.org/10.1038/s42254-021-00313-6
Article Google Scholar
Crowley, P.J.D., Green, A.G.: Anisotropic Landau–Lifshitz–Gilbert models of dissipation in qubits. Phys. Rev. A 94, 062106 (2016). https://doi.org/10.1103/PhysRevA.94.062106
Article Google Scholar
D-Wave Systems. (2020). D-Wave system documentation. https://docs.dwavesys.com/docs/latest. Accessed 17 Mar 2021
D-Wave Systems. (2021). Clarity: a roadmap for the future of quantum computing. https://web.archive.org/web/20220320053047/https://www.dwavesys.com/media/xvjpraig/clarity-roadmap_digital_v2.pdf. Accessed 26 April 2022
D-Wave Systems. (2022). dwave-neal. https://docs.ocean.dwavesys.com/projects/neal/en/latest. Accessed 15 Mar 2022
D-Wave Systems. (2024). dwave-tabu. https://docs.ocean.dwavesys.com/projects/tabu/en/latest. Accessed 07 May 2024
Dash, S.: A note on QUBO instances defined on Chimera graphs. (2013). https://doi.org/10.48550/arXiv.1306.1202
Denchev, V.S., Boixo, S., Isakov, S.V., et al.: What is the computational value of finite-range tunneling? Phys. Rev. X 6, 031015 (2016). https://doi.org/10.1103/PhysRevX.6.031015
Article Google Scholar
Dhar, D., Shukla, P., Sethna, J.P.: Zero-temperature hysteresis in the random-field Ising model on a Bethe lattice. J. Phys. A: Math. Gen. 30(15), 5259–5267 (1997). https://doi.org/10.1088/0305-4470/30/15/013
Article MathSciNet Google Scholar
Dunning, I., Gupta, S., Silberholz, J.: What works best when? A systematic evaluation of heuristics for max-cut and QUBO. INFORMS J. Comput. 30(3), 608–624 (2018). https://doi.org/10.1287/ijoc.2017.0798
Article Google Scholar
Farhi, E., Goldstone, J., Gutmann, S., et al.: Quantum computation by adiabatic evolution. (2000). https://doi.org/10.48550/arXiv.quant-ph/0001106
Farhi, E., Goldstone, J., Gutmann, S., et al.: A quantum adiabatic evolution algorithm applied to random instances of an NP-complete problem. Science 292(5516), 472–475 (2001). https://doi.org/10.1126/science.1057726
Article MathSciNet Google Scholar
Farhi, E., Goldstone, J., Gutmann, S.: Quantum adiabatic evolution algorithms versus simulated annealing. (2002). https://doi.org/10.48550/arXiv.quant-ph/0201031
Feynman, R.P.: Simulating physics with computers. Int. J. Theor. Phys. 21(6), 467–488 (1982). https://doi.org/10.1007/BF02650179
Article MathSciNet Google Scholar
Finnila, A., Gomez, M., Sebenik, C., et al.: Quantum annealing: a new method for minimizing multidimensional functions. Chem. Phys. Lett. 219(5), 343–348 (1994). https://doi.org/10.1016/0009-2614(94)00117-0
Article Google Scholar
Fossorier, M., Mihaljevic, M., Imai, H.: Reduced complexity iterative decoding of low-density parity check codes based on belief propagation. IEEE Trans. Commun. 47(5), 673–680 (1999). https://doi.org/10.1109/26.768759
Article Google Scholar
Gallavotti, G.: Statistical Mechanics: A Short Treatise. Springer Science & Business Media, New York (2013)
Google Scholar
Geyer, C.J. (1991). Parallel tempering. In: Keramidas, E.M., Kaufman, S.M. (eds) Computing Science and Statistics Proceedings of the 23rd Symposium on the Interface. American Statistical Association, New York, p 156
Glauber, R.J.: Time-dependent statistics of the Ising model. J. Math. Phys. 4(2), 294–307 (1963). https://doi.org/10.1063/1.1703954
Article MathSciNet Google Scholar
Glover, F., Laguna, M. (1998). Tabu Search, Springer US, Boston, MA, pp 2093–2229. https://doi.org/10.1007/978-1-4613-0303-9_33
Glover, F., Kochenberger, G., Du, Y.: Quantum bridge analytics I: a tutorial on formulating and using QUBO models. 4OR 17(4), 335–371 (2019). https://doi.org/10.1007/s10288-019-00424-y
Article MathSciNet Google Scholar
Glover, F., Kochenberger, G., Hennig, R., et al.: Quantum bridge analytics I: a tutorial on formulating and using QUBO models. Ann. Oper. Res. 314(1), 141–183 (2022a). https://doi.org/10.1007/s10479-022-04634-2
Article MathSciNet Google Scholar
Glover, F., Kochenberger, G., Ma, M., et al.: Quantum bridge analytics II: QUBO-Plus, network optimization and combinatorial chaining for asset exchange. Ann. Oper. Res. 314(1), 185–212 (2022b). https://doi.org/10.1007/s10479-022-04695-3
Article MathSciNet Google Scholar
Goto, H., Tatsumura, K., Dixon, A.R.: Combinatorial optimization by simulating adiabatic bifurcations in nonlinear Hamiltonian systems. Sci. Adv. 5(4), eaav2372 (2019). https://doi.org/10.1126/sciadv.aav2372
Article Google Scholar
Harris, R., Johnson, M.W., Lanting, T., et al.: Experimental investigation of an eight-qubit unit cell in a superconducting optimization processor. Phys. Rev. B 82, 024511 (2010). https://doi.org/10.1103/PhysRevB.82.024511
Article Google Scholar
Hastings, W.K.: Monte Carlo sampling methods using Markov chains and their applications. Biometrika 57(1), 97–109 (1970). https://doi.org/10.1093/biomet/57.1.97
Article MathSciNet Google Scholar
Hauke, P., Katzgraber, H.G., Lechner, W., et al.: Perspectives of quantum annealing: methods and implementations. Rep. Prog. Phys. 83(5), 054401 (2020). https://doi.org/10.1088/1361-6633/ab85b8
Article Google Scholar
Hen, I., Job, J., Albash, T., et al.: Probing for quantum speedup in spin-glass problems with planted solutions. Phys. Rev. A 92, 042325 (2015). https://doi.org/10.1103/PhysRevA.92.042325
Article Google Scholar
Honjo, T., Sonobe, T., Inaba, K., et al.: 100,000-spin coherent Ising machine. Sci. Adv. 7(40), eabh0952 (2021). https://doi.org/10.1126/sciadv.abh0952
Article Google Scholar
Houdayer, J.: A cluster Monte Carlo algorithm for 2-dimensional spin glasses. Eur. Phys. J. B 22(4), 479–484 (2001). https://doi.org/10.1007/PL00011151
Article Google Scholar
Hukushima, K., Nemoto, K.: Exchange Monte Carlo method and application to spin glass simulations. J. Phys. Soc. Jpn. 65(6), 1604–1608 (1996). https://doi.org/10.1143/JPSJ.65.1604
Article Google Scholar
Jansen, S., Ruskai, M.B., Seiler, R.: Bounds for the adiabatic approximation with applications to quantum computation. J. Math. Phys. 48(10), 102111 (2007). https://doi.org/10.1063/1.2798382
Article MathSciNet Google Scholar
Job, J., Lidar, D.: Test-driving 1000 qubits. Quant. Sci. Technol. 3(3), 030501 (2018). https://doi.org/10.1088/2058-9565/aabd9b
Article Google Scholar
Johnson, M.W., Bunyk, P., Maibaum, F., et al.: A scalable control system for a superconducting adiabatic quantum optimization processor. Supercond. Sci. Technol. 23(6), 065004 (2010). https://doi.org/10.1088/0953-2048/23/6/065004
Article Google Scholar
Johnson, M.W., Amin, M.H.S., Gildert, S., et al.: Quantum annealing with manufactured spins. Nature 473(7346), 194–198 (2011). https://doi.org/10.1038/nature10012
Article Google Scholar
Kadowaki, T., Nishimori, H.: Quantum annealing in the transverse Ising model. Phys. Rev. E 58, 5355–5363 (1998). https://doi.org/10.1103/PhysRevE.58.5355
Article Google Scholar
Kato, T.: On the adiabatic theorem of quantum mechanics. J. Phys. Soc. Jpn. 5(6), 435–439 (1950). https://doi.org/10.1143/JPSJ.5.435
Article Google Scholar
Katzgraber, H.G., Trebst, S., Huse, D.A., et al.: Feedback-optimized parallel tempering Monte Carlo. J. Stat. Mech: Theory Exp. 03, P03018–P03018 (2006). https://doi.org/10.1088/1742-5468/2006/03/p03018
Article Google Scholar
King, A.D., Suzuki, S., Raymond, J., et al.: Coherent quantum annealing in a programmable 2,000 qubit Ising chain. Nat. Phys. 18(11), 1324–1328 (2022). https://doi.org/10.1038/s41567-022-01741-6
Article Google Scholar
King, J., Yarkoni, S., Nevisi, M.M., et al.: Benchmarking a quantum annealing processor with the time-to-target metric. (2015). https://doi.org/10.48550/arXiv.1508.05087
King, J., Yarkoni, S., Raymond, J., et al.: Quantum annealing amid local ruggedness and global frustration. (2017). https://doi.org/10.48550/arXiv.1701.04579
Klauder, J.R.: Path integrals and stationary-phase approximations. Phys. Rev. D 19, 2349–2356 (1979). https://doi.org/10.1103/PhysRevD.19.2349
Article Google Scholar
Kochenberger, G., Hao, J.K., Glover, F., et al.: The unconstrained binary quadratic programming problem: a survey. J. Comb. Optim. 28(1), 58–81 (2014). https://doi.org/10.1007/s10878-014-9734-0
Article MathSciNet Google Scholar
Kone, A., Kofke, D.A.: Selection of temperature intervals for parallel-tempering simulations. J. Chem. Phys. 122(20), 206101 (2005). https://doi.org/10.1063/1.1917749
Article Google Scholar
Kowalsky, M., Albash, T., Hen, I., et al.: 3-regular three-XORSAT planted solutions benchmark of classical and quantum heuristic optimizers. Quant. Sci. Technol. 7(2), 025008 (2022). https://doi.org/10.1088/2058-9565/ac4d1b
Article Google Scholar
van Laarhoven, P.J.M., Aarts, E.H.L. (1987). Simulated annealing, Springer Netherlands, Dordrecht, pp 7–15. https://doi.org/10.1007/978-94-015-7744-1_2
Lucas, A.: Ising formulations of many NP problems. Front. Phys. (2014). https://doi.org/10.3389/fphy.2014.00005
Article Google Scholar
Mandrà, S., Katzgraber, H.G.: A deceptive step towards quantum speedup detection. Quant. Sci. Technol. 3(4), 04LT01 (2018). https://doi.org/10.1088/2058-9565/aac8b2
Article Google Scholar
Mandrà, S., Zhu, Z., Wang, W., et al.: Strengths and weaknesses of weak-strong cluster problems: a detailed overview of state-of-the-art classical heuristics versus quantum approaches. Phys. Rev. A 94, 022337 (2016). https://doi.org/10.1103/PhysRevA.94.022337
Article Google Scholar
Matsubara, S., Takatsu, M., Miyazawa, T., et al (2020) Digital annealer for high-speed solving of combinatorial optimization problems and its applications. In: 2020 25th Asia and South Pacific design automation conference (ASP-DAC), pp 667–672, https://doi.org/10.1109/ASP-DAC47756.2020.9045100
McGeoch, C., Farre, P. (2020). The D-wave advantage system: an overview. Technical Report https://www.dwavesys.com/media/s3qbjp3s/14-1049a-a_the_d-wave_advantage_system_an_overview.pdf
McGeoch, C.C., Wang, C. (2013). Experimental evaluation of an adiabiatic quantum system for combinatorial optimization. In: Proceedings of the ACM International Conference on Computing Frontiers. ACM, New York, CF ’13, pp. 23:1–23:11. https://doi.org/10.1145/2482767.2482797
McMahon, P.L., Marandi, A., Haribara, Y., et al.: A fully programmable 100-spin coherent Ising machine with all-to-all connections. Science 354(6312), 614–617 (2016). https://doi.org/10.1126/science.aah5178
Article Google Scholar
Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., et al.: Equation of state calculations by fast computing machines. J. Chem. Phys. 21(6), 1087–1092 (1953). https://doi.org/10.1063/1.1699114
Article Google Scholar
Mezard, M., Montanari, A.: Information, Physics, and Computation. Oxford University Press, Oxford (2009)
Book Google Scholar
Mohseni, M., Eppens, D., Strumpfer, J., et al.: Nonequilibrium Monte Carlo for unfreezing variables in hard combinatorial optimization. (2021). https://doi.org/10.48550/arXiv.2111.13628
Nelson, J., Vuffray, M., Lokhov, A.Y., et al.: Single-qubit fidelity assessment of quantum annealing hardware. IEEE Trans. Quant. Eng. 2, 1–10 (2021). https://doi.org/10.1109/TQE.2021.3092710
Article Google Scholar
Palubeckis, G.: Multistart Tabu search strategies for the unconstrained binary quadratic optimization problem. Ann. Oper. Res. 131(1), 259–282 (2004). https://doi.org/10.1023/B:ANOR.0000039522.58036.68
Article MathSciNet Google Scholar
Pang, Y., Coffrin, C., Lokhov, A.Y., et al.: The potential of quantum annealing for rapid solution structure identification. Constraints 26(1), 1–25 (2021). https://doi.org/10.1007/s10601-020-09315-0
Article MathSciNet Google Scholar
Puget JF (2013) D-Wave vs CPLEX comparison. Part 2: QUBO. https://web.archive.org/web/20170702140136/https://www.ibm.com/developerworks/community/blogs/jfp/entry/d_wave_vs_cplex_comparison_part_2_qubo?lang=en Accessed 27 April 2022
Ray, P., Chakrabarti, B.K., Chakrabarti, A.: Sherrington-Kirkpatrick model in a transverse field: absence of replica symmetry breaking due to quantum fluctuations. Phys. Rev. B 39, 11828–11832 (1989). https://doi.org/10.1103/PhysRevB.39.11828
Article Google Scholar
Rozada, I., Aramon, M., Machta, J., et al.: Effects of setting temperatures in the parallel tempering Monte Carlo algorithm. Phys. Rev. E 100, 043311 (2019). https://doi.org/10.1103/PhysRevE.100.043311
Article Google Scholar
Rønnow, T.F., Wang, Z., Job, J., et al.: Defining and detecting quantum speedup. Science 345(6195), 420–424 (2014). https://doi.org/10.1126/science.1252319
Article Google Scholar
Santoro, G.E., Martoňák, R., Tosatti, E., et al.: Theory of quantum annealing of an Ising spin glass. Science 295(5564), 2427–2430 (2002). https://doi.org/10.1126/science.1068774
Article Google Scholar
Selby A (2013) QUBO-Chimera. https://github.com/alex1770/QUBO-Chimera
Selby, A.: Efficient subgraph-based sampling of Ising-type models with frustration. (2014). https://doi.org/10.48550/arXiv.1409.3934
Serra, T., Huang, T., Raghunathan, A.U., et al.: Template-based minor embedding for adiabatic quantum optimization. INFORMS J. Comput. 34(1), 427–439 (2022). https://doi.org/10.1287/ijoc.2021.1065
Article MathSciNet Google Scholar
Shin, S.W., Smith, G., Smolin, J.A., et al. (2014). How “quantum” is the D-Wave machine? https://doi.org/10.48550/arXiv.1401.7087
Swendsen, R.H., Wang, J.S.: Replica Monte Carlo simulation of spin-glasses. Phys. Rev. Lett. 57, 2607–2609 (1986). https://doi.org/10.1103/PhysRevLett.57.2607
Article MathSciNet Google Scholar
Vuffray, M.: The cavity method in coding theory. Tech. rep, EPFL (2014)
Vuffray, M., Coffrin, C., Kharkov, Y.A., et al.: Programmable quantum annealers as noisy Gibbs samplers. PRX Quantum 3, 020317 (2022). https://doi.org/10.1103/PRXQuantum.3.020317
Article Google Scholar
Zaborniak, T., de Sousa, R.: Benchmarking Hamiltonian noise in the D-Wave quantum annealer. IEEE Trans. Quant. Eng. 2, 1–6 (2021). https://doi.org/10.1109/TQE.2021.3050449
Article Google Scholar
Zhu, B., Ochoa, A.J., Katzgraber, H.G.: Efficient cluster algorithm for spin glasses in any space dimension. Phys. Rev. Lett. 115, 077201 (2015). https://doi.org/10.1103/PhysRevLett.115.077201
Article Google Scholar

Download references

Acknowledgements

All work at Los Alamos National Laboratory was conducted under the auspices of the National Nuclear Security Administration of the U.S. Department of Energy under Contract No. 89233218CNA000001. This research used resources provided by the Los Alamos National Laboratory Institutional Computing Program and was supported by the Laboratory Directed Research and Development program under the projects 20210114ER and 20210674ECR. This material is based upon work supported by the National Science Foundation under Grant No. 2037755.

Funding

This research was supported by Los Alamos National Laboratory’s the Laboratory Directed Research and Development program under the projects 20210114ER and 20210674ECR. This material is based upon work supported by the National Science Foundation under Grant No. 2037755.

Author information

Authors and Affiliations

Los Alamos National Laboratory, Los Alamos, NM, 87545, USA
Byron Tasseff, Zachary Morrell, Marc Vuffray, Andrey Y. Lokhov, Sidhant Misra & Carleton Coffrin
University of New Mexico, Albuquerque, NM, 87131, USA
Tameem Albash

Authors

Byron Tasseff
View author publications
You can also search for this author in PubMed Google Scholar
Tameem Albash
View author publications
You can also search for this author in PubMed Google Scholar
Zachary Morrell
View author publications
You can also search for this author in PubMed Google Scholar
Marc Vuffray
View author publications
You can also search for this author in PubMed Google Scholar
Andrey Y. Lokhov
View author publications
You can also search for this author in PubMed Google Scholar
Sidhant Misra
View author publications
You can also search for this author in PubMed Google Scholar
Carleton Coffrin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Carleton Coffrin.

Ethics declarations

Conflict of interest

The authors have no Conflict of interest to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: D-wave advantage system details

In an ideal setting, a QA hardware device would provide the user control over the Ising models and annealing schedules that are executed. However, available QA hardware impose restrictions on the classes of Ising models and annealing schedules that can be considered per the hardware’s architecture. In this paper, we used the Advantage_system4.1 QPU, which is based on the Advantage architecture (Boothby et al. 2020). This system implements the transverse-field Ising model,

$$\begin{aligned} E(s) = - A(s) \sum _{i \in \mathcal {N}} \widehat{\sigma }^{x}_i + B(s) \left( \sum _{(i, j) \in \mathcal {E}^{H}} J_{ij} \widehat{\sigma }^{z}_i \widehat{\sigma }^{z}_j - \sum _{i \in \mathcal {N}} h_{i} \widehat{\sigma }^{z}_i \right) , \end{aligned}$$

(A1)

where $\widehat{\sigma }^{x}$ and $\widehat{\sigma }^{z}$ are the Pauli operators for the x and z bases, respectively, and the Ising model of interest is encoded in the z basis. The hardware has a fixed global annealing schedule shown in Fig. 3 and a qubit connectivity graph known as Pegasus, containing a total of 5,387 qubits. Figure 4 illustrates a Pegasus graph topology with a lattice “size parameter” of two. A Pegasus unit cell contains twenty-four qubits (here, the three diagonal clusters of eight), with each qubit coupled to one similarly aligned qubit within the cell and two similarly aligned qubits in adjacent cells. An Advantage QPU is a lattice of $16 \times 16$ such unit cells. This example illustrates that the use of QA is limited to Ising models that are topologically consistent with the QPU used and size-restricted by its number of qubits. In this work, we only consider Ising models that are subsets of this hardware graph, i.e., hardware-native Ising models. This allows us to avoid performance artifacts caused by problem encodingGlover et al. (2019, 2022a, 2022b) and embeddingChoi (2008, 2011); Serra et al. (2022), which are required to map a desired problem structure onto this hardware graph.

Even if the Ising model is topologically consistent, achieving a globally optimal solution to Problem (2) using QA is often difficult at large scales. As explained in Sect. 2, QA has been theoretically shown to minimize the energy of specific target Ising models. Experimentally, however, this is not always the case. One reason is the unavoidable corruption of the quantum system from its external environment. For example, as previously stated in Sect. 2, QPUs are known to suffer from a wide range of integrated control errors, including susceptibility to background noise, flux noise, digital-to-analog conversion, and variable scales and coefficient biases across qubits (Nelson et al. 2021).

It is also important to note that QPUs, such as the Advantage, often distinguish between the tasks of optimization and sampling. The former aims at determining a solution to Problem (2), while the latter aims at characterizing the energy landscape of an Ising model. In practice, since quantum annealing does not experimentally guarantee convergence to a global optimum, optimization is performed by conducting multiple anneal-read cycles (or “reads”) for the target Ising model. Each read provides an assignment of the decision variables, which may or may not be the true global optimum of Problem (2) but which tends to be high-quality (e.g., within one percent of optimality). The best encountered solution is then extracted by computing the minimum of these multiple reads.

In this paper, we consider only two key parameters for the configuration of a QPU, and we ignore any environmental factors that are beyond our control. The first parameter is the number of anneal-read cycles performed to obtain a solution to Problem (2). Here, a larger number generally corresponds to a higher likelihood of sampling the true ground state, i.e., as the number of reads increases, solution quality also tends to increase. The second parameter is the annealing time that defines each anneal-read cycle, which is on the order of microseconds. Here, longer anneal times typically encourage convergence of the QPU to higher-quality solutions, as would be expected from quantum annealing theory. Empirical evidence of this tendency is provided in “Appendix C”.

Appendix B: Optimization methods

In the interest of clarity, Sect. 3 briefly introduced five established and distinct solution approaches for solving the proposed CBFM-P problem instances. However, in the development of this work, we considered ten different solution approaches. In this section, we discuss all ten methods that were considered and provide additional details regarding how each was configured. The algorithms are divided into two broad categories: complete search methods, which, given sufficient time, provide proofs of global optimally, and local search methods, which are heuristics with no guarantees nor bounds on solution quality. Unless noted otherwise, each of these solvers was implemented in C/C++.

1.1 B.1 Complete search methods

In all of the complete search methods that are considered, we leverage the bijection of Ising models and QUBO models to convert a given Ising model of interest to its equivalent QUBO form prior to its solution (Coffrin et al. 2019). Preliminary experiments indicated that the QUBO formulation is advantageous for commercial mathematical programming tools, which include specialized techniques for solving problems with discrete $\{0, 1\}$ decision variables. In this work, we considered both Gurobi, version 10.0, and CPLEX, version 22.1, for solving these mathematical programming models. For the results presented in Sect. 3, both solvers used only one thread during execution, and all other parameters assumed their default values. As discussed by Puget (2013), we acknowledge that better performance may be attained with a larger number of threads and adjusting the solvers’ other parameter settings. In “Appendix D”, we evaluate a parallel, tuned variant of Gurobi, which is described in Section B.1.1.

Solver execution time limits were varied from one to $8{,}\!192$ seconds in multiples of two for each instance. Mathematical programming solution approaches were considered by some of the earliest QA benchmarking studies (McGeoch and Wang 2013) and are widely recognized as an unfair point of comparison, as specialized heuristics can be much faster than these general purpose solvers (Puget 2013). However, it was shown by Coffrin et al. (2019) and Pang et al. (2021) that these solvers provide an important point of reference for ruling out easy optimization tasks, as is similarly done in this study.

1.1.1 B.1.1 Integer quadratic programming

The integer quadratic programming (IQP) method considered in this study consists of using black-box commercial mathematical programming software (in this case, Gurobi and CPLEX) to pose and solve the following QUBO Ising model translation:

$$\begin{aligned} \begin{aligned}&\text {minimize}{} & {} \sum _{{(i, j) \in \mathcal {E}}} {c}_{ij} x_{i} x_{j} + \sum _{{i \in \mathcal {N}}} {c}_{i} x_{i} + {c} \\{} & {} &x_{i} \in \{0, 1\}, \, \forall i \in \mathcal {N}, \end{aligned} \end{aligned}$$

(B2)

where $c_{ij}$, $c_{i}$, and c are QUBO parameters derived from the Ising model parameters of Eq. (1). In “Appendix D”, we also illustrate results when the IQP Gurobi method was executed using tuned parameters. To determine these parameters, Gurobi’s command line tuning tool was executed using grbtune TuneCriterion=2 TuneTimeLimit=21600 TimeLimit=128 characteristic. Here, TuneCriterion=2 bases the tuning on the best feasible solution; TuneTimeLimit is the total time spent by the tuning tool, in seconds (i.e., six hours); TimeLimit is the time limit per tuning subproblem, in seconds; and characteristic is a path to the .mps “characteristic” CBFM-P instance (i.e., “16”) considered during tuning. The outcome of tuning suggested setting CutPasses=10. Additionally, the Threads parameter in this tuned variant was left to its default, allowing Gurobi to use up to 32 threads per solve.

1.1.2 B.1.2 Integer linear programming

The integer linear programming (ILP) model is an equivalent reformulation of the IQP model, where each variable product, $x_{i} x_{j}$, is lifted into a new variable space, $x_{ij}$. For each new variable, constraints are used to model each conjunctive relationship, i.e., $x_{ij} = x_i \wedge x_j$. Notably, this new problem contains no quadratic terms in the objective, and all constraints are affine. The ILP reformulation of the IQP formulation discussed in Section B.1.1 is

$$\begin{aligned} \begin{aligned}&\text {minimize}{} & {} \sum _{{(i, j) \in \mathcal {E}}} {c}_{ij} x_{ij} + \sum _{{i \in \mathcal {N}}} {c}_{i} x_{i} + {c} \\&\text {subject to}{} & {} x_{i} + x_{j} - 1 \le x_{ij}, \, \forall (i, j) \in \mathcal {E} \\{} & {} &x_{ij} \le x_{i}, \, \forall (i, j) \in \mathcal {E} \\{} & {} &x_{ij} \le x_{j}, \, \forall (i, j) \in \mathcal {E} \\{} & {} &x_{i} \in \{0, 1\}, \, \forall i \in \mathcal {N} \\{} & {} &x_{ij} \in \{0, 1\}, \, \forall (i, j) \in \mathcal {E}. \end{aligned} \end{aligned}$$

(B3)

This model was also evaluated in some of the earliest QA and QUBO benchmarking studies (Puget 2013; Dash 2013; Billionnet and Elloumi 2007), which suggested that it is a very effective formulation for solving Ising models with sparse graphs (e.g., D-Wave’s previous Chimera topology). Like Pang et al. (2021), our results indicate that IQP solvers have improved significantly over the past decade and consistently outperform ILP methods for solving QUBO problems on large problem instances. We remark that, in this study, only Gurobi (i.e., not CPLEX) was considered when solving ILP reformulations. Substantial performance differences were not anticipated when solving with CPLEX.

1.2 B.2 Local search methods

Although global optimization methods (e.g., IQP and ILP) are useful for measuring the quality of solutions produced by QA hardware (Billionnet and Elloumi 2007; Coffrin et al. 2019; Baccari et al. 2020), it is broadly accepted that local search algorithms are the most appropriate points of computational comparison for QA methods (Aaronson 2017). Given that an enumeration of all local search methods would be an incredibly large and impractical undertaking, this work focuses on a set of the most common or readily available approaches: greedy search via steepest coordinate descent, tabu search, message passing, Markov chain Monte Carlo, simulated annealing, spin-vector Monte Carlo, and parallel tempering with isoenergetic cluster moves. Next, each method is briefly described.

1.2.1 B.2.1 Steepest coordinate descent

The simplest heuristic algorithm considered in this work is a steepest coordinate descent, or greedy, approach that was implemented in Julia. This algorithm assigns variable values one-by-one, always taking an assignment that most decreases the objective. Specifically, the approach begins with unassigned values, i.e., $\sigma _{i} = 0$, for all $i \in \mathcal {N}$, then repeatedly applies the following assignment logic until all variables have been assigned a value of $-1$ or 1:

$$\begin{aligned} i, v&= \mathop {\mathrm {arg\,min}}\limits \left\{ E(\sigma _1, \ldots , \sigma _{i-1}, v, \sigma _{i+1}, \ldots ,\sigma _N) : i \in \mathcal {N}, \, v \in \{-1, 1\}\right\} \end{aligned}$$

(B4a)

$$\begin{aligned} \sigma _i&= v \end{aligned}$$

(B4b)

In each application of the above rule, ties in the $\mathop {\mathrm {arg\,min}}\limits $ are broken at random, giving rise to potentially stochastic outcomes. Once all variables have been assigned their values via this rule, the algorithm is repeated until a run time limit has been reached, and only the best discovered solution is returned. This approach is fast and effective on Ising models with minimal amounts of frustration. In our work, the wall-clock time limit of the steepest coordinate descent algorithm was varied from one to $8{,}\!192$ seconds in multiples of two for each problem instance to understand solution improvement.

1.2.2 B.2.2 Tabu search

Tabu search is a metaheuristic commonly used for solving combinatorial optimization problems (Glover and Laguna 1998). Like other local methods, tabu search considers a “move” operator that, for a given solution, generates a number of other possible solutions (i.e., the solution’s “neighborhood”). The “best” solution from this neighborhood is then selected, and the process repeats. To prevent cycling, a list of “tabu moves” is employed, which prohibits moves that would lead to revisiting a previously-encountered solution. This list is updated as the algorithm progresses, i.e., new moves are added to the list, and moves with a large “tabu tenure” are discarded. Tabu moves are, however, sometimes allowed if they lead to an improved solution (Beasley 1998).

In this study, we used an open-source implementation of the MST2 multistart tabu search algorithm of Palubeckis (2004), provided by D-Wave Systems (2024), which is specialized for solving QUBOs. For brevity, we forgo details of this algorithm. However, in our work, for each problem instance, the number of “reads,” num_reads, was varied between one and 2, 048 in multiples of two, and the total run time per “read,” timeout, was limited to a fixed value of one second. For the largest instances studied, the most time-intensive parameterizations resulted in execution times of nearly one hour.

1.2.3 B.2.3 Message passing

The message-based min-sum (MS) algorithm is an adaptation of the belief propagation algorithm for solving minimization problems over networks (Fossorier et al. 1999; Mezard and Montanari 2009). A key property of the MS algorithm is its ability to identify global minima of cost functions over networks with tree-like structures, i.e., if no cycles are formed by the interactions in $\mathcal {E}$. In the more general case, MS is a heuristic minimization method (Mezard and Montanari 2009). Nonetheless, it remains a popular heuristic technique, favored in communication systems for its low computational cost and performance on tree-like networks (Vuffray 2014).

For the model we consider, i.e., Problem (2), the MS messages, $\epsilon _{i \rightarrow j}$, are computed iteratively along directed edges, $i \rightarrow j$ and $j \rightarrow i$, for each $(i, j) \in \mathcal {E}$ according to the min-sum equations

$$\begin{aligned} \textrm{SSL}(x, y)&= \min (x, y)-\min (-x, y) - x \end{aligned}$$

(B5a)

$$\begin{aligned} \epsilon ^{t + 1}_{i \rightarrow j}&= \textrm{SSL}\left( 2 {J}_{ij}, 2 {h}_i + \sum _{{k \in \mathcal {E}(i) \setminus j}} \epsilon ^{t}_{k \rightarrow j}\right) . \end{aligned}$$

(B5b)

Here, $\mathcal {E}(i) \setminus j$ denotes the neighbors of i without j, and $\textrm{SSL}$ denotes the symmetric saturating linear (SSL) transfer function. Once a fixed point of Eq. (B5b) is obtained or a prescribed termination criterion is achieved (e.g., a time limit), the MS algorithm outputs the configuration

$$\begin{aligned} \bar{\sigma }_{i} = - \textrm{sign} \left( 2 {h}_i + \sum _{k \in \mathcal {E}(i)} \epsilon _{k \rightarrow j} \right) . \end{aligned}$$

(B6)

If the argument of $\textrm{sign}$ is zero, a value of 1 or $-1$ is assigned randomly with equal probability.

This algorithm is implemented in the Python programming language. In our work, the wall-clock time limit of the algorithm was varied from one to $8{,}\!192$ seconds in multiples of two for each instance.

1.2.4 B.2.4 Markov chain Monte Carlo

Markov chain Monte Carlo (MCMC) algorithms are a class of methods to generate samples from complex probability distributions. A natural MCMC method for solving Ising models is given by Glauber dynamics (GD), where the value of each variable is updated according to its conditional probability distribution. Glauber dynamics is often used as a method for producing samples from Ising models at finite temperatures (Glauber 1963). This work considers the so-called zero temperature GD algorithm, which is the optimization variant of the GD sampling method. This method is also used in physics as a simple model for describing avalanche phenomena in magnetic materials (Dhar et al. 1997). From an optimization perspective, the approach is similar to the single-variable greedy local search algorithm previously described in “Appendix B.2.1”.

A step t of the GD algorithm comprises selecting each spin variable, for $i \in \mathcal {N}$, in a random order and comparing the objective of the current configuration, $\bar{\sigma }^{t}$, to a configuration where $\bar{\sigma }^{t}_{i}$ is flipped in sign. If the objective value is smaller in the flipped configuration, i.e., $E(\bar{\sigma }^{t}) > E(\bar{\sigma }^{t}_1, \ldots , -\bar{\sigma }^{t}_i, \ldots , \bar{\sigma }^{t}_{N})$, then the flipped configuration is selected as the new configuration, i.e., $\bar{\sigma }^{t + 1} = (\bar{\sigma }^{t}_1, \ldots , -\bar{\sigma }^{t}_{i}, \ldots , \bar{\sigma }^{t}_{N})$. If, after visiting all of the variables, no single-variable flip can improve the current assignment, then the configuration is identified as a local minimum. The algorithm is then restarted with a new, randomly generated initial configuration. This process is repeated until a run time limit is reached.

This algorithm is implemented in the Python programming language. In our work, the wall-clock time limit of the algorithm was varied from one to $8{,}\!192$ seconds in multiples of two for each instance.

1.2.5 B.2.5 Simulated annealing

Simulated annealing is an algorithm inspired by the process of annealing materials to produce improved structural properties, applied to the computational task of solving a combinatorial optimization problem. In condensed matter physics, annealing is a physical process by which the temperature of a solid immersed in a heat bath is first increased, and particles of the solid randomly rearrange themselves into a liquid phase. The temperature of the heat bath is then slowly decreased, and, in an ideal case, all particles rearrange themselves to a corresponding solid lattice structure (van Laarhoven and Aarts 1987). Analogously, in simulated annealing, solutions are probabilistically perturbed according to a “temperature” schedule until reaching some minimum.

The algorithm originated with Metropolis et al. (1953), who proposed a Monte Carlo method for simulating the evolution of a solid to thermal equilibrium at a fixed temperature, T. Specifically, given the current state of the system, $\bar{\sigma }$, a small perturbation is selected. If the difference in energy, $\Delta E$, between the current and perturbed states is negative, the process is continued with the new state. Otherwise, the probability of acceptance of the new state is $\exp \left( {-\frac{\Delta E}{k_{B} T}}\right) $, called the Metropolis criterion, where $k_{B}$ is the Boltzmann constant. Simulated annealing can thus be thought of as a sequence of Metropolis algorithms, where the temperature is gradually decreased until reaching some termination criterion (e.g., a predefined temperature value) (van Laarhoven and Aarts 1987).

In this work, we employ the dwave-neal Python package, which provides an implementation of simulated annealing for general Ising model graphs. In each execution of the algorithm, the number of anneal-read cycles was set to one hundred, and the number of simulated annealing “sweeps,” or random perturbations per discrete temperature value, varied between one and $262{,}\!144$ in multiples of two. All other parameters assumed their default values. For the largest Ising models considered in this study (i.e., a Pegasus graph with a size parameter of sixteen), the longest wall-clock execution times when employing the most time-intensive parameterization were a little over an hour.

1.2.6 B.2.6 Spin-vector Monte Carlo

Spin-vector Monte Carlo (Shin et al. 2014) is a classical emulator of a noisy quantum annealer, where the system of qubits, i.e., $\mathcal {N}$, is replaced by a system of two-dimensional rotors. The system’s energy is given by

$$\begin{aligned} E(\theta ,s) = - A(s) \sum _{i \in \mathcal {N}} \sin \theta _i + B(s) \left( \sum _{i \in \mathcal {N}} {h}_i \cos \theta _i + \sum _{(i, j) \in \mathcal {E}} {J}_{ij} \cos \theta _i \cos \theta _j \right) ,\nonumber \\ \end{aligned}$$

(B7)

where $s \in [0,1]$ is the dimensionless annealing parameter; the functions A(s) and B(s) are taken to match those of the quantum annealer, illustrated in Fig. 3; and $\left\{ \theta _i \in [0, \pi ) \right\} $ are the orientations of the rotors. In principle, the rotors can point along any orientation in $[0,2 \pi )$, but with this energy function, the energy is minimized by an orientation in $[0, \pi )$, which allows us to restrict each $\theta _{i}$ to this range of angles. Using this energy, the rotors’ orientations are updated using a fixed-temperature Metropolis-Hastings criterion. This description can be physically motivated as a description of superconducting flux qubits in the strong system-bath interaction limit (Crowley and Green 2016) when using the spin-coherent path integral formalism (Klauder 1979; Albash et al. 2015).

The SVMC algorithm proceeds as follows. The rotors are initialized to point along $\theta _i = \pi /2$, corresponding to the uniform superposition state. The anneal is discretized in steps of $\Delta s$ such that $s_k = k \Delta s$, and at every value of $s_k$, a certain number of sweeps of rotor updates are performed. A sweep corresponds to attempting one Metropolis-Hasting update for each rotor. The update goes as follows. For a rotor with orientation $\theta _i$, we randomly choose a new orientation $\theta _i'$, and we calculate the change in energy, $\Delta E$, associated with this new orientation and accept the update according to the Metropolis-Hastings probability (Metropolis et al. 1953; Hastings 1970),

$$\begin{aligned} p = \min \left( 1, \exp (- \beta (E(\theta ',s)- E(\theta ,s))) \right) , \end{aligned}$$

(B8)

where $\beta = 3.9983$ GHz$^{-1}$ is the fixed inverse temperature of the simulation, corresponding to a physical temperature of 12 mK in units where the Planck constant set to one. At the end of the anneal, the rotors are projected to Ising spins, and we can measure their energy according to Eq. (1). If $\cos \theta _i > 0$, we assign a spin value of $\sigma _i = 1$, and if $\cos \theta _i < 0$, we assign a spin value of $\sigma _i = -1$. If $\cos \theta _i$ is zero, a value of 1 or $-1$ is assigned randomly with equal probability.

For each problem instance, we executed eight independent SVMC simulations in parallel on eight cores, reporting the lowest energy found by the eight independent runs. For each independent SVMC run in the parallel set, the number of sweeps used by the algorithm was varied from 1, 000 to $8{,}\!192{,}\!000$ in multiples of two. For the largest Ising models considered in this study, the longest wall-clock execution times when employing the most time-intensive parameterization were a little over an hour.

1.2.7 B.2.7 Parallel tempering with isoenergetic cluster moves

Parallel tempering (PT) (Swendsen and Wang 1986; Geyer 1991; Hukushima and Nemoto 1996) is a method that uses multiple Markov chain Monte Carlo simulations to improve the equilibration dynamics. Each simulation, known as a “replica,” has a unique inverse temperature, $\beta $, and its spin configuration is evolved using single-spin Monte Carlo updates. That is, a spin with value $\sigma _i$ is flipped to $\sigma _i' = -\sigma _i$, and the update is accepted according to the Metropolis-Hastings probability (Metropolis et al. 1953; Hastings 1970),

$$\begin{aligned} p = \min \left( 1, \exp (- \beta (E(\sigma ') - E(\sigma ))) \right) . \end{aligned}$$

(B9)

After each replica performs a fixed number of sweeps, where a sweep corresponds to performing a single Monte Carlo update on all the spins, a parallel tempering update is performed. In a parallel tempering update, the spin configurations (or equivalently the temperatures) of two replicas are swapped according to the Metropolis-Hastings probability (Metropolis et al. 1953; Hastings 1970),

$$\begin{aligned} p = \min \left( 1, \exp ( (\beta _r - \beta _{r'}) (E(\sigma _r) - E(\sigma _{r'}))) \right) , \end{aligned}$$

(B10)

where $\beta _{r}$ and $E(\sigma _{r})$ denote the inverse temperature and energy of the spin configuration of replica r, respectively. This choice preserves detailed balance.

The isoenergetic cluster move (ICM) (Zhu et al. 2015) is based on the cluster update by Houdayer (Houdayer 2001). Two independent parallel tempering simulations are performed. After a fixed number of Monte Carlo sweeps and the replica swaps are performed, we perform the ICM update. This move is performed on replicas with a temperature less than one. For equal temperature replicas r and $r'$ from each PT simulation, we calculate the site overlap,

$$\begin{aligned} q_i = \sigma _i^{(r)} \sigma _i^{(r')}, \, i \in \mathcal {N}, \end{aligned}$$

(B11)

where $\sigma _i^{(r)}$ is the spin configuration of the i-th site in the r-th replica. We randomly pick a spin i with $q_i = -1$, and starting from this spin, we build the largest connected (according to $\mathcal {E}$) cluster of spins with $q_i = -1$. All spins in this cluster are then flipped in both replicas, r and $r'$. According to Zhu et al. (2015), the ICM move need not be performed on all replicas, and we only implement it on replicas with an inverse temperature greater than one.

Therefore, the complete parallel tempering with isoenergetic cluster moves (PT-ICM) algorithm proceeds as follows. All the replicas of the two independent PT simulations are initialized with random spin configurations. Each replica is evolved independently, with two Monte Carlo sweeps. This is followed by a parallel tempering update for all pairs of neighboring temperatures, where the order of the parallel tempering pairs is chosen randomly. Finally, an ICM update is performed.

In our simulations, we have chosen to use 64 replicas, and we considered two inverse temperature distributions. The first is a typical geometrical distribution, untuned with respect to the instances considered throughout this work. The PT-ICM algorithm using this “default” inverse temperature distribution is represented in Figs. 1 and 2, and it represents an untuned variant of the algorithm, similar to the other methods that we compare. The second “non-default” distribution considers values ranging between [0.1, 8] to achieve a swap probability of approximately 0.23 (Kone and Kofke 2005) between most neighboring temperature replicas at the largest system size we study, using the scheme of Rozada et al. (2019). We illustrate this distribution in Fig. 5. Further, PT-ICM results when using this distribution are postfixed with (Opt.) in subsequent figures of this “Appendix”. We remark, however, that this choice of temperatures is not necessarily optimal (Katzgraber et al. 2006), but it is generally considered a reasonable choice. In principle, at the smaller system sizes, we require fewer replicas to achieve the same swap probability, but we do not perform this optimization, here.

In our reported results, for each problem instance, we executed eight independent PT-ICM simulations in parallel on eight cores, and we reported the lowest energy found by the eight independent runs. For each independent PT-ICM run in the parallel set, the number of parallel tempering updates used by the algorithm, prior to termination, was varied from two to 16, 384 in multiples of two. For the largest Ising models considered in this study, the longest wall-clock execution times when employing the most time-intensive parameterization were typically less than thirty minutes.

Appendix C: Annealing time sensitivity

In the quantum annealing solution approach, the annealing time, $\tau $ ($\upmu $s), of the algorithm plays a critical role in solution quality (Rønnow et al. 2014; Albash and Lidar 2018), since it is generally the case that increasing the annealing time will increase quality until a globally optimal solution is achieved. Consequently, in this work, we considered different annealing times to understand their impacts on the quality of CBFM-P solutions. This also allowed us to select a suitable value of $\tau $ for the primary runtime analysis.

Figure 6 shows the mean relative performance of the QA hardware over a set of 50 instances with 5, 387 decision variables (Pegasus lattice size sixteen). Here, the number of anneal-read cycles was varied between ten and $5{,}\!120$ in multiples of two, resulting in a variety of idealized and wall-clock QPU execution times, displayed on the horizontal axis. The annealing time, $\tau $, used in each solution process was also varied from 0.5 to $312.5 \upmu $s in multiples of five. We also leverage the spin reversal transform feature, provided by the LEAP platform, after every 100 anneal-read cycles to mitigate the undesirable impacts of the aforementioned integrated control errors. The solution qualities and execution times resulting from these annealing time parameterizations are depicted by the five grayscale curves.

It is first apparent that, for a fixed number of anneal-read cycles, as the annealing time is increased, the solution quality also increases. This is encouraging, as it indicates that the QA hardware is behaving similarly to what the theory of QA predicts. However, for each increase in annealing time, there are consistent diminishing returns in terms of solution quality. In particular, between $\tau = 62.5 \upmu $s and $\tau = 312.5 \upmu $s, the differences in solution quality appear mostly inconsequential. It is also interesting to observe that there are minimal changes in the ideal QPU execution time when using different annealing times for a fixed number of anneal-read cycles. This is due to an implementation detail of the QPU, where reading solutions takes around $100 \upmu $s, which dominates the ideal execution times presented in Fig. 6 when $\tau $ is less than $100 \upmu $s. It is also interesting to observe that the wall-clock QPU execution times (i.e., times including service overheads) are much longer than idealized times, typically ranging between two and eight seconds. These wall-clock results further suggest that increasing the annealing time typically has little effect on the overall realized execution time, as most of the time is dominated by the selected number of anneal-read cycles and service overheads.

Given the diminishing solution quality as annealing time increases, as well as marked execution time increases for $\tau = 312.5 \upmu $s, in this work, we adopt an annealing time of $\tau = 62.5 \upmu $s, which strikes a balance between solution quality and execution time. This selection is reflected in Sect. 3.

Appendix D: Additional run time analysis

In this section, we expand on the computational results presented in Sect. 3 by showing the performance results over all of the solution methods considered in this appendix and conducting an average case analysis, accompanying a characteristic example. These additional results serve to motivate the selection of the five methods presented in Sect. 3, as well as to show that the characteristic example presented in this work is a suitable representative, not an outlier, of the CBFM-P instance class. Finally, we provide a comprehensive view of solution statistics across all instances.

To review the computational setting, all of the classical optimization algorithms were executed on a system with two Intel Xeon E5-2695 v4 processors, each with 18 cores at 2.10 GHz, and 125 GB of memory. We present an evaluation of the above optimization techniques on problem instances of the largest CBFM-P Ising models that we considered on the Advantage_system4.1 system, containing 5, 387 variables. For each solution technique, parameters that control the execution time of the algorithm (e.g., number of reads for QA, the number of sweeps in SA, or the wall-clock time limit of an IQP method) were varied per the parameterizations described throughout “Appendix B” to better understand their effects on solution quality. The plots show the relative difference in the solution quality (to the best-known solution) over the solve time corresponding to a parameterization.

The benchmarking results for all methods on CBFM-P instance “16” (of 50) are illustrated in Fig. 7. A summary of these results follows the discussion of Sect. 3. However, this figure justifies our exclusion of some methods. In particular, we observe that (1) the CPLEX IQP and Gurobi ILP models are not competitive compared to the Gurobi IQP model and do not bring added insights; (2) the parallel, tuned variant of Gurobi IQP brings limited benefits; and (3) the heuristic methods of tabu search and steepest coordinate descent become stuck in similar local minima, providing limited additional insight. Message passing provides a minimum difference of $162\%$ and does not appear in the plot’s range. It can also be argued that SVMC brings limited additional insight over the SA heuristic. However, we elected to include this in Sect. 3, given the established history of comparing QA to SVMC in the literature. For heuristic methods, SA and PT-ICM provide the most interesting run time and quality tradeoffs and were often the only methods able to match solutions found by the QA hardware. Overall, we made the following selections: steepest coordinate descent for representing a greedy approach, the single-threaded Gurobi IQP method with default parameters for representing an off-the-shelf complete search approach, SVMC by precedent, and SA and PT-ICM as state-of-the-art alternatives.

Given that CBFM-P defines a distribution of randomly generated optimization instances, it is important to quantify how stable the performance characteristics are across multiple instances from this class. Figure 8 presents such an analysis. It is similar to the other analyses that have been presented but shows performance averages across 50 randomly generated instances. Error bars are used to show the standard error across multiple CBFM-P instances, but they tend to be difficult to distinguish, as the variance of results across instances is often small. Message passing again provides a minimum difference of $162\%$ and does not appear in the plot’s range. Overall, these average case results show trends very similar to the characteristic case that is presented in the rest of the paper, providing a strong indication of the stability of the performance characteristics of CBFM-P instances.

Finally, for completeness, Fig. 9 plots solution statistics across all 50 instances and all solvers. Due to the overlapping of points, it is challenging to illustrate the behavior of each solver for specific instances. However, the aggregate behavior of solution quality versus execution time appears similar to that displayed in Fig. 8.

Table 1 A summary of the best objective values found among the representative solution methods on the largest instances considered in this work, with 5, 387 decision variables (i.e., Pegasus lattice size sixteen)

Full size table

Table 2 Companion to Table 1, instead reporting the best objective percentage differences observed among selected solution methods

Full size table

1.1 D.1 Solution quality details

To further elaborate on the consistency of the results presented in Sect. 3 and support the research community in future benchmarking efforts, Table 1 in this section details the best-known objective values for the 50 largest CBFM-P instances that were considered in this work. These instances consist of 5, 387 decision variables and are provided as part of the paper’s supplementary material for future study. In particular, Table 1 reports the best objective values found by the representative solution methods of Sect. 3 and their parameterizations described in “Appendix B”. Table 2 reports the percentage difference from the best objective value for each instance and method. This table is derivable from Table 1. These detailed results further highlight the consistency in the CBFM-P instances at this problem size, as the qualitative behavior of each solution approach is consistent across all of the instances.

It is important to also note that this table only provides best-known solutions for these fifty problem instances. It was observed that the lower bounds produced by the IQP methods were very weak. Developing tight lower bounds and optimality proofs for these instances remains an important topic for future work.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tasseff, B., Albash, T., Morrell, Z. et al. On the emerging potential of quantum annealing hardware for combinatorial optimization. J Heuristics (2024). https://doi.org/10.1007/s10732-024-09530-5

Download citation

Received: 14 May 2023
Revised: 16 June 2024
Accepted: 21 June 2024
Published: 02 August 2024
DOI: https://doi.org/10.1007/s10732-024-09530-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On the emerging potential of quantum annealing hardware for combinatorial optimization

Abstract

Similar content being viewed by others

Benchmarking D-Wave Quantum Annealers: Spectral Gap Scaling of Maximum Cardinality Matching Problems

Unraveling Quantum Annealers using Classical Hardness

Boosting Quantum Annealing Performance Using Evolution Strategies for Annealing Offsets Tuning

Explore related subjects

1 Introduction

2 Quantum annealing for combinatorial optimization

2.1 Foundations of quantum annealing

2.2 Quantum annealing hardware

2.3 Benchmarking quantum annealing hardware

3 Optimization performance analysis

3.1 A characteristic example

3.2 Problem scaling run time trends

4 Limitations and opportunities

5 Conclusion

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix A: D-wave advantage system details

Appendix B: Optimization methods

1.1 B.1 Complete search methods

1.1.1 B.1.1 Integer quadratic programming

1.1.2 B.1.2 Integer linear programming

1.2 B.2 Local search methods

1.2.1 B.2.1 Steepest coordinate descent

1.2.2 B.2.2 Tabu search

1.2.3 B.2.3 Message passing

1.2.4 B.2.4 Markov chain Monte Carlo

1.2.5 B.2.5 Simulated annealing

1.2.6 B.2.6 Spin-vector Monte Carlo

1.2.7 B.2.7 Parallel tempering with isoenergetic cluster moves

Appendix C: Annealing time sensitivity

Appendix D: Additional run time analysis

1.1 D.1 Solution quality details

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation