Abstract
We show a protocol achieving the ultimate Heisenberg-scaling sensitivity in the estimation of a parameter encoded in a generic linear network, without employing any auxiliary networks, and without the need of any prior information on the parameter nor on the network structure. As a result, this protocol does not require a prior coarse estimation of the parameter, nor an adaptation of the network. The scheme we analyse consists of a single-mode squeezed state and homodyne detectors in each of the M output channels of the network encoding the parameter, making it feasible for experimental applications.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Increasing the level of precision achievable in the estimation of physical properties of systems, such as temperatures, optical lengths and magnitude of external fields among others, is one of the multiple applications of quantum technologies that have been extensively studied in the recent years. In particular, the goal of quantum metrology—the field of science laying between quantum mechanics and estimation theory—is to propose and analyse estimation protocols that surpass the precision achievable by classical strategies by employing quantum probes and quantum measurement schemes. In fact it is well known that the classical limit on the precision achievable in the estimation of an unknown parameter when employing N probes, known as shot-noise limit, for which the error is of order \(1/\sqrt{N}\), can be surpassed by quantum strategies achieving the ultimate Heisenberg limit, where the estimation scales as 1/N [1,2,3,4,5,6,7,8,9].
The first proposed protocols reaching Heisenberg-scaling sensitivity heavily employed entanglement as a metrological resource [1,2,3], and several entanglement-based strategies have been recently studied with interesting results, especially in the cases of simultaneous estimation of multiple parameters with non-commuting generators [5,6,7,8,9,10]. Nonetheless, the entanglement fragility and the complicated procedures needed to generate entangled metrological probes, such as NOON or GHZ states [11, 12], are two of the challenges that stimulated the search for more feasible estimation schemes making use of protocols implementing metrological resources that are easier to generate and to manipulate. Squeezed light [13, 14] manifests useful properties (e.g. robustness to decoherence, relatively easy implementation, reduced noise below the vacuum shot-noise) which make it a perfect candidate as a feasible metrological resource [15,16,17,18,19]. Motivated by these favourable properties, many works have recently focused on the analysis and proposal of Gaussian metrological schemes, namely involving squeezed states as probes and homodyne detection as measurement, and the ultimate precision that these can achieve in the estimation of a single localised parameter [15,16,17,18, 20, 21], a function of parameters [22,23,24], or a single distributed parameter, such as the temperature or the electromagnetic field, affecting several component of the network [8, 25,26,27,28,29,30]. These schemes typically perform an analysis based on the study of the Cramér–Rao bound (CRB) [31], or its quantum counterpart [32], to assess the ultimate precision achievable in the estimation, and show whether the Heisenberg scaling can be achieved. Although the CRB analysis does not generally assure that the ultimate bound on the precision found can be achieved globally, namely with an estimation strategy which does not depend on the true value of the parameter to be estimated [33, 34], local estimations are relevant in the typical interferometric framework, in which small deviations of the parameters need to be measured. Interestingly, it has been recently found that it is always possible to reach Heiseneberg-scaling sensitivity regardless of the structure of the network encoding the parameter, only employing a single squeezed vacuum state, a single-homodyne measurement, and an auxiliary network suitably engineered, whose preparation only requires a knowledge on the unknown parameter that can be obtained by a classical measurement [26, 29, 30]. The need for an auxiliary stage in such protocols arises from the fact that in general the probe is scattered by the network in all its output ports, while only a single port is eventually measured through homodyne detection, so that an auxiliary network is required in order to refocus the probe on the only channel observed. A question that naturally arises is whether incrementing the number of observed channels would ease, if not completely lift, the requirement of an auxiliary stage and ultimately the requirement of a prior classical knowledge on the unknown parameter. Moreover, different Gaussian protocols rely on encoding the information about the unknown parameter on the displacement of the probe, requiring that a portion of the resources in the probe are employed in a non-vanishing displacement [28, 35]. Despite concentrating all the photons in the squeezing is known to be the optimal allocation of the resources in the probe [26], encoding the parameter into a non-vanishing displacement can reduce the estimation process into the relatively simple task of inferring the parameter from the expectation value of a Gaussian probability density function [28].
In this work we investigate the ultimate precision achievable in the estimation of a parameter encoded in a generic linear network, when employing a single-mode squeezed coherent Gaussian state and performing homodyne detection on all the output channels (see Fig. 1). We show that, without making any assumption on the structure of the linear network nor on the nature of the parameter, it is always possible to reach Heisenberg-scaling sensitivity with such set-up, without the use of any auxiliary network. This allows for estimation protocols not requiring a preparatory stage nor a prior coarse estimation of the parameter, as opposed to the single-channel homodyne protocols in Refs. [29, 30]. We also show that two independent contributions on the precision arise from our analysis: one originated from the presence of displaced photons in addition to squeezed photons, and the other from the squeezing of the probe. We find that both contributions can reach Heisenberg-scaling sensitivity independently, and this can be achieved expectedly when the local oscillators phases are chosen such that the noise in the outcome is reduced, namely when the squeezed quadratures are observed in each output channel. Thus, differently from the schemes in Refs. [29, 30], it is then possible to employ this set-up to retrieve information on the parameter through measurements of the average value (i.e. the displacement) of the signal observed with homodyne detection, as well as through the modulation of the noise. Although it is not required to reach the Heisenberg-scaling sensitivity, the presence of an auxiliary network in general affects the precision of the estimation through a pre-factor multiplying the scaling. This comes in useful in those cases where priority is given to increasing the precision, at the expenses of engineering an auxiliary network to be added before the estimation protocol is started.
2 Set-up
Let us consider a generic \(M\times M\) passive linear network whose action on any injected photon probe is given by the unitary operator \({\hat{U}}_{\varphi }\), in which the unknown parameter \(\varphi \) to be estimated is encoded in an arbitrary manner. The linearity and passivity of the network allow us to describe it with an \(M\times M\) unitary matrix \(U_{\varphi }\) related to the evolution operator \({\hat{U}}_{\varphi }\) by
The input probe is prepared in a single-mode squeezed coherent state \(\vert \varPsi _{\mathrm {in}}\rangle ={\hat{D}}_1(d){\hat{S}}_1(r)\vert \mathrm {vac}\rangle \) with \(\mathrm{N}=\sinh ^2 \mathrm{r} + \mathrm{d}^2/2 \equiv N_\mathrm {S}+ N_\mathrm {D}\) average number photons, where \({\hat{S}}_1(r) = \exp \bigl (r({\hat{a}}_1^{\dag 2}-{\hat{a}}_1^{2})/2\bigr )\) is the squeezing operator with real squeezing parameter r, and \({\hat{D}}_1(d) = \exp \bigl (d({\hat{a}}^\dag _1-{\hat{a}}_1)/\sqrt{2}\bigr )\) is the displacement operator with real displacement d, and it is injected in one input channel, say the first, of the linear network. In this case, only the first row of \(U_{\varphi }\) is relevant in this protocol
where we have made explicit the probability \(P_j\) that each photon exits from the jth output port of the network, and the phase \({\bar{\gamma }}_j\) acquired in the process, with j from 1 to M. The unitarity of \(U_{\varphi }\) assures that \(\sum _j P_j = 1\).
A homodyne detection is then performed at each of the output channels, and the quadratures \({\hat{x}}_{i,\theta _{i}}\) are measured, where \(\theta _i\) is the ith local oscillator reference phase, from which we want to infer the value of \(\varphi \). Due to the Gaussian nature of the scheme, the joint probability distribution \(p({\varvec{x}}|\varphi )\) associated with the M-mode homodyne measurement is Gaussian
Here, \(\varSigma \) is the \(M\times M\) covariance matrix with elements (see “Appendix A”)
where \(\delta _{ij}\) is the Kronecker delta, \(\gamma _i={\bar{\gamma }}_i-\theta _i\) is the phase delay at the output of the ith channel relative to the correspondent local oscillator, and \(\left|\varSigma \right|\) is the determinant of \(\varSigma \), which reads (see “Appendix A”)
and \(\varvec{\mu }\) is the mean vector
For any given unbiased estimator \({\tilde{\varphi }}\), the statistical error in the estimation of \(\varphi \) after \(\nu \) iterations of the measurement is limited by the Cramér-Rao bound (CRB) [31]
where \({\mathcal {F}}(\varphi )\) is the Fisher information
associated with the Gaussian distribution (3), and reads (see “Appendix B”)
where \(C=\left|\varSigma \right|\varSigma ^{-1}\) is the cofactor matrix of \(\varSigma \) and \(\mathrm{Tr}[\cdot ]\) denotes the trace. In the following we will discuss in detail expression (9) in the asymptotic limit of large N, showing which condition must be met in order for this set-up to reach Heisenberg-scaling precision in the estimation of \(\varphi \), and compare differences and advantages with respect to the Heisenberg-scaling single-homodyne schemes [29, 30].
We conclude this section by remarking that, in the case of a single channel, \(M=1\), the last term in the right-hand side of (9) vanishes, and thus the only relevant terms are the first two, containing the derivative of the mean \(\varvec{\mu }\) and of the determinant of \(\varSigma \), which reduces to the variance of a single-homodyne measurement. Interestingly enough, we will show that also in the multi-homodyne case, only the first two terms are relevant for the Heisenberg scaling in the asymptotic regime.
3 Heisenberg scaling of the Fisher information
In order to investigate the asymptotic behaviour of the Fisher information (9), it is convenient to express the elements of the cofactor matrix C in terms of the squeezing factor r (see “Appendix B”):
where
Notice that every element of C in the previous expressions, and of \(\varSigma \) in (4), scale at most as quick as N, namely \(C_{st}=O(N_\mathrm {S})\) and \(\varSigma _{st}=O(N_\mathrm {S})\), while the mean vector \(\varvec{\mu }\) is of order \(O(\sqrt{N_\mathrm {D}})\) (see “Appendix C”), and the same asymptotic bounds hold for their derivatives with respect to \(\varphi \), since neither \(P_i\) nor \({\bar{\gamma }}_i\) depend on N. For this reason, in order for the Fisher information in (9) to asymptotically grow with Heisenberg scaling, it is essential to study the asymptotics of the determinant \(\left|\varSigma \right|\) in (5) and find the conditions for which it does not grow with N.
In fact, it is evident from Eq. (5) that in general \(\left|\varSigma \right|=O(N_\mathrm {S})\), and we show in “Appendix C” that the necessary condition for it to scale slower than \(N_\mathrm {S}\) is that the relative phases \(\gamma _i\) tend to \(\pm \pi /2\) for large \(N_\mathrm {S}\): in other words, the larger the number of photons employed in the squeezing of the probe to reach higher precisions, the closer the local oscillator phase needs to be tuned to the minimum-variance quadrature of each mode.
More precisely, as shown in “Appendix C”, the conditions to reach Heisenberg scaling in the Fisher information (9), read
When these conditions hold, we can introduce the finite quantities \(k_i =\lim _{N_\mathrm {S}\rightarrow \infty } N_\mathrm {S}(\gamma _i\mp \pi /2)\), and the determinant \(\left|\varSigma \right|\) reduces to
while \(\partial _\varphi \left|\varSigma \right|\), \(\partial _\varphi \varSigma \), \(\partial _\varphi C\) and C tend to constant values, and \(\partial _\varphi \varvec{\mu }\) scales as \(\sqrt{N_\mathrm {D}}\), thus making only the first two terms of the Fisher information dominant for large N.
As expected, the determinant of the covariance matrix \(\varSigma \) reaches its minimum value when \(\gamma _i = \pi /2\), or \(k_i=0\) for \(i=1,\dots ,M\), namely when the squeezed quadratures are measured. When conditions (12) are met, we can neglect the trace term in Eq. (9), and we can write
where \(k_\mathrm {avg}\equiv \sum \nolimits _{i=1}^M P_i k_i\), \((\partial \gamma )_\mathrm {avg} \equiv \sum \nolimits _{i=1}^M P_i \partial _\varphi \gamma _i\), and \(\varrho (x) = (8x)^2/(16x^2 + 1)^2\) and \(\zeta (x)=(16x^2 + 1)^{-1}\) are positive, even function which reach their maxima at \(x=\pm 1/4\) and \(x=0\), respectively, namely \(\varrho (1/4)=1\) and \(\zeta (0)=1\). The Cramér-Rao bound (7) with the Fisher information (14) is saturated for large \(\nu \) by the maximum-likelihood estimator [36,37,38], and thus, Heisenberg-scaling precision can be achieved. The expression of the maximum-likelihood estimator for this estimation scheme can be found in “Appendix D”.
Noticeably, both terms in the asymptotic Fisher information (14) give a Heisenberg-scaling precision in the estimation of the parameter \(\varphi \), provided that both the average number of photons in the displacement \(N_\mathrm {D}\) and in the squeezing \(N_\mathrm {S}\) scale with the total average number of photons \(N=N_\mathrm {S}+N_\mathrm {D}\), namely \(N_\mathrm {S}= \beta N\) and \(N_\mathrm {D}= (1-\beta )N\), for any value \(0<\beta \le 1\) independent of N.
Moreover, it is worth noticing that the first term in Eq. (9), and thus in Eq. (14), depends on the information encoded in the displacement of the probe, and thus it vanishes if \(\varvec{\mu } = 0\), namely if the probe is a squeezed vacuum and \(N_\mathrm {D}=0\). The second term instead depends on the information on \(\varphi \) encoded in the variance of the measurement itself: it arises only from the interaction with the squeezed photons and vanishes if \(\partial _\varphi \left|\varSigma \right| = 0\), namely when \(k_\mathrm {avg} = 0\) in Eq. (14), and in particular when \(\gamma _i = \pm \pi /2\), for \(i=1,\dots ,M\), in Eq. (12), corresponding to quadratures with minimum squeezed variances, and thus locally insensible to the variations of the parameter.
Interestingly, this latter case is similar to the single squeezed vacuum and single-homodyne scenario found in the literature [29, 30]: in fact, the second term in (14) represents a generalisation of the single-homodyne Fisher information \({\mathcal {F}}_1(\varphi ) = 8\varrho (k)(\partial _\varphi \gamma )^2 N^2\), and it can be obtained by substituting k and \(\partial _\varphi \gamma \) with their averages over the probabilities \(P_i\), namely \(k\rightarrow \sum _i P_i k_i\) and \(\partial _\varphi \gamma \rightarrow \sum _i P_i \partial _\varphi \gamma _i\).
We have then found that, also when employing multiple homodyne detections, one for each output port of the interferometer, the Heisenberg-scaling precision obtained through measurements of the squeezed noise (i.e. \(\partial _\varphi \varvec{\mu }=0\)) is only reached when the quantum fluctuations of the observed quadratures are reduced to their quantum limit, i.e. \(\left|\varSigma \right| = O(N_\mathrm {S}^{-1})\), while the variations of the unknown parameter \(\varphi \) still yield a visible effect on the outcomes of the measurements, i.e. \(\partial _\varphi \left|\varSigma \right|\) is not vanishing.
However, at the expense of introducing a nonzero displacement in the probe, it is possible to relax the condition \(\partial _\varphi \left|\varSigma \right|\ne 0\), thus allowing us to choose \(k_i=0\) in Eq. (13), thus effectively measuring the maximally squeezed quadratures at \(\gamma _i= \pm \pi /2\). Indeed in such a case, even if the contribution to the Fisher information associated with only the squeezed photons in Eq. (14) is vanishing, it is still possible to reach Heisenberg-scaling precision through the information on the parameter encoded in the displacement of the probe.
An important feature of this protocol, which differentiate it from its single-homodyne counterpart, is that it does not require any adaptation of the network to the value of the unknown parameter, namely no auxiliary networks needs to be added at the input nor the output of \({\hat{U}}_{\varphi }\) to reach Heisenberg-scaling precision. The only condition (12) can be thought as a minimum-resolution requirement on the local oscillators phases, which can thus be achieved without adding further auxiliary networks.
However, this does not mean that the form of the network \(U_{\varphi }\) does not affect the precision of our protocol in the estimation of \(\varphi \): the terms \(k_\mathrm {avg}\) and \((\partial \gamma )_\mathrm {avg}\) appearing in the constant factor in the Fisher information in Eq. (14) depend on the transition probabilities \(P_i\) and on the derivatives of the relative phases \(\partial _\varphi \gamma _i\). In particular, an exceptionally poorly conceived network, e.g. one for which \(\gamma _i\) is independent on \(\varphi \) for every i such that \(P_i \ne 0\), can be associated with a null factor \((\partial \gamma )_\mathrm {avg}\) that sets to zero the Fisher information. In this case, adding a \(\varphi \)-independent auxiliary network V, either at the input or at the output of \(U_{\varphi }\), might modify both \(P_i\) and \(\gamma _i\), and thus \((\partial \gamma )_\mathrm {avg}\).
4 Conclusions
We have shown that performing homodyne measurements at each output channel of an arbitrary linear network encoding an unknown distributed parameter \(\varphi \) to be estimated allows us to reach Heisenberg-scaling precision for a single-mode squeezed probe with no prior information on \(\varphi \). The information on \(\varphi \) is encoded both in the displacement and in the squeezing of the probe, leading to two independent contributions which can both provide Heisenberg-scaling sensitivity. We have shown that the determinant of the covariance matrix associated with the measurement outcomes plays an important role in the enhanced sensitivity: in particular, we demonstrated that the conditions to reach Heisenberg scaling in either of the two contributions, which can be met manipulating the phases of the local oscillators, correspond to imposing that the determinant of the covariance matrix is of order \(N^{-1}\) for large N. Differently from protocols involving only homodyne measurements at a single channel, here there is no need for a refocusing auxiliary stage: the procedure is independent of the network and of the value of the parameter. This allows us to safely entrust the measurement operation to an independent party without sharing any information on the structure of the network, possibly opening up a further path towards secure sensing and cryptographic quantum metrology [39,40,41]. On the other hand, we showed that, despite not required to achieve the Heisenberg limit, one can still employ an auxiliary stage to further enhance the estimation precision by a constant factor. As a future step, it would be interesting to extend the results hereby presented beyond the Cramér–Rao bound analysis, also in the framework of a parameter distributed arbitrarily in a network.
References
V. Giovannetti, S. Lloyd, L. Maccone, Quantum-enhanced measurements: beating the standard quantum limit. Science 306(5700), 1330–1336 (2004)
V. Giovannetti, S. Lloyd, L. Maccone, Quantum metrology. Phys. Rev. Lett. 96, 010401 (2006)
J.P. Dowling, Quantum optical metrology—the lowdown on high-n00n states. Contemp. Phys. 49(2), 125–143 (2008)
V. Giovannetti, S. Lloyd, L. Maccone, Advances in quantum metrology. Nat. Photonics 5, 010401 (2011)
J.P. Dowling, K.P. Seshadreesan, Quantum optical technologies for metrology, sensing, and imaging. J. Lightw. Technol. 33, 2359–2370 (2015)
A. De Pasquale, P. Facchi, G. Florio, V. Giovannetti, K. Matsuoka, K. Yuasa, Two-mode bosonic quantum metrology with number fluctuations. Phys. Rev. A 92, 042115 (2015)
S. Zhou, M. Zhang, J. Preskill, L. Jiang, Achieving the Heisenberg limit in quantum metrology using quantum error correction. Nat. Commun. 9, 1–11 (2018)
W. Ge, K. Jacobs, Z. Eldredge, A.V. Gorshkov, M. Foss-Feig, Distributed quantum metrology with linear networks and separable inputs. Phys. Rev. Lett. 121, 043604 (2018)
K. Qian, Z. Eldredge, W. Ge, G. Pagano, C. Monroe, J.V. Porto, A.V. Gorshkov, Heisenberg-scaling measurement protocol for analytic functions with quantum sensor networks. Phys. Rev. A 100, 042304 (2019)
T.J. Proctor, P.A. Knott, J.A. Dunningham, Multiparameter estimation in networked quantum sensors. Phys. Rev. Lett. 120, 080501 (2018)
D.M. Greenberger, M.A. Horne, A. Zeilinger, Going Beyond Bell’s Theorem (Springer Netherlands, Dordrecht, 1989), pp. 69–72
P. Kok, H. Lee, J.P. Dowling, Creation of large-photon-number path entanglement conditioned on photodetection. Phys. Rev. A 65, 052104 (2002)
M.O. Scully, M.S. Zubairy, Quantum Optics (Cambridge University Press, Cambridge, 1997)
W. Schleich, Quantum Optics in Phase Space (Wiley, New York, 2011)
A. Monras, Optimal phase measurements with pure gaussian states. Phys. Rev. A 73, 033821 (2006)
L. Pezzé, A. Smerzi, Mach-Zehnder interferometry at the Heisenberg limit with coherent and squeezed-vacuum light. Phys. Rev. Lett. 100, 073601 (2008)
M.D. Lang, C.M. Caves, Optimal quantum-enhanced interferometry using a laser power source. Phys. Rev. Lett. 111, 173601 (2013)
J. Aasi, J. Abadie, B.P. Abbott, R. Abbott, T.D. Abbott, M.R. Abernathy, C. Adams, T. Adams, P. Addesso, R.X. Adhikari et al., Enhanced sensitivity of the ligo gravitational wave detector by using squeezed states of light. Nat. Photonics 7, 613–619 (2013)
L. Maccone, A. Riccardi, Squeezing metrology: a unified framework. Quantum 4, 292 (2020)
M. Aspachs, J. Calsamiglia, R Muñoz. Tapia, E. Bagan, Phase estimation for thermal gaussian states. Phys. Rev. A 79, 033834 (2009)
C. Oh, C. Lee, C. Rockstuhl, H. Jeong, J. Kim, H. Nha, S.-Y. Lee, Optimal Gaussian measurements for phase estimation in single-mode gaussian metrology. NPJ Quantum Inf. 5(1), 10 (2019)
D. Gatto, P. Facchi, V. Tamma, Phase space Heisenberg-limited estimation of the average phase shift in a Mach-Zehnder interferometer. Int. J. Quantum Inf. 2020, 1941019 (2020)
Y. Xia, W. Li, W. Clark, D. Hart, Q. Zhuang, Z. Zhang, Demonstration of a reconfigurable entangled radio-frequency photonic sensor network. Phys. Rev. Lett. 124, 150502 (2020)
D. Triggiani, P. Facchi, V. Tamma, Heisenberg scaling precision in the estimation of functions of parameters in linear optical networks. Phys. Rev. A 104, 062603 (2021)
Q. Zhuang, Z. Zhang, J.H. Shapiro, Distributed quantum sensing using continuous-variable multipartite entanglement. Phys. Rev. A 97, 032329 (2018)
T. Matsubara, P. Facchi, V. Giovannetti, K. Yuasa, Optimal gaussian metrology for generic multimode interferometric circuit. New J. Phys. 21, 033014 (2019)
D. Gatto, P. Facchi, F.A. Narducci, V. Tamma, Distributed quantum metrology with a single squeezed-vacuum source. Phys. Rev. Res. 1, 032024 (2019)
X. Guo, C.R. Breum, J. Borregaard, S. Izumi, M.V. Larsen, T. Gehring, M. Christandl, J.S. Neergaard-Nielsen, U.L. Andersen, Distributed quantum sensing in a continuous-variable entangled network. Nat. Phys. 16, 281–284 (2020)
G. Gramegna, D. Triggiani, P. Facchi, F.A. Narducci, V. Tamma, Heisenberg scaling precision in multi-mode distributed quantum metrology. New J. Phys. 23, 053002 (2021)
G. Gramegna, D. Triggiani, P. Facchi, F.A. Narducci, V. Tamma, Typicality of Heisenberg scaling precision in multimode quantum metrology. Phys. Rev. Res. 3, 013152 (2021)
H. Cramér, Mathematical Methods of Statistics, vol. 9 (Princeton University Press, Princeton, 1999)
A. Holevo, Statistical decision theory for quantum systems. J. Multivar. Anal. 3(4), 337–394 (1973)
M. Hayashi, Comparison between the Cramer–Rao and the mini-max approaches in quantum channel estimation. Commun. Math. Phys. 304, 689–709 (2011)
M. Hayashi, S. Vinjanampathy, L.C. Kwek, Resolving unattainable Cramer–Rao bounds for quantum sensors. J. Phys. B: At. Mol. Opt. Phys. 52, 015503 (2018)
M.R. Grace, C.N. Gagatsos, S. Guha, Entanglement-enhanced estimation of a parameter embedded in multiple phases. Phys. Rev. Res. 3, 033114 (2021)
E.L. Lehmann, Elements of Large-Sample Theory (Springer, Berlin, 1999)
V.K. Rohatgi, A.M.E. Saleh, An Introduction to Probability and Statistics (Wiley, New York, 2000)
E. Polino, M. Valeri, N. Spagnolo, F. Sciarrino, Photonic quantum metrology. AVS Quantum Sci. 2(2), 024703 (2020)
Z. Huang, C. Macchiavello, L. Maccone, Cryptographic quantum metrology. Phys. Rev. A 99, 022314 (2019)
P. Yin, Y. Takeuchi, W.-H. Zhang, Z.-Q. Yin, Y. Matsuzaki, X.-X. Peng, X.-Y. Xu, J.-S. Xu, J.-S. Tang, Z.-Q. Zhou, G. Chen, C.-F. Li, G.-C. Guo, Experimental demonstration of secure quantum remote sensing. Phys. Rev. Appl. 14, 014065 (2020)
N. Shettell, D. Markham, E. Kashefi, A cryptographic approach to quantum metrology (2021)
S. Xu, M. Darouach, J. Schaefers, Expansion of det(a+b) and robustness analysis of uncertain state space systems. IEEE Trans. Autom. Control 38(11), 1671–1675 (1993)
Acknowledgements
This work was supported by the Office of Naval Research Global (N62909-18-1-2153). PF is partially supported by Istituto Nazionale di Fisica Nucleare (INFN) through the project QUANTUM, and by the Italian National Group of Mathematical Physics (GNFM-INdAM).
Open Access
This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Joint Detection Probability
Here we will first obtain the expectation value \(\varvec{\mu }\) associated with the homodyne measurements on the probe after the interaction with the linear network shown in (6), then we derive the expression of the covariance matrix \(\varSigma \) shown in (4), and its determinant \(\left|\varSigma \right|\) in (5).
The initial phase-space displacement of the injected probe \(\varvec{\alpha }_0 = \langle \varPsi _{\mathrm {in}}\vert \hat{\varvec{z}} \vert \varPsi _{\mathrm {in}}\rangle \), where \(\hat{\varvec{z}} = ({\hat{x}}_1, \dots , {\hat{x}}_M, {\hat{p}}_1, \dots , {\hat{p}}_M)\) and \(\vert \varPsi _{\mathrm {in}}\rangle = {\hat{D}}_1(d){\hat{S}}_1(r)\vert \mathrm {vac}\rangle \) is a 2M-vector
with \(d=\sqrt{N_\mathrm {D}}\). This vector is transformed by the linear network, and at the output reads
where R is the \(2M\times 2M\) orthogonal and symplectic matrix associated with the interferometer unitary matrix \(U_{\varphi }\)
The local oscillator phases in the homodyne measurements are described by the \(2M\times 2M\) orthogonal matrix
with \(\varTheta = \mathrm {diag}(\mathbf {\theta }) = \mathrm {diag}(\theta _1,\dots ,\theta _M)\). This matrix represents a clockwise rotation in phase space for each of the M channels of the network, of angles \(\theta _i\) for the ith mode. The mean vector \(\varvec{\mu }\) in Eq. (6) is then given by the first M elements of \(O_{\mathbf {\theta }}\varvec{\alpha }\).
The \(2M \times 2M\) symplectic covariance matrix \(\varGamma _0\) of the squeezed state \({\hat{S}}_1(r )\vert \mathrm {vac}\rangle \) reads
where \({\mathcal {R}}\) is the \(M\times M\) diagonal matrix \({\mathcal {R}} = \mathrm {diag}(r,0,\dots ,0)\). Once again, the action of the linear network \(U_{\varphi }\) is represented by the orthogonal and symplectic matrix R in Eq. (17), so that the covariance matrix of the probe at the output is
where, by direct calculation,
The covariance matrix \(\varSigma =\varSigma _x\) at the detection stage is then obtained by extracting the first M rows and columns from the matrix \(O_{\mathbf {\theta }} \varGamma O^\mathrm {T}_{\mathbf {\theta }}\), and thus its elements as shown in (4) can be easily obtained.
To evaluate the determinant \(\left|\varSigma \right|\), we first notice that \(\varGamma _0\equiv {\mathbb {I}}/2 + K_0\), where \({\mathbb {I}}\) is the identity matrix and \(K_0 = \varGamma _0 - {\mathbb {I}}/2\) is a diagonal matrix of rank 2. Being the rank invariant under orthogonal rotations, the same holds true for \(O_{\mathbf {\theta }} \varGamma O^\mathrm {T}_{\mathbf {\theta }} \equiv {\mathbb {I}}/2 + K\), with \(\mathrm{rank}(K) = 2\). By definition of rank, none of the sub-matrices of K can have rank greater than 2; hence, we can write
with \(\mathrm{rank}(A) \le 2\), being A a sub-matrix of K. We can then apply the result presented in “Appendix E” to \(\varSigma \) and write \(\left|\varSigma \right|\) as a sum of determinants of the matrices obtained replacing any number of columns of \({\mathbb {I}}/2\), with the respective columns of A
where the first term is the determinant of \({\mathbb {I}}/2\), the terms in the first summation are the contributions that arise substituting the ith column of \({\mathbb {I}}/2\) with the ith column of A, and the terms in the last summations from substituting the ith and jth columns, and we also exploited the symmetry of A. Noticeably, since \(\mathrm{rank}{A}\le 2\), all the contributions involving the replacement of three or more columns of A are vanishing. By direct calculation, the expression in (5) can then be easily obtained.
Appendix B: Fisher information
In this Appendix we will obtain the expression for the Fisher information shown in Eq. (9), and the expression for the cofactor matrix in Eq. (10).
By plugging the probability density function in Eq. (3) into the definition of the Fisher information (8), one can easily obtain
where we used the matrix identity \(\partial _\varphi \varSigma ^{-1}=-\varSigma ^{-1}(\partial _\varphi \varSigma )\varSigma ^{-1}\). We now can express the inverse of the covariance matrix in terms of its cofactor matrix C and its determinant, namely \(\varSigma ^{-1} = C/\left|\varSigma \right|\), where the symmetry of the covariance matrix allows us to consider directly the cofactor matrix, and not its transpose. The second term reads
We recognise in the first term of Eq. (27) Jacobi’s formula for the derivative of the determinant, \(\partial _\varphi \left|\varSigma \right| = \mathrm{Tr}[C \partial _\varphi \varSigma ]\), which allows us to obtain the expression shown in Eq. (9)
In order to explicit the cofactor matrix C in terms of the elements of \(\varSigma \), and thus in terms of squeezing parameter r, transition probabilities \(P_i\), and relative phases acquired \(\gamma _i\), \(i=1,\dots , M\), as displayed in (10), we first need to make some observations. First, the (s, t)-cofactor \(C_{st}\), which is defined as the determinant of the \(L-1\times L-1\) sub-matrix of \(\varSigma \) obtained deleting the sth row and tth column, then multiplied by \((-1)^{s+t}\), can also be thought as the determinant of the \(L\times L\) matrix \(\varSigma ^{[s,t],1}\), where we denote with \(X^{[s,t],n}\) the matrix obtained from the matrix X replacing all the elements in the the sth row and in the pth column with zeros, except the element (s, t) which is replaced by n, namely
Second, as discussed in “Appendix A”, \(\varSigma = {\mathbb {I}}/2 + A\), where A is a symmetric matrix with \(\mathrm{rank}(A) = \rho \le 2\). Thus, we can write the (s, t)-cofactor as \(C_{st}=\left|({\mathbb {I}}/2)^{[s,t],0} + A^{[s,t],1}\right|\), and evaluate this determinant as a sum of determinants of matrices obtained swapping columns of \(({\mathbb {I}}/2)^{[s,t],0}\) and \(A^{[s,t],1}\), as discussed in detail in “Appendix E”. Noticeably, by replacing a row and a column of A may increase its rank by one, so that \(\mathrm{rank}(A^{[s,t],1})\le 3\). It is convenient now to consider separately the simpler case \(s=t\) first, and then \(s\ne t\).
We notice that the matrix \(({\mathbb {I}}/2)^{[s,s],0}\) has a single zero eigenvalue, and thus each contribution to \(C_{ss}\) is non-vanishing only if the sth columns of \(({\mathbb {I}}/2)^{[s,s],0}\) is replaced. We thus obtain
which is the sum of terms obtained substituting the sth, the sth and ith, and the sth, ith and jth columns, respectively. Noticeably, replacing more than 3 columns yields vanishing contributions, since \(\mathrm{rank}(A^{[s,t],1})\le 3\). When \(s\ne t\), \(({\mathbb {I}}/2)^{[s,t],0}\) has two null eigenvalues; hence, all the non-vanishing contributions must replace the sth and tth columns. For example, the only contribution obtained swapping the sth and tth columns is of the type
where we also exploited the symmetry of A; the contribution obtained swapping the sth, tth and ith columns, with \(i\ne s,t\) are of the type
where once again, we exploited the symmetry of A. Replacing more than 3 columns once again yields no contributions since \(\mathrm{rank}(A^{[s,t],1})\le 3\). The final expression for \(C_{st}\), \(s\ne t\) thus reads
Replacing in (30) and (33) the definition of \(A = \varSigma - {\mathbb {I}}/2\), it is straightforward to obtain Eqs. (10).
Appendix C: Asymptotics
In this appendix we will study the asymptotic regime of the Fisher information (9),
for large \(N = N_\mathrm {S}+ N_\mathrm {D}= \sinh ^2 r + d^2\). We will show that the only conditions needed to reach Heisenberg scaling are the ones shown in (12), and that in this case the asymptotic expression for the Fisher information is (14).
First, from the explicit expressions of \(\varSigma \) in (4) and C in (10), we notice that each of their matrix elements is at most of order of \(N_\mathrm {S}\), since \(\sinh r = \sqrt{N_\mathrm {S}}\), the single-photon probabilities \(P_i\) are independent of \(N_\mathrm {S}\), and the cosine and sine functions are limited.
The same holds true for their derivatives: in particular \(\partial _\varphi P_i\) and \(\partial _\varphi \gamma _i = \partial _\varphi {\bar{\gamma }}_i\) do not depend on \(N_\mathrm {S}\), where \(\gamma _i = {\bar{\gamma }}_i-\theta _i\) is the phase acquired by the signal through the ith output port relatively to the local oscillators reference phases. Moreover, similar considerations can be applied for \(\varvec{\mu } = O(\sqrt{N_\mathrm {D}})\), and specifically for its derivative \(\partial _\varphi \varvec{\mu }\). This implies that, in order to reach a Heisenberg-scaling sensitivity, namely a scaling of order of \(N^2\) in the Fisher information, the determinant \(\left|\varSigma \right|\) cannot be of any order higher than \(N^0\).
We thus first focus our attention on \(\left|\varSigma \right|\)
shown in Eq. (5). In particular, we will suppose that, for large N, \(\gamma _i\) tend to finite values \(\gamma _{0i}\), namely that \(\gamma _i = \gamma _{0i} + k_i N^{-\alpha }\), with \(k_i\in {\mathbb {R}}\) of order 1 and \(\alpha > 0\), since if \(\gamma _i\) were to grow with N (i.e. \(\alpha <0\)), it would give rise to an oscillating asymptotic behaviour to \(\left|\varSigma \right|\).
By expanding the squeezing parameter r in powers of \(N_\mathrm {S}\) in (35), we obtain
where
In order to cancel the scaling with \(N_\mathrm {S}\), \(D_1\) must be equal to, or tend to, zero.
After some trigonometry, and exploiting the passivity of the linear network \(U_{\varphi }\) which sets \(\sum _i P_i = 1\), we can rewrite
which tends to zero iff \(\gamma _{0i} = \pi /2 + n\pi \), with \(n\in {\mathbb {Z}}\).
In particular, the scaling of \(\left|\varSigma \right|\) for large \(N_\mathrm {S}\) will be of order \(N_\mathrm {S}^0\) or lower only if \(\gamma _i = \pi /2 + n\pi + k_i N_\mathrm {S}^{-\alpha }\), with \(\alpha \ge 1/2\). To see that, we notice that, for \(\gamma _{0i}=\pi /2\), \(D_1\) and \(D_2\) scale with \(N_\mathrm {S}^{-2\alpha }\), while \(D_3\) scale with \(N_\mathrm {S}^0\). Thus, \(\left|\varSigma \right|\) scales with \(N_\mathrm {S}^{1-2\alpha }\) for \(\alpha \le 1\) (and in particular with \(N_\mathrm {S}^0\) for \(\alpha =1/2\)), and with \(N_\mathrm {S}^{-1}\) for \(\alpha > 1\). Noticeably, also \(D_2\) tends to zero iff \(\gamma _{0i} = \pi /2 + n\pi \).
We now study the asymptotics of the numerators appearing in the Fisher information, when the condition \(\gamma _i = \pi /2 + k_i N_\mathrm {S}^{-\alpha }\), with \(\alpha \ge 1/2\), is true. We first obtain the derivative of \(\varSigma \) from Eq. (4), and substitute \(\gamma _i = \pi /2 + k_i/N_\mathrm {S}^\alpha \), with \(\alpha \ge 1/2\),
and we notice that it scales at most with \(N^{1-\alpha }\) for \(1/2 \le \alpha < 1\) and at most with \(N^0\) for \(\alpha \ge 1\). We then analyse the auxiliary term (11)
of which we evaluate the derivative when \(\gamma _i = \pi /2 + k_i/N_\mathrm {S}^\alpha \), with \(\alpha \ge 1/2\), \(i=1,\dots ,M\)
which scales at most with \(N_\mathrm {S}^{1-\alpha }\) for large \(N_\mathrm {S}\).
Moreover, the covariance matrix \(\varSigma \) in (4) asymptotically reads
and thus, inserting (42) and (44) in the cofactor matrix (10), we obtain
Finally, we can write
It is easy to see that \(\partial _\varphi \left|\varSigma \right|\) scales at most with \(N_\mathrm {S}^{1-\alpha }\), since \(\sum _i^M \partial _\varphi P_i = 0\), while the derivatives of the elements of C scale at most with \(N_\mathrm {S}^{1-\alpha }\) for \(1/2\le \alpha <1\), and at most with \(N_\mathrm {S}^0\) for \(\alpha \ge 1\).
Our last step is to evaluate the asymptotics for \(\partial _\varphi \varvec{\mu }\), which can be easily evaluated differentiating equation (6)
Now we can finally draw our conclusions and obtain the scaling of the Fisher information (9) by putting together all the asymptotic regimes found. First, we notice that, independently of \(\alpha \ge 1/2\), the last term in Eq. (9) always scales with \(N_\mathrm {S}\), hence only reaching shot-noise precision. The only terms which allow the Heisenberg scaling are then the first two: in fact, the first term scales with \(N_\mathrm {D}N_\mathrm {S}^{2\alpha -1}\) for \(1/2\le \alpha \le 1\), and with \(N_\mathrm {D}N_\mathrm {S}\) for \(\alpha > 1\), and thus reaching Heisenberg scaling for \(\alpha \ge 1\) (condition \(\alpha >1\) includes the case \(\gamma _i=\pi /2\)), while the second term reaches sub-shot-noise scaling \(N_\mathrm {S}^{2-2\left|\alpha -1\right|}\) for \(1/2< \alpha < 3/2\), with Heisenberg scaling for \(\alpha =1\).
This shows that the only condition to reach Heisenberg scaling is that
for \(i=1,\dots ,M\), as shown in (12), or equivalently that asymptotically \(\gamma _i \simeq \pi /2 + k_i/N_\mathrm {S}\) with \(k_i\in {\mathbb {R}}\) of order 1, as well as that the only relevant terms under this condition are the first two in expression (9), and that only for \(\alpha = 1\) the first term is non-vanishing.
We can now finally prove (14). Substituting condition (50) inside Eqs. (41) and (43), we obtain from (46) the asymptotics
where we once again exploited the passivity of the interferometer, so that \(\sum _{i=1}^M P_i = 1\), while we obtain from (36)
which is the expression shown in (13). Lastly, from Eq. (49) when conditions (50) hold, we obtain the asymptotics
Eqs. (51), (52) and (53) yield the asymptotic expression for the Fisher information shown in (14).
Appendix D: Maximum-likelihood estimator
We will now obtain the implicit equation that defines the maximum-likelihood estimator \({\tilde{\varphi }}_\mathrm {MLE}\), which saturates the Heisenberg-scaling Cramér–Rao bound (7), with the Fisher information given in Eq. (14), in the asymptotic regime of large \(\nu \).
Let us then imagine that, after \(\nu \) iterations, we collect \(\nu \) sets of outcomes \(\varvec{x}_i\), \(i=1,\dots ,\nu \) from the M homodyne measurements at the output of the linear network \({\hat{U}}_{\varphi }\). The likelihood that these outcomes are observed for a given value \(\varphi \) of the unknown parameter is given by
where \(p( \varvec{x}_j | \varphi )\) is the probability density function given in (3). The maximum-Likelihood estimator \({\tilde{\varphi }}\equiv {\tilde{\varphi }}(\varvec{x}_1,\dots ,\varvec{x}_\nu )\) is defined as the value of \(\varphi \) which maximises the likelihood \({\mathcal {L}}\) that the outcomes \(\varvec{x}_1,\dots ,\varvec{x}_\nu \) are observed, and it is usually found by maximising the log-likelihood function
where we exploited Jacobi’s formula for the derivative of the determinant of a matrix, the identity \(\mathrm{Tr}(\varSigma ^{-1}\partial _\varphi \varSigma )= -\mathrm{Tr}(\partial _\varphi \varSigma ^{-1} \varSigma )\), and the symmetry of \(\varSigma \). Once inserting the expressions for the covariance matrix \(\varSigma \) from (4) and for the mean vector \(\varvec{\mu }\) from (6) into the previous equation, one is able to obtain with numerical methods the maximum-likelihood estimator as the \(\varphi \) that solves (55).
Equation (55) largely simplifies when the probe is a squeezed vacuum state—i.e. \(\varvec{\mu }=0\)—or when the maximally squeezed quadratures are measured—i.e. \(\partial _\varphi \left|\varSigma \right| = 0\). In the first case, it becomes
where it is possible to recognise the usual mean squared error estimator \({\tilde{\varSigma }} = \sum _{j=1}^\nu {\varvec{x}}_j{\varvec{x}}_j^\mathrm {T}/\nu \) for the variance matrix \(\varSigma \), so that the solution of Eq. (56) can be seen as the value of \(\varphi \) that sets equal to zero a weighted mean of the elements of covariance matrix estimator. In the latter case, Eq. (55) becomes
which can be seen as a weighted mean of the estimators \(\tilde{\varvec{\mu }}=\sum _{j=1}^\nu \varvec{x}_j /\nu \) of the mean \(\varvec{\mu }\).
Appendix E: Useful formulas for the determinant of a sum of two matrices
Let us consider an \(L\times L\) matrix Z which can be written as \(Z = D + W\), where \(D = \mathrm {diag}(d_1,\dots ,d_L)\) is a real diagonal matrix, and \(\mathrm{rank}(W) = \rho \le L\). In this appendix, we will show a way to write the determinant \(\left|Z\right|\) in terms of the elements of W, which is convenient for our purposes.
We exploit the identity [42]
where \(\varTheta _\alpha (X,Y)\) is the sum of the determinants of the matrices obtained by replacing any set of \(\alpha \) columns (rows) of X with the corresponding \(\alpha \) columns (rows) of Y. Since \(\mathrm{rank}(W) = \rho \), the rank of any \(\alpha \times \alpha \) sub-matrix of W is zero if \(\alpha >\rho \), so we can write
Let us make explicit the first, easier terms of the summation with the purpose to grasp the gist of this expression. For \(\alpha = 0\), no columns are replaced from D, hence \(\varTheta _0(D,W) = \left|D\right| = \prod _k d_k\). We notice that, if at least one of the eigenvalues of D is zero, this term vanishes. For \(\alpha = 1\), \(\varTheta _1(D,W)\) is the sum of determinants of matrices of the form
with \(i=1,\dots ,L\). Due to the structure of this matrices, their determinant is straightforward and reduces to \(W_{ii}\times \prod _{k\ne i} d_k\). This means that, in general, \(\varTheta _1(D,W)=\sum _i W_{ii}\times \prod _{k\ne i} d_k\). A key observation to make is that if two or more eigenvalues of D are zero, all these determinants are zero, and thus \(\varTheta _1(D,W)=0\); if instead a single eigenvalue is zero, say \(d_j=0\), with \(j=1,\dots ,L\), then only one of these determinants is non-vanishing, i.e. the determinant of the matrix obtained by replacing the jth column of D. In this case then we have \(\varTheta _1(D,W)=W_{jj}\times \prod _{k\ne j} d_k\).
Let us now consider lastly the case \(\alpha = 2\). The matrices whose determinants contribute to \(\varTheta _2(D,W)\) are of the form
with \(i<i'=1,\dots ,L\). Once again, the determinants of these type of matrices are easy to be evaluated and read \(|W^{(i,i')}|\,\prod _{k\ne i,i'}d_k\), where
We notice that \(|W^{(i,i')}| = |W^{(i',i)}|\). Thus, in general, \(\varTheta _2(D,W) = \sum _i\sum _{j>1} |W^{(i,i')}|\,\prod _{k\ne i,i'}d_k\). Once again, key observations can be made: if D has at least three null eigenvalues, then \(\varTheta _2(D,W)\) is vanishing. If only two eigenvalues are zero, e.g. \(d_j=d_{j'}=0\), then there is only one contribution to \(\varTheta _2(D,W)\), given by the matrix obtained substituting the jth and \(j'\)th columns of D, and in this case we have \(\varTheta _2(D,W) = |W^{(j,j')}|\,\prod _{k\ne j,j'}d_k\). If only one eigenvalue is zero, namely \(d_j=0\), then \(\theta _2(D,W)\) is given by the sum of all the determinants of the matrices where the jth column has been replaced, namely \(\varTheta _2(D,W) = \sum _{i\ne j}|W^{(i,j)}|\,\prod _{k\ne i,j}d_k\).
Similarly, it is possible to extend these considerations to every value of \(\alpha \) and finally obtain the compact form
where \({\mathcal {C}}^L_\alpha \) is the set of all the combinations of \(\alpha \) items in a set of L, and \(W^{(\gamma )}\) denotes the \(\alpha \times \alpha \) sub-matrix of W obtained by selecting the rows and columns with indices \(\gamma _1,\dots ,\gamma _\alpha \).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Triggiani, D., Facchi, P. & Tamma, V. Non-adaptive Heisenberg-limited metrology with multi-channel homodyne measurements. Eur. Phys. J. Plus 137, 125 (2022). https://doi.org/10.1140/epjp/s13360-021-02337-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjp/s13360-021-02337-4