Automated multidimensional single molecule fluorescence microscopy feature detection and tracking

Rolfe, Daniel J.; McLachlan, Charles I.; Hirsch, Michael; Needham, Sarah R.; Tynan, Christopher J.; Webb, Stephen E. D.; Martin-Fernandez, Marisa L.; Hobson, Michael P.

doi:10.1007/s00249-011-0747-7

Automated multidimensional single molecule fluorescence microscopy feature detection and tracking

Original Paper
Published: 18 September 2011

Volume 40, pages 1167–1186, (2011)
Cite this article

European Biophysics Journal Aims and scope Submit manuscript

Daniel J. Rolfe¹,
Charles I. McLachlan²,
Michael Hirsch¹,
Sarah R. Needham¹,
Christopher J. Tynan¹,
Stephen E. D. Webb¹,
Marisa L. Martin-Fernandez¹ &
…
Michael P. Hobson³

876 Accesses
40 Citations
9 Altmetric
Explore all metrics

Abstract

Characterisation of multi-protein interactions in cellular networks can be achieved by optical microscopy using multidimensional single molecule fluorescence imaging. Proteins of different species, individually labelled with a single fluorophore, can be imaged as isolated spots (features) of different colour light in different channels, and their diffusive behaviour in cells directly measured through time. Challenges in data analysis have, however, thus far hindered its application in biology. A set of methods for the automated analysis of multidimensional single molecule microscopy data from cells is presented, incorporating Bayesian segmentation-based feature detection, image registration and particle tracking. Single molecules of different colours can be simultaneously detected in noisy, high background data with an arbitrary number of channels, acquired simultaneously or time-multiplexed, and then tracked through time. The resulting traces can be further analysed, for example to detect intensity steps, count discrete intensity levels, measure fluorescence resonance energy transfer (FRET) or changes in polarisation. Examples are shown illustrating the use of the algorithms in investigations of the epidermal growth factor receptor (EGFR) signalling network, a key target for cancer therapeutics, and with simulated data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Rapid Measurement of Molecular Transport and Interaction inside Living Cells Using Single Plane Illumination

Article Open access 14 November 2014

Automated Analysis of Intracellular Dynamic Processes

In-Cell Single-Molecule Analysis of Molecular State and Reaction Kinetics Coupling

References

Betzig E, Patterson GH, Sougrat R, Lindwasser OW, Olenych S, Bonifacino JS, Davidson MW, Lippincott-Schwartz J, Hess HF (2006) Imaging intracellular fluorescent proteins at nanometer resolution. Science 313(5793):1642–1645. doi:10.1126/science.1127344, http://www.ncbi.nlm.nih.gov/pubmed/16902090
Google Scholar
Bräuchle C, Lamb DC, Michaelis J (2010) Single particle tracking and single molecule energy transfer. Wiley, London
Google Scholar
Chung I, Akita R, Vandlen R, Toomre D, Schlessinger J, Mellman I (2010) Spatial control of EGF receptor activation by reversible dimerization on living cells. Nature 464(7289):783–787. doi:10.1038/nature08827, http://dx.doi.org/10.1038/nature08827
Google Scholar
Citri A, Yarden Y (2006) EGF-ERBB signalling: towards the systems level. Nature Rev Mol Cell Biol 7(7):505–516. doi:10.1038/nrm1962, http://www.ncbi.nlm.nih.gov/pubmed/16829981
Google Scholar
Cronin B, de Wet B, Wallace MI (2009) Lucky imaging: improved localization accuracy for single molecule imaging. Biophys J 96(7):2912–2917. doi:10.1016/j.bpj.2008.12.3945
Article PubMed CAS Google Scholar
Das SK, Darshi M, Cheley S, Wallace MI, Bayley H (2007) Membrane protein stoichiometry determined from the step-wise photobleaching of dye-labelled subunits. Chembiochem: a European journal of chemical biology 8(9):994–999. doi:10.1002/cbic.200600474, http://www.ncbi.nlm.nih.gov/pubmed/17503420
Dunne PD, Fernandes RA, McColl J, Yoon JW, James JR, Davis SJ, Klenerman D (2009) DySCo: quantitating associations of membrane proteins using two-color single-molecule tracking. Biophys J 97(4):L5–L7. doi:10.1016/j.bpj.2009.05.046, http://www.ncbi.nlm.nih.gov/pubmed/19686638
Google Scholar
Fili N, Mashanov GI, Toseland CP, Batters C, Wallace MI, Yeeles JTP, Dillingham MS, Webb MR, Molloy JE (2010) Visualizing helicases unwinding DNA at the single molecule level. Nucleic Acids Res 38(13):4448–4457. doi:10.1093/nar/gkq173
Article PubMed CAS Google Scholar
Hern Ja, Baig AH, Mashanov GI, Birdsall B, Corrie JET, Lazareno S, Molloy JE, Birdsall NJM (2010) Formation and dissociation of M1 muscarinic receptor dimers seen by total internal reflection fluorescence imaging of single molecules. Proc Natl Acad Sci USA 107(6):2693–2698. doi:10.1073/pnas.0907915107
Article PubMed CAS Google Scholar
Hobson MP, McLachlan C (2003) A Bayesian approach to discrete object detection in astronomical data sets. Monthly Notices R Astron Soc 338(3):765–784. doi:10.1046/j.1365-8711.2003.06094.x, http://cdsads.u-strasbg.fr/abs/2003MNRAS.338..765H
Hobson MP, Rocha G, Savage RS (2009) Bayesian source extraction, chap 7.5.5. In: Hobson MP, Jaffe AH, Liddle AR, Mukherjee P, Parkinson D (eds) Bayesian methods in cosmology. Cambridge University Press, Cambridge, p 187
Holden SJ, Uphoff S, Kapanidis AN (2011) DAOSTORM: an algorithm for high-density super-resolution microscopy. Nature Methods 8(4):279–280. doi:10.1038/nmeth0411-279, http://www.ncbi.nlm.nih.gov/pubmed/21451515
Google Scholar
Jaqaman K, Loerke D, Mettlen M, Kuwata H, Grinstein S, Schmid SL, Danuser G (2008) Robust single-particle tracking in live-cell time-lapse sequences. Nature Methods 5(8):695–702. doi:10.1038/nmeth.1237, http://dx.doi.org/10.1038/nmeth.1237
Google Scholar
Kapanidis AN, Margeat E, Ho SO, Kortkhonjia E, Weiss S, Ebright RH (2006) Initial transcription by RNA polymerase proceeds through a DNA-scrunching mechanism. Science 314(5802):1144–1147. doi:10.1126/science.1131399
Article Google Scholar
Kusumi A, Ike H, Nakada C, Murase K, Fujiwara T (2005) Single-molecule tracking of membrane molecules: plasma membrane compartmentalization and dynamic assembly of raft-philic signaling molecules. Sem Immunol 17(1):3–21. doi:10.1016/j.smim.2004.09.004, http://www.ncbi.nlm.nih.gov/pubmed/15582485
Google Scholar
Leake MC, Chandler JH, Wadhams GH, Bai F, Berry RM, Armitage JP (2006) Stoichiometry and turnover in single, functioning membrane protein complexes. Nature 443(7109):355–358. doi:10.1038/nature05135, http://www.ncbi.nlm.nih.gov/pubmed/16971952
Google Scholar
Mashanov GI, Molloy JE (2007) Automatic detection of single fluorophores in live cells. Biophys J 92(6):2199–2211. doi:10.1529/biophysj.106.081117
Article PubMed CAS Google Scholar
Mortensen KI, Churchman LS, Spudich Ja, Flyvbjerg H (2010) Optimized localization analysis for single-molecule tracking and super-resolution microscopy. Nature Methods 7(5). doi:10.1038/nmeth.1447, http://www.ncbi.nlm.nih.gov/pubmed/20364147
Nordberg E, Ekerljung L, Sahlberg SH, Carlsson J (2010) Effects of an EGFR-binding affibody molecule on intracellular signaling pathways, pp 967–972, doi:10.3892/ijo
Orr G, Hu D, Ozçelik S, Opresko LK, Wiley HS, Colson SD (2005) Cholesterol dictates the freedom of EGF receptors and HER2 in the plane of the membrane. Biophys J 89(2):1362–1373. doi:10.1529/biophysj.104.056192
Article PubMed CAS Google Scholar
Press WH, Teukolsky SA, Vetterling WT, Flannery BP (1992) Numerical recipes. Cambridge University Press, Cambridge. http://library.lanl.gov/numerical/bookcpdf.html
Rees AR, Gregoriou M, Johnson P, Garland PB (1984) High affinity epidermal growth factor receptors of A431 cells have restricted lateral diffusion the surface. EMBO J 3(8):1843–1847
PubMed CAS Google Scholar
Roy R, Kozlov AG, Lohman TM, Ha T (2009) SSB protein diffusion on single-stranded DNA stimulates RecA filament formation. Nature 461(7267):1092–1097. doi:10.1038/nature08442
Article PubMed CAS Google Scholar
Rust M, Bates M, Zhuang X (2006) Stochastic optical reconstruction microscopy (STORM) provides sub-diffraction-limit image resolution. Nature Methods 3(10):793. doi:10.1038/nmeth929.Stochastic, http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2700296/
Google Scholar
Saffarian S, Li Y, Elson EL, Pike LJ (2007) Oligomerization of the EGF receptor investigated by live cell fluorescence intensity distribution analysis. Biophys J. doi:10.1529/biophysj.107.105494
Savage RS, Oliver S (2007) Bayesian methods of astronomical source extraction. Astrophys J 661(2):1339–1346. doi:10.1086/515393, http://cdsads.u-strasbg.fr/abs/2007ApJ...661.1339S
Google Scholar
Schaaf MJM, Koopmans WJa, Meckel T, van Noort J, Snaar-Jagalska BE, Schmidt TS, Spaink HP (2009) Single-molecule microscopy reveals membrane microdomain organization of cells in a living vertebrate. Biophys J 97(4):1206–1214. doi:10.1016/j.bpj.2009.05.044, http://www.ncbi.nlm.nih.gov/pubmed/19686669
Scott GL, Longuet-Higgins HC (1991) An algorithm for associating the features of two images. Proc R Soc B Biol Sci 244(1309):21–26. http://www.jstor.org/stable/76644
Selvin P, Ha T (2007) Single molecule techniques: a laboratory manual. Cold Spring Harbor Laboratory Press, New York
Google Scholar
Semrau S, Schmidt T (2007) Particle image correlation spectroscopy (PICS): retrieving nanometer-scale correlations from high-density single-molecule position data. Biophys J 92(2):613–621. doi:10.1529/biophysj.106.092577, http://www.ncbi.nlm.nih.gov/pubmed/17085496
Google Scholar
Sergé A, Bertaux N, Rigneault H, Marguet D (2008) Dynamic multiple-target tracing to probe spatiotemporal cartography of cell membranes. Nature Methods 5(8):687–694. doi:10.1038/nmeth.1233, http://dx.doi.org/10.1038/nmeth.1233
Google Scholar
Stryer L, Haugland RP (1967) Energy transfer: a spectroscopic ruler. Proc Natl Acad Sci USA 58(2):719–726. http://www.hubmed.org/display.cgi?uids=5233469
Google Scholar
Teramura Y, Ichinose J, Nishida K, Yanagida T, Sako Y (2006) Single-molecule analysis of epidermal growth factor binding on the surface of living cells. EMBO J 1–8. doi:10.1038/sj.emboj.7601308
Thompson RE, Larson DR, Webb WW (2002) Precise nanometer localization analysis for individual fluorescent probes. Biophys J 82(5):2775–83. doi:10.1016/S0006-3495(02)75618-X
Article PubMed CAS Google Scholar
Vizcay-Barrena G, Webb SED, Martin-Fernandez ML, Wilson ZA (2011) Subcellular and single-molecule imaging of plant fluorescent proteins using total internal reflection fluorescence microscopy (TIRFM). Wilson J Exp Bot. doi:10.1093/jxb/err212
Webb SED, Needham SR, Roberts SK, Martin-Fernandez ML (2006) Multidimensional single-molecule imaging in live cells using total-internal-reflection fluorescence microscopy. Optics Lett 31(14):2157–2159. http://www.ncbi.nlm.nih.gov/pubmed/16794711
Google Scholar
Webb SED, Rolfe DJ, Needham SR, Roberts SK, Clarke DT, McLachlan CI, Hobson MP, Martin-Fernandez ML (2008) Simultaneous widefield single molecule orientation and FRET microscopy in cells. Optics Exp 16(25):20258–20265. http://www.ncbi.nlm.nih.gov/pubmed/19065164
Google Scholar
Weiss S (1999) Fluorescence spectroscopy of single biomolecules. Science 283(5408):1676–1683. doi:10.1126/science.283.5408.1676, http://www.sciencemag.org/cgi/doi/10.1126/science.283.5408.1676
Google Scholar
Xiao Z, Zhang W, Yang Y, Xu L, Fang X (2008) Single-molecule diffusion study of activated EGFR implicates its endocytic pathway. Biochem Biophys Res Commun 369(2):730–734. doi:10.1016/j.bbrc.2008.02.084, http://www.ncbi.nlm.nih.gov/pubmed/18313398
Google Scholar
Yanagida T, Ishii Y (2009) Single molecule dynamics in life sciences. Wiley, London
Google Scholar
Yoon JW, Bruckbauer A, Fitzgerald WJ, Klenerman D (2008) Bayesian inference for improved single molecule fluorescence tracking. Biophys J 94(12):4932–4947. doi:10.1529/biophysj.107.116285, http://www.ncbi.nlm.nih.gov/pubmed/18339757
Google Scholar
Zhang W, Jiang Y, Wang Q, Ma X, Xiao Z, Zuo W, Fang X, Chen YG (2009) Single-molecule imaging reveals transforming growth factor-beta-induced type II receptor dimerization. Proc Nat Acad Sci USA 106(37):15679–15683. doi:10.1073/pnas.0908279106, http://www.ncbi.nlm.nih.gov/pubmed/19720988
Google Scholar

Download references

Acknowledgments

The authors gratefully acknowledge the support of the Biotechnology and Biological Sciences Research Council through grant BB/G006911/1.

Author information

Authors and Affiliations

Central Laser Facility, Science and Technology Facilities Council, Research Complex at Harwell, Rutherford Appleton Laboratory, Didcot, Oxon, OX11 0FA, UK
Daniel J. Rolfe, Michael Hirsch, Sarah R. Needham, Christopher J. Tynan, Stephen E. D. Webb & Marisa L. Martin-Fernandez
Illumina Cambridge Ltd., Chesterford Research Park, Little Chesterford, Nr Saffron Walden, Essex, CB10 1XL, UK
Charles I. McLachlan
Astrophysics Group, Cavendish Laboratory, JJ Thomson Avenue, Cambridge, CB3 0HE, UK
Michael P. Hobson

Authors

Daniel J. Rolfe
View author publications
You can also search for this author in PubMed Google Scholar
Charles I. McLachlan
View author publications
You can also search for this author in PubMed Google Scholar
Michael Hirsch
View author publications
You can also search for this author in PubMed Google Scholar
Sarah R. Needham
View author publications
You can also search for this author in PubMed Google Scholar
Christopher J. Tynan
View author publications
You can also search for this author in PubMed Google Scholar
Stephen E. D. Webb
View author publications
You can also search for this author in PubMed Google Scholar
Marisa L. Martin-Fernandez
View author publications
You can also search for this author in PubMed Google Scholar
Michael P. Hobson
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marisa L. Martin-Fernandez.

Appendices

Appendix 1: Feature detection

Each feature is assumed to have the same shape, or profile, t(x, y), which is normalised so that $\int_{-\infty}^{\infty} \int_{-\infty}^{\infty} t(x, y) \hbox{d}x \hbox{d}y=1.$ For sources of negligible size t(x, y) will correspond to the point spread function of the microscope at the detector. The signal (excluding noise contributions) in the image at position (x, y) due to a feature at position (x = X, y = Y) with total integrated intensity I is then

$$ s(x, y)=It(x-X, y-Y). $$

Now consider a pixelised image with pixel centres at $x=x_i (i=0, 1, 2,{\ldots},N_x-1)$ and $y=y_j (j=0, 1, 2,{\ldots},N_y-1)$. The implementation described in this paper uses x _i = i + 0.5, y _j = j + 0.5, i.e. x and y are measured in units of pixel size with pixel centres at half-integer values. The signal in image pixel (i, j) due to a feature at (X, Y) is then

$$ s_{ij}=I\int\limits_{x=i}^{i+1}\int\limits_{y=j}^{j+1}t(x-X, y-Y) \hbox{d}x \hbox{d}y=It_{ij}(X, Y), $$

where the profile has been integrated over the pixel area. s _ij is a component of a vector s of size N _x N _y, which represents signal in all pixels. Similarly vector t(X, Y) has components t _ij, and corresponds to the vector t in Hobson et al. (2009), except that in Hobson et al. (2009) t is normalised to unit peak. This simply means that for the definition here the amplitude A becomes the intensity I. Note that this integration of the profile over each pixel area correctly models the pixelation of the image, avoiding the “pixelation noise” effect discussed in Thompson et al. (2002).

If t(x, y) is a Gaussian profile with standard deviation σ_t

$$ t(x-X, y-Y)=\frac{1}{2\pi\sigma_t^2}\exp\left(-\frac{(x-X)^2+(y-Y)^2)}{2\sigma_t^2}\right) $$

and

$$ \begin{aligned} t_{ij}(X, Y) & = \frac{1}{4} \left(\hbox{erf}{\left({\frac{i+1-X}{\sigma_t\sqrt{2}}}\right)} -\hbox{erf}{\left({\frac{i-X}{\sigma_t\sqrt{2}}}\right)}\right)\\ & \quad \times \left(\hbox{erf}{\left({\frac{j+1-Y}{\sigma_t\sqrt{2}}}\right)} -\hbox{erf}{\left({\frac{j-Y}{\sigma_t\sqrt{2}}}\right)}\right). \end{aligned} $$

Bayes’s theorem relates the posterior probability, Pr(M|d, K) that a particular hypothesis (or model), M, is true given some data, d, and background knowledge, K, to the probability, Pr(d|M, K), that the data would have been measured if the hypothesis were true, given the same background information. Pr(d|M, k) is called the likelihood, and is generally a lot easier to specify directly than the posterior. Bayes’s theorem states that

$$ {\hbox{Pr}}(M|d,K)=\frac{{\hbox{Pr}}(d|M,K){\hbox{Pr}}(M|K)}{{\hbox{Pr}}({d|K})} $$

The hypothesis M may represent a particular model proposed to explain a system, or a set of parameters for such a model, the data set d some observations or measurements of the system, and the background knowledge K is any information about the system known prior to making the observations in question, for example known constraints on particular properties of the system. Pr(M|K) is the prior probability for the hypothesis, encoding the assumed probability of the hypothesis based on background information/bias only, before making the observations. If M represents a set of parameters, then the prior probability would be set to 0 for parameter values known to be impossible. Pr(d|K) is commonly called the evidence. For parameter estimation, where M represents a set of parameters of an assumed model, and the most probable range of parameter values consistent with the data is sought, the evidence is just a normalising factor and can be ignored. In the case of model selection, however, the evidence is crucial.

We wish to compare the posterior probability of hypotheses H ₁ and H ₀ at each point in the image. The ratio of the posterior probabilities, ρ, is given by

$$ \rho=\frac{{\rm Pr}(H_1|{\bf d})}{{\hbox{Pr}}(H_0|{\bf d})} $$

where d is the image data (Hobson et al. 2009). From Bayes’s theorem this is

$$ \rho=\frac{{\hbox{Pr}}({\bf d}|H_1){\hbox{Pr}}(H_1)}{{\hbox{Pr}}({\bf d}|H_0){\hbox{Pr}}(H_0)}, $$

where Pr(H ₁) and Pr(H ₀) are the prior probabilities of the two hypotheses, and Pr(d|H ₁) and Pr(d|H ₀) are evidences for model H ₁ and H ₀ respectively. The evidences are calculated from the likelihoods and prior probabilities for all the possible parameters for each model,

$$ {\hbox{Pr}}({\bf d}|H_1)=\int\int{\hbox{Pr}}({\bf d}|I,B_1,H_1) {\hbox{Pr}}(I,B_1|H_1) \hbox{d}I \hbox{d}B_1 $$

and

$$ {\hbox{Pr}}({\bf d}|H_0)=\int{\hbox{Pr}}({\bf d}|B_0,H_0){\hbox{Pr}}(B_0|H_0) \hbox{d}B_0. $$

Writing $\rho_0=\frac{{\hbox{Pr}}(H_1)}{{\hbox{Pr}}({H_0})}=\left\langle n \right\rangle$ (where $\left\langle n \right\rangle$ is the expected number of features per pixel in the image; Hobson et al. (2009)) gives the evidence ratio

$$ R=\frac{\rho}{\rho_0}=\frac{{\hbox{Pr}}({\bf d}|H_1)}{{\hbox{Pr}}({\bf d}|H_0)}. $$

R is calculated for each pixel in the image with the feature profile for model H ₁ centred at the pixel centre. Pixels in this evidence map with $R>R_{\rm min} \, (R_{\rm min}=1/\left\langle n \right\rangle)$ can be identified as features, i.e. the locations of the fluorophores, since in these pixels model H ₁ is more probable than model H ₀. In practise the threshold on R, R _min, can be decreased/increased to increase/decrease the feature detection rate (and false discovery rate). Since the number of single point emitting sources expected is not necessarily easy to determine, and there is some freedom to choose priors on I and B (e.g. change I _max and B _max, see below), which will also change the evidence, such tuning of the threshold is generally going to be necessary.

Single evidence ratio peaks are identified by segmenting the evidence map above the threshold into local maximum regions using a watershed-like algorithm, as follows.

1.
Find the pixel with the highest value in the evidence map above the threshold, R _min, which has not already been identified as associated with a source. If there is no such pixel the segmentation is complete.
2.
Identify this pixel as the location of a new source. This becomes the current pixel and its value of R is called R _current.
3.
Grow the set of pixels that are associated with this source by adding any of the eight pixels directly or diagonally adjacent to the current pixel that satisfy R _min < R ≤ R _current. For each such added pixel repeat this step using that pixel as the new current pixel and its R as R _current, recursively continuing until no further pixels can be added.
4.
Go to step 1.

This method will find all peaks above R _min and will correctly identify multiple peaks within single regions of the the map above R _min. For each such segmented region, the pixel with the maximum value of R is identified as the location of the source. See Fig. 10 for an example.

Constraints on the feature intensity and background are imposed by choosing appropriate prior probability distributions. We assume priors Pr(I,B ₁|H ₁) = π_I(I)π_B(B ₁) and Pr(B ₀|H ₀) = π_B(B ₀) with π_I(I) and π_B(B) uniform for I and B in a finite range, i.e.

$$ \begin{aligned} \pi_I(I) & = \left\{ \begin{array}{ll} 1/I_{\rm max} & {\rm if} \ 0 \le I \le I_{\rm max} \\ 0 & \hbox{otherwise} \end{array} \right.\\ \pi_B(B) & = \left\{ \begin{array}{ll} 1/B_{\rm max} & {\rm if} \ 0 \le B \le B_{\rm max} \\ 0 & \hbox{otherwise} \end{array} \right. . \end{aligned} $$

For each detected feature location the most probable intensity and background level, $\hat{I}$ and $\hat{B}_1,$ can be determined by Bayesian parameter estimation (Hobson et al. 2009). Maps of $\hat{I}$ can be efficiently calculated for each pixel at the same time as R, making estimates of the feature intensities available as soon as the feature locations have been identified.

The covariance matrix of P ₁ = Pr(I, B|d, H ₁), C = (− H)⁻¹, can be used to determine the root mean square uncertainties in $\hat{I}$ and $\hat{B}_1, \sigma(\hat{I})=\sqrt{{\bf C}_{11}}$ and $\sigma(\hat{B}_1)=\sqrt{{\bf C}_{22}},$ where H is the Hessian matrix,

$$ {\bf H}=\left( \begin{array}{cc} \frac{\partial^2\ln{P_1}}{\partial I^2} & \frac{\partial^2\ln{P_1}}{\partial I \partial B} \\ \frac{\partial^2\ln{P_1}}{\partial B \partial I} & \frac{\partial^2\ln{P_1}}{\partial B^2} \end{array} \right). $$

evaluated at $(\hat{I}, \hat{B}_1)$. Note that these uncertainties do not account for any error in the feature FHWM or position, and as such are likely to underestimate the error to some extent.

The feature detection was tested by simulating multiple data sets (as described in “Simulations” and “Appendix 3”), each with 100 frames of 100 constant intensity non-moving features of FWHM 3 pixels separated by at least 9 pixels. Data sets were simulated with feature intensities varying from 10 to 2,000 photons per frame (integrated over the profile), and with constant uniform background intensity 0, 1, 5 or 10 photons per pixel per frame. Poisson noise, a linear gain of 200 and readout noise of standard deviation 5 were added. Each simulation was performed using a Gaussian profile and also using an Airy disc profile. In the latter case, the Airy profile contribution to a particular pixel was determined by integrating an oversampled Airy profile over the pixel. The feature detection and measurement were performed on each simulated data set. The number of features correctly detected and the number of false detections (detections not corresponding to true features) were determined. Figure 11 shows the results as plots of the true-positive and false discovery rates as functions of photons in a feature and signal-to-noise at the peak of a feature. For a signal-to-noise in the peak pixel above about 1 the detection rate is better than 98 percent and the false discovery rate less than a few percent. There is little difference in these figures whether the model profile uses and Airy disc or a Gaussian profile, confirming that the Gaussian model in the analysis is justified for typical single molecule data. For the simulations with zero background emission but low signal-to-noise, the false discovery rate is a little higher with an Airy disc than with a Gaussian. This appears to be occasional false detections of features from emission in the Airy rings due to the approximate image noise estimation and Gaussian profile assumption. However this should not affect single molecule data from cells in which there is always some background intensity. There is no difference in the true positive rate (detection rate) between Airy and Gaussian profile simulations. This will be revisited when the full physical noise model has been implemented to see if it affects this observation.

Appendix 2: Registration transformation

The function to transform a position ${\bf r}=\left( \begin{array}{c} x \\ y \end{array} \right)$ in the non-reference channel to the corresponding point ${\bf r}_{\rm ref}=\left( \begin{array}{c} x_{\rm ref} \\ y_{\rm ref} \end{array} \right)$ in the reference channel is

$$ \begin{aligned} {\bf r}_{\rm ref} & = {\bf c}_{\rm ref}+{\bf Rot}(-\theta){\bf Rot}(\theta_S){\bf S} \,{\bf Rot}(-\theta_S)\left({\bf r}-{\bf c}+{\bf D}\right) \\ & = {\bf A}({\bf r}, {\bf T}). \end{aligned} $$

c and c _ref are the midpoints of the channel image region for the non-reference and reference channel respectively, and are adopted as sensible origins for the transformations. Rot(ϕ) is the matrix $\left( \begin{array}{cc} \cos\phi & -\sin\phi\\ \sin\phi & \cos\phi \end{array} \right)$, which rotates a vector clockwise through angle ϕ. ${\bf D}=\left( \begin{array}{c} D_x \\ D_y \end{array} \right)$ is the translation to be applied, ${\bf S}=\left(\begin{array}{cc} S_1 & 0\\ 0 & S_2 \end{array} \right)$ specifies the orthogonal scaling factors, S ₁ and S ₂. θ_S is the angle specifying the orientation of the scaling axes and θ is the angle of the rotation. The parameters to be determined are T = [D _x, D _y, S ₁, S ₂, θ, θ_S].

The Scott and Longuet-Higgins algorithm (Scott and Longuet-Higgins 1991) is used to match beads in each non-reference channel with their counterparts in the reference channel. If the positions of the N _bead,ref beads identified in the reference channel are ${\bf r}_{\rm bead,ref,i}\, (i=1{\ldots}N_{\rm bead,ref})$ and in the non-reference channel ${\bf r}_{{\rm bead},j}\,(j=1{\ldots}N_{\rm bead})$, a proximity matrix, G is calculated with elements $G_{ij}=\exp\left(-\frac{\left|{\bf r}_{{\rm bead,ref},i} -{\bf r}_{{\rm bead},j}\right|^2}{\sigma^2_{{\rm slh},ij}}\right)$ where σ_slh controls the scale on which features in different channels may be associated and influences the effectiveness of the algorithm. A value σ_slh = 10 pixels was used. The algorithm exploits the properties of singular value decomposition of G to produce a modified proximity matrix that can be used to identify which bead in the reference channel corresponds to which in the non-reference channel. For the small rotations (typically <1°) and scalings (typically <4%) found, and provided the translations are not too large (approximately <10 pixels) this algorithm works very well. In some cases where the offset, D, gets larger than about 10 pixels, the algorithm fails. Such cases are easily overcome by trying offsets to r _bead, _j, ${\bf r}_{off}=\left(\begin{array}{c} p \times 10 \hbox{pix} \\ q \times 10 \hbox{pix} \end{array} \right)$ with integers p and q both in the range $-3{\ldots}3,$ before calculating the proximity matrix.

With corresponding beads in each channel identified, determination of the transformation parameters simply requires finding the transformations for which non-reference channel beads are best aligned with their reference channel counterparts. It is not necessary to consider all possible combinations of beads in one channel with beads in the other. If the corresponding pairs of the N _matched matched beads have positions r _bead, _k and ${\bf r}_{{\rm bead,ref},k} \, (k=1{\ldots}N_{\rm matched})$, define

$$ \Updelta r_{{\rm bead},k}({\bf T})=\left|{\bf A}({\bf r}_{{\rm bead},k}, {\bf T})-{\bf r}_{{\rm bead,ref},k}\right|, $$

the difference between the reference bead position and the position of the non-reference bead registered with the transformation T. One approach to determining the optimal transformation parameters would be find the T that minimises the sum of the squared $\Updelta r_{{\rm bead},k}({\bf T})$s, i.e. minimise $\sum_{k=1}^{N_{\rm matched}} \Updelta r_{{\rm bead},k}({\bf T})^2,$ however this method would allow any mismatched beads to influence the determined transformation. Although the Scott and Longuet-Higgins method does a good job of matching beads, there are often a small number of mismatched beads (Fig. 1c). An alternative objective function, P(T), is therefore used, with

$$ P({\bf T})=\sum_{k=1}^{N_{\rm matched}} \exp\left(-\frac{\Updelta r_{{\rm bead},k}({\bf T})^2}{2\sigma_P^2}\right), $$

which must be maximised, where σ_P controls the sensitivity to mismatch. In this objective function any beads that are significantly mismatched compared to σ_P will produce small local peaks in P(T) away from the main maximum, so as long as the optimiser finds the main maximum these mismatched beads will not influence the final parameters. Two passes of optimisation are performed, first with σ_P = 10 pixels, which broadens the maximum ensuring the global maximum is identified approximately, then a second pass beginning at the maximum of P(T) from the first pass but with σ_P set to the standard deviation of the feature profiles, σ_t. The second pass thus homes in on the peak more precisely, with P(T) approximating a correlation coefficient of the profile of each bead with its counterpart. Optimisation of P(T) is performed using the Downhill Simplex method (Press et al. 1992).

To assess the quality of the determined transformation solution, three scores are calculated. Bead positions are registered, and those that are within σ_t/2 of their corresponding reference channel bead are identified as matched. The scores are

f _matched, the ratio of matched features to maximum possible matches, where the number of maximum possible matches is the sum of min(N _bead, N _bead,ref) over images of beads,
$\Updelta r_{69}$ and $\Updelta r_{95}, $ the largest separation of the best matched 69%/95% of the matched beads.

f _matched = 0.7 is used as a threshold below which registration is assumed to have failed. $\Updelta r_{69}$ and $\Updelta r_{95}$ are useful diagnostics giving an idea of the typical agreement of registered features, but they will be a combination of the localisation uncertainty in the measurement of the beads and the error in the registration. As such they can be used as estimates of the upper limit of the registration error, where $\Updelta r_{95}$ is the more conservative estimate.

Appendix 3: Simulations

This appendix details the model used for creating simulated data sets that may then be used to test the single molecule analysis methods.

Registration transformations for each non-reference channel (as described in the “Registration transformation” section) are generated. The parameters [D _x, D _y, S ₁, S ₂, θ, θ_S] for each channel are sampled from probability distributions that approximate the typical range of values seen in real data sets.
A number N _tracks of tracks is modelled.
- Each track consists of a group of N _fluor co-located fluorophores moving together and individually fluorescing/blinking/bleaching.
- N _fluor is sampled from a Poisson distribution of specified mean (currently 2) and N _fluor = 0 is avoided.
- Each fluorophore is either off (not yet fluorescing), on (fluorescing), blinking or photo-bleached.
- Fluorophores are individually switched on, may then blink, recover and photo-bleach all at times determined randomly according to rates specified for each event.
- Each track has an initial position, speed and direction of motion chosen at random, the speed chosen from a specified normal distribution. At each time step the direction and speed are modified by applying changes sampled from normal distributions with specified standard deviations.
- At each time point the track intensity is ev _fac I _fluor times the number of fluorescing fluorophores, where I _fluor is the individual fluorophore intensity and $ev_{\rm fac}=\exp{-d/d_{\rm ev}}$. d is the depth of the fluorophores in the evanescent field, and is constant and randomly chosen for each track between 0 and twice the evanescent field depth, d _ev.
- At present the intensity of fluorophores is the same in each channel. Models for different types of experiment will be implemented to improve on this, e.g. FRET and co-localisation experiments.
Single molecule images are then modelled at each time point, considering each fluorophore group as a Gaussian spot of common full width half maximum (FWHM). Real images contain background structures on a variety of spatial scales, for example auto-fluorescence, which will challenge feature detection algorithms. To simulate background emission with spatial structure on a range of spatial scales, a single image consisting of Perlin noise is generated and used as the background signal in all channels at every time point. This Perlin noise image consists of random spatial fluctuations in intensity, on spatial scales L ranging from the model feature profile FWHM to the image size, with amplitudes proportional to 1/L. This is not necessarily an accurate reflection of the background structure, but is a simple parametrised way to add challenging background emission that can be improved upon when the background properties are better characterised. The intensity incident on each pixel is then the contribution from fluorophores and the background structure, from which the number of photons in the pixel is Poisson sampled. The quantum efficiency of the detector is modelled by sampling from a binomial distribution (which, convolved with the initial Poisson distribution, gives another Poisson) and a linear gain is then applied followed by the addition of Gaussian noise and an offset to model the bias and readout noise. This is a simplified model for a CCD response. Corresponding images are produced for each channel, but with the background image, fluorophore positions and FWHMs transformed according to the registration vectors before sampling onto the model CCD/pixel grid.
Bead images are also simulated, where bead samples are modelled as a random spatially uniform distribution of Gaussian point sources, sampled on the model detector using the same detector model as above.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rolfe, D.J., McLachlan, C.I., Hirsch, M. et al. Automated multidimensional single molecule fluorescence microscopy feature detection and tracking. Eur Biophys J 40, 1167–1186 (2011). https://doi.org/10.1007/s00249-011-0747-7

Download citation

Received: 18 February 2011
Accepted: 23 August 2011
Published: 18 September 2011
Issue Date: October 2011
DOI: https://doi.org/10.1007/s00249-011-0747-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automated multidimensional single molecule fluorescence microscopy feature detection and tracking

Abstract

Access this article

Similar content being viewed by others

Rapid Measurement of Molecular Transport and Interaction inside Living Cells Using Single Plane Illumination

Automated Analysis of Intracellular Dynamic Processes

In-Cell Single-Molecule Analysis of Molecular State and Reaction Kinetics Coupling

References

Acknowledgments