Identifying Weak Signals in Inhomogeneous Neuronal Images for Large-Scale Tracing of Sparsely Distributed Neurites

Li, Shiwei; Quan, Tingwei; Zhou, Hang; Yin, FangFang; Li, Anan; Fu, Ling; Luo, Qingming; Gong, Hui; Zeng, Shaoqun

doi:10.1007/s12021-018-9414-9

Identifying Weak Signals in Inhomogeneous Neuronal Images for Large-Scale Tracing of Sparsely Distributed Neurites

Original Article
Open access
Published: 11 January 2019

Volume 17, pages 497–514, (2019)
Cite this article

Download PDF

You have full access to this open access article

Neuroinformatics Aims and scope Submit manuscript

Identifying Weak Signals in Inhomogeneous Neuronal Images for Large-Scale Tracing of Sparsely Distributed Neurites

Download PDF

Shiwei Li^1,2,
Tingwei Quan^1,2,3,
Hang Zhou^1,2,
FangFang Yin^1,2,
Anan Li^1,2,
Ling Fu^1,2,
Qingming Luo^1,2,
Hui Gong^1,2 &
…
Shaoqun Zeng^1,2

3582 Accesses
14 Citations
Explore all metrics

Abstract

Tracing neurites constitutes the core of neuronal morphology reconstruction, a key step toward neuronal circuit mapping. Modern optical-imaging techniques allow observation of nearly complete mouse neuron morphologies across brain regions or even the whole brain. However, high-level automation reconstruction of neurons, i.e., the reconstruction with a few of manual edits requires discrimination of weak foreground points from the inhomogeneous background. We constructed an identification model, where empirical observations made from neuronal images were summarized into rules for designing feature vectors that to classify foreground and background, and a support vector machine (SVM) was used to learn these feature vectors. We embedded this constructed SVM classifier into a previously developed tool, SparseTracer, to obtain SparseTracer-Learned Feature Vector (ST-LFV). ST-LFV can trace sparsely distributed neurites with weak signals (contrast-to-noise ratio < 1.5) against an inhomogeneous background in datasets imaged by widely used light-microscopy techniques like confocal microscopy and two-photon microscopy. Moreover, 12 sub-blocks were extracted from different brain regions. The average recall and precision rates were 99% and 97%, respectively. These results indicated that ST-LFV is well suited for weak signal identification with varying image characteristics. We also applied ST-LFV to trace long-range neurites from images where neurites are sparsely distributed but their image intensities are weak in some cases. When tracing this long-range neurites, manual edit was required once to obtain results equivalent to the ground truth, compared with 20 times of manual edits required by SparseTracer. This improvement in the level of automatic reconstruction indicates that ST-LFV has the potential to rapidly reconstruct sparsely distributed neurons at the large scale.

SparseTracer: the Reconstruction of Discontinuous Neuronal Morphology in Noisy Images

Article 07 December 2016

Rivulet: 3D Neuron Morphology Tracing with Iterative Back-Tracking

Article 16 May 2016

BigNeuron: a resource to benchmark and predict performance of algorithms for automated tracing of neurons in light microscopy datasets

Article 17 April 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Structural and functional mapping of neuronal circuits is one of the central tasks in neuroanatomical studies (Mitra 2014; Osten and Margrie 2013). Mapping the neuronal circuit largely depends on reconstructing the morphologies of neurons (Parekh and Ascoli 2013; Donohue and Ascoli 2011; Meijering 2010; Svoboda 2011), which are usually considered as the basic structural unit in the circuit (Marx 2012). Neurites form the core of neuronal morphologies (Parekh and Ascoli 2015; Peng et al. 2015), hence, tracing neurites plays an important role in neuronal morphology reconstruction.

In recent years, a series of breakthroughs in molecular labeling (Feng et al. 2000; Jefferis and Livet 2012; Luo and Callaway 2008; Ugolini 2010) and optical imaging techniques (Chung and Deisseroth 2013; Gong et al. 2013; Gong et al. 2016; Osten and Margrie 2013; Ragan et al. 2012; Silvestri et al. 2012) have enabled the rapid collection of brain-wide neuronal images at submicron resolutions. These techniques have been used to map the neuronal circuit of mice (Osten and Margrie 2013; Fürth et al. 2018). However, automatic tracing methods are error-prone and may fail in neuronal structures with weak signal intensity against an inhomogeneous background. These structures hamper the accuracy of overall neuronal reconstruction. In addition, tracing the circuit from tens of thousands of images is laborious (Marx 2012; Zingg et al. 2014).

Generally, the above challenges originate from the optical imaging strategy and complicated nature of neuronal morphologies. First, whole brain imaging is usually implemented at a relatively low spatial sampling rate to achieve a balance between the sampling rate and the imaging speed. Second, neurites with small radii (several hundred nanometers) contain few fluorescent molecules. These facts contribute to the presence of neurites with low signal intensity after fluorescent imaging. Third, long-term imaging procedures and structural differences in different brain regions result in an inhomogeneous background, which increase the difficulty in identifying weak signals.

Some key characteristics of neuronal images are illustrated in Fig. 1. Two sub-blocks were extracted from a whole-brain imaging dataset (Fig. 1a-c), and one sub-block (Fig. 1c) contains a neurite with a contrast-to-noise ratio (Song et al. 2004) as low as 2.09 (see Supplementary note). Furthermore, when computing the foreground and background profiles of a portion of two sub-blocks (Fig. 1d, e), the background intensity of sub-block b is even higher than the foreground intensity of sub-block c (Fig. 1f), indicating the existence of an inhomogeneous background.

Many methods exhibit great neurite tracing performance and demonstrate a good ability to identify neurites with weak signals, such as, the model-fitting (Zhao et al. 2011; Santamaria-Pang et al. 2015), open-snake (Wang et al. 2011; Cai et al. 2006; Luo et al. 2015; Xu and Prince 1998), graph-based (Peng et al. 2010; Turetken et al. 2011; Basu et al. 2013; Chothani et al. 2011; Yang et al. 2013), principal curve (Bas and Erdogmus 2011), voxel scooping (Rodriguez et al. 2009), multi-scale tracking (Choromanska et al. 2012; Frangi et al. 1988), and density filters (Radojevic and Meijering 2017) techniques. However, most of these methods require fine and frequent parameter tuning to trace neurites with weak signals. This may lead to difficulties in separating weak signals from an inhomogeneous background. Recent machine learning methods (Li et al. 2017; Chen et al. 2015; Megjhani et al. 2015; Gu et al. 2017; Hernandez-Herrera et al. 2016; Becker et al. 2013) can provide more accurate tracing results than traditional approaches. However, the largest reported volume size is limited to hundreds of megabytes and the corresponding computational cost ranges from several tens of minutes to several hours (Hernandez-Herrera et al. 2016; Li et al. 2017). This indicates that these methods may not be able to trace neurites rapidly in large-scale images without the help of GPU computing or distributed computing, which is attributed to two primary reasons. First, machine learning methods consider many image features and result in a complicated framework, thereby requiring intensive computations. Second, some methods separate the procedures of identifying foreground voxels and tracing the voxels into neurite skeletons (Li et al. 2017; Chen et al. 2015; Megjhani et al. 2015; Gu et al. 2017; Hernandez-Herrera et al. 2016; Becker et al. 2013). These techniques attempt to identify as many foreground voxels as possible, which generates the detailed shape of a neurite. Thus, larger images incur heavier computational costs.

In this study, we propose a method for identifying weak signals and embed this method into the neurite tracing pipeline. Our strategy closely links the identification and tracing procedures and requires only a few foreground voxels in the tracing process for identification. We observed many neuronal images and determined identification rules: the local background is smooth, the neurite has a strong anisotropic shape, and the difference in image intensities between the neurite and its local background makes them separable. These rules are applicable to several neuronal images but may not hold in all cases. Applying preprocessing techniques to the image may extend the application range of the rules. These rules can be summarized as a feature vector that distinguishes between foreground and background voxels. By training the feature vectors on foreground and background voxels, we obtained a classifier (Suykens and Vandewalle 1999; Cortes and Vapnik 1995) that was combined with our previous SparseTracer tool (Li et al. 2016) to give SparseTracer-Learned Feature Vector (ST-LFV). We have verified that ST-LFV accurately identifies weak signals from sparsely distributed neurites in light microscopic images and overcomes the identification difficulties caused by inhomogeneous backgrounds across different mouse brain regions. The observed rules used in ST-LFV can be applied to BigNeuron, DIADEM, and MOST datasets collected with various light microscopy techniques. In addition, the results demonstrated that ST-LFV significantly enhances the performance of SparseTracer in the large-scale tracing of sparsely distributed neurites.

Methods

The components of the proposed ST-LFV are outlined in this section. First, we describe the method used to extract the feature vectors, which display the differences between the foreground and background voxels. Second, we introduce a support vector machine (SVM) to the feature vector space to construct a classifier that can detect weak signals (Suykens and Vandewalle 1999; Cortes and Vapnik 1995). Third, we integrate this constructed classifier into our previous SparseTracer (Li et al. 2016) for better neurite tracing performance. We also discuss the parameter selection procedure for ST-LFV and validate the proposed mechanism.

Feature Extraction for Identifying Weak Signals

The extraction of representative image features is based on several assumptions about the images. Our assumptions are that the shape of a neurite can be described by a series of touching cylinders; that the background is locally smooth; and that, in a small local region, the foreground and background can be identified using a threshold value. For a given voxel, we extract its features using the image intensities of neighboring regions. This extraction includes three steps: i) set a series of threshold values for labeling the connected components of a given point; ii) generate the connected components using the threshold values; iii) use the generated components to construct the feature vector of this point. In the following, we describe how to extract features and explain why the extracted features are consistent with our assumptions.

Step i) Set a series of descending threshold values for labeling the neighboring regions of a point. For a given point p*, its corresponding threshold values are calculated by

$$ thr(m)=\Big\{{\displaystyle \begin{array}{c}\left(1-m{c}_1\right)s\left({p}^{\ast}\right)\kern0.5em if\ {c}_1s\left({p}^{\ast}\right)\ge {c}_2\\ {}\kern2.75em s\left({p}^{\ast}\right)-m{c}_2\kern0.5em otherwise\kern5.25em \end{array}} $$

(1)

where p* is 3D coordinates of point. For simplicity, we also denote the point by p*. When the coordinate elements of p* are integers, p* is regarded as a voxel. s(p*) is the weighted average image intensity of p* and its neighboring voxels; c₁ and c₂ are two predetermined constants, c₁ = 0.025 and c₂ = 1.5; m is an integer ranging from 0 to 8; and thr(m) is a threshold value that decreases as m increases. s(p*) is calculated by

$$ s\left({p}^{\ast}\right)=\frac{\sum_{p\widehat{I}T\left[{p}^{\ast}\right]}\mathit{\exp}\left(-\frac{1}{2}{\left\Vert p-{p}^{\ast}\right\Vert}_2^2\right)s(p)}{\sum_{p\widehat{I}T\left[{p}^{\ast}\right]}\mathit{\exp}\left(-\frac{1}{2}{\left\Vert p-{p}^{\ast}\right\Vert}_2^2\right)} $$

(2)

where T is the voxel set that includes the voxel [p*] and its 6-voxel neighborhood; [] represents the operation of rounding the coordinates of a point to its nearest values; p has the same definition as p*. s(p) is the intensity value of voxel p; $ \left\Vert \right\Vert {}_2{}^2 $ represents the 2-norm.

Note that the threshold value thr(m) is codetermined by 1-mc₁ and the given voxel. To simplify the form of this expression, we regard the threshold value as a function of 1-mc₁, where m = 0, 1, …, 8, and call 1-mc₁ the invariable ratio. For small values of s(p*), thr(m) in the first term of Eq. (1) decreases slowly as m increases. To prevent this, we set a lower bound (c₂ = 1.5) to overcome the decreased amplitude of the threshold values (second term in Eq. (1)).

Step ii) Extract the connected components of a given point with the threshold values. For the given point p* and a threshold value thr(m), we use the region growing method to generate a connected component in which the voxels connect with each other and have image intensities greater than thr(m). The generated region is included in the pre-determined neighborhood N(p*) (19 × 19 × 19 voxels) of the given point p*. The steps for generating the connected component are described below.

(ii-a) Set the initial seed as the point p* and label it with an arrow in Fig. 2a, and search for its neighboring voxels according to

$$ {G}_1(m)=\left\{p\in {N}_1\subset N\left({p}^{\ast}\right)|s(p)> thr(m)\right\} $$

(3)

where N₁ is the 26-voxel neighborhood of point p* with 3D coordinates x-, y-, and z-. The voxel [p*] and the searched voxels with image intensities greater than thr(m) are labeled. These labeled voxels form G₁(m). [p*] rounds each element of p* to the nearest integer.

(ii-b) In the unlabeled region of N(p*), search for the 26-voxel neighborhoods of every voxel in the set G₁(m), denoted by N₂. According to N₂ and the threshold thr(m), use Eq. (3) to generate G₂(m) and then label the resulting set of voxels.

(ii-c) Repeat step ii-b until no new voxel sets can be generated in N(p*). The labeled sets G₁(m), G₂(m), … form the connected components of p* with respect to the threshold thr(m), denoted by G(thr(m)).

Figures 2b, c illustrate how to obtain the connected regions of a foreground point and a background voxel under three threshold values, respectively. The size of the connected region of a given point depends on the threshold value thr(m), which is codetermined by the ratio 1-mc₁ and the weighted average of the imaging intensities (Eqs. (1) & (2)). Thus, identical ratios, i.e., 1.0, 0.9, 0.8, do not indicate the same threshold values in the extraction of the connected regions.

Step iii) Calculate the feature vector of a given point. For the given point p*, we can obtain nine connected regions with respect to the threshold value thr(m), m = 0, 1, ..., 8. We define the volume filling rate as the ratio of the connected component volume (number of voxels) and the neighborhood region volume, given by

$$ {r}_m=\frac{\varOmega \left(G\left( thr(m)\right)\right)}{\varOmega \left(N\left({p}^{\ast}\right)\right)}\kern0.5em ,m=0,1,...,8 $$

(4)

where Ω(·) is the total number of voxels in a region and r_m represents the volume filling rate and is the m^th element of the feature vector x of the given point p*.

We explain why the extracted features of a point are consistent with our assumptions. If a point belongs to the background, the volume filling rate in the feature vector will rapidly increase to 1.0 because of the smoothness of the local background. For the feature vector of a foreground point, the volume filling rate will increase much more slowly, and may not even reach 1.0, as the cylindrical shape of a neurite takes up a small amount of space in its neighborhood and the intensities of the foreground and background voxels will be different. The differences between foreground and background feature vectors are illustrated in Fig. 2d.

SVM Classifier Used to Identify Weak Signals

This section first describes the automatic extraction of training sets from neuronal images, and then explains how to build the SVM classifier after obtaining the training set.

In a supervised learning framework, a training set is necessary. Here, the training set contains the feature vectors of the foreground and background points. The automatic generation of training sets requires some foreground and background points to be obtained computationally, which may be practical for the following reasons. Existing tracing methods can identify weak signals at a certain level, which can provide foreground points. We used our SparseTracer tool (Li et al. 2016) to trace neurites and extract foreground points from the traced results. The traced results are composed of a series of points in which adjacent points are connected, providing the skeleton of a neurite. These skeleton points can be recognized as foreground points (Chen et al. 2015). If fewer than 500 skeleton points are selected, we calculate the feature vectors of all skeleton points. Otherwise, we acquired the signal intensities of these points in ascending order, and then chose those skeleton points having mid-level signal intensities. Finally, we calculated the feature vectors corresponding to the selected skeleton points. These feature vectors constitute the positive training set, denoted by S_train (left of Fig. 2e).

To obtain negative training samples, we randomly (from a uniform distribution) selected points from neuronal images that have the same number of positive training samples. We calculated their feature vectors (i.e. the negative training samples), denoted by B_train (right of Fig. 2e). The selected points may include a few foreground voxels, indicating that the negative training set contains some positive training samples. However, this selection is reasonable because, in most cases, the foreground voxels occupy less than 0.1% of the total number of voxels in neuronal images, according to our calculations (not shown). Thus, the number of foreground vector features included in the negative training set is negligible. In addition, SVM can tolerate a certain degree of error in constructing the training set. We identified whether a feature vector in the negative training set B_train is an outlier (the foreground feature vector) by measuring two degrees of similarity: one is the inner product between this feature vector and the mean values of B_train, and the other is the inner product between this feature vector and the mean values of S_train. If the former is larger than the latter, the vector is regarded as an outlier and is deleted from B_train. The vectors remaining in the dataset comprise the negative training samples.

To simplify the description, we used {y_k, x_k}, k = 1, 2, …, K to denote the positive and negative training sets. Here, y_k = 1 or − 1, x_k is a feature vector, and K is equal to the number of training feature vectors in both S_train and B_train. If y_k = 1, x_k is positive and equal to an element in S_train. Otherwise, x_k is equal to an element in B_train. After obtaining the training set, we introduced a linear SVM (Suykens and Vandewalle 1999; Cortes and Vapnik 1995) to build a supervised classifier. This classifier distinguishes between foreground and background voxels, and can be written as

$$ {\displaystyle \begin{array}{l}\underset{w,b,e}{\min }F\left(\mathbf{w},b,e\right)=\frac{1}{2}{\mathbf{w}}^T\mathbf{w}+\gamma \frac{1}{2}{\sum}_{k=1}^K{e}_k^2\\ {} subject\kern0.17em to\;{y}_k\left[{\mathbf{w}}^T{\mathbf{x}}_k+b\right]=1-{e}_k,k=1,2,...,K\kern1em \end{array}} $$

(5)

where $ {\left\{{y}_k,{\mathbf{x}}_k\right\}}_{k=1}^K $ are the training samples described above, and x_k refers to the k^th feature vector. Variable e_k represents the error term. γ is used to control the tradeoff between the training error and generalization ability. The optimization problem in Eq. (5) is then converted into an unconstrained optimization problem (Rockafellar 1973; Hestenes 1969):

$$ \underset{w,b,e;\alpha }{\min }L\left(\mathbf{w},b,e;\alpha \right)=F\left(\mathbf{w},b,e\right)-{\sum}_{k=1}^K{\alpha}_k\left\{{y}_k\left[{\mathbf{w}}^T{\mathbf{x}}_k+b\right]-1+{e}_k\right\} $$

(6)

where α_k is the Lagrange multiplier. Using the Kuhn–Tucker conditions, we could obtain the optimal solution (Suykens and Vandewalle 1999), and the corresponding supervised classifier can be denoted by

$$ R\left(\mathbf{x}\right)=\mathit{\operatorname{sgn}}\left({\sum}_{k=1}^K{\alpha_k}^{\ast }{y}_k{\mathbf{x}}_k^T\mathbf{x}+{b}^{\ast}\right)=\mathit{\operatorname{sgn}}\left({\sum}_{k=1}^K{\mathbf{w}}^{\ast T}\mathbf{x}+{b}^{\ast}\right) $$

(7)

where x represents the input feature vector, α_k^*and the coefficients w*, b* are obtained by solving the optimization problem (6). sgn() represents the sign function. If w*^Tx + b* > 0, the input belongs to the positively labeled set; otherwise, it belongs to the negatively labeled set. Applying the classifier to the training samples shows that most of the positive and negative values are close to 1 and − 1, respectively (Fig. 2f), which illustrates the large differences in the feature vectors of the foreground and background points (Fig. 2e).

Using the Identification Model for Neurite Tracing

Neurite tracing is the process of obtaining the skeleton of a neurite. A key component of neurite tracing is the accurate identification of foreground points. When tracing a neurite, if the current tracing point is identified as a background point, the tracing will be terminated. We applied the identification model described above to our SparseTracer tool to obtain better neurite tracing results. The pipeline is described as follows (Fig. 3).

Step 1) Use SparseTracer to trace the neurite. When the point p_n + 1 is identified as a background point, tracing stops and an initial skeleton is generated, represented byP = {p₁, p₂, ..., p_i, ..., p_n}, where p_i is the i^th point on the skeleton.

Step 2) Extract the feature vectors of foreground and background points separately. These form the positive and negative training sets, respectively. Note that the foreground points are the skeleton points of traced neurites generated with SparseTracer.

Step 3) Obtain the SVM classifier with the training set.

Step 4) Apply the obtained classifier to the identification of points p_n and p_n + 1. If one of these two points is identified as a foreground point, continue tracing with SparseTracer, and go to step 5). Otherwise, go to Step 6).

Step 5) In the tracing process, if the last two tracing points are identified as a background point with SparseTracer, the SVM classifier will automatically activate. If both these two points are identified as the background points, terminate this tracing and go to Step 6); otherwise, continue tracing with SparseTracer and SVM classifier.

Step 6) If point p₁ is not a branching point, carry out the same tracing and identifying procedure in Step 4) and Step 5) for the points p₂ and p₁, until the termination condition is satisfied, namely, the last two tracing points are identified as the background points with the SVM classifier. Otherwise, finish this neurite tracing.

The above steps describe how our identification model and SparseTracer collaborate to achieve better neurite tracing. Note that our identification model learns from the characteristics of an image stack, and is only applied in this image stack. If the neurites in a given image stack cannot be traced completely after one iteration, we design a strategy in which, once our constructed classifier detects some foreground points with weak signal intensities, the positive training set can be updated and used to build a new classifier in the next tracing iteration. This strategy helps to provide nearly complete neurite reconstructions for a given image stack.

Parameter Settings in the Identification Model

To construct the identification model for detecting weak signals, certain key parameters must be pre-determined, including the size of the training set, the ratios used in feature vector extraction, and the size of the neighborhood.

Size of the Training Set

The positive training set depends on the tracing results. If the total number of foreground points in the traced neurites is less than a pre-determined threshold (500 in this study), we selected all foreground points and calculated their feature vectors to form the positive training set. In this case, though the size of the positive training set is small (dozens of points), the SVM classifier still behaves well. This is why we do not use upsampling to increase the size of the training set. When the traced neurites include many points, numerous positive feature vectors can be generated. In this case, the upper limit for the number of feature vectors in the positive training set is 500. This threshold is based on a tradeoff between computational cost and classification performance, as the inclusion of more training samples may not improve identification performance. This selection ensures that the time required to identify weak signals is approximately the same as that required for neurite tracing. In addition, balanced training sets are ideal for supervised SVM classifiers, and so the negative training set is of similar size as the positive training set (Tang et al. 2009).

Ratios Used in Feature Vector Extraction

The feature vector of a point depends on certain ratio settings, as described in Eq. (1). In our analysis, the ratios range from 0.8–1, and the difference between two adjacent ratios is c₁ = 0.025. The choice of a small c₁ is based on one of our assumptions, namely, the local background is smooth. Consequently, a slight decrease in the ratio (i.e., a small c₁) can fill the entire neighborhood of a background point through region growing, and the corresponding elements of its feature vector will be equal to 1. Therefore, a small value of c₁ can capture the smoothness of the background. A rapidly decreasing ratio (i.e., a large c₁) would indicate lower threshold values. With these lower thresholds, the region of a weak signal point would quickly fill its entire neighborhood, and thus the features of the neurite’s morphology would not be captured. The parameter c₂ covers situations in which the background intensity is very low and c₁ is not sufficient to achieve an appropriate granularity in ratios for feature vector extraction. Similar to c₁, the selection of c₂ aims to maintain the ability to detect weak neurite signals while identifying local background smoothness. Overall, the selection of these two parameters is intended to capture the feature differences between weak signal voxels and background regions.

Size of the Neighborhood in Feature Vector Extraction

In feature vector extraction, the neighborhood of a point contains 19 × 19 × 19 voxels. This is based on the following considerations: if the size of a neighborhood is small, the local morphology of a neurite extracted with a relatively low threshold may, in some situations, span the entire neighborhood, preventing the capture of its local morphology. However, a large neighborhood gives rise to the need for highly complex computations to obtain the region, which is a key step in feature vector extraction. Considering the diameter of a neurite (less than 5 μm) (De Paola et al. 2006; Stettler et al. 2006; Loopuijt et al. 2007) and the voxel size (0.5 × 0.5 × 0.5 μm³~ 2 × 2 × 2 μm³), we set the neighborhood range to be 19 × 19 × 19 voxels. This setting satisfies the condition that the local morphology of a neurite occupies a small portion of the neighborhood. All of the parameters discussed in this subsection remain unchanged throughout our analysis.

Multi-Fold Cross-Validation of SVM Classifier

To validate the effectiveness of the constructed SVM classifier in our identification model, we used multi-fold cross-validation (Kohavi 1995). The procedure of cross-validation is as follows: in the image stack, we used our method to generate a data set containing 500 foreground feature vectors and 500 background feature vectors. The data set was randomly partitioned into 10 equally sized subsets in which both the foreground and positive feature vectors have the same number. Of the 10 subsets, a single subset was retained as testing data and the remaining 9 subsets were used for training data. This process was then repeated 10 times, and correspondingly, 10 testing errors were generated. We averaged these testing errors to evaluate the SVM classifier (See TableS1).

Evaluation of Automated Neurite Tracing Methods

The precision and recall rates are often used to quantify the difference between the automatic and manual reconstruction given by a series of traced skeleton points. Here, each skeleton point has three-dimensional coordinates, and also can be regarded as a voxel if its coordinate elements are integers. These evaluation measurements were used in our previous studies (Quan et al. 2016; Li et al. 2016). The precision and recall are computed according to the numbers of true positive points. A true positive point is defined as follows: For any given point on the automatic reconstruction, find its nearest point on the manual reconstruction. If the distance between the given point and the found point is less than a pre-determined threshold (6 μm in this study), the given point is considered to be a true positive point. The pre-determined distance threshold judges whether a point in one skeleton is equated to a point in another skeleton or not. The parameter is set based on the consideration of the morphological characteristics of thick dendrites and total length of a neuron. According to our previous work (Quan et al. 2016), the evaluation results change slightly when this parameter ranges from 6 μm to 10 μm. The precision is then defined as the ratio of the number of true positive points to the total number of points in the automated reconstruction. The recall is defined as the ratio of the number of true positive points to the total number of points in the manual reconstruction.

Results

To evaluate the performance of the proposed ST-LFV, the fMOST (Gong et al. 2013), DIADEM (Brown et al. 2011), and BigNeuron (Peng et al. 2015) datasets were used. The fMOST dataset includes typical sub-blocks from different mouse brain regions collected with the fMOST imaging system (Gong et al. 2013) using a voxel size of 0.3 × 0.3 × 1 μm³. These voxels were automatically merged with sizes in the range from 0.5 × 0.5 × 0.5 μm³ to 2 × 2 × 2 μm³. This range is suitable for our tool GTree using which our analysis was performed here. The DIADEM (www.diademchallenge.org) and BigNeuron (http://alleninstitute.org/bigneuron/data/) datasets are freely available; information about these datasets can be found on the respective websites. We performed experiments on a computer workstation (Intel® Xeon® CPU 3.46 GHz computing platform, Quadro K4000 3G GPU, 192 GB RAM, Windows 7). Our analysis involved two algorithms: an automatic tracing algorithm, SparseTracer, and the combination of SparseTracer and the learned feature vectors in ST-LFV. The proposed algorithms (SparseTracer and ST-LFV) are integrated into our software GTree (https://github.com/GTreeSoftware/GTree/releases). GTree is implemented using widely used open source standards and programming practices (C++ with ITK and VXL libraries, graphical user interfaces written with QT version 5.5.1 and VTK version 8.0) and uses a GNU GENERAL PUBLIC LICENSE (https://github.com/GTreeSoftware/GTree/blob/master/License.md). When we used SparseTracer to analyze each image stack, we carefully selected the parameters to ensure the optimal tracing results. When using ST-LFV, the default settings for the tracing parameters were used, as they can provide an initial training set in most situations; the identification parameters are fixed and discussed in detail in the Methods section.

We first demonstrated the ability of ST-LFV to trace neurites in inhomogeneous neuronal images. Two image stacks were selected from a whole brain imaging dataset and the relevant image characteristics were identified (Fig. 4a-f). We evaluated the image quality of selected sub-blocks (Fig. 4b, e) using the contrast-to-noise ratio (CNR) (Song et al. 2004). The CNRs of the target areas (Fig.S1) range from 1.53–2.68, indicating that some regions have small differences between their foreground and background intensities. However, by modifying the signal range in the visualization mode, we were able to discriminate the foreground from the background, and thus manual tracing could achieve the ground-truth results (red curves in Fig. 4a, d). We further calculated the image intensities of the skeleton points on manually traced neurites (red curves in Fig. 4a, d) from the original image and its estimated background image (Quan et al. 2013). In most cases, the background intensities (blue curve in Fig. 4c) estimated from Fig. 4a are 2–3 times larger than the foreground intensities (red curve in Fig. 4f) calculated from Fig. 4d. This indicates that the background intensities vary sharply in brain-wide imaging datasets. In this case, we compared the tracing results drawn using SparseTracer with those from ST-LFV (Fig. 4g, h). SparseTracer can trace the neurite well (Fig. 4d, h) with a suitable threshold. However, the same threshold is not applicable in other cases (Fig. 4a, g). In contrast, ST-LFV can attain trace results that are almost equal to the ground truth. We concluded that ST-LFV can overcome the influence of the inhomogeneous background to trace neurites successfully.

We used an experimental dataset to verify the ability of ST-LFV to identify weak signals. The experimental dataset included a neurite with weak signals at several sites (Fig. 5a). We selected two sites and extracted their corresponding sub-blocks (Fig. 5b, c). These two sub-blocks had CNRs of 1.47 and 1.21, respectively. We manually traced the neurites in these two sub-blocks and computed the foreground and background values of the skeleton points in the traced neurites (Fig. 5d, e). These results (Fig. 5b-e) show that the neurites have weak signal intensities at some sites. SparseTracer cannot handle this case, tracing only parts of the neurite (left and middle panels in Fig. 5f), despite repeated efforts to select suitable thresholds. Compared with SparseTracer, ST-LFV yields good tracing results (red curve, right panel in Fig. 5f) that are broadly equivalent to the ground truth (green curve in Fig. 5f). ‘High threshold’ and ‘low threshold’ in Fig. 5f have the same meaning as in Figs. 4g, h, and are explained in the legend of Fig. 4.

Furthermore, we showed that ST-LFV is superior to SparseTracer for tracing neurites using 12 experimental image stacks containing sub-blocks of 600 × 600 × 600 voxels. These datasets are clearly distributed among different brain regions in which the background intensities change obviously (Fig.S2). Two typical datasets and their corresponding tracing results are presented in Fig. 6a, b. The sites of these two datasets are labeled with yellow arrows in Fig. 6c. The sites of the other ten datasets are also shown in Fig. 6c. When using SparseTracer to analyze these datasets, we selected the tracing threshold value that maximized the average tracing accuracy and used this setting for all datasets. This sometimes caused SparseTracer to produce more tracing results than manual reconstruction (white circles in middle panel, Fig. 6a). Unlike SparseTracer, ST-LFV generates the corresponding identification models for each dataset to identify untraced foreground points. We used multi-fold cross-validation (see section 2.5) to validate the constructed SVM classifier in the identification model (TableS1). The highest error rate was 2.1% (dataset 2); the other datasets had error rates of less than 0.5%. The cross-validation results indicate the robustness of the SVM classifiers to image stacks from different brain regions. Furthermore, we quantified the tracing results from SparseTracer and ST-LFV (Fig. 6d, e). The average precision and recall rates were 93% and 86% for SparseTracer and 99% and 97% for ST-LFV, respectively. These results indicate that ST-LFV can achieve almost complete neurite reconstruction using our identification model.

We also compared the reconstruction performances derived by using our previous method SparseTracer, our method ST-LFV, Open-Snake (Wang et al. 2011) and UltraTracer (Peng et al. 2017) on various datasets. The datasets are from BigNeuron project, DIADEM challenge and MOST data. According to the reconstruction and quantified results (Fig. 7& TableS2), all the tested methods behaved well on datasets with simple neuron structure and clean background (Fig. 7a, b). Open-snake and UltraTracer showed their own advantages on some specific neurite structures (Fig. 7c, d), such as, neurons consisting of short, thick neurite segments (Fig. 7d). With regard to tracing axonal neurites with weak image intensity, SparseTracer, Open-snake and UltraTracer were challenged (Fig. 7e) and some of them even failed to cope with this case (Fig. 7f). Note that in the use of SpareseTracer, the tracing seeds were manually provided except for Fig. 7e. This is because the image (Fig. 7e) includes many separate neurites and manually selecting tracing seeds is relatively time consuming. These results provide some evidences that identifying the weak foreground in neuronal images is still a challenging problem for the state-of-the-art method. In this demonstration, ST-LFV can identify almost all axonal neurites.

Next, we demonstrated that our identification model is not limited to images without a smooth background. Synthetic datasets were used for this purpose. Each dataset consisted of 517 × 515 × 517 voxels and contained several neurites. The signal intensity of the first three images in Fig. 8a is 255, but with different noise levels (Gaussian white noise with zero mean and standard deviation of 20, 60, or 100). The last image corresponds to signal and noise intensities of 150 and 100, respectively. These four images have CNRs (Welvaert and Rosseel 2013) of 12.75, 4.25, 2.55, and 1.5, respectively. An anisotropic Rudin-Osher-Fatemi (ROF) denoising method (Goldstein and Osher 2009; Rudin et al. 1992) was applied to smooth the image stack (Fig. 8b). We then validated the identification model drawn from the corresponding smoothed image stacks. When building the model, the positive training sets were calculated from the initial traced results (red curves in left-top panel, Fig. 8a). The negative training sets were formed of randomly chosen voxels. We manually checked the availability of the testing sets, i.e., that the positive and negative feature vectors corresponded to foreground and background voxels, respectively. The voxels from these four smoothed image stacks (Fig. 8b) had the same coordinates, and each smoothed image stack generated a testing set (Fig. 8c). In the testing sets, even with the high noise level, the positive and negative feature vectors were still separable (right panel in Fig. 8c). The results indicate that our feature vectors differentiate the foreground and background sufficiently well to handle images with nonsmooth backgrounds using an appropriate denoising method. We evaluated the error rate of the SVM classifiers on four testing sets (Fig. 8d). The highest error rate is 3% for the data with the highest noise level (fourth image in Fig. 8b). From these results, we could conclude that our identification model can be applied to image stacks with a nonsmooth background.

We investigated whether our assumptions were applicable to DIADEM datasets. Two datasets were used for this purpose. The dataset shown in Fig. 9a (Neocortical Layer 6 Axons dataset) was imaged by a two-photon microscope. The other in Fig. 9b (OP dataset) was imaged by 2-channel confocal microscopy. We extracted the foreground points and background points and calculated the corresponding feature vectors. The calculated feature vectors (Fig. 9c-f) illustrate the large feature differences between the extracted foreground and extracted background points. These differences result in a classifier for detecting weak signals with a low training error (1.3% for Fig. 9g and 0% for Fig. 9h, respectively). Note that, in generating the training set, the extracted background points included a few points located in regions adjacent to or at the boundaries of neurites (Fig. 9a). This may reduce the power of the classifier. We quantified this negative influence by applying cross-validation and the corresponding error rate is 0.75%. For the clean image background in Fig. 9b, the corresponding error rate is 0.05%.

We also demonstrated that our assumptions hold for BigNeuron data using two typical datasets (checked6_mouse_tufts and checked_mouse_korea) with a noisy background (Fig. 10a, b). Despite the presence of such noise, the foreground and background feature vectors are different and can be easily classified into two groups (Fig. 10c-f). These feature vectors were used to derive classifiers with training error rates of 0.4% (Fig. 10g) and 1.2% (Fig. 10h). The training errors can be attributed to the interference of the noisy background. We further validated the identification model using cross-validation. The estimated error rates are 0.05% and 0.1%, respectively. The low error rates indicate large differences between the features of the foreground and background points. These results indicate that our assumptions are consistent with the features of the BigNeuron datasets, and that the generated model is valid.

Furthermore, we checked that the constructed classifiers (Fig. 10g, h) produce better tracing results. We compared the tracing results from SparseTracer with those given by ST-LFV (Fig. 11a-d), and found that SparseTracer was unable to trace some neurites with weak signals (arrows in Fig. 11a, c). ST-LFV produced tracing results that included almost all of the neurites that could not be traced by SparseTracer. These results verify that the classifiers (Fig. 10g, h) are applicable even when there are several training errors.

The superior tracing performance of ST-LFV resulted in a vast improvement in the automation level of the SparseTracer software, enabling the rapid tracing of neurites in large-scale datasets. We selected a dataset that included several long axons, with a total size of 1.99 × 1.93 × 1.32 mm³ (voxel size, 0.3 × 0.3 × 1 μm³, 105 GB). We adopted the divide-and-conquer strategy used in our previous work (Li et al. 2016) for the analysis of large datasets. We also added a manual editing module to SparseTracer, to obtain the initial tracing direction and sites for continuous tracing when the weak signal strength fails the detection. This manual editing module allows SparseTracer to produce the same tracing results as ST-LFV (Fig. 12b, c). However, the number of manual editing sessions required for SparseTracer (20 times) is far greater than that for ST-LFV (1 time). This demonstrates the advantage of ST-LFV for large-scale tracing. We quantified this advantage by comparing the total time required while using ST-LFV with that while using SparseTracer. Tracing neurites with SparseTracer requires intensive manual edits, and a skillful annotator may require 20 min. The same annotator would require only 5 min to finish the same task using ST-LFV. In addition, we measured the time required to build the identification model and compared it with that for neurite tracing. Fifteen sub-blocks from the dataset in Fig. 12 were used for this purpose, and the corresponding information is presented in TableS3. According to the tracing pipeline (Method section 2.3), foreground identification is closely linked with neurite tracing and the model only identifies a small part of the failed traced voxels. Therefore, the time required to build and customize the identification model is less than that for tracing neurites. In this comparison, the image reading time was ignored as it would be negligible for the data storage system (a RAID is connected to the workstation directly) used in this study. From the above comparisons, we can conclude that ST-LFV significantly improves the tracing performance of SparseTracer and is a valuable resource for large-scale neurite tracing.

Discussion

ST-LFV involves more rules deduced from images by human beings than other methods (Li et al. 2017; Chen et al. 2015). These rules are based on several assumptions that are commonly applicable to neuronal images collected with optical microscopes, and provide a basis for constructing a feature vector that displays the differences between foreground and background voxels. The robustness of our model was verified by multi-fold cross validation (see results in TableS1). We also demonstrated that, under our assumptions, the feature vector of a weak signal voxel is essentially different from that of a background voxel (Fig.S3). In our method, a complicated procedure for extracting valid features and identifying weak signals is avoided by using more rules, thus eliminating the need for intensive computation. This is the primary reason why ST-LFV is suitable for large-scale tracing of neurites.

In ST-LFV, unlike other methods (Li et al. 2017; Chen et al. 2015), the identification model is embedded in the tracing procedure. When the tracing termination conditions are triggered, the identification model operates to allow tracing to continue after the model has identified the current tracing point as a foreground voxel. The identification model is linked to the tracing procedure and enhances the ability of ST-LFV to trace neurites with weak signals. Other methods separate the identification and tracing procedures, first using a machine learning method to identify as many foreground voxels as possible and then performing the tracing procedure, i.e., extracting the skeleton based on the identified foreground voxels. These methods aim to identify all foreground voxels, and are therefore relatively computationally expensive, which is an obstacle to the large-scale tracing of neurites. ST-LFV only activates the identification model when the tracing termination conditions have been triggered, and thus only identifies a few foreground voxels when tracing a neurite. This contributes to the ability of ST-LFV to rapidly trace neurites in large-scale images.

Foreground identification is an indispensable step in neurite tracing. The identification model in ST-LFV is used to distinguish foreground voxels from the background. Generally, our identification model could be employed alongside many widely used tracing methods, such as, the model fitting method (Zhao et al. 2011), principal curves method (Bas and Erdogmus 2011), etc. The termination condition of these methods will be activated when the local structure information becomes inadequate. Combined with our model, these methods could potentially identify weaker foreground signals and continue tracing.

As in many machine learning methods (Suykens and Vandewalle 1999; Cortes and Vapnik 1995), obtaining a training set is a key part of ST-LFV. In our method, the training set contains positive (foreground) and negative (background) feature vectors, is automatically generated, and does not require manual labeling. A positive feature vector in the training set is determined by its corresponding foreground point (see the Methods section). The point is automatically generated by the tracing procedure, such as, SparseTracer (Li et al. 2016) or other methods (Bas and Erdogmus 2011; Rodriguez et al. 2009). A negative feature vector is determined by a corresponding point drawn at random (according to a uniform distribution) from the image. This random selection ensures that the chosen point has a very low probability of being in the foreground. The probability is low because the distribution of neurites is sparse and the number of foreground voxels is far smaller than the total number of voxels. According to the above analysis, the automatic generation of a training set from a sparse image is feasible.

In SparseTracer, we used a constrained principal curve to trace neurites with weak signals (Li et al. 2016; Quan et al. 2016). In general, it is difficult to obtain a forward tracing direction from this type of neurite because of inadequate local structure information. In this case, the constrained principal curve introduces directional information for the traced points, and this direction becomes the forward tracing direction. This feature allows SparseTracer to detect weaker signals than other methods (Rodriguez et al. 2009; Wang et al. 2011; Xiao and Peng 2013). When analyzing images that include weak signals, SparseTracer demonstrates highly accurate tracing performance (>85% recall at >90% precision). However, like most other methods, SparseTracer uses a set of thresholds to determine whether the tracing should be terminated. This termination condition may not be suitable when detecting weak signals from an inhomogeneous background (see Figs. 4g, h). This type of weak signal detection is a common task in the process of tracing large-scale neurites. Considering this situation, we proposed an identification model that can be combined with SparseTracer, i.e., ST-LFV, to enable the large-scale tracing of neurites.

The identification model in ST-LFV is based on rules that are suitable for various types of neuronal images. However, directly using the identification model may be inappropriate for images whose characteristics do not satisfy our assumptions. Consider the following two examples. 1) The neurites or somas are densely distributed in the images. In this case, there will be a relatively high number of foreground voxels in the randomly chosen voxels used to construct the negative training set, which will reduce the performance of the identification model. This problem can be addressed by cleaning the negative training set. For instance, the foreground region could be labeled using neurite tracing and soma shape reconstruction methods (Quan et al. 2013; Quan et al. 2014; Luengo-Sanchez et al. 2015; Varando et al. 2018), and then feature vectors whose corresponding voxels were in the labeled regions could be removed from the negative training set. 2) For images without a smooth background, the background features may deviate from the assumption used to construct the identification method. In this case, an anisotropic ROF denoising method (Goldstein and Osher 2009) or bias correction method (Sing et al. 2015) are good approaches to ensure that the smoothed background satisfies our assumption. The selection of the approach shall be based on different image characteristics. In a nutshell, pre-processing methods can extend the application range of our identification model. However, there are some cases that may result in failure to identify weak signals. In case 1), some tracing methods may fail to trace neurites that are highly disconnected, i.e., neurites that can be modeled as a series of sufficiently separable clusters of foreground points rather than as a series of connected cylinders. In this case, neurite tracing failures may result in the unsuccessful application of the identification model. In case 2), there may be images with low z-resolution depths where the neural signal is not sparsely distributed. The identification will then fail because the extraction of background feature vectors will be restricted by the dense foreground voxels.

We demonstrated that ST-LFV can detect weak foreground voxels in tracing sparsely distributed neurites. ST-LFV also has potential advantages in tracing neurites from the large-scale images in which neurites are sparsely distributed. However, it is worth noticing that the identification model in ST-LFV activates only when a starting point and tracing direction are provided. For a partially traced neurite, ST-LFV may generate a complete reconstruction. But it fails in tracing some neurites that are not detected by tracing methods, due to lack of starting traced point and tracing direction. In addition, like other tracing methods (Peng et al. 2010; Peng et al. 2017; Quan et al. 2016; Wang et al. 2011; Wearne et al. 2005), ST-LFV still experiences many difficulties in tracing neurons on a brain-wide scale, which can be attributed to the following causes. First, brain-wide neurite tracing involves identifying individual neurons in the presence of the packed neurites. This problem still challenges the current tracing methods. Second, due to the tree structure of a neuron, the tracing errors can accumulate continuously. Tracing a long-projection neuron makes this situation worse and the tracing errors even extend to the whole brain. This means that for the brain-wide reconstruction, a tracing error will cause an unacceptable tracing result in some cases. Finally, the image stacks with size of terabytes or even tens of terabytes are required to be coped with the brain-wide tracing of neurons, which requires the integration of big data technique and neurite tracing methods. Thus, many challenges still exist in the brain-wide tracing of individual neurons. Aiming to overcome these challenges, appropriate methods should be developed and integrated into a software tool.

Conclusion

We have proposed a method for identifying weak neurite signals from the background that is inhomogeneous but locally smooth. We verified that the extracted features, which differentiate the foreground and background, are widely applicable to various types of light-microscopic images containing sparsely distributed neurites. The identification method was shown to improve the accuracy of neurite tracing on condition that our rules are consistent with the image characteristics. We further demonstrated that this identification method is suitable for the large-scale tracing of neurites that are sparsely distributed, which may aid in the reconstruction of neurons across different brain regions.

Information Sharing Statement

Our method is plugged into the software GTree, which is an open source software and available at https://github.com/GTreeSoftware/Release/releases. We also provide some example datasets used in our paper. They can be downloaded at https://github.com/GTreeSoftware/TEST_DATA/releases/tag/ST-LFV. If one is interested in other datasets, please feel free to contact us.

References

Bas, E., & Erdogmus, D. (2011). Principal curves as skeletons of tubular objects: Locally characterizing the structures of axons. Neuroinformatics, 9(2–3), 181–191.
PubMed Google Scholar
Basu, S., Condron, B., Aksel, A., & Acton, S. T. (2013). Segmentation and tracing of single neurons from 3D confocal microscope images. IEEE J Biomed and Heath Informatics, 17(2), 319–335.
Google Scholar
Becker, C., Rigamonti, R., Lepetit, V., & Fua, P. (2013). Supervised feature learning for curvilinear structure segmentation. Proc Int Conf Med Image Comput Comput Assist Intervent (MICCAI), 526–533.
Brown, K. M., Barrionuevo, G., Canty, A. J., De Paola, V., Hirsch, J. A., Jefferis, G. S., et al. (2011). The DIADEM data sets: Representative light microscopy images of neuronal morphology to advance automation of digital reconstructions. Neuroinformatics, 9(2–3), 143–157.
PubMed PubMed Central Google Scholar
Cai, H., Xu, X., Lu, J., Lichtman, J. W., Yung, S. P., & Wong, S. T. C. (2006). Repulsive force based snake model to segment and track neuronal axons in 3D microscopy image stacks. Neuroimage, 32(4), 1608–1620.
PubMed Google Scholar
Chen, H., Xiao, H., Liu, T., & Peng, H. (2015). SmartTracing: Self-learning-based neuron reconstruction. Brain Informatics, 2(3), 135–144.
PubMed PubMed Central Google Scholar
Choromanska, A., Chang, S.-F., & Yuste, R. (2012). Automatic reconstruction of neural morphologies with multi-scale tracking. Front Neural Circuits, 6, 25.
PubMed PubMed Central Google Scholar
Chothani, P., Mehta, V., & Stepanyants, A. (2011). Automated tracing of neurites from light microscopy stacks of images. Neuroinformatics, 9(2–3), 263–278.
PubMed PubMed Central Google Scholar
Chung, K., & Deisseroth, K. (2013). CLARITY for mapping the nervous system. Nat Methods, 10(6), 508–513.
CAS PubMed Google Scholar
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Mach Learn, 20(3), 273–297.
Google Scholar
De Paola, V., Holtmaat, A., Knott, G., Song, S., Wilbrecht, L., Caroni, P., & Svoboda, K. (2006). Cell type-specific structural plasticity of axonal branches and boutons in the adult neocortex. Neuron, 49(6), 861–875.
PubMed Google Scholar
Donohue, D. E., & Ascoli, G. A. (2011). Automated reconstruction of neuronal morphology: An overview. Brain Res Rev, 67(1–2), 94–102.
PubMed Google Scholar
Feng, G., Mellor, R. H., Bernstein, M., Keller-Peck, C., Nguyen, Q. T., Wallace, M., Nerbonne, J. M., Lichtman, J. W., & Sanes, J. R. (2000). Imaging neuronal subsets in transgenic mice expressing multiple spectral variants of GFP. Neuron, 28(1), 41–51.
CAS Google Scholar
Frangi, A. F., Niessen, W. J., Vincken, K. L., & Viergever, M. A. (1988). Multiscale vessel enhancement filtering. MICCAI, 98, 130–137.
Google Scholar
Fürth, D., Vaissière, T., Tzortzi, O., Xuan, Y., Märtin, A., Lazaridis, I., et al. (2018). An interactive framework for whole-brain maps at cellular resolution. Nat Neurosci, 21(1), 139–149.
PubMed Google Scholar
Goldstein, T., & Osher, S. (2009). The split Bregman method for L1 regularized problems. SIAM J Imaging Sci, 2(2), 323–343.
Google Scholar
Gong, H., Zeng, S., Yan, C., Lv, X., Yang, Z., Xu, T., Feng, Z., Ding, W., Qi, X., Li, A., Wu, J., & Luo, Q. (2013). Continuously tracing brain-wide long-distance axonal projections in mice at a one-micron voxel resolution. Neuroimage, 74, 87–98.
PubMed Google Scholar
Gong, H., Xu, D., Yuan, J., Li, X., Guo, C., Peng, J., Li, Y., Schwarz, L. A., Li, A., Hu, B., Xiong, B., Sun, Q., Zhang, Y., Liu, J., Zhong, Q., Xu, T., Zeng, S., & Luo, Q. (2016). High-throughput dual-colour precision imaging for brain-wide connectome with cytoarchitectonic landmarks at the cellular level. Nat Commun, 7, 12142.
CAS PubMed PubMed Central Google Scholar
Gu, L., Zhang, X., Zhao, H., Li, H., & Cheng, L. (2017). Segment 2D and 3D filaments by learning structured and contextual features. IEEE Trans Med Imag, 36(2), 596–606.
Google Scholar
Hernandez-Herrera, P., Papadakis, M., & Kakadiaris, I. A. (2016). Multi-scale segmentation of neurons based on oneclass classification. J Neurosci Methods, 266, 94–106.
PubMed Google Scholar
Hestenes, M. R. (1969). Multiplier and gradient methods. J Optim Theory Appl, 4(5), 303–320.
Google Scholar
Jefferis, G. S., & Livet, J. (2012). Sparse and combinatorial neuron labelling. Curr Opin Neurobiol, 22(1), 101–110.
CAS PubMed Google Scholar
Kohavi, R. (1995). A study of cross-validation and bootstrap for accuracy estimation and model selection. Proc 14th Int Joint Conf Artificial Intelligence, 1137–1143.
Li, S., Zhou, H., Quan, T., Li, J., Li, Y., Li, A., et al. (2016). SparseTracer: The reconstruction of discontinuous neuronal morphology in noisy images. Neuroinformatics, 15(2), 133–149.
Google Scholar
Li, R., Zeng, T., Peng, H., & Ji, S. (2017). Deep learning segmentation of optical microscopy images improves 3D neuron reconstruction. IEEE Trans Med Imag, 36(7), 1533–1541.
Loopuijt, L. D., Silva, F. M., Hirt, B., Vonthein, R., & Kremers, J. (2007). Dendritic thickness: A morphometric parameter to classify mouse retinal ganglion cells. Braz J Med Biol Res, 40, 1367–1382.
CAS PubMed Google Scholar
Luengo-Sanchez, S., Bielza, C., Benavides-Piccione, R., Fernaud-Espinosa, I., DeFelipe, J., & Larrañaga, P. (2015). A univocal definition of the neuronal soma morphology using gaussian mixture models. Front Neuroanat, 9, 137.
PubMed PubMed Central Google Scholar
Luo, L., & Callaway, E. K. (2008). Genetic dissection of neural circuits. Neuron, 57(5), 634–660.
CAS PubMed PubMed Central Google Scholar
Luo, G., Sui, D., Wang, K., & Chae, J. (2015). Neuron anatomy structure reconstruction based on a sliding filter. BMC Bioinformatics, 16, 342.
PubMed PubMed Central Google Scholar
Marx, V. (2012). Technology feature charting the brain's networks. Nature, 490(7419), 293–298.
CAS PubMed Google Scholar
Megjhani, M., Rey-Villamizar, N., Merouane, A., Lu, Y., Mukherjee, A., Trett, K., Chong, P., Harris, C., Shain, W., & Roysam, B. (2015). Population-scale three-dimensional reconstruction and quantitative profiling of microglia arbors. Bioinformatics, 31(13), 2190–2198.
CAS PubMed PubMed Central Google Scholar
Meijering, E. (2010). Neuron tracing in perspective. Cytometry A, 77(7), 693–704.
PubMed Google Scholar
Mitra, P. P. (2014). The circuit architecture of whole brains at the mesoscopic scale. Neuron, 83(6), 1273–1283.
CAS PubMed PubMed Central Google Scholar
Osten, P., & Margrie, T. W. (2013). Mapping brain circuitry with a light microscope. Nat Methods, 10(6), 515–523.
CAS PubMed PubMed Central Google Scholar
Parekh, R., & Ascoli, G. A. (2013). Neuronal morphology goes digital: A research hub for cellular and system neuroscience. Neuron, 77(6), 1017–1038.
CAS PubMed PubMed Central Google Scholar
Parekh, R., & Ascoli, G. A. (2015). Quantitative investigations of axonal and dendritic arbors: Development, structure, function, and pathology. Neuroscientist, 21(3), 241–254.
CAS PubMed Google Scholar
Peng, H., Ruan, Z., Atasoy, D., & Sternson, S. (2010). Automatic reconstruction of 3D neuron structures using a graph-augmented deformable model. Bioinformatics, 26(12), i38–i46.
CAS PubMed PubMed Central Google Scholar
Peng, H., Hawrylycz, M., Roskams, J., Hill, S., Spruston, N., Meijering, E., & Ascoli, G. A. (2015). BigNeuron: Large-scale 3D neuron reconstruction from optical microscopy images. Neuron, 87(2), 252–256.
CAS PubMed PubMed Central Google Scholar
Peng, H., Zhou, Z., Meijering, E., Zhao, T., Ascoli, G. A., & Hawrylycz, M. (2017). Automatic tracing of ultra-volumes of neuronal images. Nat Methods, 14(4), 332.
Quan, T., Zheng, T., Yang, Z., Ding, W., Li, S., Li, J., Zhou, H., Luo, Q., Gong, H., & Zeng, S. (2013). NeuroGPS: Automated localization of neurons for brain circuits using L1 minimization model. Sci Rep, 3.
Quan, T., Li, J., Zhou, H., Li, S., Zheng, T., Yang, Z., et al. (2014). Digital reconstruction of the cell body in dense neural circuits using a spherical-coordinated variational model. Sci Rep, 4, 4970.
CAS PubMed PubMed Central Google Scholar
Quan, T., Zhou, H., Li, J., Li, S., Li, A., Li, Y., Lv, X., Luo, Q., Gong, H., & Zeng, S. (2016). NeuroGPS-tree: Automatic reconstruction of large-scale neuronal populations with dense neurites. Nat Methods, 13(1), 51–54.
CAS PubMed Google Scholar
Radojevic, M., & Meijering, E. (2017). Automated neuron tracing using probability hypothesis density filtering. Bioinformatics, 33, 7.
Google Scholar
Ragan, T., Kadiri, L. R., Venkataraju, K. U., Bahlmann, K., Sutin, J., Taranda, J., Arganda-Carreras, I., Kim, Y., Seung, H. S., & Osten, P. (2012). Serial two-photon tomography for automated ex vivo mouse brain imaging. Nat Methods, 9(3), 255–U248.
CAS PubMed PubMed Central Google Scholar
Rockafellar, R. T. (1973). A dual approach to solving nonlinear programming problems by unconstrained optimization. Math Program, 5(1), 354–373.
Google Scholar
Rodriguez, A., Ehlenberger, D. B., Hof, P. R., & Wearne, S. L. (2009). Three-dimensional neuron tracing by voxel scooping. J Neurosci Methods, 184(1), 169–175.
PubMed PubMed Central Google Scholar
Rudin, L. I., Osher, S., & Fatemi, E. (1992). Nonlinear total variation based noise removal algorithms. Physica D: Nonlinear Phenomena, 60(1–4), 259–268.
Google Scholar
Santamaria-Pang, A., Hernandez-Herrera, P., Papadakis, M., Saggau, P., & Kakadiaris, I. A. (2015). Automatic morphological reconstruction of neurons from multi-photon and confocal microscopy images using 3D tubular models. Neuroinformatics, 13(3), 297–320.
PubMed Google Scholar
Silvestri, L., Bria, A., Sacconi, L., Iannello, G., & Pavone, F. S. (2012). Confocal light sheet microscopy: Micron-scale neuroanatomy of the entire mouse brain. Opt Express, 20(18), 20582–20598.
CAS PubMed Google Scholar
Sing, J. K., Adhikari, S. K., & Kahali, S. (2015). On estimation of bias field in MRI images. In CGVIS 2015 IEEE international conference (pp. 269–274).
Google Scholar
Song, X., Pogue, B. W., Jiang, S., Doyley, M. M., Dehghani, H., Tosteson, T. D., & Paulsen, K. D. (2004). Automated region detection based on the contrast-to-noise ratio in near-infrared tomography. Appl Opt, 43(5), 1053–1062.
PubMed Google Scholar
Stettler, D. D., Yamahachi, H., Li, W., Denk, W., & Gilbert, C. D. (2006). Axons and synaptic boutons are highly dynamic in adult visual cortex. Neuron, 49, 877–887.
CAS PubMed Google Scholar
Suykens, J. A., & Vandewalle, J. (1999). Least squares support vector machine classifiers. Neural Process. Lett, 9(3), 293–300.
Google Scholar
Svoboda, K. (2011). The past, present, and future of single neuron reconstruction. Neuroinformatics, 9(2–3), 97–98.
PubMed Google Scholar
Tang, Y., Zhang, Y.-Q., Chawla, N. V., & Krasser, S. (2009). SVMs modeling for highly imbalanced classification. IEEE Trans Syst, Man, Cybern B, Cybern, 39(1), 281–288.
Turetken, E., Gonzalez, G., Blum, C., & Fua, P. (2011). Automated reconstruction of dendritic and axonal trees by global optimization with geometric priors. Neuroinformatics, 9(2–3), 279–302.
PubMed Google Scholar
Ugolini, G. (2010). Advances in viral transneuronal tracing. J Neurosci Methods, 194(1), 2–20.
PubMed Google Scholar
Varando, G., Benavides-Piccione, R., Muñoz, A., Kastanauskaite, A., Bielza, C., Larrañaga, P., & DeFelipe, J. (2018). MultiMap: A tool to automatically extract and analyse spatial microscopic data from large stacks of confocal microscopy images. Front Neuroanat, 12.
Wang, Y., Narayanaswamy, A., Tsai, C. L., & Roysam, B. (2011). A broadly applicable 3-D neuron tracing method based on open-curve snake. Neuroinformatics, 9(2–3), 193–217.
PubMed Google Scholar
Wearne, S. L., Rodriguez, A., Ehlenberger, D. B., Rocher, A. B., Henderson, S. C., & Hof, P. R. (2005). New techniques for imaging, digitization and analysis of three-dimensional neural morphology on multiple scales. Neuroscience, 136(3), 661–680.
CAS PubMed Google Scholar
Welvaert, M., & Rosseel, Y. (2013). On the definition of signal-to-noise ratio and contrast-to-noise ratio for fMRI data. PLoS One, 8(11), e77089.
CAS PubMed PubMed Central Google Scholar
Xiao, H., & Peng, H. (2013). APP2: Automatic tracing of 3D neuron morphology based on hierarchical pruning of a gray-weighted image distance-tree. Bioinformatics, 29(11), 1448–1454.
CAS PubMed PubMed Central Google Scholar
Xu, C. Y., & Prince, J. L. (1998). Snakes, shapes, and gradient vector flow. IEEE Trans Imag Process, 7(3), 359–369.
CAS Google Scholar
Yang, J., Gonzalez-Bellido, P. T., & Peng, H. (2013). A distance-field based automatic neuron tracing method. BMC Bioinformatics, 14(1), 1–11.
Google Scholar
Zhao, T., Xie, J., Amat, F., Clack, N., Ahammad, P., Peng, H., Long, F., & Myers, E. (2011). Automated reconstruction of neuronal morphology based on local geometrical and global structural models. Neuroinformatics, 9(2–3), 247–261.
PubMed PubMed Central Google Scholar
Zingg, B., Hintiryan, H., Gou, L., Song, M. Y., Bay, M., Bienkowski, M. S., Foster, N. N., Yamashita, S., Bowman, I., Toga, A. W., & Dong, H. W. (2014). Neural networks of the mouse neocortex. Cell, 156(5), 1096–1111.
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

This work was supported by the Science Fund for Creative Research Group of China (Grant No. 61421064), National Program on Key Basic Research Project of China (Grant No. 2015CB7556003), National Natural Science Foundation of China (Grant No. 81327802), Science Fund for Young and Middle-aged Creative Research Group of the Universities in Hubei Province (Grant No. T201520), Natural Science Foundation of Hubei Province (2014CFB564) and Director Fund of WNLO.

Author information

Authors and Affiliations

Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics-Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China
Shiwei Li, Tingwei Quan, Hang Zhou, FangFang Yin, Anan Li, Ling Fu, Qingming Luo, Hui Gong & Shaoqun Zeng
MOE Key Laboratory for Biomedical Photonics, Collaborative Innovation Center for Biomedical Engineering, School of Engineering Sciences, Huazhong University of Science and Technology, Wuhan, 430074, Hubei, China
Shiwei Li, Tingwei Quan, Hang Zhou, FangFang Yin, Anan Li, Ling Fu, Qingming Luo, Hui Gong & Shaoqun Zeng
School of Mathematics and Economics, Hubei University of Education, Wuhan, 430205, Hubei, China
Tingwei Quan

Authors

Shiwei Li
View author publications
You can also search for this author in PubMed Google Scholar
Tingwei Quan
View author publications
You can also search for this author in PubMed Google Scholar
Hang Zhou
View author publications
You can also search for this author in PubMed Google Scholar
FangFang Yin
View author publications
You can also search for this author in PubMed Google Scholar
Anan Li
View author publications
You can also search for this author in PubMed Google Scholar
Ling Fu
View author publications
You can also search for this author in PubMed Google Scholar
Qingming Luo
View author publications
You can also search for this author in PubMed Google Scholar
Hui Gong
View author publications
You can also search for this author in PubMed Google Scholar
Shaoqun Zeng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tingwei Quan.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

ESM 1

(DOCX 1059 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Li, S., Quan, T., Zhou, H. et al. Identifying Weak Signals in Inhomogeneous Neuronal Images for Large-Scale Tracing of Sparsely Distributed Neurites. Neuroinform 17, 497–514 (2019). https://doi.org/10.1007/s12021-018-9414-9

Download citation

Published: 11 January 2019
Issue Date: October 2019
DOI: https://doi.org/10.1007/s12021-018-9414-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Identifying Weak Signals in Inhomogeneous Neuronal Images for Large-Scale Tracing of Sparsely Distributed Neurites

Abstract

Similar content being viewed by others

SparseTracer: the Reconstruction of Discontinuous Neuronal Morphology in Noisy Images

Rivulet: 3D Neuron Morphology Tracing with Iterative Back-Tracking

BigNeuron: a resource to benchmark and predict performance of algorithms for automated tracing of neurons in light microscopy datasets

Introduction

Methods

Feature Extraction for Identifying Weak Signals

SVM Classifier Used to Identify Weak Signals

Using the Identification Model for Neurite Tracing