Introduction

The spatial pattern of forest trees is the distribution and spatial arrangement of trees in a population and can be described by the spatial correlation of the marks characterizing the structure of individual trees in a forest stand (Wang et al. 2010; Podlaski 2019). The spatial pattern can also characterize the spatial correlation of potential intraspecific and interspecific interactions among individuals (Pillay and Ward 2012; Erfanifard and Sheikholeslami 2017) and reveal information on stand history, population dynamics, competition and their survival in different environments (Zhang et al. 2010; Alvarez et al. 2011; Lin et al. 2011; Genet et al. 2014).

Spatial patterns can be divided into unmarked and quantitatively marked patterns depending on the involvement of mark and into univariate, bivariate and multivariate patterns according to the number of species involved. A mark is information that characterizes individual trees and can be qualitative or quantitative (Wiegand and Moloney 2013). A quantitative mark is a scalar that describes a size variable of an individual tree, such as tree height, diameter at breast height, and crown width; whereas a qualitative mark is a descriptor such as tree species or tree habit. Summary statistics have frequently been used to quantify statistical properties of spatial patterns and to provide a brief and concise description of patterns using numbers, functions and diagrams at multiple scales (Wiegand and Moloney 2004; Illian et al. 2008). Commonly used summary statistics are the pair correlation function, mark connection function and its variations. Specific null models are also used to randomize certain aspects of the data, while holding others constant according to the specific type of patterns (Wiegand and Moloney 2013), aiming to create patterns that are expected in the absence of a particular ecological mechanism of interest (Graves 1996).

However, the spatial correlations of tree location and facilitated or competitive interaction and of size and facilitated or competitive interaction are still not clear due to the lack of corresponding marks and summary statistics. Previous research with traditional census data has usually focused only on tree height and diameter at breast height, which lack insight into the three-dimensional structure of individual trees. Laser scanning technology, which can provide a three-dimensional structure of a forest tree, has provided a new solution (Henning and Radtke 2006; Packalen et al. 2013; Taubert et al. 2015; Lin et al. 2016; Fischer et al. 2020). Airborne laser scanning (ALS) technology can be used to collect a point cloud covering a large area, and individual trees can be delineated from the ALS point cloud through segmentation algorithms, from which tree marks can be further extracted (Zhao et al. 2014).

Aiming at exploring whether there are spatial correlations (1) among individual locations based on a given coordinates (i.e., if trees are spatially clustered, random, or segregated), (2) among quantitative marks representing tree size (i.e., if trees are significantly large or small at certain scales), (3) among vectorial marks representing individual facilitated or competitive interactions (i.e., if trees are spatially facilitated or competitive at certain scales), (4) among quantitative and vectorial marks (i.e., if facilitated trees are large or competitive trees are small at certain scales). Four types of patterns were investigated: (1) unmarked pattern, (2) quantitatively marked pattern, (3) vectorially marked pattern and (4) quantitatively and vectorially marked pattern. For the vectorial mark-related patterns, we developed new methods to analyze. All patterns were tested with the 1-ha plot both univariately and bivariately, so eight types of patterns are presented in this study. However, because the species information of the 9-ha plot was not specified, we did only a univariate pattern analysis for the 9-ha plot and thus present only four types of patterns. The analysis based on the 9-ha plot showcases the implementation of ALS for forest trees patterns analysis over a much wider area compared with traditional measurements.

Materials and methods

Data collection and mark extraction

Study area and data collection and preparation

A 1-ha plot and a 9-ha plot were established in a deciduous forest located in Wytham Woods (1°20′ W, 51°47′ N), Oxfordshire, UK. These two plots are part of the ForestGEO global network run by the University of Oxford (Butt et al. 2009; Calders et al. 2018). The mean annual temperature is 10 °C, the mean annual rainfall is 726 mm, and the mean annual radiation is 118 W/m2. The plots are dominated by Acer pseudoplatanus (sycamore), Fraxinus excelsior (ash) and Corylus avellana (hazel), followed by Quercus robur (English oak) and Crataegus monogyna (common hawthorn).

We collected a terrestrial laser scanning (TLS) point cloud of the 1-ha plot with a RIEGL VZ-400 terrestrial laser scanner in leaf-off (December 2015, January 2016) condition at multiple locations, which were connected by reflective targets (Calders et al. 2016). The vegetation point density was about 11,000 points per m2 according to the visual interpretation. We also collected an ALS point cloud with a Leica ALS-50 II scanner during leaf-on (June 2014) (Butt et al. 2009; Calders et al. 2018). The point cloud density was 6.0 points per m2. The southwest coordinate of the 1-ha plot is located at latitude 51.7750579, longitude − 1.33904729, accurately measured with the differential GPS. We collected census data containing species information for the 1-ha plot, which was used for a bivariate pattern analysis. We also collected an ALS point cloud of the 9-ha plot on the side of the 1-ha plot.

Through careful visual inspection, 523 individual trees were manually extracted from the 1-ha TLS point cloud, then manually assigned species information using the census data. The 1-ha plot contains 364 sycamore trees and 159 other species. The average tree height is 15.42 m, median height is 16.08 m, average crown width is 6.18 m, and median crown width is 5.61 m.

Because an ALS point is most likely from the same tree as its spatially nearest TLS point, we tagged all ALS points with corresponding trees based on TLS points that had been georeferenced. These tagged ALS points are called “true ALS” hereafter. We also segmented the ALS point cloud of the 1-ha and 9-ha plot with automatic segmentation algorithms, including canopy height model-based approach, pit-free canopy height model-based approach, and point cloud–layer stacking seed point-based approach and a metabolic transporting distance-based approach (Millikan et al. 2019; Zaforemska et al. 2019; Hastings et al. 2020; Xin et al. 2021). The approach with highest accuracy was then selected for segmenting ALS. The segmented ALS points are hereafter called segmented ALS. The true ALS was used to assess the accuracy of the segmented ALS-based marks in the pattern analysis of the 1-ha plot.

Mark extraction

Tree height and crown projection area (CPA) were used as quantitative marks and were extracted from the segmented ALS point cloud. The height of a tree is the elevation from ground to the highest ALS point of the tree (Kwak et al. 2007). The CPA, an essential factor describing horizontal tree size, is the area of the convex hull of ALS points in a horizontal plane (Xu et al. 2013). However, the CPA is often overlooked in patterns analysis due to the inaccessibility of census data and is replaced by crown widths in two directions.

The essential mark we extracted from the tree crown is the crown displacement vector (CDV), a vectorial mark. CDV is the vector from tree top to crown centroid, which is the centroid of the convex hull of the ALS points. Note that the CDVs we used in this study are two-dimensional vectors in a horizontal plane, although they have vertical component. An essential assumption is that the x-, y-coordinates of the tree top are the same as that of the tree root and tree location; thus, CDV can represent the crown displacement. The ecological significance of CDV pairs of two trees lies in their characterization of the facilitation or competition between the two trees (Mead 1966; Brisson 2001).

We analyzed eight types of patterns based on the quantitative and vectorial marks, four of which were new developed here, then developed corresponding summary statistics and null models. The patterns characterize the forest from several perspectives, including the spatial distribution of trees (i.e., if trees are spatially clustered, random or segregated), spatial correlation of tree size (i.e., if trees are significantly large or small at certain scales), spatial correlation of facilitation/competition (i.e., if trees are spatially facilitated or competitive at certain scales), and spatial correlation between tree size and facilitation/competition (i.e., if facilitated trees are large or competitive trees are small at certain scales), all considered both univariately and bivariately. In this study, we employed 19 Monte Carlo simulations with a confidence level of 95% to determine whether a pattern is significantly different from that produced by null models (Wiegand and Moloney 2013). The summary statistics can be tested for significance against the corresponding null model, and whether there is a structure in the pattern that does not exist in the null model (Wiegand and Moloney 2013).

In general, summary statistics can be divided into cumulative and noncumulative according to the mode of recruitment of individual trees, as shown in Fig. 1. Bandwidth b, a buffer zone threshold, equals half the value of ring width, used to determine whether a neighbour tree is distance \(r\) away from a focal tree. According to Illian et al. (2008), bandwidth can be set as.

Fig. 1
figure 1

Schematic diagram of a noncumulative and b cumulative summary statistics. Each dot stands for an individual tree, r stands for scale, and b stands for bandwidth. a Noncumulative summary statistics recruit trees at the distance of scale r with bandwidth b from the focal tree, that is, to recruit the trees located in a ring with a ring width of 2b centred at the focal tree, three trees are recruited in this example. b Cumulative summary statistics recruit all trees within the distance of scale r from focal tree, that is, to recruit the trees located in a disk with a radius of r centred at the focal tree; four trees are recruited in this example. The noncumulative and cumulative summary statistics iterate all the individual trees in the plot as the focal tree

$$d=0.1/\sqrt{\lambda },$$
(1)

where \(\lambda\) is the tree density of the plot. There are 523 trees in the 1-ha plot of this study, then \(d=0.1/\sqrt{523/10000}\approx 0.44 m\). However, the values of summary statistics at different scales fluctuate strongly with the bandwidth of 0.5 m. Thus, we set bandwidth as 1.0 m for all the patterns for the 1-ha and 9-ha plots to avoid potential strong fluctuation. It is clear that the cumulative statistics gloss over some details of the spatial patterns shown in their noncumulative counterpart, while noncumulative statistics always show some stochastic noise. The question as to whether or not to use noncumulative or cumulative statistics depends on whether the goal is to reveal specific scale effects among the trees separated by distance r or to clarify the overall effects of all trees within distance r (Wiegand and Moloney 2013). In this study, aimed at revealing more details at different scales, all the results are based on the noncumulative summary statistics with a bandwidth b of 1 m.

Accuracy assessment of segmented ALS-based marks and pattern analysis

To analyze the eight types of spatial patterns in a larger area using ALS data, we thus used the algorithm-segmented ALS point cloud, instead of the matched true ALS. We assessed the accuracy of marks extracted from segmented ALS by comparing the mean error of segmented ALS-based quantitative marks with the true ALS-based marks, and the mean errors of magnitude of segmented ALS-based CDVs along the x- and y- coordinates were calculated. Meanwhile, to test the applicability of the segmented ALS for spatial pattern analysis, we calculated the summary statistics based on the segmented ALS and the true ALS and used cosine similarity (CS) and correctly detected rate (CDR) to assess accuracy of the indices.

Cosine similarity, a metric used to measure how similar the two multi-dimensional vectors are, calculates the cosine of the angle of the two multi-dimensional vectors (Nguyen and Bai 2011). CS is defined as

$$\mathrm{CS}\left({\mathrm{v}}_{\mathrm{s}},{\mathrm{v}}_{\mathrm{t}}\right)=\frac{{\sum }_{i=1}^{m}{\mathrm{v}}_{\mathrm{s}}\left(i\right)\cdot {\mathrm{v}}_{\mathrm{t}}\left(i\right)}{\sqrt{{\sum }_{i=1}^{m}{\mathrm{v}}_{\mathrm{s}}^{2}\left(i\right)}\cdot \sqrt{{\sum }_{i=1}^{m}{\mathrm{v}}_{\mathrm{t}}^{2}\left(i\right)}},$$
(2)

where \(m\) counts the number of scales \(r\), and \({v}_{\mathrm{s}}\) and \({v}_{t}\), the vectors composed of summary statistics values at a series of scales, were calculated from segmented ALS and true ALS respectively. \({v}_{s}\) and \({v}_{t}\) equal to

$${v}_{s}=[{v}_{s}\left(1\right),{v}_{s}\left(2\right),\dots ,{v}_{s}\left(m\right)]$$
(3)
$${v}_{t}=[{v}_{t}\left(1\right),{v}_{t}\left(2\right),\dots ,{v}_{t}\left(m\right)]$$
(4)

The value of CS is always within the range of − 1 and 1. The larger value of CS signifies that the segmented ALS-based summary statistics values are more similar to that based on true ALS, indicating the better results of segmented ALS-based analysis. Instead, the smaller value signifies worse analysis results. The CS, a direct index to evaluate the analytical results, compares the values based on segmented and true ALS.

We also used CDR to indirectly evaluate the applicability of segmented ALS on pattern analysis. There are three possibilities for the relative relationship between the summary statistics value and its simulation envelopes produced by the null model, value above the simulation envelopes (positive correlation or clustered), below simulation envelopes (competition or segregated) and within simulation envelopes (no spatial correlation or random). We defined it as correctly detected if the segmented and the true ALS-based the summary statistics values have same relative relationship in comparison with simulation envelopes, otherwise, incorrectly detected. Thus, CDR is the ratio of correctly detected number of scales to the number of all scales as follows:

$$\mathrm{CDR}=\frac{{N}_{\mathrm{C}}}{{N}_{\mathrm{C}}+{N}_{\mathrm{I}}},$$
(5)

where, \({N}_{\mathrm{C}}\) is the number of scales where spatial correlations are correctly detected, \({N}_{\mathrm{I}}\) is the number of scales where spatial correlations are incorrectly detected. CS evaluates the accuracy of the segmented ALS-based results in a direct way and CDR evaluates the accuracy in an indirect way.

Analysis 1: unmarked pattern

The ecological question of univariate unmarked pattern is to characterize the spatial distribution of trees, to test whether they are clustered, random or segregated distribution at a series of spatial scales. The ecological question of bivariate unmarked pattern is to characterize the spatial distribution of other species (one or more species) around the focal species, to test whether other species are aggregately, randomly or separately distributed surround focal species. Commonly used summary statistics are univariate and bivariate Ripley (1976)’s L function (cumulative) and pair correlation function (noncumulative) (Illian et al. 2008). The null model for univariate unmarked pattern is to randomize the two-dimensional tree locations in the horizontal plane. For bivariate unmarked pattern is to randomize the two-dimensional tree locations within each component pattern using a toroidal shift approach in the horizontal plane (Wiegand and Moloney 2013).

Analysis 2: quantitatively marked pattern

The ecological question of univariate quantitatively marked pattern is to characterize spatial correlation of tree size. The commonly used summary statistic is the univariate mark correlation function, its estimator is calculated as follows (Wiegand and Moloney 2013):

$${\widehat{k}}_{\mathrm{t},\mathrm{uni}}\left(r\right)=\frac{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }t\left({m}_{i},{m}_{j}\right)\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}{{c}_{\mathrm{t}} {\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}},$$
(6)

where \({k}_{\mathrm{t},\mathrm{uni}}\left(r\right)\) is the value of univariate mark correlation function at a scale of \(r\), \(n\) is the number of trees, \(t\left({m}_{i},{m}_{j}\right)\) is a test function and has multiple expressions (Illian et al. 2008), \({c}_{\mathrm{t}}\) is the nonspatial mean of the test function, yielding the asymptotic value of test function, \(||{p}_{i}-{p}_{j}||\) is the distance separating focal tree \({p}_{i}\) and neighbor tree \({p}_{j}\), \(k(||{p}_{i}-{p}_{j}||-r)\) is a kernel function determining whether or not two trees are located distance \(r\) apart and equals 1 if the value \(||{p}_{i}-{p}_{j}||-r\) is smaller than bandwidth \(b\) and equals 0 if greater than bandwidth \(b\), \({w}_{i,j}\) is the plot edge correction factor (Goreaud and Pélissier 1999). In this study, we used the periodic edge correction method to avoid any strange or spurious configurations of the trees along the plot borders; hence, \({w}_{i,j}=1\). The estimators visit all pairs of trees separated by distance \(r\) and calculate the mean value of the test function over these pairs.

The ecological question of the bivariate quantitatively marked pattern is to characterize the spatial correlation of tree size of two species. The bivariate mark correlation function is calculated as follows (Illian et al. 2008):

$${\widehat{k}}_{\mathrm{t},\mathrm{biv}}\left(r\right)=\frac{{\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }t\left({m}_{i},{m}_{j}\right)\cdot {C}_{\mathrm{lm}}({p}_{i},{p}_{j})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}{{c}_{\mathrm{t}}\cdot {\sum }_{\mathrm{i}=1}^{{n}_{1}}{\sum }_{\mathrm{j}=1}^{{n}_{2},\ne }{C}_{\mathrm{lm}}({p}_{i},{p}_{j})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}},$$
(7)

where \({k}_{\mathrm{t},\mathrm{biv}}\left(r\right)\) is the summary statistics value at scale of \(r\), \({n}_{1}\) is the number of focal species, \({n}_{2}\) is the number of other species, and \({C}_{\mathrm{lm}}({p}_{i},{p}_{j})\) is a judging function that yields a value of 1 if the focal tree \({p}_{j}\) is of species \(l\) and neighbour tree \({p}_{i}\) is of type \(m\) or other species, and 0 otherwise.

The null model of univariate quantitatively marked pattern randomizes the mark (i.e., tree height or CPA) all over the trees, which we called random labelling, and maintains the tree locations unchanged. That is, the marks are randomly shuffled over all trees. The null model of bivariate quantitatively marked pattern randomizes the mark within each species and maintains the tree locations unchanged.

Analysis 3: vectorially marked pattern

Under the assumption of environmental homogeneity, tree growth is not affected by environmental factors (e.g. illumination, nutrition and moisture), then the CDV pair is able to describe the facilitation or competition of two trees, which are separated by distance r.

We defined facilitation and competition of focal and neighbour trees by their CDVs as follows. An auxiliary vector from the focal tree to the neighbour tree was established. If the angle between the CDV of the focal tree and auxiliary vector is smaller than 45° and the angle between the CDV of neighbour tree and auxiliary vector is greater than 135°, then there is facilitation between the focal tree and the neighbour tree. If the angle between the CDV of focal tree and auxiliary vector is greater than 135° and the angle between the CDV of neighbour tree and auxiliary vector is smaller than 45°, then there is a competition between focal tree and neighbour tree. Otherwise, we assume that there is no facilitation or competition between focal and neighbour trees. Note that the vectors are two dimensional in horizontal plane, and the vertical component was not considered.

We proposed a vectorially marked pattern, where the vectorial mark (i.e., CDV) is attached to the univariate or bivariate pattern. Analogous to the use of the mark correlation function for a qualitatively marked pattern, mark connection functions are developed to characterize the spatial relationship among the trees regarding CDV. The ecological question of vectorially marked pattern is to determine whether the CDV pairs of the individual trees separated by distance \(r\) are spatially correlated. Analogous to the summary statistics for bivariate unmarked pattern, the mark connection function for vectorially marked pattern is estimated with the ratio of the product densities.

The second-order product density characterizes the frequency of possible configurations of two trees in the plot (Illian et al. 2008). For a homogeneous pattern in which the properties of the pattern are the same in all directions and at all locations, the second-order product density for unmarked pattern can be estimated as follows:

$${\widehat{d}}^{{^{\prime}}\left(r\right)}=\frac{n}{A}\frac{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }k\left(\left|\left|{p}_{i}-{p}_{j}\right|\right|-r\right)\cdot {w}_{i,j}}{2\pi r}=\frac{1}{2\pi rA}{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }k\left(\left|\left|{p}_{i}-{p}_{j}\right|\right|-r\right)\cdot {w}_{i,j},$$
(8)

where \(A\) is the area of the plot, \(n/A\) is the tree density within the plot, and \(\left(\left|\left|{p}_{i}-{p}_{j}\right|\right|-r\right)\cdot {w}_{i,j}/2\pi r\) is the tree density around tree \(i\) at the distance of \(r\), that is the tree density within the ring with the width of two times of bandwidth while the ring is centred on tree \(i\).

The second-order product density for vectorially marked pattern can be estimated as follows:

$${\widehat{d}}_{V}\left(r\right)=\frac{1}{2\uppi rA}{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }t({v}_{i},{v}_{j})\cdot V({v}_{i},{v}_{j})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j},$$
(9)

where \({\mathrm{v}}_{\mathrm{i}}\) and \({\mathrm{v}}_{\mathrm{j}}\) are the crown displacement vectors of the focal tree and neighbour tree, respectively. \(V({v}_{i},{v}_{j})\), a judging function, returns a value of 1 for facilitation of the focal and neighbour tree, which is determined by the CDV pair, a value of − 1 for competitive interaction, and a value of 0 otherwise. The quantity \(t({v}_{i},{v}_{j})\) can be expressed as

$$t\left({v}_{i},{v}_{j}\right)=\left|{v}_{i}\cdot {v}_{j}\right|,$$
(10)

where \(t\left({v}_{i},{v}_{j}\right)\) is the absolute value of dot product \({\mathrm{v}}_{\mathrm{i}}\) and \({\mathrm{v}}_{\mathrm{j}}\), and \(t({v}_{i},{v}_{j})\cdot V({v}_{i},{v}_{j})\) in Eq. 4 quantifies the degree of facilitation or competition of tree pairs. To some extent, \(t({v}_{i},{v}_{j})\) is a quantitative mark for the vector and is similar to the test function \(t\left({m}_{i},{m}_{j}\right)\) in Eq. 1.

The function \(\hat{d}^{\prime}\left( r \right)\) counts all tree pairs, but \(\hat{d}_{V} \left( r \right)\) only counts the tree pairs which are facilitated or competitive due to the judging function \(V({v}_{i},{v}_{j})\). Thus, the estimator of second-order product density for the unmarked pattern is accordingly modified as

$$\widehat{d}\left(r\right)=\frac{1}{2\pi rA}{\sum\limits }_{i=1}^{n}{\sum\limits }_{j=1}^{n,\ne }\left|V({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}})\right|\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j},$$
(11)

where \(\left|V({v}_{i},{v}_{j})\right|\) the absolute value of the judging function, equals 1 or 0. Therefore, the noncumulative mark connection function for the univariate vectorially marked pattern can be estimated as

$${\widehat{q}}_{\mathrm{V},\mathrm{uni}}\left(r\right)=\frac{{\widehat{d}}_{\mathrm{V}}\left(r\right)}{\widehat{d}\left(r\right)}=\frac{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }t({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}})\cdot V({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }\left|V({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}})\right|\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}$$
(12)

where the subscript \(\mathrm{V}\) refers to the function judging the relative directions of CDVs. Analogous to Eq. 7, the estimator for the cumulative mark connection function for the univariate vectorially marked pattern can be estimated as

$${\widehat{q}}_{\mathrm{V},\mathrm{uni}}^{\mathrm{cum}}\left(r\right)=\frac{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }t({{\varvec{v}}}_{i},{{\varvec{v}}}_{j})\cdot V({{\varvec{v}}}_{i},{{\varvec{v}}}_{j})\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}}{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }\left|V({{\varvec{v}}}_{i},{{\varvec{v}}}_{j})\right|\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}}$$
(13)

where the superscript \(\mathrm{cum}\) refers to cumulative, the kernel function \(I(||{p}_{i}-{p}_{j}||,r)\) equals 1 if the neighbour tree \({p}_{j}\) is located within distance \(r\) from focal tree \({p}_{i}\) and 0 otherwise. Compared with the noncumulative function, the cumulative function glosses over some details, but is less likely to be affected by scale because the cumulative functions involve more trees at the same scale, which reduces the potential stochastic noise (Wiegand and Moloney 2004). Note that expressions without superscript \(\mathrm{cum}\) are all noncumulative summary statistics [e.g.\({q}_{\mathrm{V},\mathrm{uni}}\left(r\right)\),\({q}_{\mathrm{V},\mathrm{biv}}\left(r\right)\), \({k}_{\mathrm{t},\mathrm{V},\mathrm{uni}}\left(r\right)\) and \({k}_{\mathrm{t},\mathrm{V},\mathrm{biv}}\left(r\right)]\).

Accordingly, to characterize the spatial correlation of facilitation or competition of two species, we developed the noncumulative and cumulative bivariate mark connection function for vectorially marked pattern as follows:

$${\widehat{q}}_{\mathrm{V},\mathrm{biv}}\left(r\right)=\frac{{\widehat{d}}_{\mathrm{V}}\left(r\right)}{\widehat{d}\left(r\right)}=\frac{{\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }t({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}{{\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }\left|V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\right|\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}$$
(14)
$${\widehat{q}}_{\mathrm{V},\mathrm{biv}}^{\mathrm{cum}}\left(r\right)=\frac{{\widehat{d}}_{\mathrm{V}}\left(r\right)}{\widehat{d}\left(r\right)}=\frac{{\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }t({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}}{{\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }\left|V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\right|\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}},$$
(15)

where \({n}_{1}\) is the number of focal tree species, \({n}_{2}\) is the number of neighbour tree species. \({\widehat{q}}_{\mathrm{V},\mathrm{biv}}\left(r\right)\) recruits neighbour trees of species 2 at distance \(r\) away from the focal tree of species 1, while \({\widehat{q}}_{\mathrm{V},\mathrm{biv}}^{\mathrm{cum}}\left(r\right)\) recruits trees within distance \(r\).

For the univariate vectorially marked pattern, the null model is to randomize CDVs of all the trees while the tree locations remain unchanged. But for bivariate vectorially marked pattern, the null model is to randomize vectorial marks of trees within each component pattern, because the two component patterns are a priori different (Wiegand and Moloney 2013).

Analysis 4: quantitatively and vectorially marked pattern

The quantitatively and vectorially marked pattern is a univariate or bivariate unmarked pattern attached a quantitative mark (e.g., tree height or CPA) and a vectorial mark (i.e., CDV). The ecological question is to determine the spatial correlation between quantitative mark and vectorial mark. For example, we may expect that tree pairs with facilitation would be taller or with larger CPA than expected given over all sizes of facilitated or competitive tree pairs. Accordingly, this pattern requires a null model where quantitative mark or vectorial mark fixed, while the other mark is randomly shuffled over trees. Note that the null model of univariate pattern shuffles marks over all trees, but that of the bivariate pattern shuffles marks within each species and avoids an interspecific shuffle.

We also built summary statistics for a quantitatively and vectorially marked pattern, analogous to the former functions. The estimator of non-normalized univariate mark correlation functions yields

$${\widehat{k}}_{\mathrm{t},\mathrm{V},\mathrm{uni}}^{{^{\prime}}}\left(r\right)=\frac{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }{t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\cdot V({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }\left|V\left({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}}\right)\right|\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}},$$
(16)

where \({t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)={m}_{i}+{m}_{j}\) if \(V\left({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}}\right)=1\), \({t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)=2{m}_{\mathrm{max}}-({m}_{i}+{m}_{j})\) if \(V\left({\mathrm{v}}_{\mathrm{i}},{\mathrm{v}}_{\mathrm{j}}\right)=-1\), and \({m}_{\mathrm{max}}\) is the maximum value of the quantitative mark, and equals highest tree height or largest CPA.

It is clear that if there is no spatial correlation among marks, the estimator of the non-normalized univariate mark correlation functions \({\widehat{k}}_{\mathrm{t},\mathrm{V},\mathrm{uni}}^{{^{\prime}}}\left(r\right)\) equals the asymptotic value of \({t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\). Thus, to ensure the comparability at difference scales r, the estimator (Eq. 11) should be normalized by \(c\), the asymptotic value of \({t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\). The expectation of all facilitated tree pairs yields

$$E\left({m}_{i}+{m}_{j}\right)=2\mu$$
(17)

and the expectation of all competitive tree pairs yields

$$E\left(2{m}_{\mathrm{max}}-({m}_{i}+{m}_{j})\right)=2{m}_{\mathrm{max}}-2\mu .$$
(18)

Thus,

$$\begin{aligned} c=\{{n}_{\mathrm{pos}}\cdot E\left({m}_{i}+{m}_{j}\right)+{n}_{\mathrm{neg}}\cdot E[2{m}_{\mathrm{max}}-({m}_{i}+{m}_{j})\}/ \\ {n}_{\mathrm{pos}+\mathrm{neg}}=(2\mu \left({n}_{\mathrm{pos}}-{n}_{\mathrm{neg}}\right)+2{m}_{\mathrm{max}}{n}_{\mathrm{neg}})/{n}_{\mathrm{pos}+\mathrm{neg}}, \end{aligned}$$
(19)

where \({n}_{\mathrm{pos}}\) and \({n}_{\mathrm{neg}}\) are the numbers of facilitated or competitive tree pairs respectively, and \({n}_{\mathrm{pos}+\mathrm{neg}}={n}_{\mathrm{pos}}+{n}_{\mathrm{neg}}\).

Furthermore, the estimator of normalized univariate and bivariate mark correlation function yields

$${\widehat{k}}_{\mathrm{t},\mathrm{V},\mathrm{uni}}\left(r\right)=\frac{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }{t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\cdot V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}{c\cdot {\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }\left|V\left({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}}\right)\right|\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}$$
(20)
$${\widehat{k}}_{\mathrm{t},\mathrm{V},\mathrm{biv}}\left(r\right)=\frac{{\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }{t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\cdot V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot {C}_{lm}({p}_{i},{p}_{j})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}{c\cdot {\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }\left|V\left({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}}\right)\right|\cdot {C}_{lm}({p}_{i},{p}_{j})\cdot k(||{p}_{i}-{p}_{j}||-r)\cdot {w}_{i,j}}.$$
(21)

The test function \({t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\) for univariate pattern are calculated for pairs of trees that fulfill two conditions: the focal tree and neighbour tree are either facilitated or competitive, and the trees are separated by distance \(r\).

Accordingly, the cumulative univariate and bivariate functions are estimated as follows:

$${\widehat{k}}_{\mathrm{t},\mathrm{V}}^{\mathrm{cum}}\left(r\right)=\frac{{\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }{t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\cdot V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}}{c\cdot {\sum }_{i=1}^{n}{\sum }_{j=1}^{n,\ne }\left|V\left({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}}\right)\right|\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}}$$
(22)
$${\widehat{k}}_{\mathrm{t},\mathrm{V},\mathrm{biv}}^{\mathrm{cum}}\left(r\right)=\frac{{\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }{t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\cdot V({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}})\cdot {C}_{lm}({p}_{i},{p}_{j})\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}}{c\cdot {\sum }_{i=1}^{{n}_{1}}{\sum }_{j=1}^{{n}_{2},\ne }\left|V\left({\mathbf{v}}_{\mathbf{i}},{\mathbf{v}}_{\mathbf{j}}\right)\right|\cdot {C}_{lm}({p}_{i},{p}_{j})\cdot I(||{p}_{i}-{p}_{j}||,r)\cdot {w}_{i,j}},$$
(23)

where the neighbour trees are within distance \(r\) of focal tree. Analogous to the cumulative form of functions for the vectorially marked pattern, cumulative functions for quantitatively and vectorially marked pattern reduce the potential stochastic noise, but gloss over some details, especially at the larger scales due to the higher numbers of tree pairs. Note that both the noncumulative estimator (Eqs. 19 and 20) and cumulative estimator (Eqs. 21 and 22) need \(c\), the asymptotic value of \({t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\), because both of them contain the test function \({t}_{\mathrm{sum}}\left({m}_{i},{m}_{j}\right)\), though noncumulative estimators only recruit trees at distance \(r,\) while cumulative estimators recruit trees within distance \(r\).

The null models for the functions of this type of pattern are also random labelling. It requires a null model where the quantitative marks are fixed, but the vectorial marks are randomly shuffled over all trees (for univariate pattern) or within each component pattern (for bivariate pattern). Alternatively, another type of null model is to maintain the vectorial mark fixed, but to shuffle the quantitative marks over all trees (for univariate pattern) or within each component pattern (for bivariate pattern).

Results

Comparison of matched true ALS- and algorithm-segmented ALS-based results

Accuracy assessment of the four ALS point cloud segmentation results showed that the metabolic transport distance-based approach (Xin et al. 2021) yields a coefficient of determination (R2) for tree height and crown width of detection of 0.9108 and 0.6193, respectively, and the overall R2 of tree height and crown width are 0.8045 and 0.4743, respectively, the mean error of tree height is 2.20 m, evaluated against the true ALS-based tree height and crown width. The other three approaches delineate understories with lower R2 of tree height and crown width, and the mean errors of tree height ranged from 6.24 to 6.58 m (detailed by Xin et al. 2021). Thus, we employed the metabolic transport distance-based approach for the ALS point cloud segmentation of the 1-ha and 9-ha plots, which showed that there are 4380 trees in the 9-ha plot.

The validity of using segmented ALS in spatial pattern analysis was evaluated by the mean error of marks, CS and CDR. The mean errors of quantitative and vectorial mark are shown in Table 1. The error of the two quantitative marks was 2.21 m. The mean errors of the two components of the vectorial mark were 1.04 m and 1.14 m, implying that the segmented ALS-based structure parameters could be used for the spatial pattern analysis.

Table 1 Accuracy of quantitative and vectorial marks based on a segmented ALS point cloud, compared with a true ALS point cloud. Mean error is the average absolute value of difference between the segmented and the true ALS. CDVx and CDVy are the x and y components of crown displacement vector, respectively

CS and CDR were used to assess the results for the eight types of segmented ALS-based pattern analysis in comparison to the true ALS-based results. From most patterns, values for assessment index CS were greater than 0.94, the average value of CS of all eight types of patterns was also 0.94, indicating a high similarity of the two lines of summary statistics. The CDR values of eight types of patterns vary from 70 to 86% (mean: 79%), which means about a number of 39.5 scales of all 50 scales were correctly detected.

In conclusion, according to the summary statistic lines and indices of CS and CDR, the algorithm segmented ALS has an excellent performance in the eight types of spatial patterns. We found that tree height-marked analysis usually get better results compared with CPA-marked analysis due to the coefficient of determination R2 of tree height based on ALS segmentation algorithm is better than that of crown width. The performance test of segmented ALS, which used the true ALS as the criterion, is the precondition for segmented ALS-based analysis and also for further larger plot and larger scale analysis without census data.

The eight types of spatial patterns analysis

The spatial distributions of the locations and marks of trees in the 1-ha plot were shown in Fig. 2, we analyzed the univariate patterns (Fig. 2a–d) in Sect. 3.2.1 and bivariate patterns in Sect. 3.2.2. The eight types of spatial patterns we analyzed were listed in Table 2.

Fig. 2
figure 2

The eight types of spatial patterns of the 1-ha plot. Each dot or circle represents an individual tree. a Spatial distribution of 523 individual trees. b Disk area of the black dot is proportional to CPA. c The length of the red arrow is proportional to the length of CDV. d Similar to b and c, but the length of red arrow is constant. e Spatial distribution of sycamore (open circles, 364 individuals) and other species (grey dots, 159 individuals). f Similar to b and e. g Similar to c and e. Red and blue arrows are the CDVs of sycamore and other species, respectively. h Similar to d and g

Table 2 The eight types of spatial pattern analysis of forest trees based on qualitative, quantitative and vectorial marks, where the type iii, iv, vii and viii were newly developed. The new patterns analyzed the facilitation and competition and its impact on tree size. Each type has a specific ecological question, corresponding null model and summary statistics and analyzes the different types of spatial correlations among trees in the plot

Univariate patterns

We analyzed four types of univariate patterns, as shown in the unmarked (Fig. 3a), quantitatively marked (Fig. 3b, c), vectorially marked (Fig. 3d), and quantitatively and vectorially marked (Fig. 3e, f) patterns. The trees showed strong segregation at scales of 1–3 m due to the close tree spacing, and rapidly turned to an aggregation at the scale of 5 m (Fig. 3a). Trees were significantly spatially aggregated at scales of 5–11 m, and peaked at scales of 4 and 11 m.

Fig. 3
figure 3

Summary statistics (red lines) for the eight types of patterns in the 1-ha plot based on algorithm-segmented ALS, and the simulation envelopes (grey lines) produced by null models. The x-axes are the spatial scales with a step of 1 m, indicating the spatial extent; the y-axes show the values of dimensionless summary statistics. In each diagram, values above their envelopes indicate spatial aggregation (a and g), large tree size (b, c, h and i), facilitation (d and j), or s above their envelopes indicate a spatial segregation (a and g), small tree size (b, c, h and i), competition (d and j), or competition and small tree size (e, f, k and l). The scales with significant departures are listed below each diagram. The quantitative marks H and CPA are tree height and crown projection area, respectively. The test function used for pattern ii and vi is \(t\left({m}_{i},{m}_{j}\right)={(m}_{i}-\mu ){(m}_{j}-\mu )\), where \({m}_{i}\) and \({m}_{j}\) are the quantitative mark values of the focal and neighbour tree, respectively, and \(\mu\) is the mean mark. Different patterns employ different null models

The tree height-marked quantitative pattern (Fig. 3b) showed a significant departure at scales of 1–3, 6–9 and 11 m under the test function.

$$t\left({m}_{i},{m}_{j}\right)={(m}_{i}-\mu ){(m}_{j}-\mu ),$$
(24)

where \({m}_{i}\) and \({m}_{j}\) are tree heights of the focal and the neighbor tree, respectively, and \(\mu\) is the mean height. A significant departure from the simulation envelopes indicates a spatial correlation among the quantitative marks of the focal tree and neighbour tree. Values of \({k}_{t,uni}\left(r\right)\) lying above the simulation envelopes for null model indicate facilitation among the quantitative mark of focal tree and neighbour tree; that is, the focal tree and neighbour tree both are small trees or both are large trees. At scales of 1–3 m, Fig. 3b showed the positive correlation between the height of focal tree and neighbour tree, and at scales of 6–9 and 11 m, it showed a competitive interaction of the height. By contrast, the CPA-marked quantitative pattern (Fig. 3c) was less correlated with tree distribution, and it did not show significance at most of scales, except the slightly negatively departure from simulation envelopes at the scale of 3 and 6 m. The results showed that CPA is less likely to be affected by tree populations compared with tree height.

However, we are not yet able to determine whether they are small or large trees. Also, values below simulation envelopes indicate a negative correlation, and we are still not able to determine the size of focal and neighbour trees until joining the vectorially marked pattern.

Vectorially marked pattern (Fig. 3d) showed spatial correlation of facilitation among trees at the scales of 5–6 and 9–10 m; that is, CDVs of the focal tree and neighbour trees are more likely to point to each other, indicating the facilitation among trees at these scales. Thus, combined with Fig. 3b, we can estimate that both the focal and neighbour trees are potentially tall trees at a scale of 6 and 9 m, and combined with Fig. 3c, the focal tree is more likely to be with a large crown, while neighbours have a small crown.

Tree height- and vector-marked quantitative and vectorial patterns (Fig. 3e) showed a significantly positive departure from envelopes at the scale of 9–12 m, which indicates t facilitation among focal and neighbour trees and that they are tall trees. These results further confirmed the pattern analysis results of Fig. 3b, d. CPA- and vector-marked quantitative and vectorial pattern (Fig. 3f) also showed significantly positive departure at scales of 9–12 m, indicating the facilitation and large CPA of focal and neighbour trees. Figure 3c showed significant departure at the scales of 3 and 6 m, while Fig. 3f did not show any significant departure at scales smaller than 9 m, indicating that the small CPA is not correlated with the tree interactions.

On the whole, the univariate pattern showed that at small scales (1–3 m), trees tend to be spatially segregated and have small crowns and short heights due to the competition among trees for nutrients. At larger scales (around 5–10 m), trees tend to facilitate with each other, resulting in a spatially clustered distribution and taller trees, but the CPA did not show much spatial correlation with tree interactions. Thus, the vectorially marked pattern and quantitatively and vectorially marked pattern indicate a specific spatial relationship of size and interaction of trees.

Bivariate patterns

The bivariate pattern analysis tests for an interspecific spatial correlation, and the summary statistics different from those of the univariate pattern. Bivariate pattern analysis goes through the specific species separately, while the univariate pattern goes through all trees without distinguishing them by species. Results of bivariate pattern analysis are shown in Fig. 3g–l, where the “two” species are dominant species (sycamore) and the other species. The unmarked pattern Fig. 3g did not show a significant departure from envelopes at small scales (1–15 m), indicating that sycamore and the other trees are randomly distributed with each other at these scales, and there was no significant spatial correlation of tree locations.

The tree height-marked quantitative bivariate pattern (Fig. 3h) showed significant departure at a scale of 3–10 m, indicating a positive correlation between the two species for tree height; that is, both focal sycamore and neighbor trees (other species) are tall trees or both are short trees. CPA-marked pattern (Fig. 3i) showed there is slightly significant departure at a scale of 11 m. However, the sizes of sycamore and other species are still not clear; more types of patterns are needed.

The undetermined spatial relationship of tree size of sycamore and other species could be clarified by a bivariate vectorial pattern (Fig. 3j). As shown in Fig. 3j, there is significant departure at a scale of 9 m, indicating facilitation between sycamore and the other species. When these results are combined with results of the quantitatively marked pattern, we can judge that both the sycamore and other species are with large trees. Moreover, we can further clarify using the quantitatively and vectorially marked pattern that both Fig. 3k, l showed a significant departure at a scale of 12 m, indicating a positive correlation of tree size with interspecific interactions among sycamore and other species. Thus, both bivariate vectorially marked pattern and bivariate quantitatively and vectorially marked pattern clarified the connection between tree size and interspecific interaction.

Univariate pattern analysis ignores the species information, and only focuses on one species or all species, while the bivariate pattern analysis focuses on two species or on one focal species and one other species. There is less significant departure of the bivariate pattern than that of the univariate pattern according to the results, indicating that the interspecific interaction is weaker than the intraspecific interaction in the 1-ha plot. Moreover, trees tend to be competitive and compete with each other at small scales, and are facilitated at large scales. Note that a significant departure occurred more randomly at scales greater than 20 m, showing weaker spatial correlation at these scales and less ecological significance at larger distances among trees; thus, we only focused on scales less than 20 m in this study.

Extending the analysis to the 9-ha plot based on the ALS point cloud

The ALS has the ability to acquire a point cloud in a large area, so we also analyzed the spatial pattern of the 9-ha plot based on the ALS point cloud, as shown in Fig. 4.

Fig. 4
figure 4

Summary statistics (red lines) for the four types of univariate patterns in the 9-ha plot. The simulation envelopes (grey lines) were produced by null models. The x-axes are the spatial scales with a step of 1 m; y-axes show the values of dimensionless summary statistics. In each diagram, values above their envelopes indicate spatial aggregation a, large tree size (b and c), facilitation (d), or facilitation and large tree size (e and f). Values above their envelopes indicate spatial segregation a, small tree size (b and c), competition (d), or competition and small tree size (e and f). The scales with significant departures are listed below each diagram. The quantitative marks H and CPA are the tree height and crown projection area, respectively. The test function used for pattern ii and vi is \(t\left({m}_{i},{m}_{j}\right)={(m}_{i}-\mu ){(m}_{j}-\mu )\), where \({m}_{i}\) and \({m}_{j}\) are the quantitative mark values of the focal and neighbour tree, respectively, and \(\mu\) is the mean mark. The scales with significant departures are listed below each diagram. Different patterns employ different null models. The bivariate pattern analysis was not conducted due to the lack of species information

The univariate pattern analysis of the 9-ha plot provided a wider perspective for exploring the spatial correlation of trees (Fig. 4). Similar to the analytical methods for ecological significance of the 1-ha plot, the unmarked pattern (Fig. 4a) showed that trees are aggregated at the scale of 6–8 m, and the tree height-marked quantitative pattern showed (Fig. 4b) that the heights of focal and neighbour trees are positively correlated, while the height- and vector- marked pattern (Fig. 4e) showed that the trees tend to be shorter and interacting competitively; thus, there is competitive inhibition among trees at the scale of 6–7 m. However, near 2 m, trees tend to be separately distributed (Fig. 4a) and competitive (Fig. 4d). Based on the quantitatively and vectorially marked pattern analysis (Fig. 4e, f), trees tend to be short and with small crowns at scales near 2 m due to competitive inhibition. Compared with the 1-ha plot (Fig. 3), the 9-ha plot (Fig. 4) tended to have less significant departures from the simulation envelopes. The reason lies in the number of trees; the 9-ha plot contains 4380 trees, while the 1-ha plot contains 523 trees, so there is less potential stochastic noise in the 9-ha plot, and the values of summary statistics of the 9-ha plot show relatively less departure than in the 1-ha plot. Note that the cumulative summary statistics also have less stochastic noise compared with the noncumulative summary statistics, though the noncumulative may gloss over some details.

Discussion

Previous spatial analytical methods have described the scattering of individual trees and the spatial distribution of tree size, whereas our newly proposed spatial analysis methods use new quantitative and vectorial marks to determine the driving factors that led to the spatial patterns. That is, traditional methods only analyze unmarked and quantitatively marked patterns such as a significant segregated distribution of certain species adults and a significant clustered distribution of subadults and juveniles at extreme small scales (Nguyen et al. 2014; Gu et al. 2019), a wide clustered distribution of some typical species at small scales (Nguyen et al. 2016), or a significantly smaller tree size of a minor species around the dominant species (Wiegand and Moloney 2013). In this study, we proposed new types of patterns based on the new marks CPA and CDV, which have not been used in other ecological studies, to further clarify the spatial correlations of tree facilitation or competition, of tree size and facilitation or competition. We found significant facilitation among the trees in the plot, which led to a larger tree size, and had a greater impact on tree height than crown size.

Trees could be spatially facilitated or competitive at any scale. The univariate and bivariate pattern of the 1-ha plot (Fig. 3) and the univariate pattern of the 9-ha plot (Fig. 4) showed that trees are more likely to spatially interact at smaller scales, and the significance of the effect (i.e., the departure from null models) is weaker at large scales. This result is consistent with other reports that tree spatial correlation diminishes with distance (Miao et al. 2014; Erfanifard et al. 2016; Moreno-Fernández et al. 2020). In fact, all the proposed univariate and bivariate patterns, including the quantitatively and vectorially marked related patterns, could be further expanded to trivariate patterns, which incorporate spatial correlations for three species and also need corresponding summary statistics and mull models (Velázquez et al. 2016).

Tree height, diameter at breast height and crown width have been commonly used marks in ecological studies, while crown width can be considered as a simplified form of CPA. In this study, we used CPA as a quantitative mark, measuring the vertical size of individuals. We also proposed the vectorial mark, CDV, a new type of mark that is a measure of the crown centre displacement from the stem of an individual tree. More three-dimensional marks could also be potentially extracted from ALS point cloud. They could be quantitative, describing the size of trees from different dimensions, or vectorial, representing the crown asymmetry more accurately, or other new types of marks. In addition, terrestrial laser scanning point clouds could also be used to provide more detailed three-dimensional structural marks, although the cost will be greater.

We examined the performance of the pattern analysis models only for 1-ha and 9-ha plots, but the two plots are both mixed forest. More types of forests, such as a coniferous forest or tropical forest, need to be tested to evaluate the broad applicability of the proposed models (Carrer et al. 2018). Nevertheless, it is still a challenge to derive species information from the ALS point cloud, which is required by bivariate or further trivariate pattern analysis. Fortunately, we could obtain species information from census data or simultaneous remote sensing data (e.g., spectral or hyperspectral images).

Conclusions

Quantitative and vectorial marks were derived from ALS point clouds, and the spatial patterns of two forest stands in Oxfordshire, UK was analyzed. To reveal details on the three-dimensional structure and facilitation and competition of trees, we used newly proposed marks including CPA, which measures the horizontal size of trees, and CDV, which measures the crown displacement of trees. New types of patterns suitable for these new marks were developed, including the vectorially marked pattern and the quantitatively and vectorially marked pattern. Correspondingly, we developed new summary statistics and null models and assessed the utility of the newly proposed marks. The newly proposed patterns were useful in clarifying the spatial correlation of intraspecific and interspecific facilitated or competitive interactions and the spatial correlation of tree size and facilitated or competitive interactions at a series of scales. They also helped to connect the spatial relationship of quantitative and vectorial marks. ALS-based marks were introduced for spatial pattern analysis, and their applicability shown. We perfected the spatial pattern system and further clarified the spatial correlations of individual trees in forests. This study thus provides a new perspective to reveal the relationship between facilitation and competition and tree size at a series of scales in forests, and gain insight into the ecological driving factors of tree distribution and in relation to tree size and location.