Marked spatial point processes: current state and extensions to point processes on linear networks

Within the applications of spatial point processes, it is increasingly becoming common that events are labeled by marks, prompting an exploration beyond the spatial distribution of events by incorporating the marks in the undertaken analysis. In this paper, we first consider marked spatial point processes in $\R^2$, where marks are either integer-valued, real-valued, or object-valued, and review the state-of-the-art to analyze the spatial structure and type of interaction/correlation between marks. More specifically, we review cross/dot-type summary characteristics, mark-weighted summary characteristics, various mark correlation functions, and frequency domain approaches. Second, we propose novel cross/dot-type higher-order summary characteristics, mark-weighted summary characteristics, and mark correlation functions for marked point processes on linear networks. Through a simulation study, we show that ignoring the underlying network gives rise to erroneous conclusions about the interaction/correlation between marks. Finally, we consider two applications: the locations of two types of butterflies in Melbourne, Australia, and the locations of public trees along the street network of Vancouver, Canada, where trees are labeled by their diameters at breast height.


Introduction
Significant advancements in data collection and storage capacities have led to vast point pattern data availability from diverse sources, which often give access to (precise) spatial locations and time occurrences of events together with further valuable point-specific information, i.e. marks.In some cases, spatial locations might be restricted by the entities of some spatially embedded relational systems, which consequently limit where events could occur.
Thus, the state space that accommodates the events plays a vital role in advanced statistical analysis.Not surprisingly, the growing availability and accessibility of point pattern data, which find applications in distinct scientific fields, generally stimulated an increasing interest in developing suitable statistical/mathematical tools for the analysis of point patterns.
Within the literature, applications often include astronomy (Feigelson and Babu, 2012), zoology (Russell et al., 2016), ecology (Daniel et al., 2020), forestry (Gavrikov and Stoyan, 1995;Yazigi et al., 2021), geology (Flagg et al., 2020), and health (Bayisa et al., 2023).Within these applications, typical examples of marks, which are point specific, include, e.g., the shape of galaxies, animal sightings of (non)invasive species, the diameter of trees at breast height (dbh), habitat characteristics, the magnitude of earthquakes, and exposure levels to pollutants.In any such application, the interesting aim is not only to make inferences about the spatial distribution of events and their potential interaction but also to understand the association among the corresponding marks.For instance, in the case of forestry and ecology, spatial variation of dbh measurements for pairs of distinct trees and spatial correlation between sighting locations for pairs of different animal species might be interesting.Note that understanding how the dbh values differ across spatial locations might provide valuable insights into tree growth patterns, resource distribution, and ecological interactions within the forest ecosystem.Also, studying sighting locations might uncover relationships and dependencies between animal distributions, shedding light on potential habitat preferences, interspecies dynamics, and ecological coexistence patterns.These objectives are usually addressed by employing different so-called mark summary characteristics, which are expected to account for the specificity of the marks and, potentially, the constrained spatial domain of the points.This paper discusses the existing methodologies and state-of-the-art for marked spatial point processes in R 2 and proposes some novelties in the context of point processes on linear networks.
The literature for spatial point processes usually employs diverse exploratory tools, such as summary characteristics, to e.g.investigate both pairwise and higher-order interactions between points, as well as to validate fitted models.As a general construction principle, any such summary characteristic quantitatively assesses the average interrelation between points and/or marks within a specific interpoint distance.Initially, the main attention was paid to unmarked stationary and homogeneous point processes, where the spatial distribution of points is translation invariant, and the intensity function remains constant over the corresponding state space, leading to the development of the K-function (Ripley, 1976), used to study the pairwise relationships between points, and the J-function (van Lieshout and Baddeley, 1999), which goes beyond the second-order analysis.Both these summary characteristics are used to identify clustering and/or inhibition among points.These were later extended to marked stationary point processes, developing cross-type second/higherorder summary characteristics for point processes with integer-valued marks (Lotwick and Silverman, 1982;van Lieshout, 2006), and mark-weighted second-order summary charac-teristics for real-valued marks (Penttinen et al., 1992;Schlather, 2001).A different line of research focused on the pairwise association/variation of real-valued marks, including the mark variogram (Cressie, 1993;Wälder and Stoyan, 1996;Stoyan and Wälder, 2000), which has similarities with the (semi-)variogram commonly used in geostatistical contexts, and Stoyan's mark correlation function (Stoyan and Stoyan, 1994).Similar to the classic point process characteristics, any such tool is constructed based on the marks for pairs of points at arbitrary interpoint distances.While, in practice, points might exhibit inhomogeneity, all of these summary characteristics for real-valued marks are only defined for stationary point processes, and their inhomogeneous versions remain an open topic for future research.
Other approaches to investigating the dependencies between marks and spatial locations are proposed by Schlather et al. (2004); Guan (2006); Guan and Afshartous (2007).Moreover, some recent developments for the analysis of stationary spatial point processes, which simultaneously possess both integer-valued and real-valued marks, include some graphical model approaches, and partial characteristics proposed by Eckardt and Mateu (2019a,b).Instead of considering a spatial domain perspective, they applied a frequency domain representation to the marked points to compute different (partial) spectral density characteristics.This highlights the energy distribution of the point patterns over a range of frequencies, unlike the summary characteristics which focus on point-wise spatial interactions.Although frequency domain methods offer a highly flexible and computationally efficient way to investigate the structural interrelation of complex point processes, methodological contributions and practical applications within the spatial point process literature remain limited.
Turning back to unmarked point processes, Baddeley et al. (2000) and van Lieshout (2011) extended the second-and higher-order summary characteristics to inhomogeneous settings for particular classes of point processes, including second-order intensity-reweighted stationary and intensity-reweighted moment stationary processes; the former is a particular case of the latter.The extended versions of these summary characteristics to take integervalued marks into account are proposed by Møller and Waagepetersen (2003); Cronie and van Lieshout (2016).However, as of today, to the best of our knowledge, methods for realvalued marks still rely on the stationary assumption, and their extensions to non-stationary setting remains open and challenging.
In an effort to study non-integer/real-valued marks, the focus is directed toward proposing novel methodologies capable of handling diverse forms of marks.More specifically, by borrowing ideas from functional data analysis (Ramsay and Silverman, 1997), extensions of Stoyan's mark correlation function (Stoyan and Stoyan, 1994) to function-valued marks are proposed by Comas et al. (2011Comas et al. ( , 2013) ) for stationary point processes, and Ghorbani et al. (2021) proposed a framework for functional marked point processes together with some weighted marked reduced moment measures.Moreover, Eckardt et al. (2023) andEckardt et al. (2023) extended some summary characteristics for spatial point processes with integervalued marks to the case of multivariate point processes with multivariate function-valued marks and constrained vector-valued quantities.These non-integer valued marks are summarized into the class of object-valued marks, where instead of a scalar mark, each point is augmented by a non-scalar mark, i.e. an object-valued attribute, that lives on a suitable mark space whose precise form depends on the object under study.Suitable choices for the mark space include the Banach/Hilbert space for function-valued marks and the simplex for constrained vector-valued quantities.As in standard mark point process investigations, these extended tools aim to explore the spatial variation/association of the specific objects, e.g.curves, for pairs of points at arbitrary distances.
In the past two decades, significant attention has been given to point processes on linear networks, where spatial locations of events are constrained.Okabe and Yamada (2001) and Xie and Yan (2008) proposed a network-based version of Ripley's K-function and an intensity estimator, which replaced the Euclidean distance with the shortest-path distance without taking the geometry of the underlying network into account, leading to biased results; note that the underlying network itself often has a non-uniform distribution.As for taking the geometry of the underlying network into account, novel methodologies focusing on intensity function and summary characteristics were developed.Utilizing shortest-path distances, Ang et al. (2012) proposed geometrically-corrected second-order summary characteristics for both homogeneous and inhomogeneous point processes on linear networks.
The developed framework by Ang et al. (2012) was subsequently expanded to the case of point processes on linear networks with integer-valued marks (Baddeley et al., 2014), and spatio-temporal point processes on linear networks (Moradi and Mateu, 2020).Moreover, Rakshit et al. (2017) proposed the consideration of regular distances on linear networks and defined novel versions of the summary characteristics proposed by Ang et al. (2012) based on a more general class of metrics.Later, employing regular distances, Cronie et al. (2020) proposed higher-order summary characteristics for point processes on linear networks.In terms of intensity functions, various techniques are proposed, which are either based on kernel functions (McSwiggan et al., 2017;Moradi et al., 2018) or Voronoi tessellations (Moradi et al., 2019;Mateu et al., 2020).Computational issues became evident on large networks, leading to a fast kernel-based intensity estimator Rakshit et al. (2019) and an efficient way to compute the K-functions for point processes on linear networks Rakshit et al. (2019).
Excluding the second-order summary characteristics for multitype point processes on linear networks proposed by Baddeley et al. (2014), and a kernel-based smoothing approach for scalar marks on linear networks by Rakshit et al. (2019), the literature certainly lacks marked-based methodologies to analyze events occurring on linear networks.In particular, no specific contributions go beyond integer-valued marks.
Knowing the limitations mentioned above, we propose novel methodologies for spatial point processes on linear networks that possess either integer-or real-valued marks.More specifically, in the case of real-valued marks, we present extensions of various marked summary characteristics, e.g.Stoyan's mark correlation and mark-weighted summary characteristics to linear network settings.In the cases of multivariate/multitype point processes on linear networks, we take the higher-order summary characteristics, defined by Cronie et al. (2020), and propose their cross versions, which are of great use to reveal the type of interac-tions between points with different marks.In Section 2, we present two motivating datasets, one on a planar state space and one on a linear network.We then, in Section 3, provide a detailed overview of the state-of-the-art for the analysis of marked spatial point processes in R 2 .In particular, we start by presenting various summary characteristics for inhomogeneous multivariate/multitype point processes and then present several mark summary characteristics for stationary point processes with real-valued marks.This section further covers mark characteristics for object-valued marks as well as distinct frequency domain methods.Section 4 starts by reviewing second-order summary characteristics for inhomogeneous multivariate/multitype point processes on linear networks together with defining their mark-weighted versions and then proposes higher-order summary characteristics for such point processes.It then extends several mark summary characteristics for point processes with real-valued marks to settings where events happen on linear networks, accompanied by a numerical evaluation highlighting the importance of considering the underlying networks.
Lastly, in Section 5, we present the outcomes obtained from the analysis of two considered datasets and close the paper with a discussion in Section 6.

Data
To illustrate the use of both the existing methodologies and our novel contributions, we consider two spatial point pattern datasets sourced from publicly accessible open data repositories.These datasets are records detailing butterfly diversity in Melbourne, Australia (Figure 1) and data concerning public street trees in Vancouver, Canada (Figure 2).
The butterfly dataset was made accessible through the Our City's Little Gems project (OCLG), on the Open Science Framework, and it was sourced from the open data repository maintained by the city of Melbourne, Australia1 .Field researchers undertook data collection during January-March 2017, employing a transect tetrad study design established by the OCLG initiative (Kirk et al., 2017).The original data show concentrations of butterflies dispersed across multiple locations throughout the city, but we focus on some public parks around Melbourne Zoo which contain 1396 butterflies.However, after cleaning the dataset and maintaining those locations with unique species types, we obtained a point pattern of 84 points with two types of butterflies Little Blue, with 64 points, and Cabbage White, with 20 points.We use the convex hull of all points as the observed window.3 Marked spatial point processes in R 2 Let x " tx i , mpx i qu n i"1 be an observed finite marked point pattern within a window W Ă R 2 , where x i is the spatial location of i-th event and mpx i q is some associated mark living on a suitable mark space M. It is assumed that x has been generated by an underlying random mechanism X, called a marked spatial point process, on the product space R 2 ˆM, which is equipped with the spatial distance dpu, vq " ||u ´v||, u, v P R 2 , and the Lebesgue measure | ¨|.For an unmarked version of X, denoted by X, it is said that X is a homogeneous point process with intensity λ if ErN p X X Aqs " λ|A|, A Ă R 2 , where N is a count function.
Otherwise, X is an inhomogeneous process, and so that λp¨q governs the spatial distribution of points.Heuristically, λpuq|du|, u P R 2 , is the probability of finding a point of X within a vicinity of u of size |du|.From now on, whenever we refer to the intensity function, we are specifically referring to the intensity function of an unmarked point process.Furthermore, according to Campbell's formulas, for any non-negative measurable function f : R where λ pmq p¨q is called the m-th order product intensity of X, and ‰ means that the sum is taken over distinct m-tuples.Note that (1) is a particular case of (2) when f pxq " 1tx P Au where 1 is an indicator function.In a similar manner, λ pmq pu 1 , . . ., u m q ś m i"1 |du i | is the probability of having points of X in m infinitesimal disjoint areas around u 1 , . . ., u m with sizes |du 1 |, . . ., |du m |.
As the product intensity functions of X are not designed to study the correlation among points, if all m-th order product intensities exist, one may wish to employ the correlation function g m pu 1 , . . ., u m q " λ pmq pu 1 , . . ., u m q λpu 1 q ¨¨¨λpu m q " m ÿ j"1 ÿ D 1 ,...,D j ξ N pD 1 q ptu i : i P D 1 uq ¨¨¨ξ N pD j q ptu i : i P D j uq , where ř D 1 ,...,D j ranges over all partitions tD 1 , . . ., D j u of t1, . . ., mu into j non-empty and disjoint sets, N pD j q is the cardinality of the index set D j , and the ξ-functions are permutation invariant (van Lieshout, 2011).If such, the point process X is called intensityreweighted moment stationary.Note that, if m " 2 it reduces to the so-called second-order intensity-reweighted stationary (Baddeley et al., 2000), where one can obtain g 2 pu, vq ǵ1 puq " pξ 2 pu, vq `1q ´1 " ξ 2 pu, vq, where g 2 pu, vq is the so-called pair correlation function.A more stringent condition, which entails that X has an identical distribution to X `a " x `a : x P X, i.e., invariance under translations, leads to the concept of stationarity.
Next, by increasing marks' complexity, we present different summary characteristics designed to investigate the interactions between marks subject to an interpoint distance 0 ă r ă 8.Under specific assumptions, such as mark independence, these summary characteristics have predetermined values, making them valuable benchmarks for indicating interactions between marks.Thus, they can be effectively utilized in the hypothesis testing framework, enabling us to draw inferences about the spatial structure of marks and identify potential deviations from mark independence.Depending on the data in question, null models include e.g. the superposition of some independent point processes and the random labeling of points generated by subsequent processes.

Discrete and integer-valued marks
When marks represent a set of countably many entities, which correspond to the simplest case of all potential marked point process settings, each point is assigned a single label 1 ď k ď n, meaning that an integer-valued quantity categorizes points.The underlying process is called a multivariate/multitype point process in this case.More specifically, the observed point pattern x will be denoted as a collection tx 1 , . . ., x k u, k ě 2, where for each point pattern x i , i " 1, . . ., k, we have mp¨q " i, meaning that each point pattern x i consists of some points with a specific mark.Within the literature, multivariate/multitype point patterns are commonly analyzed through the so-called cross/dot-type extensions of the classical summary characteristic such as the K-functions and the pair correlation functions (Ripley, 1976;Baddeley et al., 2000), the nearest-neighbor distance distribution function H, the empty space function F , and the J-function (van Lieshout and Baddeley, 1996;van Lieshout, 2011).The cross-type summary characteristics concern pairs of points with distinct marks, whereas the dot-type summary characteristics focus on the relationship between points with a particular mark and all the remaining points.These summary characteristics were initially developed for homogeneous marked spatial point processes (Lotwick and Silverman, 1982;Harkness and Isham, 1983;van Lieshout and Baddeley, 1999).However, since, in practice, homogeneity is rarely satisfied, we here only present the inhomogeneous versions of such summary characteristics, which reduce to their homogeneous counterparts when intensity is constant.We note that the inhomogeneous K-functions and pair correlation functions for multivariate/multitype point processes in R 2 are defined for second-order intensity-reweighted stationary processes (Møller and Waagepetersen, 2003), whereas the inhomogeneous nearest-neighbor distance distribution function, the inhomogeneous empty space function, and the inhomogeneous J-function demand intensity-reweighted moment stationarity (Cronie and van Lieshout, 2016).
For a second-order intensity-reweighted stationary multivariate/multitype point process X in R 2 , the cross-type K-function is given as where X i , X j are point processes with marks i, j, respectively, and λ j p¨q is the intensity function of X j .If X i does not depend on X j , then K inhom ij prq " πr 2 , which can serve as a criterion to discover independence between sub-processes of different types.If K inhom ij prq ą πr 2 , one can conclude that the expected number of points of type j around points of type i, within an interpoint distance r, is more than the expected number under mark independence, indicating a tendency to occur around points of types i.On the contrary, K inhom ij prq ă πr 2 means that points of type j tend to maintain an interpoint distance from points of type j.

Note that K inhom
ii p¨q gives the K-function for X i .Moreover, the dot-type K-function is of the form where λp¨q is the intensity function of X.In practice, (5) measures the expected number of points of any type around the points with type i.Furthermore, one could obtain the cross/dot-type versions of both the inhomogeneous pair correlation function and the inho- Apart from the above cross/dot-type second-order summary characteristics, when X is further intensity-reweighted moment stationary, there exist the cross-type inhomogeneous nearest-neighbor distance distribution function and the cross-type inhomogeneous J-function where λj " inf uPX j λ j puq, and which is the inhomogeneous empty space function of X j and does not depend on the choice u (Cronie and van Lieshout, 2016).Moreover, if X i and X j are independent, then H inhom ij prq " F inhom j prq and consequently J inhom ij prq " 1, which can be used as a benchmark to detect independence between points of different types.Note that these functions are not symmetric in i and j meaning that e.g., H inhom ij prq ‰ H inhom ji prq, and by setting j " i in ( 6) and ( 7) one obtains the nearest-neighbor distance distribution function and the J-function of X i . Furthermore, where F inhom prq is the inhomogeneous empty space function of X. Due to the properties of J-functions under superposition of stationary and independent processes, van Lieshout and Baddeley (1999) introduced the I-function, which its analogous for inhomogeneous processes becomes I inhom prq " ř k i"1 p i J inhom i prq ´Jinhom prq where p i is the probability of type i.Under independence of marks, the I-function becomes zero, whereas deviations from zero suggest either positive or negative associations among the components.
If point processes X i and X j are independent, then one expects to have J inhom ij prq " 1 at least for small distances, while J inhom ij prq ą 1 points to a tendency for point of type j to happen near points of type i, and J inhom ij prq ă 1 means that points of type j prefer not to occur close to points of type i showing an inhibition between the two types (van Lieshout, 2011; Cronie and van Lieshout, 2016).In general terms, cross-type summary characteristics are employed to examine whether the presence of a specific type can influence the occurrence of points belonging to another type.

Mark connection and mingling functions
Within the context of discrete/integer-valued marks, a variety of more accessible empirical summary characteristics are available for stationary point processes.For instance, the mark connection functions p ij prq " p i p j ρ ij prq{ρprq, where p i is the probability of type i and ρp¨q is the pair correlation function, can be intuitively interpreted as the conditional probability that two points at a distance r ą 0 from each other possess marks i and j provided that these points belong to the point process X; under the independence of X i and X j , we have p ij prq " p i p j (Stoyan and Stoyan, 1994;Illian et al., 2008;Baddeley, 2010).For p ij prq ą p i p j , the mark connection function indicates a tendency of aggregation of components i and j around each other while p ij prq ă p i p j suggests repulsion.Moreover, the normalized mark mingling function is given by where c " ř k i"1 pn i pn ´ni qq{pnpn ´1qq is a normalizing constant, n i is the number of points of type i, and ř ‰ means that points are distinct (Lewandowski and Pommerening, 1997;Pommerening et al., 2011;Hui and Pommerening, 2014).The mark mingling function νprq may be used to disclose aggregation of points with (dis)similar marks so that under the independence of marks we have νprq « 1, while νprq ą 1 indicates heterospecific attraction, and νprq ă 1 suggests conspecific attraction.Note that the main difference between the mingling function and the cross/dot-type summary characteristics is that the mingling function quantifies the closeness of points with distinct types.

Real-valued marks
Unlike discrete/integer-valued marks, which are investigated by reformulating a marked point process into k ą 1 components, real-valued marks are commonly analyzed through test functions t f : M ˆM Ñ R `, which is usually a function of the marks of two points located at a distance of r ą 0 from each other (Penttinen and Stoyan, 1989;Schlather, 2001).Moreover, in contrast to the presented summary characteristics in Section 3.1, which address the aggregation/repulsion between points of different types, test functions t f analyze numerical differences between the marks of distinct points as a function of distance.For instance, neighboring points may exhibit (dis)similar mark values, and specific points might exert dominance over the marks of neighboring points, having large marks while other points in their vicinity possess smaller marks (Illian et al., 2008, Chapter 5).
For stationary point processes, the t f -correlation function κ t f prq is given as where c t f is a normalizing factor; note that the numerator in ( 12) is a conditional expectation with respect to the joint distribution of marks.Thus, one can see how marks of the points, located at a distance r from each other, interact in comparison to the average behavior of marks (Penttinen and Stoyan, 1989;Baddeley, 2010).Indeed, the explicit form of these mark characteristics relies on the choice of test functions, which often account for the spatial variation of marks as well as their pairwise relationships.Within the literature, denoting the average and the variance of all marks as µ m and σ 2 m , different test functions and normalizing factors are proposed, which are given in Table 1.We note that Stoyan and Stoyan (1994) proposed the so-called nearest-neighbor indexes, which are similarly constructed through different test functions but only accounting for the marks at a point x P X and its nearest neighbor, rather than considering all potential pairs of points within a distance r from x.
Table 1: Common test functions for point processes with real-valued marks.For Schlather's I, µ m prq is the mean of the marks for all points located at a distance r from x, and for the mark differentiation function c ∆ t f " 1 ´p2{npn ´1qq where mpx piq q is the i-th increasingly-ordered mark, R 1 " 0 and R i " ř i´1 j"1 mpx pjq q, for 2 ď i ď n.
The mark variogram γ mm prq (Cressie, 1993;Wälder and Stoyan, 1996;Stoyan and Wälder, 2000) measures the local half-squared differences among marks of a pair of points while Stoyan's mark correlation function κ mm prq (Stoyan and Stoyan, 1994) focuses on the mean of the local product of such marks.In other terms, the objective of the mark variogram γ mm prq is to assess the pairwise differences between the marks associated with a pair of points separated by a distance r, which can provide valuable insights into the spatial relationship and variability of marks within a specified distance range.Concerning Stoyan's mark correlation function κ mm prq, the pairwise product is expected to coincide with the global squared mean of the marks µ 2 m under the independence assumption such that κ mm prq " 1.If nearby points have smaller (larger) marks, their average product will also be small (large) and deviate from 1.We note that Stoyan's mark covariance function Cov mm prq (Stoyan, 1984) is indeed a linear transformation of his mark correlation function such that both Cov mm prq and κ mm prq essentially convey the same message (Schlather, 2001).Regarding the use of r-mark correlation functions, κ m‚ prq and κ ‚m prq, both reflect the average of the mark for either of the two considered points with respect to a distance r.Except for mark independence, the averaged r-mark correlation will not be equal to the overall mark mean since either the mark of the first or second point is chosen.Shedding some light on Beisbart's and Kerscher's version of κ mm prq, to have a large contribution in ( 12), only one of the points is required to have a large mark.However, under mark Independence, this mark correlation function coincides with twice the mark mean yielding κ Bei mm prq " 1.In contrast to κ mm prq and κ Bei mm prq, Isham's function κ Ish mm prq reveals a Pearson-type correlation for marks.Schlather's function I mm prq gives insight into how marks, for a pair of points with an interpoint distance r, are related to each other (Schlather et al., 2004).Basically, this function centers each mark by the conditional mean mark µ m prq, i.e., the r-mark correlation function, and normalizes the product by the mark variance σ 2 m .Shimanti's function I mm prq is constructed similarly but uses the global mark mean instead of the conditional mark mean (Shimatani, 2002).Finally, the mark differentiation function ∆ mm prq shows how the ratio of marks varies with respect to a distance r.In the case of mark independence, the minimum and the maximum of marks are assumed to be, on average, similar such that the proportion becomes 1, i.e., ∆ mm prq " 0 (Pommerening et al., 2011;Hui and Pommerening, 2014).
Aiming at defining a K-function for stationary point processes with real-valued marks and a constant intensity λ, which accounts for the correlation between points as well as between marks, Penttinen et al. (1992) proposed a mark-weighted K-function as  A further summary statistics for a stationary point process X, with an intensity function λ and the pair correlation ρp¨q, is where |dx|, |dy| are sizes of two infinitesimal small areas around x and y separated by a distance r.In a similar manner as K t f prq in (13), U p¨q considers the correlation between points and between marks.Since under complete spatial randomness ρp¨q " 1, and for independent marks κ t f p¨q " 1, thus, in the combination of those cases, U p¨q reduces to the second-order product density of X (Capobianco and Renshaw, 1998;Renshaw, 2002).
If at least one of these two does not vanish, then U p¨q is a version of the second-order product density of X weighted by the spatial interaction between the points via ρp¨q and/or association between the marks via κ t f p¨q.We note that further summary characteristics, considered within the literature, include the (non)cumulative density correlation functions (Fedriani et al., 2015).
Turning to the case where each data point has two distinct marks m i p¨q, i " 1, 2, by considering a bivariate test function t f pm 1 pxq, m 2 pyqq " m 1 pxqm 2 pyq, x, y P X, Stoyan (1987) proposed a bivariate mark correlation function with a normalizing constant µ 1 µ 2 where µ i , i " 1, 2, is the mean of mark m i p¨q, i " 1, 2; see also Raventós et al. (2011); Wiegand and Moloney (2013).Note that such an idea can be used to construct other bivariate and multivariate (nearest-neighbor-based) test functions (Pommerening and Grabarnik, 2019;Eckardt et al., 2023).In addition, the cross/dot-type pair correlation function and the bivariate/multivariate mark summary characteristics can also be used to derive cross/dot-type and multivariate versions of U prq (Eckardt and Mateu, 2019a;Eckardt et al., 2021).Such extended versions not only help to investigate the interrelations between different types of points but also allow to understand the association between different real-valued quantities over space.

Frequency domain approaches
Next, we discuss characterizations of marked spatial point processes through frequency domain methods, all of which could be translated into their corresponding distance-based characteristics, e.g.U p¨q, given in ( 14), by using the inverse Fourier theorem.This close relationship of the frequency and distance-based approaches allows to derive planar partial mark characteristics which reflect the interrelation between two components X i , X j , i, j " 1, . . ., k, conditional on all the remaining types.

Frequency domain approaches for discrete and integer-valued marks
In contrast to previous marked summary characteristics, frequency domain characteristics for marked spatial point processes have received limited attention.Bartlett's complete covariance density function (Bartlett, 1964) is given as K ij pu, vq " λ i puqδ ij pu ´vq `ϑij pu, vq, i, j " 1, . . ., k, where ϑ ij p¨q is a covariance density function (Mugglestone and Renshaw, 1996, Equation 2) and δ ij p¨q is a two-dimensional Dirac delta-function.Similar to the covariance density function, K ij pu, vq may be used to describe the cross second-order behavior of points within the corresponding space.However, unlike the covariance function, it controls for multiple coincident points via δ ij .For the components X i , X j , i, j " 1, . . ., k, and at frequencies ω " pω 1 , ω 2 q, using a discrete Fourier transform representation Mugglestone and Renshaw (1996) introduced the cross-spectral density function f ij pωq as where r " dpu, vq, u, v P R 2 , ω J is the transpose of ω, and ı " ?´1.Note that, in this case, K ij p¨q only depends on distances between points.Two components are said to be independent if the cross-spectral density function equals zero for all frequencies.We note that by applying Theorem 8.3.1 of Brillinger (1981), the above spectrum could be translated into a partial version f ij|Xzti,ju pωq, i.e. computing the Schur complement of the cross-spectral densities (Eckardt and Mateu, 2019a).
The cross-spectral function f ij p¨q further leads to the definition of some interesting spectral functions, including the spectral coherence function R 2 ij pωq " f 2 ij pωq{ pf ii pωqf jj pωqq, the cross-amplitude spectrum ζ ij pωq " mod tf ij pωqu, the cross-phase spectrum ℘ ij pωq " argpf ij pωqq, the gain function G i|j pωq " a pf ij pωqR ij pωqq{f i pωq, and also their correspond-ing partial versions (Mugglestone and Renshaw, 1996;Eckardt and Mateu, 2019b).To offer some insight into these functions, the cross-phase spectrum ℘pωq measures the similarity between two point patterns to see if the spectrum of one pattern is a linear shift of that of the other one, while the amplitude spectrum ζ ij pωq encodes the relative magnitude of frequencies for two point patterns.Thus, both functions are useful tools to investigate the characteristics of the empirical spectrum with respect to the frequencies ω.The gain spectrum G i|j pωq, in contrast, can be interpreted as a regression coefficient in a linear regression at frequency ω, for two corresponding components.Additionally, one could make directional inferences by transforming the spectral density functions into the polar form; see e.g.Renshaw andFord (1983, 1984).

Frequency domain approaches for real-valued marks
The accessibility of frequency domain approaches for real-valued marks is more constrained than that of discrete marks.The auto/cross-spectral density functions for real-valued marks, at frequencies ω " pω 1 , ω 2 q, can be obtained from a discrete Fourier transformation of the corresponding U prq functions, as where U ii prq and U ij prq are the auto/cross-type versions of ( 14), respectively; see Renshaw (2002); Eckardt and Mateu (2019a) for more details.Analogous to the multivariate spatial point processes, the marked spectra can be transformed into partial versions using the results of Theorem 8.3.1 of Brillinger (1981).
As of some recent development, Eckardt and Mateu (2019a), using a re-scaled version of the partial spectral coherence, defined a spatial dependence graph model for multivariate point processes with, potentially, real-valued marks, according to which different components are represented as nodes and the conditional independence structure among the k components is reflected by missing edges.

Object-valued marks
We now discuss mark summary characteristics for an observed point pattern x " tx i , opx i qu n i"1 in which each point x i is augmented by a non-scalar quantity o i living on some suitable mark space M, which depends on the specificity of marks in question.Apart from the functionvalued mark setting, where M is the Hilbert/L 2 space (Comas et al., 2008(Comas et al., , 2011(Comas et al., , 2013;;Ghorbani et al., 2021;Eckardt et al., 2023), this newly introduced class of marked spatial point processes also includes the cases where marks are constrained arrays or inherently structured quantities (Eckardt et al., 2023).
Focusing explicitly on function-valued mark scenarios, where for each point x i the corresponding mark opx i q is a function-valued quantity gpx i q on FpT q with T " pa, bq, ´8 ď a ď b ď 8 and gpx i q : T Ď R Þ Ñ R, different mark summary characteristics, similarly to Section 3.2, can be defined through an extended test function t f : F ˆF Ñ R `. Starting with a pointwise specification, Eckardt et al. (2023) proposed a generalized version of the t f -correlation function in (12), given as where m x ptq is the mark of x at t P T , and c t f ptq is, for a fixed t P T , a pointwise normalizing constant corresponding to the expectation of the test function when r tends to infinity.Denoting the mean and variance of all marks at t P T by µ g ptq and σ 2 g ptq, a summary of the extended test functions is presented in Then, any such global characteristic allows, in a L 2 sense, for a similar interpretation as discussed for Table 1.In other words, the global mark variogram γ gg prq gives the average variability over distinct pairs of functions for two points with an interpoint distance r.Under mark independence, the average differences among the curves are expected to coincide with the functional variance such that γ gg prq " 1.Likewise, κ gg prq assesses the product of any two function-valued quantities with respect to the interpoint distance of points x, y P X, where κ gg prq " 1 for independent marks.Unlike κ gg prq, both r-mark correlation functions κ g‚ prq and κ ‚g prq concern the average of the function-valued point attribute of either the first or the second point subject to a distance r.This usually differs from µ g except under mark independence settings.Similarly to Beisbart and Kerscher's original version, κ Bei gg prq reflects the average pairwise sum, i.e. the perturbation, of any two functions for a pair of points at a distance r from each other, which is expected to coincide with twice the functional mean µ g if marks are independent.If two functions of two nearby locations vary strongly from the functional average, the sum of both would also be, on average, different from the expected case leading to κ Bei gg prq ‰ 1. Lastly, in both function-valued versions of Schlather's and Shimanti's I-functions, marks are centered by the conditional and unconditional functional means squared, respectively, and scaled by the functional variance.We note that the same ideas can also be applied to the mark-weighted K-function K t f p¨q, given in ( 13), and the function U prq, given in ( 14), leading to K t f ptq " ş K t f pr, tqdt and U prq " ş U pr, tqdt.
The extensions of the above summary characteristics to multivariate spatial point processes, including cross-type, cross-function and cross-type cross-function mark summary characteristics are discussed by Eckardt et al. (2023).While the cross-type characteristics reveal the interrelation of mark gp¨q for points x P X i and y P X j , the cross-function versions are defined for function-valued marks g 1 p¨q ‰ g 2 p¨q on F 2 at locations x, y P X.
These ideas are also extended to multi-function versions for the sets tx i , mpx i qu n i"1 , with mpx i q P F s , s ą 2, see Eckardt et al. (2023) for details.

Marked spatial point processes on linear networks
In certain point process applications, the spatial distribution of events may become confined within an underlying structure.Instances of these applications include the study of street crimes, traffic accidents, trees positioned alongside roads, and various other scenarios where spatial locations are restricted to a linear network.
A linear network L Ă R 2 is considered as a finite union of some line segments, i.e., L " Ť k i"1 l i where l i " ru i , v i s " ttu i `p1 ´tqv i : 0 ď t ď 1u, u i ‰ v i P R 2 .The linear network L is equipped with a regular distance metric d L , with the shortest-path distance being an example of it (Rakshit et al., 2017).Moreover, no over/under-pass is assumed within L, meaning that each intersection of segments is a node.The total length of L is denoted by |L| " We denote a marked point process on L by X L for which an observed point pattern is denoted by x L , and similarly to (1), if we let XL be the unmarked version of X L , then where λ L p¨q is the intensity function of XL , and d 1 stands for integration with respect to arc length on the network.In this case, the intensity function λ L puq gives the expected number of points per unit length of the network in the vicinity of a location u P L (Baddeley et al., 2021).Similarly to ( 2) and ( 3), for any non-negative measurable function f p¨q on the product and where λ pmq L p¨q is the m-order product intensity of X L , ř D 1 ,...,D j ranges over all partitions tD 1 , . . ., D j u of t1, . . ., mu into j non-empty and disjoint sets, and N pD j q is the cardinality of the index set D j (Cronie et al., 2020).In order to propose marked summary characteristics for point processes on linear networks, similarly to Section 3, we need some form of stationarity.
However, addressing this challenge has proven to be quite complex (Baddeley et al., 2017), as, currently, there is no transformation that can transform points on a linear network with the guarantee that the shifted point will remain on the same network.Cronie et al. (2020) proposed different notions of stationarity on linear networks, according to which whenever the product intensities λ pmq L p¨q, 1 ď m ď k, exist, λL " inf uPL λ L puq ą 0, and for any m P t2, . . ., ku the correlation function g L m p¨q satisfies for any u P L and some function ḡL m : r0, 8q m Ñ r0, 8q, they said that XL is k-th order intensity reweighted pseudostationary (with respect to a regular distance d L ); XL is called intensity reweighted moment pseudostationary (IRMPS) when this holds for any order k ě 2.
Moreover, a homogeneous point process XL , which is also k-th order intensity reweighted pseudostationary, is called k-th order pseudostationary.Finally, a moment pseudostationary point process XL is considered (strongly) pseudostationary if its moments completely and uniquely characterize its distribution.

Discrete and integer-valued marks
We now consider the case where X L can be decomposed into 1 ă k ď n components according to a discrete/integer-valued mark.In this case, X L is called a multivariate/multitype spatial point process on a linear network L, and x L is denoted as a collection of k distinct point patterns tx L 1 , . . ., x L k u, k ě 2. Marked summary characteristics for multivariate/multitype point processes on linear networks can be naturally proposed by mimicking the underlying principles presented in Section 3.1.However, despite the growing availability of multivariate/multitype spatial network-constrained point processes, there exist only a few of such extensions (Spooner et al., 2004;Baddeley et al., 2014;Eckardt andMateu, 2018, 2021).
In particular, most attention is paid to the pair correlation function and the K-function, marked point process X L , we have and where ∇p¨q acts as an edge correction factor, and λ L j p¨q is the intensity function of the jth component.If the employed metric is the shortest-path distance, then ∇tu, d L pu, xqu becomes the reciprocal of the number of points lying exactly d L pu, xq units from u P X i .In a similar manner, one could obtain the cross/dot-type versions of the inhomogeneous pair correlation function.

Mark connection and mingling function
For homogeneous point processes on linear networks, Baddeley et al. (2014) presented the network-based versions of the mark connection and mark equality functions as where ρ L ij p¨q is the cross pair correlation function for the processes with types i, j and ρ L p¨q is the pair correlation of the full process.The mark connection function p L ij pr L q is, given the presence of a pair of points separated by a distance r L , intuitively the conditional probability that those points are of types i, j.Similarly, p L pr L q is the conditional probability that those two points, separated by a distance r L , have the same type.
Next, we propose the counterpart of the normalized mingling function (11) for homoge-neous point processes on linear networks as where c is the same normalizing factor as in (11).Similarly, ν L p¨q may be used to discover aggregation/repulsion between points of different types subject to an interpoint distance r L .Note that, unlike the cross/dot-type summary characteristics, the mingling functions ( 11) and ( 23) quantify the aggregation and/or repulsion tendencies between any given types, rather than focusing on the interaction between two specific types or comparing a single type against all other types.

Higher-order marked inhomogeneous summary characteristics
Next, for IRMPS point processes, we extend the inhomogeneous higher-order summary characteristics of Cronie et al. (2020) to multivariate/multitype settings.In particular, we have the following cross-type summary characteristics and the dot-type ones where F L,inhom pu, r L q and F L,inhom j pu, r L q are the inhomogeneous empty space functions of X L and X L j , respectively.For any two fixed types i, j, J L,inhom ij p¨q " 1 means points of type j are randomly distributed around points of type i, J L,inhom ij p¨q ą 1 means that points of type j tend to happen around points of type i, and J L,inhom ij p¨q ă 1 indicates that points of type j prefer to maintain a distance from points of type i.Turning to the dot-type J-function, if J L,inhom i‚p¨q " 1, it implies that points of different types than i exhibit no interaction with points of type i.When J L,inhom i‚p¨q ą 1, it signifies a propensity for other types to appear in proximity to points of type i.Conversely, J L,inhom i‚ p¨q ă 1 suggests a preference for other types to hold a distance from points of type i.Note that these higher-order marked summary characteristics are expected to provide a deeper understanding of the spatial patterns and clustering/repulsion behavior among points of different types since they go beyond pairwise interactions providing information about interactions involving more than two points.

Real-valued marks
We now turn to a situation where points are labeled by real-valued marks, taking the presented methods in Section 3.2, and present novel mark summary characteristics that take the geometry of the underlying network into account.

Mark correlation functions and mark-weighted summary characteristics
Generalizing the t f -correlation functions in (12) to the present setting, the t L f -correlation function for point processes on linear networks is of the form where c t L f is a normalizing factor; the superscript L emphasizes that the t f -correlation function will be evaluated over the linear network L. The numerator in ( 29) is the conditional expectation of a given test function evaluated over a pair of marks whose corresponding spatial locations are r L distances away, given that the two points belong to the point process X L .The denominator c t L f is the expected value of the considered test function under the mark independence assumption.Applying the same principles as in Section 3.2, the specific linear-network-based mark characteristic depends on the explicit formulation of the test function.A summary of potential test functions for point processes on linear networks and their notations is given in Table 3.
Table 3: Test functions for point processes on linear networks with real-valued marks.The average and variance of all marks are denoted by µ m,L and σ 2 m,L , and µ m,L pr L q is the conditional mean of the marks for points with an interpoint distance r L .

Name of the function
Stoyan's covariance function Cov L mm pr L q mpxqmpyq ´pµ m,L q 2 1 Schlather's I I L mm pr L q pmpxq ´µm,L pr L qqpmpyq ´µm,L pr L qq σ 2 m,L Shimanti's I I Shi,L mm pr L q pmpxq ´µm,L qpmpyq ´µm,L q σ 2 m,L Taking the geometry of linear networks into account and employing a regular metric (Rakshit et al., 2017), the mark variogram γ L mm pr L q quantifies the variability of marks for pairs of points with an interpoint distance r L .Note that, in contrast to the classic mark variogram γ mm prq, given in Table 1, which disregards the underlying geometry of the linear network L and, thus, may yield biased outcomes and potentially mislead interpretations (see Section 4.2.2), the geometrically corrected mark variogram γ L mm pr L q takes the specific properties of the linear network into account and provide more accurate outcomes.The large/small local differences, i.e. high/low local variation, between marks give rise to large/small values for the mark variogram function, deviating from the mark independence assumption.Similarly, the geometrically corrected version of Stoyan's mark correlation κ L mm pr L q quantifies the average pairwise product of marks for points separated by a distance r L along the linear network L. If marks are independent, it coincides with the mark mean squared µ 2 m,L providing that κ L mm pr L q " 1.If marks for nearby points on a linear network differ from the expected mark under independence, their average would also be different from µ 2 m,L and thus κ L mm pr L q ‰ 1.
Similarly, the linear network r L -mark correlation functions would only coincide with the mark mean if the marks are independent subject to a distance r L .Consequently, any deviations of r L -mark correlation functions from one indicate the existence of a structure in the distribution of marks.The geometrically corrected version of Beisbart and Kerscher's mark correlation κ Bei,L mm pr L q expects that the average pairwise sum of marks is expected to be equal to twice the mark mean under independence.Thus, deviations of κ Bei,L mm from unity highlight local dependencies and the existence of a particular structure among marks.In a similar manner, one may describe other mark characteristics presented in Table 3.
In addition to the above test functions, we further, for a homogeneous point process X L with constant intensity λ L , extend the mark-weighted K-functions K t f prq, given in ( 13), to the linear network settings, and propose its counterpart as for inhomogeneous point processes.Implementing the test function t L f p¨q as a weight in the geometrically corrected K-function for spatial point processes on linear networks as in ( 30) and ( 31), gives rise to a scaled version of the original K-function.In particular, since the implemented weight becomes one if marks are independent, the up/down-scaled version of the original K-function highlights some interaction between marks.Under the assumption that points are generated from a Poisson process with independent marks, the above mark-weighted K-functions possess a known value denoted by r L .However, due to the presence of two sources for re-scaling the K-function, namely pairwise interaction between points and between marks, it is advisable to compute both the weighted and unweighted versions for a more comprehensive understanding.Note that the weighting idea is applicable to other summary characteristics such as the inhomogeneous empty space function, the inhomogeneous nearest neighbor distance distribution function, and the inhomogeneous Jfunction.
Next, for a homogeneous point process X L , on a linear network L, with the pair correlation ρ L p¨q, we propose an extension of ( 14) as where |d 1 x| and |d 1 y| are the sizes of two infinitesimal segments around the points x, y P L; similar interpretation, as the planar case, holds for U L pr L q.

Numerical evaluation
This section is devoted to evaluating the performance of the proposed linear-network-based mark characteristics and investigating potential variations between the classic planar mark correlation functions and those that take the underlying linear network into account.In terms of the spatial distribution of points, following the assumptions concerning the definition of mark correlation functions, we consider homogeneous point patterns, with 100 uniformly distributed points, on a dendrite network, which is previously used in Jammalamadaka et al.From each model, we simulate 199 patterns, and then, for each simulated point pattern, we calculate the Stoyan's mark correlation functions κ mm prq and κ L mm pr L q; we let d L be the shortest-path distance.We also let r L " 1.25r (Rakshit et al., 2019), leading to κ mm prq and κ L mm pr L q being evaluated over distances r P r0, 200s and r L P r0, 250s, respectively.
Based on the 199 estimated mark correlation functions, we obtain 95% pointwise envelopes to compare the general behaviors of κ mm prq and κ L mm pr L q when the data are generated on a dendrite network.Figure 4 shows the estimated Stoyan's mark correlation functions for all models together with their 95% pointwise envelopes and average behavior.Looking at the top row of Figure 4, showing the results for the model I, while the structure of the marks is not revealed by Stoyan's mark correlation function κ mm prq (left panel), indicating mark independence, our proposed linear-network-based Stoyan's mark correlation functions κ L mm pr L q (central panel) clearly reveals a positive mark correlation for shortest path distances up to around 175 units.For larger distances, the mark correlation becomes less than one, which means that the average product of marks for points that are apart by a distance higher than 175 units is smaller than the squared mark mean value.To better compare the average behavior of κ mm prq and κ L mm pr L q, the averaged mark correlation functions are displayed in the right panel.Similarly, model II (middle row) and model III (bottom row) show apparent dissimilarities between the planar and the linear network mark correlation functions.Looking at the middle row and the presented results for model II, one can see a similar tendency of having a high mark correlation for smaller distances, which turns out to be lower than the correlation value under mark independence for larger distances.However, the clear difference between the two is that κ mm prq turns from positive to negative quickly, around r " 100, whereas κ L mm pr L q maintain values higher than the expected value under mark independence up to r L " 200.Note that, as the shortest path distances from the points at the border to the central point on the dendrite network increase in value, central points are surrounded mainly by higher mark values.In contrast, very small marks appeared only very close to the dendrite's border.Even though both κ mm prq and κ L mm pr L q may show a positive correlation for smaller distances and a negative correlation for larger distances, their main difference is in the degree of positiveness/negativeness and further on the turning point from positive to negative.This impression also holds for the obtained results for model III displayed in the bottom row.While the κ L mm pr L q clearly show the structure of the marks for small distances, increasing until r L " 50 and then decreasing, κ mm prq entirely show a decreasing trend from small to large distances.Note also the difference between the turning points.
Upon further comparison between κ mm prq and κ L mm pr L q, we proceed by selecting a single realization from each model.We then compare the Stoyan's mark correlation function of these realizations to a 95% pointwise envelope under mark independence.This envelope is constructed using 199 point patterns with randomly allocated marks derived from the original pattern.Looking at Figure 5, a clear difference between the envelopes for κ mm prq and κ L mm pr L q is that κ mm prq shows a quite higher variation for larger distances compared

Real data analysis
We now study the mark structure for two real data presented in Section 2. For the case of butterflies in Melbourne, Australia, we further let the data points be labeled by the intensities so that we can make use of both cross-type and mark correlation functions in our analysis.Intensities, separately for each type, are estimated via the non-parametric Jones-Diggle kernel-based estimators (Jones, 1993) employing Scott's rule of thumb for bandwidth selection (Scott, 2015).In the case of public street trees in Vancouver, Canada, we only make use of our proposed version of Stoyan's mark correlation function for point processes on linear networks and make an additional comparison with its counterpart for point processes in R 2 in the case of real datasets.We point out that both real datasets exhibit inhomogeneity, which is expected when dealing with real phenomena.However, we believe mark correlation functions can serve as good indicators for discovering interactions between marks, although they are essentially developed for stationary and homogeneous point processes.
5.1 Butterflies in Melbourne, Australia  for each type of butterfly by randomly allocating marks.Then, we compare Stoyan's mark correlation function κ mm prq for each type of butterfly with 95% pointwise envelopes obtained from the 199 patterns under mark independence.We can see that, for distances smaller than be the fact that, for small interpoint distances, mark correlation functions for all species have larger values than the expected value under mark independence, except type Populus, for which mark correlation functions stay below the corresponding envelopes for very small interpoint distances.Note that this means for a pair of trees of the type Populus with very small interpoint distances, the dbh value for at least one of them is pretty small.

Discussion
Methods for analyzing marked spatial point processes, when each point is augmented by some auxiliary quantity, have recently witnessed impressive developments.Apart from some extensions of the current methodological toolboxes to the case of point processes with events on linear networks/graphs (Jammalamadaka et al., 2013;Baddeley et al., 2014), parts of the recent literature proposed generalizations to more challenging scenarios where marks are non-scalar.Extensions to the settings where marks are function-valued were considered, followed by some recent contributions covering functional characteristics, where marks are inherently structured objects including compositions, densities, and graphs (Eckardt et al., 2023).In this paper, we have restricted our focus to general marked summary characteristics sphere.As a final limitation, it would also be interesting to develop methodologies and test functions that can incorporate negative marks.

Figure 2
Figure 2 presents the spatial distribution of public trees, excluding park trees, located along the street network of Vancouver, Canada, in 2016.Attributes such as tree species and their corresponding diameter at breast height (dbh) in inches are attached to each location.The original data is sourced from the open data portal 1 of Vancouver, Canada, and consists of 136,574 places of public trees categorized into 282 different species.However, we only consider five species Aquifolium, Arnold, Bignonioides, Involucrata, and Populus, which together include 1,045 trees with a dbh varying between 2 to 94 inches.The street network comprises 49,928 vertices and 55,221 segments and spans a total length of 1,779.547km.

Figure 2 :
Figure 2: Public street trees in Vancouver, Canada, in 2016.Left: different shapes show species of trees; right: colors show diameter at breast height in inches.

K
mm p¨q, by taking the pairwise product of marks as weight into the K-function, the estimated curves become scaled versions of the original K-function, except under mark independence.More specifically, for pairwise positively correlated marks, i.e. if the pairwise product of marks exceeds the overall expected value, the empirical curves are up-scaled versions of K-function.Otherwise, they are down-scaled.To better understand the impact of marks, it is advisable to compare the original K-functions with their mark-weighted counterparts.Further, normalizing K t f prq by its unmarked counterpart, i.e. the Ripley's K-function, one can obtain a cumulative mark-correlation function corresponding to the mean value of the employed test function for a pair of points with an interpoint distance r(Wiegand and Moloney, 2013).We note that D'Angelo et al. (2023)  recently discussed local mark-weighted cumulative summary statistics, which account for the contribution of each marked point to the global second-order summary characteristics.

(
2013);Baddeley et al. (2014).Next, to convert the simulated point patterns into marked point patterns, we generate real-valued quantities according to three distinct scenarios, called models I, II, and III, and assign the simulated marks to each point location.In model I, marks follow the function f px, yq " px `yq{5000, having a trend from bottom-left to topright of the network; in model II, marks are the shortest-path distance from each point to the dendrite's border, and in model III marks are the number of points which have a distance less than 80 to a target point, i.e., the number of nearest neighbor points.Figure3shows examples of the considered models.Note that within all three considered models, there is a clear structure for the marks on the dendrite network.

Figure 3 :
Figure 3: Examples of simulated data.From left to right: Models I, II, and III.Colors show mark values.

Figure 4 :
Figure 4: 95% pointwise envelopes for Stoyan's mark correlation functions based on 199 simulated patterns from Models I, II, and III.From left to right: κ mm prq, κ L mm pr L q, and their average.From top to bottom: Model I, Model II, and Model III.

Figure 5 :
Figure 5: 95% envelopes for Stoyan's mark correlation functions, under mark independence, based on 199 simulated patterns.The solid curves show Stoyan's mark correlation functions for one point pattern per each model I, II, and III (left to right).

Figure 6
Figure6shows the cross-type summary characteristics and Stoyan's mark correlation function for butterflies in Melbourne, Australia.In the cross-type inhomogeneous K-function case, we only use its estimator with an isotropic edge correction.For small distances (r ď 200), the cross-type inhomogeneous K-function moves around its theoretical value for Poisson processes, meaning that the butterflies of type Cabbage White are randomly distributed around butterflies of type Little Blue.However, for larger distances, the cross-type inhomogeneous K-function lies below its theoretical value for Poisson processes, indicating that, at those distances, the number of butterflies of type Cabbage White around butterflies of type Little Blue is smaller than that of complete spatial randomness.A similar message is conveyed by the cross-type inhomogeneous J-function.

Figure 6 :
Figure 6: Mark summary characteristics for butterfly data in Melbourne, Australia.From left to right, cross-type inhomogeneous K-function, cross-type inhomogeneous J-function, Stoyan's mark correlation function for the Cabbage White type, and Stoyan's mark correlation function for the Little Blue type.

Figure 7 :
Figure 7: Stoyan's mark correlation function for the five species of trees alongside the street network of Vancouver, Canada; the real-valued mark is dbh.From left to right: Arnold, Populus, Involucrata, Aquifolium, Bignonioides.The top row considers the network; the bottom row ignores it.
Penttinen et al. (1992)ˇˇu P X f is the expected value of t f p¨q under mark independence for which K t f prq reduces to the so-called Ripley's K-function(Baddeley et al., 2015, Chapter 15); (13) can be extended to inhomogeneous cases.Penttinen et al. (1992)initially considered the Stoyan's mark correlation function in (13), i.e., t f p¨q " κ mm p¨q, and commonly denoted K t f p¨q as K mm p¨q, however, one may use any of the test functions presented in Table1.Note that, in the case of

Table 2
x ptqg y ptq ˇˇx, y P X ‰ needs to divided by µ g ptq(Eckardt et al., 2023).While all the test functions presented in Table2, with respect to the argument t, have

Table 2 :
Pointwise test functions for point processes with function-valued marks.
similar interpretations as those in Table1, they do not convey any information on the overall pairwise interrelation between the function-valued quantities under study.The desired global mark characteristics, however, can be constructed from their pointwise versions by the integration of the normalized expectation of t f pg x ptq, g y ptqq over T .In this respect, the pointwise mark variogram and mark correlation functions translate into global versions γ gg prq " T E " g x ptqg y ptq ˇˇx, y P X ı dt ": ż T κ gg pr, tqdt.