1 Introduction

In agricultural, ecological or environmental sampling, population units at a geographical location are usually arranged in a number of quadrats that naturally or by design fall into a grid structure. In this grid structure, a subset of grid cells is selected under a plan produced on the basis of well-specified sampling methodology. The sampling methodology signifies the underlying probability structure (or sampling variation) of samples generated from the population with certain selection probabilities. These selection probabilities make it possible to draw inferences about overall population units in a finite population setting.

The choice of sampling methodology depends on the objective of the study, the nature of the underlying population, prior knowledge of the sampling region and spatial features of the grid structure. If the sampling region has some sort of spatial pattern, a sampling design that provides a good spatial coverage over the region may provide more information than a sampling plan providing a poor coverage (Bhave and Sadhwani 2021). A spatial pattern in a region may be a function of environmental factors, or chemical, physical and microbiological patterns of the soil map. Often, prior information is available for soil structure from previous experiments, yield maps and soil analysis of the sampling region. On a small scale, variation can also arise from a design difference within agricultural equipment—for example, variation caused by the tine geometry of a seeder. Environmental factors can be assessed by ground elevation, wind pattern and vegetation density of the sampling region or surrounding areas. This paper uses all available prior knowledge on grid structure to construct a sampling design to produce a better spatial pattern coverage for a population under study.

The prior knowledge of the sampling region could be subjective, such as in the form of visual inspection of the sampling region, or could be in the form of an auxiliary variable highly correlated with the variable of interest. The auxiliary variable may be obtained by flying a drone equipped with image processing software over the sampling region. The images can be processed to obtain a rough predictor of the variable of interest on each grid cell over the sampling region. Readers are referred to Kasprzak (2021) for further information regarding image processing in rectangular grids. Prior information regardless of its origin is used to determine relative position (rank) of within-row and/or within-column sampling units prior to selection of samples. For example, ranks, \(1,2,\ldots , p\), of p quadrats in a row allow within-row units to be stratified based on their relative positions (not based on their actual unknown values ) with respect to each other. Ranks in rows and columns then lead to a two-way stratification (row–column structure) of the sampling region. We introduce a sampling method to provide a better spatial coverage than a simple random sampling method over this two-way stratification. In this paper, sampling (rank) coverage, representative sample or spatial pattern coverage are used interchangeably to describe a representative sample that consists of observations across the ranks of the auxiliary ranking variable.

To highlight the significance of the use of relative position information, we consider a sugarcane disease caused by the fungus Pachymetra chaunorhiza (Chau). This disease is associated with yield loss up to 40% in Australia. The fungus invades individual roots and breaks down the internal root tissues. The variable of interest (y) is the amount of disease (log-DNA copies/g sample) that sugarcane plants have in each grid cell. The sampling region consists of 49 grid cells organized into 7 by 7 square plots of sugarcane plants. The values of y are presented in panel (b) of Table 1. The population mean and standard deviation are 15.87 and 15.56 log-DNA copies/g, respectively. In this population, we consider the estimation of the population mean. Before selecting the sample units, we create a two-way stratification based on the relative position (rank) of within-row and within-column units of the square. Ranks can be obtained from a proper ranking procedure that does not require actual measurement of the units, such as brightness of leaf colors, which can be obtained by flying a drone equipped with image processing software. The quadrats in rows and columns then can be ranked independently using the brightness or color intensity of the leaves. Since the row and column rankings are independent, the ranking in the entire square may not have a joint perfect ranking even if the rankings in rows and columns are perfect. On the other hand, it provides a reasonable stratification in both row and column spaces. Once the two-way stratification based on ranks is completed, we select a better representative sample than a Latin square sample by exploiting the ranking structure of the quadrats in the sampling region. The detail of the sampling procedure is provided in Sect. 2.

The row–column structure is similar to a Latin square sampling design, where the population units are organized in a p by p square of grid cells based on two stratification criteria, row and column stratification. A Latin square sample consists of the selection of p cells such that exactly one unit from each of the square’s rows and columns is selected. To our knowledge, Vickery et al. (1949) is the first publication on Latin square sampling, where the average content of a mineral in plant leaves is estimated in a controlled environment. Since then there have been many contributions on the topic. Further details and relevant references about Latin square samples and related sampling designs can be found in Borkowski (2003), Cochran (1972), Kish (1965), Munholland and Borkowski (1996), Raj (1968) and Yates (1960).

The sampling design in this paper differs from existing designs in the cited references. It has potential benefits for constructing a better representative sample by exploring the heterogeneity of the sampling region. The new design, unlike the Latin square sampling which relies only on the grid structure of the region, uses both grid structure and the row and column ranks of the quadrats. Since ranks of quadrats are more closely related to the values of the variable than the positions of the quadrats in a Latin square, they provide a better row–column stratification and lead to a better representative sample.

This paper presents a procedure relevant to the Latin hypercube sampling with ancillary variables by Minasny and McBratney (2006). Their procedure requires a knowledge of probability distributions of ancillary random variables \(X_1\) and \(X_2\), which would be a strong assumption to meet, whereas our procedure relies on relative closeness of units in rows and columns. Hence, our procedure would be less restrictive for practical application.

Use of ranking information in sampling is not new. It has been successfully used in ranked set sampling (RSS) to reduce the sampling cost by stratifying sampling units based on their relative positions in a field experiment (McIntyre 2005). An RSS of set size H and sample size n is constructed in three basic steps. Step 1 selects nH grid cells at random from the population in the sampling region, and partitions them into n sets each of size H. Step 2 ranks the H grid cells in each of those n sets using within-set ranking information and selects cells for the next step. Step 3 measures the actual value of the variable of interest in the h-th ordered grid cell (ranked based on prior information) in the \(n_h\) sets, \(Y_{i[h]}\), \(i=1,\ldots , n_h\), where \( \sum _{h=1}^H n_h=n\), and \(n_h\) is the number of observations from the h-th ranking group. The final set of the n measured cells is called a ranked set sample. Details of RSS design can be found in Deshpande et al. (2006), McIntyre (2005), Murray et al. (2000), Ozturk (2017, 2019), Ozturk and Kravchuk (2021, 2022), Ridout (2003), Takahasi and Wakimoto (1968), Wolfe (2004, 2012) and the references contained therein.

RSS design uses relative closeness in a comparison set of a one dimensional space, but ignores the closeness in row and column structure of a rectangular grid. This paper combines the stratification principle in RSS with the stratification principle in Latin square sampling to create a two-way structure for the sampling region. Stratification of rows uses similarity (or dissimilarity) based on ranks of within-column units (or adjacent units within the column) while stratification of columns uses the relative closeness of within-row units with respect to the variable of interest in the sampling region. The sampling method provides a reasonably good representative sample in both row and column directions of the sampling region. Section 2 introduces the design structure and its properties in a finite population setting. Section 3 constructs an estimator for the variance of the sample mean of the new sampling design. Section 4 provides empirical results to compare the proposed sampling design with its competitors. Section 5 investigates the design effects of the new sampling methodology for three examples from field sampling. Section 6 provides some concluding remarks. All proofs are provided in the appendix.

2 The design structure and its properties

In addition to the fungus population described in the introduction, we provide two more populations to motivate the ranking structure of a sampling region. Munholland and Borkowski (1996) considered estimating the total abundance of trees in the Rocky Mountain area in the USA. The region was subdivided into a square of 64 grid cells, which has a distinct elevation gradient with higher elevations occurring north to south through its center. The abundance of trees appears to be influenced by the change of elevation in the region. The data set is presented in panel (a) of Table 1 for easy reference. We use this data set as our population and estimate the mean of tree abundance in this population. The true values of the mean and standard deviation of abundance of trees are 15.92 and 7.38, respectively.

This population naturally falls into a grid structure. Even though Latin square sampling uses this grid structure to create a better representative sample than a simple random sample, it fails to accommodate the use of available auxiliary information, such as vegetation cover of the sampling region. Total abundance of trees in each grid cell is highly correlated with the vegetation cover. The assessment of vegetation cover may be performed using smartphone based digital technologies in agricultural research (Heinonen and Mattila 2021). If there is no strong decreasing or increasing trend in row and column directions, one can first rank within-row cells and then independently rank the within-column cells using the vegetation cover from the images of aerial photographs. The grid cells are then rearranged based on their row and column ranks to create a two-way stratification based on row and column rank structure. The new sampling methodology selects the sample units from this new ranking structure.

Table 1 Sampling regions for abundance of trees and sugarcane diseases

The third example is again related to sugarcane plants. It involves another disease, caused by the nematode Pratylenchus zeae (Zeae). The nematode causes mechanical damage and discrete light-brown lesions in the root cortex; the lesions become necrotic, darken and coalesce as the nematode population increases. The disease eventually kills the plant or causes a significant amount of yield loss. In this example, the variable of interest is the amount of disease due to nematodes (log-DNA copies/g sample). The population values of y for 49 grid cells are given in panel (c) of Table 1. The mean and standard deviation of this population are 277 and 212.84 log-DNA copies/g, respectively. In this case, bright green leaf colors (or freshness) could be associated with healthy plants and faded green color could be related to the severity of the disease (or large values of y). Flying a drone equipped with an online camera could provide the images of all 49 grid cells. These images lead to a two-way stratification based on row and column ranking similar to the one in the abundance population.

In Fig. 1, we also provide heat map plots of the sampling regions of these three populations to show the spatial patterns in row and column directions. The abundance data shows a strong quadratic trend in rows and relatively strong decreasing trend in columns. The fungus data does not show a particularly strong trend in either direction, but a somewhat increasing trend in columns. The nematode data shows a weak quadratic pattern in columns, and little trend in rows. We show in Sect. 5 that weak spatial patterns in fungus and nematode populations can be improved using one- or two-way stratification using within-row and or within-column ranks.

In these three examples, ranking population units does not require their manual measurement. In general, ranking can be performed based on visual inspection, a yield map, soil analysis of the sampling region or an auxiliary variable obtained by flying a drone equipped with image processing software. Hence, ranking cost will be considered as minimal and have not been included.

Fig. 1
figure 1

The heat maps of tree abundance, fungus and nematode populations

We now provide a general setup for the sampling methodology. We consider a finite population of sampling units \(u_{i,j}, i=1\ldots , rp, j=1,\ldots , p\), arranged in a grid of dimension rp rows and p columns, such that \(N=rp^2\). The variable of interest measured on unit \(u_{i,j}\) is denoted by \(y_{i,j}\). The population values of y will be denoted with \(\varvec{y}=\left\{ y_{1,1},\ldots , y_{rp,p}\right\} \). Prior to sampling, we rank the population units in each row, column or both row and column independently and arrange them based on their rank order as is explained in the three examples above. For example, if we consider ranking within-row units only, we arrange the unit having rank 1 as the first unit in the row, the unit having rank 2 as the next, etc. Rank order of units does not necessarily replace the neighboring units. Rank ordering is constructed using relative closeness based on perceived values of y on the experimental units. The perceived value describes the value of y based on the perception of the ranker using all available ranking information. Hence, units with rank 1 and p could be located next to each other in physical location, but in the two-way grid layout they could be replaced as the first and last in a row. After completing the within-row ranking process and arranging the units by their within-row ranks to form rank columns, if there is no strong increasing or decreasing trend for within-column units, we perform a similar ranking process for within-column units independent of the within-row ranking process and re-arrange this time the units by their within-column ranks to form rank rows. This creates a two-way stratification based on within-row and within-column ranking of the units. We note that row and column rankings are performed independently, Hence, joint ranking may not be in perfect order even if the row and column rankings are perfect.

We superimpose a row–column treatment allocation design on row–column ordered rectangular grid cells. Latin letters are used to label the treatment levels in a row–column design in a design of experiment, but these letters are used here to sample population units rather than for a treatment allocation as in the design of experiment. The basic structure of a row–column sampling design is similar to a row–column design in a design of experiment. To have a better sampling coverage, we require that each letter appears in each row and column exactly one and \(r (r>0)\) times, respectively. The choices of row–column structures of Latin letters will be denoted with \(L_l\), \(l=1, \ldots , N\). In order to account for variation due to different row–column structures, we permute independently the rows and columns of the matrix of Latin letters. For \(p=5\) and \(r=2\), the rectangular grid with a particular row–column structure \((L_1)\) is represented in Table 2. In Table 2, the two 5 by 5 Latin square letters are stacked together for the sake of illustration. This need not be the case in practice since we independently permute the rows and columns of Latin letters.

Table 2 Grid structure of the sampling region and Latin square sampling design \(L_1\)

Sampling units need not be arranged in a rectangular grid. If the sampling frame is arranged in a linear region, such as a riparian region in a river system, a conceptual \(rp \times p\) rectangular grid can be defined with each row corresponding to the runs of p consecutive units and columns representing the ranks of p units in each run. The rectangular grid here corresponds to rp runs along a linear region. In this case, the row–column sampling design not only selects a unit in each run, but also a unit(s) from every rank sequence position.

To simplify the notation, we rewrite the population units in a vector form \(\varvec{v}_i; i=1, \ldots , rp\), where

$$\begin{aligned} \varvec{v}_i^\top =(v_{(i-1)p+1} \ \ldots \ v_{(i-1)p+p})= (u_{i,1}\ \ldots \ u_{i,p}), i=1, \ldots , rp. \end{aligned}$$

and \({ }^\top \) denotes the transpose of the vector. The entire set of population units is represented in \(\varvec{v}^\top =(\varvec{v}_1^\top \ \ldots \ \varvec{v}_{rp}^\top )=(v_1 \ \ldots \ v_{rp^2})\). The vector \(\varvec{v}\) is constructed in such a way that its first p entries are the population units in the first row, the second p entries are the population units in the second row of the ordered grid population, etc.

In this population structure, a row–column sampling procedure with sampling design \(L_l\) involves selecting \(m\ ( 1 < m \le p)\) letters from the collection of p letters at random and identifying corresponding population units as a row–column sample (\(RCS_l\)). For example, with design \(L_1\) in Table 2, we have \(p=5\) , \(m=2\) and \(r=2\). If a particular random selection of letters from \(\{A,B,C,D,E\}\) yields B and D, then our \(RCS_1\) would contain the units

$$\begin{aligned} RCS_1= & {} \{v_2,v_5,v_{6},v_{8}, v_{12},v_{14}, v_{16},v_{19}, v_{23}, v_{25},v_{27},v_{30},v_{31},v_{33}, v_{37},\\&v_{39}, v_{41},v_{44}, v_{48},v_{50}\} . \end{aligned}$$

In general \(RCS_l\) contains rmp observations, rp of which are selected using the first letter, the next rp units are selected using the second letter, etc. For our example in Table 2, when \(p=5\), \(m=2\) and \(r=2\), \(RCS_l\) sample could be defined as follows

$$\begin{aligned} RCS_l= & {} \left\{ \varvec{V}^\top _{1}, \ldots , \varvec{V}^\top _{rp}\right\} , \varvec{V}^\top _{i} = (v_{(i-1)p+1}\ \ldots \ v_{(i-1)p+s_{i,m}})\\&= (u_{i,s_{i,1}} \ \ldots \ u_{i,s_{i,m}}), i=1, \ldots , rp, \end{aligned}$$

where \(s_{i,j}\) is the j-th (\(j=1,\ldots ,m\)) selected sample unit in row i of the row–column grid population. We note that the \(RCS_l\) is a balanced sample since there are m and mr observations from each row and column ranking groups, respectively. The \(RCS_l\) structure helps to explore the two-dimensional space. Ranking between row units explores the heterogeneity among rows while ranking between column units explores the heterogeneity among columns through ranking of experimental units prior to sampling. We do not assume that the ranking process is exact. It could be subjective and in error. The ties in ranking can be resolved at random. Hence, it covers a wide range of sampling designs. For example if \(r=1\) and the ranking process is completely random in both row and column spaces, the \(RCS_l\) sample is equivalent to a simple random sample of size mp. Any improvement in ranking process yields a different design and provides higher design efficiencies. The design efficiency with respect to Latin square sample depends on monotonic trends in row and column spaces and the quality of the ranking process. If the quality of ranking is strong enough to improve the existing trend in row and column spaces, the RCS sample improves the LS sample. Otherwise, it may be less efficient than a Latin square sample.

In this finite population structure, we are interested in estimating the population mean or total. Since N is known, we only consider the estimation of population mean. The population total can be estimated by multiplying the sample mean by N.

Theorem 1 In a row–column sample, for a given row–column structure of Latin letters \(L_l\), inclusion probabilities are given as follows

$$\begin{aligned} \pi _{k}= & {} E(v_k \in RCS_l)=\frac{m}{p}, \quad k=1, \ldots , N ; l=1,\ldots ,N^*_L\\ \pi _{kt}(l)= & {} P((v_k,v_t) \in RCS_l) =\frac{m(m-1)+m(p-m)I_{kt}(l)}{p(p-1)},\\&\quad k \ne t; k,t =1, \dots , N ; l=1,\ldots ,N^*_L, \end{aligned}$$

where \(N^*_L\) is the number of total unique row–column designs, and \(I_{kt}(l)=1\) if \(v_k\) and \(v_t\) share the same letters in \(RCS_l\), and zero otherwise.

We now introduce our estimator of the population mean

$$\begin{aligned} {\bar{y}}_{RCS_l}= \frac{1}{rp^2} \sum _{k=1}^N \frac{Z_k y_k}{\pi _k}, \end{aligned}$$

where \(Z_k=1\) if the unit \(v_k\) is in \(RCS_l\) and zero otherwise. In this formulation, \(Z_k\) is a random variable, and \(y_k\) is a constant. This estimator can be written in a slightly different form

$$\begin{aligned} {\bar{y}}_{RCS_l}= \frac{1}{rp^2} \sum _{j=1}^{rmp} \frac{Y_{s_j}}{\pi _{s_j}}, \end{aligned}$$

Note that \(Y_{s_j}\) is a random variable here since \(s_j\) is random. We note that the estimators for both the \(RCS_l\) and SRS designs are simply the sample means since the inclusion probabilities are always m/p.

For comparison purposes, we also consider a simple random sample (SRS) using the sampling without replacement selection procedure from the same population

$$\begin{aligned} SRS=\left\{ v_{s_1}^*, \ldots , v_{s_{rmp}}^*\right\} . \end{aligned}$$

The marginal and joint inclusion probabilities for SRS are given by \(\pi _k^*=\frac{rmp}{rp^2}\) and \(\pi _{k,t}^*=\frac{(rmp-1)rmp}{(rp^2-1)rp^2} \). An estimator for the population mean using the SRS sample is given by

$$\begin{aligned} {\bar{y}}_{SRS}=\frac{1}{rp^2} \sum _{j=1}^{rmp} \frac{Y_{s_j}^*}{\pi ^*_{s_j}}, \end{aligned}$$

where \(y^*_{s_j}\) is the value of y on unit \(v^*_{s_j}\).

Theorem 2 The estimators \({\bar{y}}_{RCS_l}\) and \({\bar{y}}_{SRS}\) are unbiased for the population mean \(\mu =\sum _{k=1}^N y_k/ N\). The conditional variance of \({\bar{y}}_{RCS_l}\), for a given row–column design \(L_l\), is given by \(\sigma ^2_{RCS_l}\)

$$\begin{aligned} \sigma ^2_{RCS_l}= & {} \frac{p-m}{r^2mp^4} \sum _{k=1}^N y_k^2+\frac{2(p-m)}{r^2mp^4(p-1)}\sum _{k=1}^N \sum _{ k<t}^N\left\{ pI_{kt}(l)-1\right\} y_ky_t. \end{aligned}$$
(1)

The variance of \({\bar{y}}_{SRS}\) is given by

$$\begin{aligned} \sigma ^2_{SRS}= & {} \frac{p-m}{r^2mp^4} \sum _{k=1}^N y_k^2-\frac{2(p-m)}{r^2mp^4(rp^2-1)}\sum _{k=1}^N \sum _{k< t}^Ny_ky_t. \end{aligned}$$

We note that \(\sigma ^2_{RCS_l}\) is a conditional variance for a given design \(L_l\). Since the estimator \({\bar{y}}_{RCS_l}\) is unbiased for \(\tau \) for \(l=1, \ldots , N^*_L\), and each design is equally likely, the unconditional variance of the RCS design is given by

$$\begin{aligned} \sigma ^2_{RCS}=\frac{1}{N^*_L} \sum _{l=1}^{N^*_L} \sigma ^2_{RCS_l}. \end{aligned}$$

We now investigate the conditions under which the RCS estimator is more efficient than a simple random sample estimator. Both sampling designs have the same marginal inclusion probabilities \( (\pi _k=\pi _k^* ),\) but they differ in their joint inclusion probabilities (\(\pi _{kt}(l) \ne \pi _{kt}^*\)). It follows that \(\sigma ^2_{RCS_l} \le \sigma ^2_{SRS}\) if and only if

$$\begin{aligned} (rp^2-1)\sum _{k=1}^N \sum _{k<t}^N\left\{ pI_{kt}(l)-1\right\} y_ky_t \le -(p-1)\sum _{k=1}^N \sum _{k<t}^N y_ky_t . \end{aligned}$$

In the above inequality, we partition the summations with respect to \(I_{kt}(l)=1\) and \(I_{kt}(l)=0\) and organize the terms to write

$$\begin{aligned} p^2r(p-1)\sum _{k=1}^N \sum _{k<t:I_{kt}(l)=1}^N y_ky_t \le p(rp-1)\sum _{k=1}^N \sum _{k<t:I_{kt}(l)=0}^N y_ky_t . \end{aligned}$$
(2)

Note that the summation on the left has \(rp^2(rp-1)/2\) terms, while the summation on the right has \(rp^3(p-1)/2\) terms. Hence, if all y values are positive, we expect that this inequality will hold for most populations we encounter in practice.

Note that using the conditional expectation we can write

$$\begin{aligned} E\left\{ \frac{p^3r(p-1)}{m} Y_{s_1}Y_{s_2}|I_{s_1s_2}(l)=1\right\}= & {} p^2r(p-1)\sum _{k=1}^N \sum _{k<t:I_{kt}(l)=1}^N y_ky_t \end{aligned}$$

and

$$\begin{aligned} E\left\{ \frac{p^2(p-1)(rp-1)}{m(m-1)} Y_{s_1}Y_{s_2}|I_{s_1s_2}(l)=0\right\}= & {} p(rp-1)\sum _{k=1}^N \sum _{k<t:I_{kt}(l)=0}^N y_ky_t, \end{aligned}$$

where \(I_{s_1s_2}(l)=1\) if sample units \(v_{s_1}\) and \(v_{s_2}\) share the same letter in design \(L_l\) and zero otherwise. Using these equalities in the inequality (2), we conclude \( \sigma ^2_{RCS_l} \le \sigma ^2_{SRS}\) if and only if

$$\begin{aligned} rpE\left\{ Y_{s_1}Y_{s_2}|I_{s_1s_2}(l)=1\right\} \le \frac{rp-1}{m-1} E\left\{ Y_{s_1}Y_{s_2}|I_{s_1s_2}(l)=0\right\} . \end{aligned}$$
(3)

In inequality (3), the right-hand side has a larger number of joint expectations from different ranking groups than the left-hand side. Since order statistics are positively correlated, joint expectations from different ranking groups are usually larger than the joint expectation from the same ranking groups. Hence, this inequality indicates that the proposed RCS design is more efficient than a simple random sampling design if the ranking groups are well separated from each other. Since the largest separation happens under perfect ranking, we conjecture that the largest efficiency improvement is achieved under perfect ranking.

3 Variance Estimate

All joint inclusion probabilities are positive. Hence, the variance estimator of the Horvitz–Thompson estimator is unbiased. Using the Horvitz–Thompson variance estimator, an unbiased estimator of \(\sigma ^2_{RCS_l}\) is given by

$$\begin{aligned} {\hat{\sigma }}^2_{RCS_l}=\frac{(p-m)p}{r^2m^2p^4} \sum _{j=1}^{rmp} Y_{s_j}^2+\frac{2}{r^2p^4}\sum _{j=1}^{rmp} \sum _{r \ne j}^{rmp} \left\{ \frac{p(p-m)(pI_{s_js_r}(l)-1)}{m^2\left\{ (m-1)+(p-m)I_{s_js_r}(l)\right\} } \right\} Y_{s_j}Y_{s_r}.\nonumber \\ \end{aligned}$$
(4)

In a similar fashion, an unbiased estimator for \( \sigma ^2_{SRS}\) is given by

$$\begin{aligned} {\hat{\sigma }}^2_{SRS}=\frac{(p-m)p}{r^2m^2p^4} \sum _{j=1}^{rmp} Y_{s_j}^2-\frac{2p(p-m)}{r^2p^4m^2(pmr-1)}\sum _{j=1}^{rmp} \sum _{r \ne j}^{rmp} Y_{s_j}Y_{s_r}. \end{aligned}$$

We note that even though \({\hat{\sigma }}^2_{RCS_l}\) and \( {\hat{\sigma }}^2_{SRS}\) are unbiased estimators for \(\sigma ^2_{RCS_l}\) and \(\sigma ^2_{SRS}\) they are unstable estimators. If the sample units \(s_j\) and \(s_r\) do not share the same letters in \(RCS_l\) and have large y values, the estimator in Eq. (4) frequently produces negative values for some sample selections. For this reason, we introduce another estimator for \(\sigma ^2_{RCS_l}\) based on bootstrap methods.

The RCS is a probability sampling that considers the spatial structure in the grid population. A bootstrap resampling procedure should capture this spatial structure of the sampling region. For a given RCS sample \(RCS_l=\left\{ Y_{s_j}, j=1, \ldots , rpm\right\} \), we construct the bootstrap resample \(RCS_l^b=\left\{ Y^b_{s_j}, j=1, \ldots , rpm\right\} \) by selecting an RCS sample from the population grid using design \(L_l\) for \(b=1, \ldots , B\). All empty units in the bootstrap resample \(RCS_l^b \) are then filled with the nearest neighbors in \(RCS_l\) sample. The bootstrap variance estimate is then given by \({\tilde{\sigma }}^2_{RCS_l}\)

$$\begin{aligned} {\tilde{\sigma }}^2_{RCS_l}=\frac{1}{B-1} \sum _{b=1}^B({\tilde{Y}}^b_{RCS_l}-{\tilde{Y}}_{RCS_l})^2, \quad {\tilde{Y}}_{RCS_l}=\frac{1}{B}\sum _{b=1}^B {\tilde{Y}}^b_{RCS_l}, \quad {\tilde{Y}}^b_{RCS_l}=\frac{1}{rpm} \sum _{j=1}^{rpm} Y^b_{s_j}. \end{aligned}$$

We use the following algorithm to compute the bootstrap variance estimate.

Algorithm 1

Bootstrap variance estimate and confidence interval:

  • Input: \(RCS_l=\left\{ Y_{s_j}, j=1, \ldots , rpm\right\} \) and design \(L_l\).

  • For \(b=1, \ldots , B\) do

    \(\bullet \):

    Permute the rows and columns in Latin square design \(L_l\),

    \(\bullet \):

    Select m letters from the pool of letters in \(L_l\), say, letters A and C for \(m=2\),

    \(\bullet \):

    Select all units corresponding to these m letters, \(RCS_l^b=\left\{ Y^b_{s_j}, j=1, \ldots , rpm\right\} \),

    \(\bullet \):

    Replace each unit \(Y^b_{s_j}\) with its nearest neighbor in \(RCS_l\),

    \(\bullet \):

    Compute \(\tilde{Y}^b_{RCS_l}=\frac{1}{rpm} \sum _{j=1}^{rpm} Y^b_{s_j}\),

  • end for

  • Output: Compute \( \tilde{\sigma }^2_{RCS_l}=\frac{1}{B-1} \sum _{b=1}^B(\tilde{Y}^b_{RCS_l}-\tilde{Y}_{RCS_l})^2\), where \(\tilde{Y}_{RCS_l}=\frac{1}{B}\sum _{b=1}^B\tilde{Y}^b_{RCS_l}\).

  • Output: Compute \((1-\alpha )100\%\) percentile confidence interval \([LL_{\alpha /2}, UL_{1-\alpha /2}]\), where \(LL_{a}\) and \(UL_{1-a}\) are the aB-th and \((1-a)B\)-th smallest observations among \(\tilde{Y}^b_{RCS_l}\), \(b=1, \ldots , B\).

The neighborhood of a unit in a grid cell is defined in terms of its Euclidean distance from another point. Table 3 presents a bootstrap resample \(RCS_l^b\) constructed from an \(RCS_l\) using the fungus population described in Sect. 5. In this table, the underlined entries represent the \(RCS_l\) that corresponds to the randomly selected two Latin letters. The bootstrap resample units are identified by a bold face font that corresponds to randomly selected Latin letters. For example in Table 3, the entry in row 1 and column 4 is a unit in the bootstrap resample. Since this point is not observed in the original sample (not underlined), it has to be filled with its nearest neighbors from the underlined units. The closest underlined units are 5 and 4 in cells (1,3) and (2,4), respectively. So the bootstrap resample for this point becomes the average of 5 and 4. The bootstrap resample for cell (1, 3) is 5 since it is already in the\( RCS_l \) The other bootstrap resample points are computed in a similar fashion.

An approximate \((1-\alpha )100\)-percent bootstrap percentile confidence interval for the population mean \(\mu \) can be obtained from Algorithm 1. We can also construct a \((1-\alpha )100\)-percent confidence interval using a normal approximation

$$\begin{aligned} {\bar{Y}}_{RCS_l} \pm t_{\alpha /2;df} {\tilde{\sigma }}^2_{RCS_l}, \end{aligned}$$

where \(t_{a,df}\) is the a-th upper quantile of a t-distribution with degrees of freedom \(df=mrp-1\).

Table 3 A bootstrap resample from the row–column ranked fungus population. Measured RCS and bootstrap resamples are the underlined and bold-faced entries, respectively

4 Design Effect

In this section, we investigate the design effect of the RCS design. Since RCS considers two-way stratification, we look at six different populations having different spatial structures. The values of the variable y in these populations are generated with different mean functions (\(\mu _{ij}\)) in row and column space. For notational convenience, we define \(i_1=i-1\) and \(j_1=j-1\). The mean functions for population \(P_t\), \(t=1, \ldots , 6\), are given below

$$\begin{aligned}&P1\,:\, \mu _{i,j}= 10, \quad i=1, \ldots ,rp, \quad j=1, \ldots , p \\&P2\,:\, \mu _{i,j}= 4\frac{j_1}{p}, \quad i=1, \ldots ,rp, \quad j=1, \ldots , p \\&P3\,:\, \mu _{i,j}= 4(1-\frac{i_1}{rp}), \quad i=1, \ldots ,rp, \quad j=1, \ldots , p \\&P4 \,:\, \mu _{i,j}=3\left\{ \frac{j_1}{p}+\frac{i_1}{rp}\right\} +\text {sin}\left\{ 6\left( \frac{j_1}{p}+\frac{i_1}{rp}\right) \right\} , \quad i=1, \ldots ,rp, \quad j=1, \ldots , p \\&P5 \,:\, \mu _{i,j}=3\left\{ 4-\frac{6i_1}{rp}\right\} ^2\text {exp}\left\{ -\left( \frac{6i_1}{rp}-3\right) ^2-\left( \frac{6j_1}{p}-2\right) ^2\right\} \\&\quad -10\left\{ 0.2\left( \frac{6i_1}{rp}-3\right) -\left( \frac{6i_1}{rp}-3\right) ^3-\left( \frac{6j_1}{p}-3\right) ^5\right\} exp\left\{ -\left( \frac{6i_1}{rp}-3\right) ^2-\left( \frac{6j_1}{p}-3\right) ^2\right\} \\&\quad \frac{ -1}{3}exp\left\{ -\left( \frac{6i_1}{rp}-2\right) ^2-\left( \frac{6j_1}{p}-3\right) ^2\right\} , \quad i=1, \ldots ,rp, \quad j=1, \ldots , p \\&P6\, :\, \mu _{i,j}= \frac{1}{20}\left\{ \frac{12i_1}{p}-\frac{12j_1}{rp}\right\} ^2 +\frac{1}{20}exp \left\{ \left[ 1-\text {sin}\left( \frac{12i_1}{p}-6\right) \right] ^2\right\} \text {cos}\left( \frac{12j_1}{rp}-6\right) \\&\quad +\frac{1}{20}\text {exp} \left\{ \left[ 1-\text {cos}\left( \frac{12j_1}{rp}-6\right) \right] ^2\right\} \text {sin}\left( \frac{12i_1}{p}-6\right) , \quad i=1, \ldots ,rp, \quad j=1, \ldots , p, \end{aligned}$$

where sin and cos are the sine and cosine functions. In these six populations, all means are shifted to make the smallest cell mean 10. The population values y of each cell is then constructed by adding a random number between 0 and 5 generated by a uniform distribution. For a fair comparison among the six populations, all population units are scaled by dividing each value by the population standard deviation. The heat maps of these populations are presented in Fig. 2. The population P1 does not show any spatial pattern. The populations P2 and P3 have decreasing trends in row and column directions, respectively. The populations P4, P5 and P6 use mean functions for the corrugated inclined plane, peak function and bird functions in Robertson et al. (2018). These six populations cover a wide range of spatial patterns that we might see in practice.

Fig. 2
figure 2

The heatmaps of populations P1, P2, P3, P4, P5 and P6

We performed a simulation study to investigate the design effect of RCS design with respect to LS sampling design using these six populations. Simulation parameters are selected to be \(m=2\), \(r=1\) and \(p=30\). For row and column ranking, we used the Dell and Clutter (1972) model. This model creates a ranking variable U correlated with a response variable y, \(\rho =corr(U,y)\). In the simulation study, the values of \(\rho \) are selected to be \(\rho =0,0.2,\ldots ,1\). The details of this ranking model can be found in Ozturk (2019) and Ozturk and Kravchuk (2021). The simulation size is taken to be 5000.

For one-way stratification based on ranks, in each replication of the simulation, before selecting an RCS sample only the within-row (within-column) units are ranked independently using the Dell and Clutter model with a given value of \(\rho \). No ranking is performed for within-column (within-row) units. Rows and columns of design \(L_l\) are permuted so that each replication uses a different Latin square design. Hence, the simulation study accounts for the variation due to within-row ranking error and selection of row–column design \(L_l, l=1, \ldots , N^*_L\).

For two-way stratification based on ranks, in each iteration, within-row units in the population are first ranked independently in each row using the Dell and Cutter model, and then within-column units in each column are ranked independently. Again rows and columns of design \(L_l\) are permuted in each iteration.

Fig. 3
figure 3

Design effect of RCS samples with respect to LS samples

Figure  3 presents the design effect of the RCS sample with respect to the LS sample for this simulation setting. The design effect is defined as the ratio of variances

$$\begin{aligned} D_{eff}=\frac{\sigma ^2_{LS}}{{\bar{\sigma }}^2_{RCS}}, \end{aligned}$$

where \({\bar{\sigma }}^2_{RCS}\) is the average of \(\sigma ^2_{RCS_l}\) in equation (1) from the 5000 simulation replications. The values of \(D_{eff}\) greater than one indicate that the RCS design is superior to the LS design.

Figure 3 provides the \(D_{eff}\) curves of the six populations for within-row (Row), within-column (Col.) and row–column (RowCol.) rankings when \(m=2\), \(r=1\), and \(p=30\). The design effects for each ranking category are plotted against the ranking correlation \(\rho =0,0.2, \ldots , 0.8\). The design effects are much larger under the perfect ranking (\(\rho =1\)) than under the imperfect ranking (\(\rho <1\)). Hence, they are reported separately in Table 4 to illustrate the detailed feature of \(D_{eff}\) in an appropriate plotting scale when \(\rho <1\). The heat maps in Fig. 2 indicate that the population P1 does not show any spatial trend either in the row or column spaces. Hence, ranking in one of the row, column or row–column directions yields higher design effect for the RCS sample. For the population P1, all three curves lie above 1, indicating that the RCS sample is more efficient than the LS sample. The populations P2 and P3 have a strong increasing and decreasing trend only in one of the row and column spaces. They show no trend (uniform) in the other direction. For example, the within-row units in the population P2 have increasing order, while the within-column units are uniformly distributed. For the population P2, \(D_{eff}\) values are larger than one for the within-column units orderings for all \(\rho \) values. The row and row–column orderings perform poorly for the population P2. This intuitively makes sense since there is already a strong increasing trend in the within-row units, and any ranking in this direction with \(\rho <1\) will destroy this trend and reduce the degree of stratification of the within-row units. Similar observations can be made in the population P3, where there is a strong negative trend in the column direction. Hence, any ranking of the within-column units with (\(\rho <1\)) will reduce the design effect of the RCS sample with respect to the LS sample.

The populations P4, P5 and P6 have strong spatial patterns, but they do not necessarily show a strong increasing or decreasing trend in either the row or column space. Hence, ranking procedures in the RCS sample create strong increasing trends in the within-row and column units, and lead to a better two-way stratification than a Latin square sample. This yields a higher design effect than a LS sample for a moderate ranking correlation \(\rho \). Overall patterns in these populations indicate that the effect of the RCS design is more pronounced for populations lacking a strong trend in row and/or column space. In this case, the ranking procedure exploits the relative closeness (ranks) of sampling units in the grid structure to create a strong two-way stratification, and provide a better spatial coverage in the sampling region.

Table 4 Design effect of RCS design with respect to LS design under perfect ranking (\(\rho =1\))

Table 4 presents the design effects of the RCS sample under a perfect ranking \((\rho =1)\). In this case, the RCS sample outperforms the LS sample regardless of the spatial patterns. The reason for this is that even if the grid structure of the population has a strong linear trend, a perfect ranking does not destroy this trend. Hence, it always improves the LS sample. The improvement is substantial. The smallest \(D_{eff}\) value in Table 4 is 4.883 for the population P5 and row–column ranking, which indicates that the RCS sample is approximately 5 times more efficient than an LS sample.

We performed another simulation study to investigate the properties of the bootstrap variance estimates of RCS sample means. In this part of the simulation, we only considered population P6. Simulation parameters are selected as \(m=1\), \(r=1\). Simulation size was set to be 50000 replications. Again population units are ranked by using Dell and Clutter model with ranking correlation coefficient \(\rho =0,0.2,0.4,0.6,0.8,0.9,\) and 1. Figure 4 illustrates the box plots of the bootstrap variance estimates of the RCS mean estimator for different values of \(\rho \). The first, second and third rows present the box plots for the within-row, within-column and row–column units ordering, respectively. The true value of the variance (variance of the 50000 RCS sample means) is marked with a triangle on each box plot. The box plots indicate that bootstrap variance estimates are skewed right and provide under estimation for the true variance. The amount of bias decreases for large values of \(\rho \). When \(\rho =1\) the true variance is at around the median lines in the box plots. Under estimation of the variance of the estimator is a common problem in a spatially balanced sampling designs in the literature (Stevens and Olsen 2003; Grafström and Schelin 2014; Robertson et al. 2018).

Fig. 4
figure 4

Box plots of bootstrap variance estimates of the RCS sample mean for population 6, \(m=1\), \(r=1\) and \(p=30\)

Table 5 Coverage probabilities of 95% bootstrap percentiles and approximate t-intervals

We also computed the coverage probabilities of the approximate 95% bootstrap and normal approximated confidence intervals of the population mean using the RCS sample means. Table 5 indicates that coverage probabilities are lower than the nominal coverage probability 0.95 for both bootstrap and normal (t-distribution) approximated confidence intervals. On the other hand, the confidence intervals based on normal approximation provide better coverage probabilities than the bootstrap confidence intervals, specially for large \(\rho \) values. For example, if \(\rho \ge 0.8\) the coverage probabilities are all greater than 0.90, and closer to 0.95 for \(\rho =0.9\) and \(\rho =1\).

5 Examples

Fig. 5
figure 5

Design effect of RCS samples with respect to LS samples for tree abundance, fungus and nematode populations

In this section, we performed another simulation study using the RCS and LS sampling designs to estimate the population mean \(\mu \) of the tree abundance, fungus and nematode populations that we introduced in Sects. 1 and 2. For the simulation parameters, we selected \(m=1\), \(r=1\), and ranking correlations \(\rho =0.1,0.2, \ldots , 1.0\). As before, the Dell and Clutter model is used to rank the within-row and -column units. Simulation size is taken to be 5000. Simulation results are presented in Fig.  5. The three curves in the panels represent the design effect of the RCS sample with respect to the LS sample for row, column and row–column ordered population units.

For the tree abundance population, the RCS sample is less efficient than the LS sample when \(\rho < 0.8\). Note that the heat map of this population in Fig. 2 indicates that there is a strong trend in both row and column directions. Hence, use of the imperfect ranking weakens the strong spatial structure and reduces the efficiency of the RCS sample. Since the trend in the column direction is not as strong as in the row direction, the dashed red curve corresponding to the within-column units ordering lies above the other two curves corresponding to the within-row units and row–column units orderings. Under perfect ranking, the RCS sample is superior to the LS sample in both one-way and two-way ranking stratification.

In Fig.  2, while the heat maps are indicating some increasing and decreasing trends in the column direction, they show a small increasing and decreasing trend in the row direction for the fungus and nematode populations, respectively. As expected, the within-row and row–column units ranking (solid black and dotted green curves) out performs within-column unit ranking. These findings are consistent with our simulation results in Section 4, where we showed that RCS samples outperform LS samples if there is no strong increasing or decreasing trend in row and column directions.

6 Concluding Remarks

A Latin square sample ensures a good coverage for the sampling region, but it does not provide a variance estimate for the estimator unless a replicated sample is generated. This article shows that when there is a spatial trend among the units that is not strongly increasing or decreasing in row and column directions in the sampling region, latinized row–column ranked sampling design is generally more efficient than a simple random sample and a Latin square sample of the same sample size. A row–column sample creates a two-way stratification in rows and columns. The cells in a rectangular grid in the sampling region are reorganized based on the relative closeness (ranks) of units within rows and/or columns. The sampling method provides a better sampling coverage by re-indexing physical grid quadrats and ensuring that at least one observation is selected from each rank- row and rank-column. The ranking can be done on the variable of interest and/or one or more than one auxiliary variables. In this paper, we consider a ranking procedure with a single auxiliary variable. We have constructed point estimators, for the population mean and total and their variances. We have also constructed an approximate \((1-\alpha )100\% \) confidence interval for the population mean.

We have investigated the design effect of the new sampling method for a wide ranges of spatial population structures. The ranking structure in the RCS sampling design is superior to the usual LS sampling if there is no strong increasing or decreasing trend in the row and/ or column space of the grid structure. For reasonable ranking information, such as when the correlation between the ranking variable and the variable of interest is greater than 0.4, the proposed sampling design performs better than Latin square and simple random sampling designs of the same sample size as long as the sampling region does not show a very strong increasing or decreasing trend in the rows and columns of the grid structure.

Two confidence intervals are suggested for the population mean. The first interval is the bootstrap percentile confidence interval. The second interval is constructed using the bootstrap variance estimator of the sample mean with a normal approximation. The coverage probabilities for bootstrap confidence intervals are smaller than the nominal coverage probability of 0.95. They are also smaller than the coverage probabilities of the confidence intervals based on normal approximation. The coverage probabilities of the confidence intervals based on normal approximation approach to the nominal coverage probability of 0.95 for larger values of \(\rho \).

In some sampling regions, the size of the grid cells may be large and it may not be easy to quantify the value of the variable of interest y. In this case, we may consider a two-stage sampling design. The stage-one sample would select an RCS sample from the primary population of a rectangular grid. The stage-two sample would select a sample from each of the grid cells in the stage-one sample. Development of this sampling design will be a future project.