Introduction

This paper is part of ongoing research of multidisciplinary character involving the methods of ancient history, archaeology, Geographic Information Systems (GIS), and satellite remote sensing. The researchers’ intention is to contribute to solving the long-standing enigma of the location of one of the most significant and famous battles of the ancient world—the Battle of Gaugamela. The Battle of Gaugamela was fought in 331 BCE between the Greek-Macedonian army under the command of Alexander the Great and the Persian army led by its king Darius III (Marsden 1964; Dąbrowa 1988). Alexander’s victory at Gaugamela certainly changed history—it led to the collapse of the vast Persian kingdom (539–331 BCE) and the beginning of a new age now widely known as the Hellenistic period (circa 331–30 BCE) (Austin 1981; Bugh 2006).

Notwithstanding the great importance of this battle, its precise location has been contested (Seibert 1972). Two main locations have been put forward over the course of the last two centuries of research—the Navkur Plain near Tell Gomel in Kurdish Iraq (Markwart 1905; Marsden 1964; Schachermeyr 1973; Fiey 1965; Reade and Anderson 2013) and the vicinity of modern Mosul in northern Iraq (Sushko 1936; Stein 1942; Badian 2000; Zouboulakis 2015, 2016). However, it should be stressed that the majority of modern scholars support the localization of the famous battle in the Navkur Plain. This identification can be argued on several premises: first, the continuity of local onomastics—the name Tell Gomel can be considered a remnant of the ancient name Gaugamela (Fales and Del Fabbro 2014: n. 6; Morandi Bonacossi and Iamoni 2015, 12, n. 8); second, the existence of a long-distance travel route between Nisibis and Arbela leading through the Navkur Plain in the times of Alexander the Great (Marciak et al. 2020); third, the appropriate topographic and geomorphological features in the Tell Gomel Plain (Marciak et al. 2022). At the same time, the identification in the Karamleis Plain features several serious deficiencies (Marciak et al. 2021a, b). First, its origin was deeply conditioned by the very limited regional experience of the earliest European explorers. The location of Tell Gomel was not yet known to the earliest European explorers, and Karamleis had the advantage of being located on the axis of contemporary travel between Mosul, Erbil, and Baghdad. Furthermore, the use of GIS recently revealed that this identification lacks some fundamental topographical and geomorphological features: visibility on the battlefield and a distinctive accumulation of sloped areas. In our present project, an attempt has been made to bring another aspect of the identification to the fore—namely, possible remains of military camps at the battlefield of Gaugamela, potentially discoverable with the use of satellite archaeology.

The hope that traces of the Macedonian camp could be discovered in the course of archaeological works expressed by Ritter (1840) a long time ago has not been realized so far. Nevertheless, we decided to consider the possibility, as it was recently implied by Reade and Anderson (2013), that such traces could be buried under the current surface and might be revealed with the application of modern technologies, especially remote sensing (as was the case with many Roman marching camps discovered in Europe, mainly in the UK, Germany, and South Moravia in Czechia (Roth 1999; Hanel 2007)); these have sometimes been acknowledged as successors to the Macedonian marching camps (Lévêque 1957; Champion 2009). Owing to the problematic character of the artifacts subject to the search (imprecise historical descriptions and lack of archaeological analogies), photo interpretation of satellite image data was selected as the main research tool. The principles of photo interpretation, including the significance of direct (shape, color, and size) or indirect (e.g., shade) photo-interpretative qualities of features, as well as the utilization of false color composites, were discussed extensively by Jensen (2000) and Lillesand et al. (2015), among others.

Contemporary studies of geographical space, including archaeological research, incorporate multiple spatial digital data (satellite and aerial data, laser scanning, SAR, digital elevation models, and others) (see Linck et al. 2013; Stewart et al. 2016; Stewart 2017; Checa and Reche 2018; Evans and Mourad 2018; Peppa et al. 2018; Hadjimitsis et al. 2019; Kalayci et al. 2019; Luo et al. 2019; Blanco et al. 2020). Such data provide an opportunity to extend research to large areas, to acquire a bird’s-eye view perspective and, most of all, to objectify studies on the basis of quantitative data thus obtained. The question of gaining access to the study area, especially hard-to-reach or no-entry areas (like in our case), is another essential aspect. Moreover, a large variety of Earth’s satellite photos makes it possible to register information about the surface of areas with manifold spatial resolutions and with a wide spectrum of electromagnetic radiation (visible, infrared, and microwave). It is accompanied by a continuously growing availability of image data connected with a dynamically increasing number of registering platforms and resources with free archived data (USGS-NASA Landsat missions, ESA Sentinel-2, MODIS, SPOT, and many others), which are effectively applied to archaeological research (Ur 2006; Parcak 2009).

Considering the above, the authors of this article made an attempt to incorporate satellite images into the analysis of the areas indicated by historians and archaeologists as the most likely locations of the Battle of Gaugamela. The focus was on the northern location—the Navkur Plain—and the southern location—the Karamleis Plain (Fig. 1). Owing to lack of reference standards which would make way for automating the recognition process, photo interpretation, reinforced with multivariate processing of multispectral images, was adopted as the only legitimate and possible method at this stage. In accordance with the elementary principles of photo interpretation (the importance of direct and indirect photo-interpretative features), a search intended to identify the occurrence of sites (the so-called anomalies, artifacts) was conducted. Usually these surface features are either geometrical or have no justification with regard to the current land use (e.g., humidity and/or vegetation). A natural character of such places is certainly possible; however, there are chances they are of anthropogenic origin and, as such, testify to the existence of underground archaeological remains.

Fig. 1
figure 1

Geographical context and study sites: 1, Mahad Hills (MH); 2, Tell Gomel Plain (TG); 3, Hussein Ferrash (HF); 4, Bartella-Khazna (BK); 5, Minarah Shebek (MS)

The ultimate goals of the research included:

  • 1. Indicating the most likely places where the camp and/or the battle were located based on the visual interpretation of an array of satellite data. Considering the limitations to free access to these areas in particular, such activities may provide practical benefits in terms of narrowing the areas that would require field work and/or determining priority research areas.

  • 2. Verifying the applicability of multispectral satellite imaging to this type of research. Such an analysis was conducted on the hills of the Navkur Plain (Pirowski et al. 2021) and provided preliminary conclusions on how to structure an appropriate collection of image data. The main conclusion drawn from this research was the need for complimentary interpretative work on data registered in different seasons, both on the basis of standard color compositions from RGB + NIR bands and products derived from multivariate processing of multispectral bands. Vegetation indices proved to be less applicable. The analysis conducted in this article, which is an extension of previous studies of one area to the total of five, and the suggested new methodology of assessing the interpretative value of data provide a more extensive basis for drawing conclusions and justifying prior observations.

Study area

The study area is located in contemporary northern Iraq, with its center determined by the coordinates of 36°30′ north latitude and 43°24′ of east longitude (Fig. 1). It is situated between two major rivers: the Khosr in the west and the Khazir in the east. This area is the foreland of the Zagros Mountains, which stretch nearly latitudinally several dozen kilometers to the north. The highest elevations are located in the central part and they are made up of such hill chains as Jebel Maqlub (1057 m a.s.l.) and Jebel Bashiqah (661 m a.s.l.), as well as smaller ones: Mahad Hills (490 m a.s.l.) in the north and Jebel Ain as Safrah in the south (673 m a.s.l.). All of them create a natural watershed chain between the Khosr and Khazir Rivers. Elevation differences of the entire area amount to nearly 900 m (min 182 m a.s.l., max 1057 m a.s.l., and mean 347 m a.s.l.). It testifies to a great morphological diversity of the area of less than 2000 km2. The climate of this area is determined by its intercontinental location and belongs to the tropical continental dry type, with a mean annual temperature of + 21 °C. It is characterized by two distinct seasons: dry and hot summer (mean + 32°) and cooler and more humid winter (mean + 8°). Precipitation is low, averaging 380 mm per year, which leads to the occurrence of numerous intermittent rivers.

As the site of the battle has not been unambiguously determined, its location is still subject to discussion. It results from the condition of the preserved historical sources which, unfortunately, do not provide precise geographical and topographical information (Seibert 1972). Moreover, multiple significant accounts include contradictory information: for example, according to Arrian (Robson 1954) the battlefield of Gaugamela was situated 111 km away from the city of Arbela, while according to Curtius Rufus (Rolfe 1946) only about 14 km from the Great Zab River. Ultimately, over the last two centuries of research, several locations of ancient Gaugamela have been put forward: Karamleis (Niebuhr 1776), Qaraqosh (Sushko 1936), Tell Aswad (Spiegel 1851), (a mound to the south of) Wardak (Schweiger-Lerchenfeld 1876)—currently referred to as the southern hypothesis—and Tell Gomel (Hoffmann 1880; Sachau 1883; Von Oppenheim 1900) —called the northern hypothesis. The above premises became the basis for conducting a detailed analysis of the areas potentially connected with the battle site. The authors marked 5 areas to be covered by the analysis: Mahad Hills and Tell Gomel Plain in the north and Hussein Ferrash, Bartella-Khazna, and Minarah Shebek in the south (Fig. 1). All these sites were suggested by previous scholars as possible alternative locations of the Macedonian military camps before the battle. The Mahad Hills were proposed as the location of the Macedonian final camp and the Tell Gomel plain as the actual battlefield by Marciak et al. (2021b). Hussein Ferrash and the vicinity of two nearby towns, Bartella and Khazna, were put forward by Sushko (1936) as the sites of the penultimate and ultimate camps of the Macedonian troops respectively, while Stein (1942) considered Minarah Shebek to have been the place of the Macedonian final camp.

Data

In the first stage, a relevant collection of satellite image data of very high resolution (0.5 × 0.5 m SVHR) was obtained for each investigated area (Bartella-Khazna, Minarah Shebek, and Hussein Ferrash in the Karamleis Plain, as well as Mahad Hills and Tell Gomel Plain in the Navkur Plain). The utilized satellite data included various time points from the period between 2013 and 2020 (Table 1). The selected data derived from two sensors: Pleiades and WorldView-2. Considering the above, the initial task was to standardize the materials in the way that would provide identical underlying satellite data from each of the five examined areas during the interpretation. The collection of basic data contained the total of 15 satellite image scenes—three images obtained for different seasons in each of the five locations (Table 1).

Table 1 Characteristics of satellite data used in the analyses

Previous research conducted in the Navkur Plain (Pirowski et al. 2021) indicated that the difference in spatial resolution between original multispectral images (2 m) and images of spatial resolution enhanced by the integration with a panchromatic image (up to 0.5 m) is crucial, especially for detecting linear features. On the other hand, the applied merge algorithm was of no relevance. Therefore, new materials involving the areas of Bartella-Khazna, Minarah Shebek, and Hussein Ferrash were acquired as MS + PAN integrated images of band spatial resolution artificially increased from 2 to 0.5 m. For the Navkur Plain region, for which WorldView-2 data (independently integrated with various methods) were available, merged data obtained with the high-pass filtering method HPF were selected for the analyses (Schowengerdt 1980; Chavez et al. 1991).

The works conducted in the Navkur Plain (Pirowski et al. 2021) revealed no increased interpretative usefulness of atypical spectral ranges, which were provided by WV-2 (yellow band, RedEdge band, Near Infra Red NIR 2 s band). Information recorded in the bands typical of SVHR recording, i.e., RGB + NIR, proved to be of major importance. These conclusions encouraged the authors to obtain data for the new research areas (the Karamleis Plain) from the Pleiades sensor, which has such a spectral profile, to serve as a sufficient supplementation of already available data for the northern location. At the same time, for the purpose of standardization of interpretative materials, only 4 out of WV-2 bands were used: RGB and NIR1.

In order to ensure the possibility of detecting diversified features, it was significant to provide data from different seasons. It gave an opportunity to highlight (depending on a season) the differences in local moisture of uncovered soils and their structural and textural features (winter, autumn), vegetation (spring, autumn), and the dissimilarities in drying out (autumn). Detailed descriptions of the applied data sets have been comprised in Table 1.

Methods

Detecting a group of surface and linear anomalies is more reliable and comprehensive if the underlying satellite material is diversified and consists of both classical color compositions and further processing. A complementary utilization of such data may verify questionable indications (e.g., the material originating from a particular season or properly processed by enhancing the content of the image can reveal unique features which make it possible to identify a given artifact).

Initially, for all the 15 satellite image scenes, two color compositions were prepared: True Color TC (Blue-Green–Red) in natural colors and Color Infra Red CIR (Green–Red + Near InfraRed) in fake colors. Then, each scene was subject to standardized transformation named principal component analysis (sPCA) (Lillesand et al. 2015; Richards 2013). The PCA transformation leads to obtaining a new uncorrelated set of images, the so-called principal components (PC). It provides an opportunity to develop new color compositions, different from those created on the basis of original spectral bands. In this case, a composition based on PC1, PC2, and PC3 components and PC2, PC3, and PC4 components was created. Subsequently, the PC123 and PC234 images were subject to principal components standardization and inverse transformation was conducted. This procedure is called decorrelation stretch and is usually performed on three selected spectral bands (DStretch) (Campbell 1996). It emphasizes the colors of areas which seem little differentiated in primary compositions. The DStretch procedure is particularly useful for composites containing highly correlated spectral bands characterized by low contrast and poor coloring (e.g., a composite in true colors, also CIR composite to a lesser extent). Owing to these activities, a new array of spectral bands and new images were acquired.

The next stage consisted in calculating the Normalized Difference Vegetation Index (NDVI) for all the 15 scenes (Agapiou et al. 2012; Kalayci et al. 2019). The NDVI is an indicator defining the quantitative share of vegetation (biomass) in a particular area and may highlight the features related to the local diversity of vegetation in a more efficient way than color compositions.

The final stage was the use of a high-pass filter (HPF), allowing the detection of edges. This highlights areas of tonal change in the image, running in any direction, and is used in archaeological interpretation (Lasaponara and Masini 2012; Sarris et al. 2013; Mondino et al. 2012). Since a grayscale image is used for filtering, a WorldView-2 panchromatic image was used for processing for the analysis area of The Navkur Plain (Mahad Hills and Tell Gomel Plain), for the autumn term. In the other cases, only Pleiades RGB and NIR images were available, after fusion with the panchromatic image (PAN). The PAN image was simulated by averaging the brightness from 4 channels, with a weighting of 0.5 introduced for the NIR channel (in the Plejades, the PAN image only partially covers the NIR band). The option of using the first PCA component from the RGB + NIR channels was also tested, but the post-filtered image had less detail. After initial tests, it was considered useful to filter in a larger window, i.e., 11 × 11 (HPF11), apart from the base 3 × 3 filtering kernel (designation in the text: HPF3).

Consequently, 9 new images were drawn up on the basis of the 15 scenes of multispectral satellite images (see Table 1, last column). Therefore, the total of 135 images were subject to interpretation. An exemplary complete set of image data (27 images in total) for the Hussein Ferrash site in the form of a small fragment of the scene has been illustrated in Fig. 2.

Fig. 2
figure 2

An example of a set of image data for interpretation; a fragment of the Hussein Ferrash (HF) area of 1200 m × 800 m

The anticipated linear and surface anomalies (artifacts) in the context of the search for the Macedonian camp were the subject of detailed and multivariate characteristics in the earlier study (Pirowski et al. 2021). Field verification for the Navkur region did not bring about the expected results related to finding possible remains. It confirmed the concerns that possibly no remnants of former camps (a levelled bank or a shallower trench, etc.) have been preserved to the present day. Natural erosive processes, and anthropogenic activity and events occurring in this area throughout such an extensive period of time (over 2300 years) have caused that the camp boundaries—if they are successfully detected—may probably have linear outlines of a geometrical shape identifiable only from the air/satellite perspective and not ground-based observations. In such a case, in order to indicate its potential locations in a satellite image, the only trace can be the occurrence of an anomaly (artifact) in this area, which results from former ground disturbance or the existence of foreign elements in the soil. Depending on the landform and coverage of the area, it may be reflected in multiple ways. Former facilities can manifest themselves in the contemporary topography through a specific distribution of vegetation (connected with the spheres of moisture) or even, in a broader context, through the development and layout of transportation or settlement networks. In fewer cases, it may be directly observed in the images (e.g., changes in the spectral response of the soil, which can be noticed in the areas not covered by vegetation). The contemporary fragmented character of the camp may be another impediment to the identification of its remains. Regardless of the assumed shapes and area of the camp (30 ha), as well as the length of linear features (400 m), there might be no artifacts of such dimensions in the image. It is also possible that when the camp was established natural land features (the river system or ridge lines), which could have been its sides, were used. For that reason, the theoretically presupposed rectangular shape of the camp might not have ever been realized.

Table 2 provides a collection of expected major photo-interpretative features searched for in the created 135 color compositions and grayscale images, potentially relating to particular traces of the camp.

Table 2 Sought-after elements and related photointerpretation features anticipated in the satellite imagery (after Pirowski et al. (2021) changed)

During the photo-interpretative analysis, the scale for assessing the value of single materials ranging from 0.0 (invisible) to 1.0 (the best material for detecting the polygon/line) was adopted. Intermediate values, i.e., 0.25, 0.5, and 0.75, which denoted partly applicable material, were also accepted. Having identified a feature, the adopted values were assigned to all the employed materials in a particular area on every single occasion (27 images = 27 assessments).

An exemplary scoring for one of the polygons is presented in Table 3. In this particular example, marking the boundaries of polygons was most efficient in the DS234 color composition and in the material recorded in spring (the most reliable, precise, and easily discernible). Regular compositions such as CC TC and FCC CIR, as well as NDVI (spring data), were also relatively applicable. Their values of 0.75 imply that the underlying material was more difficult to interpret or that the boundaries of a feature were not easily identifiable in its selected minor parts. The dropping values, e.g., those taken for the winter recording (0.5), are tantamount to the growing deterioration of the readability of images, which entails a less and less reliable identification of features (difficulties in their detection and/or an increasing share of layout fragments in features which are difficult to mark). The applied methodology of an individual assessment conducted for every detected feature was used for both lines and polygons in all the 5 study areas. It is an extension of the methodology suggested by Pirowski et al. (2021), where only the best readings involving a season (without the gradation of applicability) together with a major interpretative feature determining the detection were registered. In the solution adopted in this study, a comprehensive database was built for every artifact regarding the possibility of detection and including detailed information about the applicability of a particular image material.

Table 3 An exemplary scoring for one of the analyzed polygons

In the first stage, the analysis was performed for every type of material in all the images “one by one” (e.g., all the 15 TC-BGR images for the 5 study areas were analyzed, then all the CIR images, etc.) In order to secure the impartiality of detecting artifacts in each subsequent underlying material (NDVI, PCA, DS, HPF, etc.), the analysis of images was conducted after a longer time limit, on data which were artificially rotated by 90, 180, or 270°, with no access to the information on the previously marked features. This stage made it possible to identify materials whose applicability for the detected artifacts amounted to 0.75–1.

In the second stage, the results (marked lines and polygons in various underlying images) were matched and their compatibility was verified. In the case of major discrepancies in their course, the numerical values of applicability were reduced for relevant image data. For example, when in three materials the course drawn for a linear feature was consistent, but in a certain fragment of another one a divergent/incorrect course compared to the previous three was observed or the feature was incomplete/shorter, such a case resulted in decreasing the applicability from 0.75/1 to a lower one, depending on the recognized degree of such a material deficiency. On the other hand, based on consistent interpretations from several materials, the ultimate outline of a surface area or the final course of a linear feature was determined.

In the third stage, it was verified whether the marked linear and surface features are modelled in the materials which had been considered to be of zero applicability in Stage I. Can they be detected anyway, for example, less credibly, fragmentarily, or with conspicuous errors in the course of their layout? If yes, they were attributed the applicability of 0.25.

The analyses of multispectral satellite data were carried out in QGIS 3.10 (Open Source Geospatial Foundation Project 2020) software with plugin SCP (Semi Automatic Classification) and Idrisi Kilimanjaro (Clark Labs 2006). Additionally, the calculations of vector layers and the graphic work were conducted in ArcGIS (ESRI 2020) software.

Results and discussion

Quantity and spatial distribution of the identified features

The summary of data on the number and size of indicated locations regarding the study areas is presented in Table 4. The total of 131 features was detected: 45 polygons and 86 lines, which would require field examination. Assuming that the limit value is 30 ha, the condition was fulfilled by only 12 features, while subsequent 9 features met it conditionally (features with the area of 15–30 ha—the aforementioned uncertainties resulting from the possibility to detect the remains or part of the camp area exclusively). Additionally, the requirement of 400 m in length for a line was fulfilled by 80 out of 86 indicated features. Consequently, taking into account the size threshold for both surface and linear features, 92 features would call for field verification (12 surface and 80 linear ones). It may be easily noticed that mainly a small area of detected polygons contributes to the aggregate decrease in the number of features (merely 12 out of 45 features). By including partly preserved fragments of the former camp and lowering the limit to 15 ha, the total of 21 polygons could be covered by the study.

Table 4 Tabulation of the detected features for all study sites

Should we investigate the overall number of marked features with respect to the division into examined areas, we will notice that the largest numbers of detected features were recorded for the Tell Gomel and Hussein Ferrash areas (over 30). It mainly results from a big amount of linear features.

Figures 3, 4, 5, 67 and 8 present a special layout of marked linear and polygon features for each study area. These are potential archaeological research sites, the purpose of which is to detect the remains of a military camp from fourth century BCE. No characteristic layout or interrelated co-occurrence of the discovered features (e.g., a polygon feature that would be surrounded/bordered with linear features resembling a geometrical shape) can be discerned. The most polygons were detected in the area of Minarah Shebek (Fig. 7), although it needs to be emphasized that some of them overlap. For each of the three areas (Tell Gomel, Bartella-Khazna, Minarah Shebek), 4 polygons measuring ± 30 ha were marked, while in the remaining two areas (Mahad Hills, Hussein Ferrash) the number of identified polygons was smaller and their average size was merely 7–8 ha (Table 4). In total, in all the five areas, only 12 polygons met the requirement of reaching the size of 30 ha, which accounts for about one-fourth of all designated polygons.

Fig. 3
figure 3

All line and polygon features distinguished on Mahad Hills study site (background image: DS234 autumn)

Fig. 4
figure 4

All line and polygon features distinguished on the Tell Gomel study site (background image: PCA123 autumn)

Fig. 5
figure 5

All line and polygon features distinguished on the Hussein Ferrash study site (background image: PCA123 autumn)

Fig. 6
figure 6

All line and polygon features distinguished on the Bartella-Khazna study site (background image: PCA123 autumn)

Fig. 7
figure 7

All line and polygon features distinguished on the Minarah Shebek study site (background image: PCA123 autumn)

Fig. 8
figure 8

All identified line and polygon features (study sites numbering in accordance with Fig. 1)

The situation looks different in the case of lines: most of them were designated in the area of Hussein Ferrash (Fig. 5) and almost all of them exceed the length of 400 m, with the mean length of 1 km. Generally, as many as 93% of all identified lines meet the condition of length, while the average length of all the lines amounts to around 970 m (Table 4).

Considering the number of features detected in the images exclusively, the priority of field study should be given to the area of Hussein Ferrash (37 features), then Tell Gomel (34) and Mahad Hills (23), and finally Minarah Shebek (21) and Bartella-Khazna (16). It should however be noted that some polygons clearly mark the extent of lower towns around tells (see Fig. 5), and not boundaries of a military enclosure. Additionally, many linear features can be tentatively identified as hollow ways, which are broad and shallow linear depressions in the landscape thought to be formed by the continuous passage of human and animal traffic (Ur 2003), and again not as the boundaries of a military structure.

Interpretative value of remote sensing materials

The evaluation of the actual applicability of remote sensing materials can only be performed following a field verification. However, it may be assumed that featuring artifacts themselves—and, consequently, the opportunity to detect them more easily and geometrize them in particular remote sensing data and the products obtained from their processing—can be the basis for ranking image data. Such an approach was adopted in this work considering the fact that the very possibility of identifying surface anomalies, which is frequently difficult or even impossible to capture in the case of in situ work, leads to increasing the efficiency of field work (by indicating the possible locations of the searched historical traces). The total number of “photo interpretative points,” which contain the number of detections and the information on how easy it was to designate them and how reliable these markings are, was considered equivalent to this applicability.

The interpretative analysis began with determining the degree of detectability of particular features expressed through a points system. Table 5 shows the amount of all detected features together with the average number of interpretative points per one feature (the quotient of interpretative points and the number of features). This parameter provides the information on the difficulty in detecting features—the lower the value, the more difficult the feature identification (i.e., feasible only in a limited amount of image data). The values varied considerably, depending on the study area and the type of features. On average, one detected feature had 4.9 photo-interpretative points; however, it was more complicated to detect polygons (the average of 3.7 points) than linear features (the average of 5.5) (Table 5). The highest mean point value for the detected lines amounted to 8.9 pts/feature (Mahad Hills), and 7.2 pts/feature for polygons (Hussein Ferrash). It confirms great sharpness of features searched for in the study, which are currently visible in numerous image data. On the other hand, the lowest values for lines (4.3 pts/feature) were obtained in Tell Gomel and for polygons (1.9 pts/feature) in the area of Minarah Shebek. These low numbers indicate that line objects occur twice less frequently here than in the area with the highest scoring, while polygons are nearly four times less common. All in all, the highest values were obtained for Mahad Hills, which may be accounted for by the fact that in the hills a number of borders of the artifacts subject to the study could be marked along natural, geomorphological elements of the landscape (ridges, watercourses), well-visible in the majority of the employed image materials. The lowest values, on the other hand, were recorded for Bartella-Khazna and Minarah Shebek (the average of 3.2–3.4 pts).

Table 5 Tabulation of the sum of the scores during interpreting according to the image data

Table 5 also comprises the total number of points for the identified features (polygons, lines) divided into the employed image data for every study site. In all the areas, the amount of points awarded to linear objects is much higher, which results from the simple fact that nearly twice as many lines as polygons were detected. Only for the area of Bartella-Khazna is the number of points for both types of features similar. As far as image data with the highest amount of detected features are concerned, for lines these were CIR and TC data, which scored 41% of all points. In the case of polygons, the points were almost evenly distributed in all the data. NDVI and HPF data, which obtained the fewest points (and, consequently, were characterized by the lowest detectability of the sought-after features), were the only exception.

Determining the significance of recording the area in different seasons was another investigated element. Table 6 presents the total number of points awarded during the interpretation of image data, including the information on the type of data and the season of data acquisition. No prevailing season can be observed for polygon features. On the other hand, autumn proved to be the best season for detecting lines in the materials. It can also be noticed in the overall scores—autumn, as the period following plants vegetation, is conducive to the most effective identification of features visible topography.

Table 6 Tabulation of the sum of the scores during interpreting according to the season of the image data acquisition

As far as image data are concerned, the highest scores were obtained for FCC CIR (128.75 pts) and CC TC (118.25 pts), that is, classically applied color compositions. On the other hand, the NDVI turned out to be the least useful material (38 pts) and images after HPF filtration (9 pts + 25.25 pts). The remaining image data scored from 71.25 pts (DS123) to 92 pts (PCA123), but if analyzed jointly (as PCA transformation products), their total amount reached 318.25 pts. Therefore, it may be claimed, that if this additional image processing had not been applied, a smaller number of artifacts would have been detected (it is also confirmed by a detailed “unique feature detection” analysis, see in Table 7).

Table 7 Unique feature detections related to the image data used

The final performed activity was the analysis of “unique feature detection,” which was supposed to check how many features (polygons and lines) were found in only one kind of image data (i.e., TC, CIR, NDVI, PCA123, PCA234, DS123, DS234) or grouped data (composites from TC and CIR, NDVI bands, composites based on PCA, HPF3 + HPF11) during the interpretation. Table 7 depicts this analysis for every study site. The following table reveals that 16% of all detected polygons are present in the image data of one type exclusively (e.g., PCA234). However, if we apply data composites, the number increases remarkably up to 40% (Table 7). On the other hand, for linear features, the total amount of unique feature detection is considerably lower and ranges from 1 to 7%. Therefore, it may be stated that the PCA image material (CC PCA123, CC PCA234, DS123, DS234) actually increases the quantity of polygon and line features that are searched for. However, one should be aware that working solely on PCA composites would be insufficient and unreliable. Finally, it must be remarked that NDVI and HPF images were the only material where no unique features were observed. Consequently, employing NDVI and HPF should be considered of secondary importance, applicable only to support identification.

Additionally, Table 8 shows unique feature detections related to the season of image data acquisition. Numerical statistics of unique artifact detection reveal that around half of the detected features were noticed in the data from a particular season and were not visible in the others (56% for surface features, 47% for linear features). The aggregated data indicate the most features for autumn (68%), then for winter (20%) and the fewest for spring (12%).

Table 8 Unique feature detections related to the season of the image data acquisition

Summary and conclusions

The adopted methodology of precise allocation of interpretative values to remote sensing materials for every detected artifact provided an opportunity to accumulate an extensive amount of information. It also provided the basis for a synthetic analysis regarding the methods of image processing on the one hand and the dates of recording on the other. The assessment of the applicability of the materials prepared for photo interpretation led to the following conclusions:

1. The significance of recording the site in various seasons is very high—the application of data derived from three seasons increased the number of designated locations by around 50% (the so-called unique detections). If it is necessary to conduct works in merely one dataset, it is recommended to employ autumn data (the most interpretative point values were awarded to the autumn data—38% of all). However, the scores for the remaining data, i.e., from winter and spring, are on a very similar level of 30–31%. Owing to high numbers of unique detections, working on data from one acquisition period leads to the decrease in detection by around 30–38% (Fig. 9).

Fig. 9
figure 9

Number of detected features resulting from using data for a specific season for polygons (A) and lines (B)

2. The importance of image processing is considerable. The work on color compositions from PCA and decorrelation stretch provided a greater amount of new locations (the so-called unique detections) than CC TC and FCC CIR themselves—31% more for surface features and 5% more for linear ones (the weighted average of 13.7%). No unique feature detection was obtained with the application of NDVI data. Having differentiated data available in numerous cases provides additional reliability of detection and facilitates marking the boundaries of features.

Owing to the new methodology, based on the obtained results, the impact of data acquisition from different seasons can be estimated more precisely—to be twice as significant as additional image processing from one date of recording (increase/decrease of detections: ~ 30–37% to 14.5% > 2) (Fig. 10).

Fig. 10
figure 10

Number of detected features resulting from using additional imaging processing for polygons (A) and lines (B)

3. Surprisingly low usefulness was noted for NDVI and HPF methods. It seems that the searching and delineation of artifacts based on image features (boundaries, texture, spectral features) that vary and change even within a single object limits the usefulness of an analysis based only on biomass intensity (as is the case with NDVI) or on object boundaries (as is the case with HPF). Perhaps the complementary use of spectral indices other than NDVI (including soil) would improve results in this group of data. On the other hand, for methods based on edge detection, it seems that an additional problem is the relatively large dimensions of the artifacts sought, often of a fuzzy nature. This is clearly visible for the reported very low suitability of HPF for polygons. Better results were obtained for linear objects, but it should be remembered that even they are often wider objects than the edges in the images, and therefore difficult to detect by filtering. This problem was partly solved using a larger filtering window, allowing for correctly mapping of larger structures by eliminating small-scale structures (higher HPF11 scores than HPF3). The NDVI and HPF in comparison with the CC and PCA/DS should be considered supporting data. The usefulness of the HPF method would certainly be greater for detailed work and for the analysis of the boundaries of small-scale objects.

4. The conducted analyses proved the applicability of high-resolution satellite data in detecting local artifacts covered by vegetation and human activity and, consequently, difficult to identify in landscape during surface fieldwork. Generalization of data recorded from space ceiling is, undoubtedly, the factor which facilitates marking such features, both surface and linear ones. Generalization does not mean withdrawing from detailed imaging of the Earth surface, but the ability to include in an analysis and combine diverse information, volatility of the nature of artifacts in space, the interconnections between features and their environment, and the opportunity to determine their functions. All these aspects can be relatively easily studied both in the local (high-scale) and regional (low-scale) contexts.

Undeniably, the detected and described anomalies which are visible in the satellite images are merely introductory indications to conduct thorough fieldwork. They should be considered a tool which provides an opportunity to reduce field research to areas of potentially highest chances for discovering the siting of the camp. Moreover, the possibility to carry out small-scale work on images gives the unquestionable advantage over fieldwork, which is usually performed within a short period of time, in a particular season, and in limited spatial coverage.