1 Introduction

Coronal holes (CHs) are large-scale features in the solar corona often characterized by reduced emission in X-ray and extreme ultraviolet (EUV) which are associated with open magnetic field lines of a dominant polarity. Coronal plasma is accelerated along the open field lines causing a high velocity outflow of particles, often referred to as fast solar wind or high speed solar wind stream (HSS). The plasma depletion causes a reduction of density and temperature in these regions in comparison to the surrounding solar corona. Thus, CHs can be observed as dark structures in the EUV and X-ray emission (see e.g., Schwenn, 2006; Cranmer, 2002, 2009).

To investigate the morphology and intensity of CHs as observed in EUV, as well as their underlying photospheric magnetic field, the identification and extraction of CH boundaries are key. There exist multiple approaches to this topic with one of the most popular using a single wavelength, intensity-based threshold approach on EUV observations. Due to the high contrast and the optimal filter sensitivity, the coronal emission line of 11 times ionized iron (Fe xii: \(193/195\) Å) is often used to extract CHs (e.g., Krista and Gallagher, 2009; Rotter et al., 2012, 2015; Reiss et al., 2015; Caplan, Downs, and Linker, 2016; Boucheron, Valluri, and McAteer, 2016; Hofmeister et al., 2017; Heinemann et al., 2018a). Other intensity based approaches include multi-thermal emission recognition (Garton, Gallagher, and Murray, 2018) and spatial possibilistic clustering (Verbeeck et al., 2014). A different concept is to model the open field, which characterizes CHs, using photospheric magnetograms. Examples include the potential field source surface model (PFSS; Altschuler and Newkirk, 1969), its improved version including the Schatten current sheet, the Wang–Sheeley–Arge model (WSA model; Arge and Pizzo, 2000) and the MULTI-VP model (Pinto and Rouillard, 2017). Studies comparing the two different conceptual approaches have shown significant differences in the size, location, shape and occurrence of the dark and/or open structures defined as CHs (e.g., Lowder et al., 2014; Lowder, Qiu, and Leamon, 2017; Linker et al., 2017; Wallace et al., 2019; Huang, Lin, and Lee, 2019; Asvestari et al., 2019). Additionally new approaches like machine learning/neural networks (e.g., Illarionov and Tlatov, 2018) and extraction methods based on plasma properties (differential emission measure; Raymond and Doyle, 1981; Hahn, Landi, and Savin, 2011) are the topic of current research.

Reliably defining CH boundaries is not only relevant for studying coronal and photospheric properties and their evolution but is also of major scientific importance towards space weather research. Empirical relations between CH area and measured solar wind speed at 1 AU (e.g., Nolte et al., 1976; Vršnak, Temmer, and Veronig, 2007; Tokumaru et al., 2017; Hofmeister et al., 2018) are used for forecasting purposes (e.g., Rotter et al., 2012, 2015; Reiss et al., 2016; Temmer, Hinterreiter, and Reiss, 2018). Moreover, the distance to CH boundary is an important parameter for MHD models simulating the solar wind distribution in interplanetary space (e.g., ENLIL: Odstrčil and Pizzo 1999, EUHFORIA: Pomoell and Poedts 2018 and CORHEL: Riley et al. 2012). When considering CH extraction, usually there is the choice between manual and automated algorithms both of which have advantages and disadvantages. On the one hand, manual extraction of CHs requires a lot of time and experience in order to get reliable results. On the other hand, automated extraction methods are prone to significant errors and artifacts.

In the first part of this study we present a new method for extracting CH boundaries in EUV images by using an intensity threshold which is modulated by the intensity gradient of the CH boundary. The method is based on the work of Rotter et al. (2012), Rotter et al. (2015), and Krista and Gallagher (2009) and is incorporated into an easy-to-use GUI application developed in SSW-IDL. The Collection of Analysis Tools for Coronal Holes (CATCH) application enables users to easily extract and analyze CHs in a supervised semi-automated fashion. CATCH uses a modulated intensity threshold method to extract CH boundaries from EUV images and analyzes the associated properties. In addition, it offers the possibility to investigate the underlying magnetic field. In the second part, we use CATCH to investigate 707 CHs, covering the complete time range of the operational lifetime of the Solar Dynamics Observatory (SDO; Pesnell, Thompson, and Chamberlin, 2012) so far, starting in May 2010, running until February 2019. We derive statistical CH properties of the area, intensity, and the underlying magnetic field including the magnetic fine structure over nearly the full Solar Cycle 24. Furthermore, we present how the parameters for an optimal CH extraction vary during the solar cycle. The CH dataset is available as supplementary material and as an online catalogue under the CDS database using the Vizier catalogue service (Ochsenbein, Bauer, and Marcout, 2000).

2 The “Collection of Analysis Tools for Coronal Holes”

The Collection of Analysis Tools for Coronal Holes (CATCH) was created in order to collect and structure CH identification, extraction and analysis in a handy and fast way without the disadvantages of automatic algorithms as described in Sections 2.1 and 2.2. It enables the user to download and process EUV filtergrams (\(193/195\) Å) and line-of-sight (LoS) magnetograms. CATCH is able to handle data from different spacecraft missions covering the interval from 1996 until now. These are SDO, the Solar Terrestrial Relations Observatory (STEREO; Kaiser et al., 2008) and the Solar and Heliospheric Observatory (SOHO; Domingo, Fleck, and Poland, 1995). Data from the Atmospheric Imaging Assembly (Lemen et al., 2012, AIA; 193 Å), the Extreme ultraviolet Imaging Telescope (Delaboudinière et al., 1995, EIT; 195 Å) and the Extreme UltraViolet Imager (Howard et al., 2008, EUVI; 195 Å) as well as from the Heliospheric and Magnetic Imager (HMI: Schou et al., 2012; Couvidat et al., 2016) and the Michelson Doppler Imager (MDI: Scherrer et al., 1995) can be processed. Additionally, user supplied full–disk images can also be analyzed.

The user can perform CH boundary detection, extraction and analysis using a manually adjustable intensity threshold. The threshold range, in which reasonable CH boundaries can be extracted, can be derived from the intensity histogram of the solar disk. After specifying a threshold, it is applied to the full solar disk and the user may select the structure of interest to calculate its parameters and to get an estimate of the boundary stability and uncertainty. Then by varying the threshold to minimize the boundary uncertainty (\(\epsilon _{A}\)), the user can find an optimized CH boundary in an easy and fast way, even without previous experience in CH extraction. For deriving the properties of a CH, CATCH analyzes five boundaries in an interval of 1 DN (data number) centered around the selected threshold and calculates the mean values. The maximum deviation of the derived values from the calculated mean is the uncertainty. After extracting a satisfactory boundary from EUV filtergrams, CATCH can analyze the properties of the CH. The boundary may then be used on LoS magnetograms (if available) to analyze the underlying photospheric magnetic field of the CH and its fine structure represented by flux tubes. Figure 1 shows an example of how to find the optimal threshold by considering the uncertainty of the extracted CH boundary. The red contour represents the CH boundary (of the chosen threshold), the blue shaded areas are the uncertainties of the boundary. The best boundary for this CH can be identified as shown in panel (d), where the blue shaded area is smallest in comparison to the area enclosed in the CH boundary.

Figure 1
figure 1

Example images of boundary extraction of a CH on August 31, 2014. The red line is the CH boundary (of the chosen threshold) and the blue shaded areas are the uncertainties as described in Section 2.1. Panel (a) shows the CH without boundaries and panel (b) with the CH boundary of a threshold of 31 DN which gives an uncertainty of \(\epsilon_{A}= 47 \%\). In panel (c) a threshold of 37 DN is used which gives an uncertainty of \(\epsilon_{A}= 16\%\). The best boundary for this case (e.g., the lowest uncertainty) is reached with a threshold of 41 DN and is shown in panel (d) with \(\epsilon_{A}= 10\%\).

CATCH calculates a variety of properties of the extracted CH, which include morphological properties, the intensity, boundary stability as well as properties of the underlying photospheric magnetic field and its fine structure (for the full list of calculated parameters, see Table 1). The calculations are based on the studies by Hofmeister et al. (2017), Heinemann et al. (2018a), and Heinemann et al. (2018b).

Table 1 Parameters calculated with CATCH.

For proper image processing and analysis the SSW (SolarSoftWare) package under IDL (Interactive Data Language) is required, therefore the tool is written in SSW-IDL and the code, including an user-manual, is available on the authors GitHub page (https://github.com/sgheinemann/CATCH) or by contacting the author directly via E-mail.Footnote 1 Figure 2 shows the GUI structure of CATCH, displaying the main menu, the data download widget as well as the CH extraction and the magnetic field analysis widget. A more detailed description of CATCH and its functionalities can be found in the user manual.

Figure 2
figure 2

Screenshots of the CATCH GUI showing the main menu (left bottom), the data download application (right bottom), the coronal hole extraction option (left top) and the option for the magnetic field analysis (right top)

2.1 Coronal Hole Extraction from EUV 193/195 Å Filtergrams

2.1.1 Intensity Threshold

The basic principle under which CH extraction operates is an intensity-based threshold technique applied to EUV filtergrams of sufficient contrast, which was developed by Rotter et al. (2012). To find an optimal threshold Krista and Gallagher (2009) derived that an intensity distribution of the solar disk (or a subfield) with a CH present differs significantly from a distribution where CHs are absent. Figure 3 shows as an example the intensity distribution of the solar disk on May 29, 2013. Hereby, the first maximum, seen at lower intensities, represents one or multiple dark structures on the solar disk. It was proposed that an optimal threshold for a CH boundary lies somewhere in the following minimum. However, note that this characteristic shape is often not well established, especially if no large and well defined CHs are present on the solar disk. Also, it has been found that there is a strong solar cycle dependence of the solar disk EUV intensity distribution, which is additionally amended by the current conditions on the Sun (e.g., increased abundance of dark structures or bright active regions). As such, neither a fixed threshold nor a median-intensity dependent threshold, which aims to mitigate intensity variation, perform continuously well. Frequent manual adjustments are needed for optimized results. Thus, the aim is to use an adjustable threshold depending on the current solar conditions, both locally and globally.

Figure 3
figure 3

Normalized distribution of AIA/SDO 193 Å intensities of the solar disk on May 29, 2013 12 UT (see inset). The maximum around 20 DN represents pixel located inside the CH boundary and the shaded area the reasonable threshold range as proposed by Krista and Gallagher (2009).

2.1.2 Intensity Gradient, Uncertainty Estimation and Calculation of CH Properties

The common intensity-based methods have the drawback that the threshold range in which the boundary is considered optimal, is large (see Figure 3, shaded area). To narrow down the range of reasonable thresholds, we propose an intensity gradient method to estimate the boundary stability and give relevant errors to calculated properties. Recent studies, investigating CHs and their boundaries, have revealed a steep intensity gradient at the CH boundary (Hofmeister et al., 2017). This is due to a strong decrease of the plasma density of quiet Sun temperatures around 1.6 MK (Hahn, Landi, and Savin, 2011). Figure 4 shows a representative intensity profile perpendicular to the CH boundary layer, from inside of the CH (\(x=0\)) to outside (\(x=1\)) in arbitrary scale. The y-axis shows the intensity that is scaled to the maximum in this interval which represents the quiet Sun intensity. We see that within a small layer the intensity drops by at least \(40\%\) from the quiet Sun level. This small layer represents the range where CH boundaries are usually extracted. Assuming that the CH boundary is best represented where the intensity profile is changing most strongly, we define the optimum boundary to be placed at the steepest intensity gradient (i.e., gradient has a maximum). In an ideal case, the implication of this definition is that the boundary is approximately constant for small threshold variations around the maximum intensity gradient threshold. This physical 1D principle of the maximum intensity gradient perpendicular to the boundary can be extended to 2D to consider the entire boundary instead of one localized cross–section. This can be done by calculating the change of the CH area for a given intensity threshold by varying the threshold slightly. Using the assumption of a similar intensity gradient along the full boundary, a minimum in the change of the area indicates that, on average, the boundary is located at the maximum gradient, i.e. the optimal threshold.

Figure 4
figure 4

A representative intensity profile perpendicular to the CH boundary and its derivative. Both are scaled to the maximum (\(I_{ \max }=1\)). The \(x\)-axis represents the radial distance from inside of the CH (\(x=0\)) across the boundary to the surrounding quiet Sun (\(x=1\)).

With this definition of the boundary we aim to minimize the variations in different parameters (first of all the area) to properly estimate the boundary. Practically, this is done by calculating the parameters not only for the boundary defined by the selected threshold but also for boundaries of slightly larger and smaller thresholds. From this set of boundaries, a mean value (\(\bar{P}\)) and its uncertainty (\(\epsilon _{P}\)) is calculated. The uncertainty corresponds to the maximum deviation between the determined values and the mean value.

A reasonable CH boundary can be determined by finding the threshold that minimizes \(\epsilon _{A}\) (uncertainty in the CH area) and the CH properties are then given as:

$$ P_{\mathrm{CH}}=\bar{P} \pm \epsilon _{P}. $$
(1)

2.2 Analysis of the Underlying Photospheric Magnetic Field

To extract and investigate CHs, it is not sufficient to only use the information extracted from EUV filtergrams as it lacks information as regards the underlying magnetic structure. The magnetic field configuration is what distinguishes CHs from other dark structures (e.g., filament channels, coronal dimmings) in the solar corona. Studies suggest that it may be possible to differentiate those structures purely from intensity filtergrams (Reiss et al., 2014) but a clear distinction cannot always be made. A much more precise approach is the definition based on the underlying magnetic field (Reiss et al., 2015; Delouille et al., 2018). CHs are defined by their open magnetic field configuration which is reflected in the ratio of the total signed to the unsigned magnetic flux inside the CH and in the skewness of the magnetic field distribution. Filaments and filament channels on the other hand ideally show a symmetric distribution between pixels of positive and negative magnetic flux (closed magnetic structures), as they are located along polarity inversion lines. Thus, analyzing the magnetic field underlying an extracted dark structure reveals its magnetic configuration and enables a clearer classification as CH or filament.

The calculation of the photospheric magnetic field underlying a CH is often performed by a simple projection of the EUV extracted boundary onto the photospheric magnetogram (line-of-sight or radial). However, it is important to stress that there are several uncertainties in the extraction. First of all, the height difference between coronal imaging in EUV (EUV 193 Å: 1.01 – 1.05 R) and photospheric magnetic field (1.00 R). Second, the unknown expansion of the magnetic field over the EUV height. Simple projections will have an increased effect on the CH boundary the further it is located away from the center of the solar disk. Another source of uncertainty arises from the noise level, resolution and smoothing of the magnetogram. This can cause non-trivial effects on parameters like the unsigned magnetic flux, flux balance and skewness of the magnetic field. This complicates a comparison of magnetic field properties derived from differently prepared magnetograms. When interpreting such parameters a relative comparison should be preferred rather than relying on absolute values.

Hofmeister et al. (2019) showed that the photospheric magnetic field underlying CHs can be divided into 3 categories: \(\approx 22 \pm 4\%\) of the signed magnetic flux is contributed by a slightly unbalanced background field. \(\approx 5.0 \pm 0.1\%\) come from small scale unipolar magnetic elements (flux tubes, FTs) nearly symmetrically distributed over both polarities and which are associated with the super-, meso-, and granular motion of the photosphere. The major contribution, on average \(\approx 69 \pm 8\%\), comes from strong and long-lived FTs which have almost exclusively the dominant polarity of the CH. To map these properties, we calculate the contribution of FTs to various CH parameters. We define two FT categories, strong and weak (with the category weak also covering medium FTs; for more details see Heinemann et al. 2018b). FTs are extracted as structures of pixels above a magnetic field strength of 20 G and the mean magnetic field strength of each structure determines the category. If the mean magnetic field strength of one FT is between an absolute value of 20 to 50 G it is categorized as weak, if exceeding 50 G then it is considered strong.

3 Statistical Analysis

3.1 Data and Data Processing

For the presented statistical study, we did not exhaust all the possibilities of CATCH but constrained the used dataset to one spacecraft. SDO was chosen over STEREO because of the availability of magnetic field maps, and over SOHO because of the better resolution and contrast. The dataset ranges from May 2010 until February 2019. The EUV 193 Å filtergrams observed by AIA/SDO as well LoS magnetograms from HMI/SDO were acquired in a 1 day cadence using the Joint Science Operations Center Servers via the CATCH download application. For the magnetograms the 720s LoS data product was preferred over the 45s due to the lower photon noise of \(\approx 3\) G measured near the disk center and a better signal-to-noise ratio (Couvidat et al., 2016).

The EUV filtergrams and magnetograms were prepared to level 1.5 using standard SSW-IDL routines and the EUV filtergrams were down-scaled from a pixel scale of \(4096 \times 4096\) to \(1024 \times 1024\) to significantly enhance the processing speed. Before the extraction, the full-disk filtergrams were corrected for limb brightening using the annulus limb-brightening correction (Verbeeck et al., 2014) which is available in CATCH. The boundaries were smoothed using circular (2-pixel radius) morphological operators (open and close). To avoid the loss of information on the magnetic fine structure, the magnetograms were not down-scaled. The EUV extracted boundaries were re-scaled to fit the magnetograms resolution.

Note that the effects on boundary detection as well as on the calculation of the parameters in the EUV due to down-scaling are negligible. E.g., we tested for an isolated CH, located close to the disk center on May 29, 2013 how the area of the extracted CH changes for a fixed threshold (\(43\%\) of the median solar disk intensity). By varying only the resolutions between \(4096 \times 4096\) and \(1024 \times 1024\), (without smoothing) we find a deviation of the extracted CH of less than \(0.5\%\). Other parameter behave similar. As such, the uncertainties from using different filtergram resolutions to extract CH boundaries is much lower than uncertainties in the extraction itself.

From the daily EUV images, dark structures located close to the central meridian were extracted (center of mass, CoM, located \({\pm}\, 10^{\circ}\)). The extracted structures were limited to the central meridian to reduce longitudinal projection effects due to the spherical nature of the Sun. Polar CHs as well as polar connected CHs were excluded for the same reason. Each structure was extracted only once for each solar disk passage to avoid statistical biases because of similar datapoints. The magnetic properties of each dark structure were investigated and non-CH structures were identified (defined as structures with a flux balance below \(10\%\) or a magnetic field skewness below 1) and discarded from further analysis. This approach yielded 707 CHs over a wide range of sizes and latitudes spanning a time range of more than 8 years.

3.2 Results

We analyzed 707 CHs near their central meridian passage and categorized them by their boundary stability. All the parameters presented here are calculated with CATCH. Our findings are as follows.

3.2.1 Assessment of the Stability of the Extracted Boundaries

First, we assessed the stability of the extracted CH boundaries by analyzing \(\epsilon _{A}\) for the optimal threshold for all 707 CHs. Figure 5a shows the CH area (\(A_{\mathrm{CH}}\)) against its uncertainty (\(\epsilon _{A}\)). We find a dependence on the area which seems to have two causes: (i) the larger impact of stray light for smaller CHs which could partly be compensated by performing a PSF deconvolution before the CH extraction and (ii) the non-zero extent of the boundary layer whose area is growing linearly in contrast to the total CH area (which grows according to a power law). This causes larger percentage variation for smaller CH areas. To correct for this dependence we introduce the category factor (\(\zeta \)) which is given by

$$ \zeta =\frac{\epsilon _{A}}{f_{\mathrm{fit}}(A_{\mathrm{CH}})}, $$
(2)

with \(f_{\mathrm{fit}}(A)\) being the fit shown in Figure 5a as the red line. It is given by

$$ f_{\mathrm{fit}}=3.31\times (A_{\mathrm{CH}})^{-0.53}+4.71, $$
(3)

with \(A_{\mathrm{CH}}\) in units of \(10^{10}\mbox{ km}^{2}\). The resulting \(\zeta \)-factor as a function of CH area is shown in Figure 5b. From this we define three categories of boundary stability:

  1. i)

    high: \(\zeta \leq 1\),

  2. ii)

    medium: \(1 < \zeta \leq 2\),

  3. iii)

    low: \(\zeta > 2\).

We find that \(60.0\%\) of the CHs under study have a high boundary stability, \(34.2\%\) a medium boundary stability and only \(5.8\%\) are of low boundary stability.

Figure 5
figure 5

Panel (a) shows a scatterplot of the CH area against its maximum deviation as given in Section 2.1. The red line shows the fit which is used to calculate the category factor (\(\zeta \)). \(\zeta \) is plotted against the area in panel (b). The shaded areas (green, orange and red) represent the stability assessment of the boundaries as high, medium and low, respectively.

3.2.2 Thresholds

Second, we investigated how the optimal threshold to extract CHs is distributed and varies over the course of the observed time period from 2010 to 2019. This period nearly covers the whole Solar Cycle 24. Figure 6 shows the threshold over time (a) in absolute counts (DN) and (b) in percent of the median intensity of the solar disk. The black line in panel (d) shows the smoothed daily sunspot number by SIDC/SILSO,Footnote 2 which acts as a proxy of the solar activity. We find a clear solar cycle dependence in the optimal threshold (between 25 – 55 DN), which cannot be correctly mitigated by modulation with the median solar disk intensity (of the full solar disk). It seems that the correction is too strong, especially during solar minimum. Additionally, because of the individual configuration of CHs, the optimal threshold may vary by up to \(\approx 20\) DN for any given time. The distribution of thresholds (in DN) is shown in Figure 7a (cyan) with a mean of 43.9 DN and a standard deviation of 12.1 DN. The distribution shifts from \(53.5 \pm 8.6\) DN during solar maximum (red, 2012 – 2014) to \(29.7 \pm 4.6\) DN during decline and minimum (blue, 2017 – 2019). When considering the threshold in percent of the median solar disk intensity (Figure 7b), the mean threshold is \(40.4 \pm 6.3 \%\) with a variation between the solar maximum (red, 2012 – 2014) with \(37.3 \pm 5.0 \%\) and the decline and minimum (blue, 2017 – 2019) with \(46.2 \pm 5.5 \%\). We find the threshold to be independent of the CH size. We believe that the large change of the optimal threshold (in DN) is due to the change in intensity due to the solar cycle evolution (e.g., number of active regions, higher quiet Sun level,…). Figure 8 shows six CHs extracted with a different optimal threshold, varying from 25 to 65 DN.

Figure 6
figure 6

Evolution of the optimal intensity threshold for CH extraction as a function of time. Panel (a) shows the intensity in counts (DN) and panel (b) shows the threshold in percent of the median intensity of the solar disk at the time of the extraction. The blue line represents the mean value and the blue shaded area the \(1\sigma \) range over the whole period studied. The dashed-red line in panel (b) represents the often used threshold value of \(35\%\) of the median solar disk intensity. The errorbars indicate the variation of the threshold for the uncertainty estimation as described in Section 2.1. The third panel (c) shows the latitudinal positions of the CoMs of the CHs under study. The bottom panel (d) shows the smoothed daily sunspot number as provided by the SIDC/SILSO. The black line is the total sunspot number while the red and blue lines show the sunspot number for the northern and southern hemisphere, respectively.

Figure 7
figure 7

Distribution of the optimal threshold during the entire SDO-era (cyan), covering solar maximum (red) and the decline and minimum phase (blue). Panel (a) shows the threshold in absolute counts (DN) and panel (b) in percent of the median solar disk intensity.

Figure 8
figure 8

Sample of CHs extracted with varying optimal thresholds (as defined in Section 2.1). The sample represents the threshold value distribution as given in Figure 7. The optimal thresholds range form 25 to 65 DN and are primarily caused by a different intensity quiet Sun level rather than large changes in the CH intensity. The red boundary corresponds to the boundary derived by the optimal threshold and the blue shaded areas are the uncertainties (see Section 2.1.2). All images are equally scaled.

3.2.3 Area, Intensity, and Position

After investigating the extraction mechanism in terms of intensity threshold, we analyze how CH properties are distributed in our data-set. Figure 9 gives an overview of the main CH properties, i.e., the distribution of the areas, latitudes, and intensities of all CHs under study. Figure 9a shows the distribution of the deprojected areas. We find CH areas ranging from \(1.6 \times 10^{9}\mbox{ km}^{2}\) to \(1.8 \times 10^{11}\mbox{ km}^{2}\), with an average of \((2.69 \pm 2.73) \times 10^{10}\mbox{ km}^{2}\). CHs with an area below \(2\times 10^{10}\mbox{ km}^{2}\) account for \(56\%\) of all CHs, whereas only \(5\%\) of CHs exceed an area of \(8\times 10^{10}\mbox{ km}^{2}\).

Figure 9
figure 9

Distribution of CH properties sorted corresponding to the category factor (green: high, orange: medium and red: low). Panel (a) show the area distribution, panel (b) the distribution of the latitudinal location of the center of mass and the panels (c) and (d) show the distribution of the median and mean intensity within the extracted CH boundaries.

The CoM of CHs under study are distributed over latitudes ranging from \(-\,63^{\circ}\) to \(+\,63^{\circ}\) (Figure 9b). \(39\%\) of all CHs, which are located between an absolute value of \(40^{\circ}\) and \(20^{\circ}\), can be considered medium-latitude CHs and \(50\%\) are considered low-latitude CHs, located below \(20^{\circ}\). We find the CHs to be nearly balanced between the hemispheres (south: \(48\%\) CHs; north: \(52\%\) CHs) without a clear relation to the solar activity (see Figure 6c).

We calculate the median and mean intensity in the 193 Å wavelength for each CH of the dataset. The mean of the median intensities is calculated to be \(29.0 \pm 8.5 \) DN (Figure 9c) and the mean of the mean intensities is \(29.5 \pm 8.4 \) DN (Figure 9d). When only considering the \(50\%\) and \(25\%\) pixel with the lowest intensities we find the mean intensity to be \(23.9 \pm 7.4 \) DN and \(21.2 \pm 6.9 \) DN, respectively.

We investigated the intensity profile of the cross-section of the CHs. To that end we cut the CHs longitudinally through their CoM and superpose the intensity profiles. Figure 10a shows the superposed mean profile (black line) with the \(1\sigma \) uncertainties represented by the shaded area and the second panel (b) shows the derivative of the mean profile. Note that the intensity profiles were scaled so that the CH boundaries correspond to \(x=\pm 1\). We find that when using CATCH to extract CHs the boundary is consequently extracted at the highest gradient in the intensity, which was the initial assumption. With this we can highlight the CH extraction according to a physical principle in contrast to an arbitrarily chosen (or empirically found) value.

Figure 10
figure 10

The superposed intensity profile of the longitudinal cross-sections at the CoM of the CHs under study (a) and its derivative (b). Before superposing, each intensity profile is scaled so that \(x\pm 1\) represents the CH boundaries. The black line is the mean profile and the shaded gray–blue area represents the \(1\sigma \) standard deviation. The dotted vertical lines mark the location of the CH boundary.

3.2.4 Properties of the Underlying Photospheric Magnetic Field

The analysis of the magnetic field properties underlying the CHs yielded a near symmetric distribution of positive and negative polarity CHs (Figure 11a). The mean of the absolute values of the signed mean magnetic field strength (\(|B_{s}|\)) is \(2.9\pm 1.9\) G (Figure 12c). There seems to be no correlation between \(|B_{s}|\) and the CH area (Figure 12a; see also Figure 14). There is, however, a slight difference in the distribution of \(|B_{s}|\) between the solar maximum against the decaying and minimum phase. In the maximum (2012 – 2014) the absolute value of the mean magnetic field strength exhibits a slightly higher average and a wider spread with \(3.4 \pm 2.1\) G than in the decaying phase and solar minimum (2017 – 2019) with \(1.6 \pm 0.8\) G (Figure 12b,c). The increased magnetic field strengths within CHs during solar maximum may be the result of enhanced magnetic activity during the reversal of the solar magnetic field which causes more active regions to appear and consequently decay (Karachik, Pevtsov, and Abramenko, 2010). In Figure 11b the unsigned mean magnetic field strength is shown. We find the mean to be \(7.3 \pm 1.9 \) G and find that \(90\%\) of the CHs have a value below 10 G. We note that the unsigned magnetic flux is strongly dependent on the magnetogram resolution and smoothing, as it is dominated by the sum of the noise of the magnetic field pixel distribution. Therefore, the unsigned magnetic flux should be considered in relative comparison (e.g., between two CHs) rather than in absolute values. This is also true for the skewness and the flux balance. For example, for the isolated CH located near the disk center on May 29, 2013 the skewness changes from 9.2 at a resolution of \(4096 \times 4096\) to 7.4 at a resolution of \(1024 \times 1024\), and the flux balance changes from \(57.2\%\) to \(66.7\%\), respectively.

Figure 11
figure 11

Distribution of magnetic CH properties sorted corresponding to the category factor (green: high, orange: medium and red: low). Panels (a) and (b) show the distribution of the signed and unsigned mean magnetic field strength of the photospheric field below the CH. The distributions of the signed and unsigned flux are shown in the panels (c) and (d). The flux balance, the ratio between the signed and unsigned magnetic flux, is shown in panel (e). Panel (f) shows the distribution of the values for the skewness of the magnetic field distribution.

Figure 12
figure 12

In panel (a) the CH area is plotted against the absolute value of the signed mean magnetic field strength (\(|B_{s}|\)) in double logarithmic depiction. Panel (b) gives the temporal evolution of the absolute value of the signed mean magnetic field strength (\(y\)-axis is logarithmically scaled). Panel (c) shows the distribution of \(|B_{s}|\) for the whole dataset in cyan, for the solar maximum in red and for the declining phase and the minimum in blue.

The signed magnetic fluxes of the CHs seem to be symmetrically distributed between both polarities. The mean of the absolute value is given at \((7.2 \pm 13.5) \times 10^{20}\) Mx with a maximum value of \(6.9 \times 10^{21}\) Mx. The unsigned magnetic fluxes range from \(7.2 \times 10^{19} \) to \(2.0 \times 10^{22} \) Mx with a mean of \((2.0 \pm 3.7) \times 10^{21}\) Mx (Figure 11c,d). The flux balance, the ratio of the signed magnetic flux to the unsigned magnetic flux and with that hinting towards open magnetic flux, shows that the CHs are distributed from \(10\%\) to \(87\%\) with a mean of \(36,3 \pm 16.3\%\) (Figure 11e). The \(47\%\) of CHs that show positive polarity have a mean flux balance of \(37.4\pm 16.7 \%\) whereas the \(53\%\) of CHs that show negative polarity have a mean of \(-(35.6 \pm 15.7) \%\). The shift in the magnetic field distribution that characterizes CHs is shown in Figure 11f. The mean of the absolute value of the skewness is \(8.2 \pm 2.1 \), clearly showing the asymmetry in the magnetic field caused by the abundance of open magnetic field lines. There is no clear difference between polarities or boundary categories.

3.2.5 Flux Tubes

Besides the magnetic parameters for the global structure of a CH, CATCH can analyze the fine structure of the magnetic field in form of FTs or magnetic elements. We analyzed the contribution of the small unipolar FTs categorized as weak (\(20~\mbox{G} < |B_{s,\mathrm{FT}}| < 50~\mbox{G}\)) and strong (\(|B_{s,\mathrm{FT}}| > 50\) G) to the CH parameters. Figure 13 shows the distribution of FT number (a, b), area ratio (c, d) and flux ratio (e, f) for weak and strong FTs respectively. There is no significant difference for the three categories of boundary stability. The number of extracted FTs ranges from 15 to 2670 for the weak and from 1 to 223 for the strong FTs. This gives an average of \(135 \pm 30 \) FTs per \(10^{10}\mbox{ km} ^{2}\) for weak and \(15 \pm 6 \) FTs per \(10^{10}\mbox{ km}^{2}\) for strong FTs. The number of strong FTs per area scales with the signed mean magnetic field strength of the CH (Pearson Correlation Coefficient (ccp): 0.74 with a \(95\%\) confidence interval (CI) of \([0.70,0.78]\)), whereas the weak ones do not (cc\(_{p}=-0.12\) with a \(95\%\) CI of \([-0.18,-0.06]\)).

Figure 13
figure 13

Distribution of FT properties within the CH sorted corresponding to the category factor (green: high, orange: medium and red: low). Panels (a) and (b) show the distribution of the number of FTs per CH (weak and strong, respectively). The distribution of the area ratio of FTs for weak and strong FTs is shown in panel (c) and (d). Panels (e) and (f) show the distribution of the FT flux ratio for weak and strong FTs.

When analyzing the contribution of the weak and the strong FTs to the area and signed magnetic flux of the CH we find that the strong ones are dominating. For most CHs (\(90\%\)) the contribution of the strong FTs to the signed magnetic flux is between \(40\%\) and \(80\%\) with a mean of \(58.1\pm 13.1\%\), although they only cover between \(0.5\%\) and \(6\%\) of the CHs area (on average \(2.6 \pm 1.8\%\)). We find that the coverage of the strong FTs is strongly correlated with the mean magnetic field strength of the CH (cc\(_{p}=0.98\) with a \(95\%\) CI of \([0.97,0.98]\)). In contrast, the weak FTs only contribute \(16.3 \pm 8.8 \%\) of the signed magnetic flux and cover a rather constant CH area of 1.5 – 4% (\(84\%\) of CHs) without a correlation to the mean magnetic field strength of the CH (cc\(_{p}=-0.02\) with a \(95\%\) CI of \([-0.07,0.03]\)).

We note that in the Appendix, Figure 14 the pairwise Spearman correlation coefficients of all parameters calculated in the statistical study using CATCH can be viewed. It should be minded that not all correlations imply a causal relationship, but might be correlated by definition.

Figure 14
figure 14

Spearman correlation matrix of all the CH properties derived in the statistical analysis. The values, the square size as well as the color scheme represent the value of the Spearman correlation coefficient which is given in percent depiction. The black asterisks mark the significance level of the correlation: **** indicates a significance level of \(p \leq 0.001\), *** indicates \(p \leq 0.01\) and ** indicates \(p \leq 0.05\). The values for all correlation coefficients with a significance level higher than 0.05 were omitted. The parameters are listed in Appendix.

4 Discussion

Using the intensity profile perpendicular to the boundary layer of CHs we were able to improve the intensity-based threshold method by Rotter et al. (2012), based on the concept initially proposed by Krista and Gallagher (2009). By adding an estimation of the boundary stability and uncertainty, local as well as global influences on the CH intensity can be described and compensated. By investigating the performance of the newly adjusted threshold method we highlight the advantages of such a supervised method.

4.1 CATCH

Reliable extraction of CHs from EUV observations is an important step towards understanding their configuration, a necessary aspect in solar- and space weather research and space weather applications. Without a precise definition of the CH boundaries, which is then applied to all CHs under study, an analysis is often biased by differences in the extracted boundary and by local conditions which lead to significant problems in the comparison of different studies. Approaches that aim to optimize a threshold for full–disk images (Rotter et al., 2012; Hofmeister et al., 2017; Garton, Gallagher, and Murray, 2018) or synoptic maps (Hamada et al., 2018) can adjust for global changes in the intensity distribution, but do not take into account the local variations. In another study, a dual-threshold-based approach (ezseg: Caplan, Downs, and Linker (2016)) was developed with the aim to consider local variations, if the threshold pair is properly tuned.

Our analysis showed that the optimal threshold (as defined in Section 2) between CHs in one single filtergram may vary significantly due to the abundance and proximity of active regions, quiet Sun areas and bright loops. Wendeln and Landi (2018) found from differential emission measure analysis (DEM) that a significant contribution within CHs comes from stray light of nearby active regions, high overlying loops and the instrumental point spread function (PSF). It is reasonable to suggest that these effects also influence the CH extraction in one (or multiple) wavelengths. However, by individually assessing the boundary of each CH, effects coming from local conditions can be mitigated. We also tested the influence of the PSF by deconvoluting the images before extraction (this option is available in CATCH using SSW-routines) and found clear enhancement in the extraction process but dismissed it for the statistical study because of the greatly increased processing time (up to a factor 100). There are faster options to perform PSF deconvolution when not using IDL (e.g., Prato et al., 2012), which have not been explored yet as CATCH is entirely written in SSW-IDL.

The intensity profile perpendicular to the CH boundary is very dependent on the coronal configuration outside the CH. Active regions have significantly higher intensities than the quiet Sun, but also loops associated with activity near the CH boundary show increased intensities. Enhancements near and at the boundaries may be the result of the CH evolution through the process of interchange reconnection (Madjarska and Wiegelmann, 2009; Edmondson et al., 2010; Yang et al., 2011). It is a known drawback of this method that by considering the average gradient along the full CH boundary, small scale conditions are neglected. The method approximates that the intensity gradient across the boundary of a given CH is constant along the boundary, which we know is not always true. This leads to uncertainties, which we approximate as presented in Section 2.1. To consider such small scale variations, a much more precise definition of the boundary needs to be established which requires a new approach for detecting CH boundaries. Automated threshold techniques are fast, but may extract several CHs in close vicinity which may or may not appear with merged boundaries depending on slight variations in the threshold. However, the threshold–based method described here delivers consistent results when manually supervised. This is due to the constrains set by the boundary gradient approach.

We therefore pursued to further develop this approach, having in mind the advantages of being computationally very inexpensive, fast and flexible. From the statistical results we derive, we find that our method consistently performs well (by standards of visual inspection) over the changing conditions of a full solar cycle and also mitigates local variations. Comparing to the method using a fixed intensity threshold of \(35\%\) of the median solar disk intensity (Vršnak, Temmer, and Veronig, 2007; Rotter et al., 2012; Reiss et al., 2016; Hofmeister et al., 2017; Heinemann et al., 2018a), we find significant deviations for the boundary we would consider as optimal. This is expected as automated methods are often judged by how close they come to manual or manually-tuned methods. We also find that the \(35\%\) of the median solar disk intensity is a good estimate for the mean threshold during solar maximum (Figure 6b red line; Figure 7). In our study the mean threshold for the time period of the solar maximum (2012 – 2014) is found to be \(37.3 \pm 5.0\%\) of the median solar disk intensity. This is very well shown in the solar cycle dependence of the threshold (Figure 6). The threshold may vary even up to 20 DN for a given filtergram but is additionally modulated by a global trend.

By considering all these factors we can highlight the importance of individually extracting CHs without neglecting the local variations on CH size scales. Although manual input is needed, the extraction method implemented in CATCH aims to be as objective as possible without specifying any underlying extraction conditions except for the approach of the boundary gradient.

4.2 Distribution over CHs of Solar Cycle 24

By analyzing the CHs of the SDO-era we not only gain a large sample of different CHs but also cover nearly one full solar cycle. As such, the sample includes CHs from the rising phase (\(\approx 2010 / 2011\)), the maximum phase (≈ 2012 – 2014), the decaying and minimum phase (≈ 2015 – 2019) of this cycle.

The CH parameters derived from the dataset are in good agreement with the study of Hofmeister et al. (2017) who studied 288 low-latitude CHs near the maximum of Solar Cycle 24 and are as such a subset of this study. They found that the CH sizes are distributed around a median of \(2.39 \times 10^{10}\mbox{ km}^{2}\) which is very close to the value derived in this study with a mean area of \((2.69 \pm 2.73) \times 10^{10}\mbox{ km} ^{2}\). Note here that the mean is strongly biased by the large amount of small CHs, of which a large portion is present in solar maximum. The spread in the CH sizes may also be influenced by the few large CHs (\(5 \%\) of CHs with an area exceeding \(8 \times 10^{10}\mbox{ km}^{2}\)). We excluded all polar and polar–connected CHs (as manually defined by the threshold tuning) from this statistical analysis which removes some of the largest CHs observed in this period from the study. This might be the reason why the extracted CH areas do not show the cycle dependence found by the solar cycle study by Lowder, Qiu, and Leamon (2017).

The mean signed magnetic field strength in our study shows a wider spread and higher average during the maximum phase than during the decaying and minimum phase. This was also previously stated by Harvey, Sheeley, and Harvey (1982), who studied 33 CHs at 63 occasions and found that CHs near solar minima have magnetic field strengths ranging from 1 to 7 G, while those detected near solar maxima, range from 3 to 36 G. In comparison, our values for the maximum (\(3.4 \pm 2.1\) G) are significantly lower but for the minimum we have good agreement. The difference may be due to the use of different instrumentation, as it has been shown that different instruments measure significantly different magnetic fields (e.g., Liu et al., 2012). Statistically, we find the mean absolute value of the mean magnetic field strength for all CHs under study to be \(2.9 \pm 1.9\) G distributed from 0.4 to 14.0 G. Results from other studies are found inside this range (\(\approx 3\) G: Bohlin and Sheeley 1978, 1 – 5 G: Obridko and Shelting 1989, Belenko 2001). Considering the property of the CHs open magnetic field configuration, we find that the flux balance, the ratio of the signed to the unsigned magnetic flux which is a measure of the percentage of open flux, is distributed from 10 to \(87\%\) which overlaps with the range found by Hofmeister et al. (2017) of 6 to \(81\%\). A likely reason for the wide spread in the abundance of percentual open flux is that CHs of all evolutionary states are included in the dataset. The open magnetic field of the majority of CHs has been shown to possibly be due to the mean magnetic field strength which varies with the evolution of a CH (Heinemann et al., 2018b). This evolutionary process seems to be governed especially by interchange reconnection (Wang and Sheeley, 2004; Madjarska, Doyle, and van Driel-Gesztelyi, 2004; Krista, Gallagher, and Bloomfield, 2011; Ma et al., 2014; Kong et al., 2018) and flux emergence (Cranmer, 2009, and the references therein).

Hofmeister et al. (2017), Hofmeister et al. (2019), and Heinemann et al. (2018b) found that the abundance of the strong unipolar magnetic elements (flux tubes) is what defines the magnetic configuration of a CH. Notwithstanding that they cover only a small fraction of the CH area they contribute a major part of the total signed magnetic flux of the CH. Hofmeister et al. (2017) found that strong FTs cover \(1\%\) of the CHs area and contribute \(38\%\) to the signed flux. These values are slightly lower than the ones we found in our study with \(r_{A}=2.6\pm 1.8\%\) and \(r_{\Phi }=58\pm 13\%\). This might be due to the differences in the extraction and definition of the strong FTs. Our results are in better agreement with the study of Heinemann et al. (2018b) who found values of \(r_{A}\leq 5 \%\) and \(r_{\Phi }=48\) to \(71\%\). The recent study by Hofmeister et al. (2019) found that these strong FTs have lifetimes larger that those of supergranular cells essentially making them the fundamental building blocks of CHs, and they are not governed by the photospheric network motion.

5 Summary

In this comprehensive study we investigated the intensity gradient across the CH boundary to develop a new CH extraction method using an intensity-based threshold method and to estimate the uncertainties of the extracted CH boundaries. We successfully implemented the flexible and fast method into an easy-to-use GUI and applied it to the SDO-era to extract CHs. We created a CH catalogue of considerable size, covering the time period from May 2010 to February 2019, which includes 707 non-polar CHs that were closely analyzed. Our major findings can be summarized as follows.

  1. i)

    By incorporating the principle of the maximum gradient into the intensity-based threshold method we were able to:

    1. a)

      create, for the first time, CH boundaries with reasonable estimates for the uncertainties;

    2. b)

      achieve a high consistency between boundaries extracted by different users;

    3. c)

      develop an objective as possible CH extraction method, without disregarding the advantages of manual user input.

  2. ii)

    Changes in the threshold due to small scale variations in the vicinity of CHs as well as global intensity variations as a consequence of the solar cycle show the importance of the individual extraction of a CH.

  3. iii)

    By implementing the code into an SSW-IDL GUI we provide an user friendly environment for more objectively extracting CHs for scientific analysis, including reasonable uncertainties.

  4. iv)

    Using CATCH we created an extensive catalogue for the CHs observed by SDO between its operational start in 2010 and February 2019. Over this era, we extracted and analyzed 707 non-polar CHs and found them to exist in sizes ranging from \(1.6 \times 10^{9}\) to \(1.8 \times 10^{11}\mbox{ km}^{2}\). Small CHs (\(<2 \times 10^{10}\mbox{ km}^{2}\)) were found to be most abundant (\(56\%\)). The strength of the photospheric magnetic field underlying the CHs is distributed around \(2.9\pm 1.9\) G which is in agreement with most results found in the literature and shows that CHs are mostly covered by low magnetic field.

  5. v)

    We confirm previous studies (Hofmeister et al., 2017; Heinemann et al., 2018b; Hofmeister et al., 2019) that the magnetic configuration of CHs is highly dependent on the abundance and field strength of the small unipolar magnetic elements (flux tubes), which only cover a small fraction of the CH area.

We plan to continue to develop CATCH. Planned major upgrades are the (partial) implementation in Python and the option to use synoptic magnetograms. Also compatibility with Parker Solar Probe and Solar Orbiter are planned. New functionalities and upgrades will be published on GitHub and future user manual versions.