Statistical Analysis and Catalog of Non-polar Coronal Holes Covering the SDO-era using CATCH

Coronal holes are usually defined as dark structures as seen in the extreme ultraviolet and X-ray spectrum which are generally associated with open magnetic field. Deriving reliably the coronal hole boundary is of high interest, as its area, underlying magnetic field, and other properties give important hints towards high speed solar wind acceleration processes and on compression regions arriving at Earth. In this study we present a new threshold based extraction method that incorporates the intensity gradient along the coronal hole boundary, which is implemented as a user-friendly SSWIDL GUI. The Collection of Analysis Tools for Coronal Holes (CATCH) enables the user to download data, perform guided coronal hole extraction and analyze the underlying photospheric magnetic field. We use CATCH to analyze non-polar coronal holes during the SDO-era, based on 193 {\AA} filtergrams taken by the Atmospheric Imaging Assembly (AIA) and magnetograms taken by the Heliospheric and Magnetic Imager (HMI), both on board the Solar Dynamics Observatory (SDO). Between 2010 and 2019 we investigate 707 coronal holes that are located close to the central meridian. We find coronal holes distributed across latitudes of about +/- 60 degree for which we derive sizes between 1.6 x 10 9 and 1.8 x 10 11 km^2 . The absolute value of the mean signed magnetic field strength tends towards an average of 2.9 +/- 1.9 G. As far as the abundance and size of coronal holes is concerned, we find no distinct trend towards the northern or southern hemisphere. We find that variations in local and global conditions may significantly change the threshold needed for reliable coronal hole extraction and thus, we can highlight the importance of individually assessing and extracting coronal holes.


Statistical Analysis and Catalog of Non-polar Coronal Holes Covering the SDO-era using CATCH
Stephan G. Heinemann 1 · Manuela Temmer 1 · Niko Heinemann 1 · Karin Dissauer 1 · Evangelia Samara 2,3 · Veronika Jerčić 1,4 · Stefan J. Hofmeister 1 · Astrid M. Veronig 1,5 c Springer •••• Abstract Coronal holes are usually defined as dark structures as seen in the extreme ultraviolet and X-ray spectrum which are generally associated with open magnetic field. Deriving reliably the coronal hole boundary is of high interest, as its area, underlying magnetic field, and other properties give important hints towards high speed solar wind acceleration processes and on compression regions arriving at Earth. In this study we present a new threshold based extraction method that incorporates the intensity gradient along the coronal hole boundary, which is implemented as a user-friendly SSWIDL GUI. The Collection of Analysis Tools for Coronal Holes (CATCH) enables the user to download data, perform guided coronal hole extraction and analyze the underlying photospheric magnetic field. We use CATCH to analyze non-polar coronal holes during the SDO-era, based on 193Å filtergrams taken by the Atmospheric Imaging Assembly (AIA) and magnetograms taken by the Heliospheric and Magnetic Imager (HMI), both on board the Solar Dynamics Observatory (SDO). Between 2010 and 2019 we investigate 707 coronal holes that are located close to the central meridian. We find coronal holes distributed across latitudes of about ±60 o for which we derive sizes between 1.6 × 10 9 and 1.8 × 10 11 km 2 . The absolute value of the mean signed magnetic field strength tends towards an average of 2.9 ± 1.9 G. As far as

Introduction
Coronal holes (CHs) are large-scale features in the solar corona often characterized by reduced emission in X-ray and extreme ultraviolet (EUV) which are associated with open magnetic field lines of a dominant polarity. Coronal plasma is accelerated along the open field lines causing a high velocity outflow of particles, often referred to as fast solar wind or high speed solar wind stream (HSS). The plasma depletion causes a reduction of density and temperature in these regions in comparison to the surrounding solar corona. Thus, CHs can be observed as dark structures in the EUV and X-ray emission (see e.g., Schwenn, 2006;Cranmer, 2002Cranmer, , 2009.
To investigate the morphology and intensity of CHs as observed in EUV, as well as their underlying photospheric magnetic field, the identification and extraction of CH boundaries are key. There exist multiple approaches to this topic with one of the most popular using a single wavelength, intensity-based threshold approach on EUV observations. Due to the high contrast and the optimal filter sensitivity, the coronal emission line of eleven times ionized iron (Fe xii: 193/195Å) is often used to extract CHs (e.g., Krista and Gallagher, In the first part of this study we present a new method for extracting CH boundaries in EUV images by using an intensity threshold which is modulated by the intensity gradient of the CH boundary. The method is based on the works of Rotter et al. (2012), Rotter et al. (2015), and Krista and Gallagher (2009) and is incorporated into an easy-to-use GUI application developed in SSW-IDL. The Collection of Analysis Tools for Coronal Holes (CATCH) application enables users to easily extract and analyze CHs in a supervised semi-automated fashion. CATCH uses a modulated intensity threshold method to extract CH boundaries from EUV images and analyzes the associated properties. In addition, it offers the possibility to investigate the underlying magnetic field. In the second part, we use CATCH to investigate 707 CHs covering the complete time range of the operational lifetime of the Solar Dynamics Observatory (SDO; Pesnell, Thompson, and Chamberlin, 2012) so far, starting in May 2010 until February 2019. We derive statistical CH properties of the area, intensity, and the underlying magnetic field including the magnetic fine structure over nearly the full Solar Cycle 24. Furthermore, we present how the parameters for an optimal CH extraction vary during the Solar Cycle. The CH dataset is available as an online catalogue under the CDS database using the Vizier catalogue service (Ochsenbein, Bauer, and Marcout, 2000).

The "Collection of Analysis Tools for Coronal Holes"
The Collection of Analysis Tools for Coronal Holes (CATCH) was created in order to collect and structure CH identification, extraction and analysis in a handy and fast way without the disadvantages of automatic algorithms as described in the Sections 2.1 and 2.2. It enables the user to download and process EUV filtergrams (193/195Å) and line-of-sight (Los) magnetograms. CATCH is able to handle data from different spacecraft missions covering the interval from 1996 until now. These are SDO, the Solar Terrestrial Relations Observatory (STEREO; Kaiser et al., 2008) and the Solar and Heliospheric Observatory (SOHO; Domingo, Fleck, and Poland, 1995). Data from the Atmospheric Imaging Assembly (Lemen et al., 2012, AIA;193Å), the Extreme ultraviolet Imaging Telescope (Delaboudinière et al., 1995, EIT;195Å) and the Extreme UltraViolet Imager (Howard et al., 2008, EUVI;195Å) as well as from the Heliospheric and Magnetic Imager (HMI: Schou et al., 2012;Couvidat et al., 2016) andthe Michelson Doppler Imager (MDI: Scherrer et al., 1995) can be processed. Additionally, user supplied full-disk images can also be analyzed.
The user can perform CH boundary detection, extraction and analysis using a manually adjustable intensity threshold. The threshold range, in which reasonable CH boundaries can be extracted, can be derived from the intensity histogram of the solar disk. After specifying a threshold, it is applied to the full solar disk and the user may select the structure of interest to calculate its parameters and to get an estimate of the boundary stability and uncertainty. Then by varying the threshold to minimize the boundary uncertainty ( A ), the user can find an optimized CH boundary in an easy and fast way, even without previous experience in CH extraction. For deriving the properties of a CH, CATCH analyzes five boundaries in an interval of 1 DN (data number) centered around the selected threshold and calculates the mean values. The maximum deviation of the derived values from the calculated mean is the uncertainty. After extracting a satisfactory boundary from EUV filtergrams, CATCH can analyze the properties of the CH. The boundary may then be used on LoS magnetograms (if available) to analyze the underlying photospheric magnetic field of the CH and its fine structure represented by FTs. Figure 3 shows an example of how to find the optimal threshold by considering the uncertainty of the extracted CH boundary. The red contour represents the CH boundary (of the chosen threshold), the blue shaded areas are the uncertainties of the boundary. The best boundary for this CH can be identified as shown in panel (d), where the blue shaded area is smallest in comparison to the area enclosed in the CH boundary.
CATCH calculates a variety of properties of the extracted CH, which include morphological properties, the intensity, boundary stability as well as properties of the underlying photospheric magnetic field and its fine structure (for the full list of calculated parameters see Tab For proper image processing and analysis the SSW (SolarSoftWare) package under IDL (Interactive Data Language) is required, therefore the tool is written in SSW-IDL and the code, including an user-manual, is available on the authors GitHub page (https://github.com/sgheinemann/CATCH) or by contacting the author directly via E-mail 1 . Figure 4 shows the GUI structure of CATCH, displaying the main menu, the data download widget as well as the CH extraction and the magnetic field analysis widget. A more detailed description of CATCH and its functionalities can be found in the user-manual. a Note, that all magnetic field parameters are calculated using Line-of-Sight magnetograms, which have been corrected for the assumption of radial magnetic field: The deprojection was done using a pixel wise correction with A i,corr = A i cos(α i ) and α being the angular distance from the disk center.

Intensity Threshold
The basic principle under which CH extraction operates is an intensity-based threshold technique applied to EUV filtergrams of sufficient contrast, which was developed by Rotter et al. (2012). To find an optimal threshold Krista and Gallagher (2009) derived that an intensity distribution of the solar disk (or a subfield) with a CH present differs significantly from a distribution where CHs are absent. Figure 1 shows as an example the intensity distribution of the solar disk on May 29, 2013. Hereby, the first maximum, seen at lower intensities, represents one or multiple dark structures on the solar disk. It was proposed that an optimal threshold for a CH boundary lies somewhere in the following minimum. However, note that this characteristic shape is often not well established, especially if no large and well defined CHs are present on the solar disk. Also, it has been found that there is a strong Solar Cycle dependence of the solar disk EUV intensity distribution, which is additionally amended by the current conditions on the Sun (e.g., increased abundance of dark structures or bright active regions). As such, neither a fixed threshold nor a median-intensity dependent threshold, which aims to mitigate intensity variation, perform continuously well. Frequent manual adjustments are needed for optimized results. Thus, the aim is to use an adjustable threshold depending on the current solar conditions, both locally and globally.

Intensity Gradient, Uncertainty Estimation and Calculation of CH Properties
The common intensity-based methods have the drawback that the threshold range in which the boundary is considered optimal, is large (see Figure 1, shaded area). To narrow down the range of reasonable thresholds, we propose an intensity gradient method to estimate the boundary stability and give relevant errors to calculated properties. Recent studies, investigating CHs and their boundaries, have revealed a steep intensity gradient at the CH boundary (Hofmeister et al., 2017). This is due to a strong decrease of the plasma density of quiet Sun temperatures around 1.6 MK (Hahn, Landi, and Savin, 2011). Figure 2 shows a representative intensity profile perpendicular to the CH boundary layer, from inside of the CH (x = 0) to outside (x = 1) in arbitrary scale. The y-axis shows the intensity that is scaled to the maximum in this interval which represents the quiet Sun intensity. We see that within a small layer the intensity drops by at least 40% from the quiet Sun level. This small layer represents the range where CH boundaries are usually extracted. Assuming that the CH boundary is best represented where the intensity profile is changing most strongly, we define the optimum boundary to be placed at the steepest intensity gradient (i.e., gradient has a maximum). In an ideal case, the implication of this definition is that the boundary is approximately constant for small threshold variations around the maximum intensity gradient threshold. This physical 1D principle of the maximum intensity gradient perpendicular to the boundary can be extended to 2D to consider the entire boundary instead of one localized cross-section. This can be done by calculating the change of the CH area for a given intensity threshold by varying the threshold slightly. Using the assumption of a similar intensity gradient along the full boundary, a minimum in the change of the area indicates that, on average, the boundary is located at the maximum gradient, i.e. the optimal threshold. With this definition of the boundary we aim to minimize the variations in different parameters (first of all the area) to properly estimate the boundary. Practically, this is done by calculating the parameters not only for the boundary defined by the selected threshold but also for boundaries of slightly larger and smaller thresholds. From this set of boundaries, a mean value (P ) and its uncertainty ( P ) is calculated. The uncertainty corresponds to the maximum deviation between the determined values and the mean value.
A reasonable CH boundary can be determined by finding the threshold that minimizes A (uncertainty in the CH area) and the CH properties are then given as: (1)

Analysis of the Underlying Photospheric Magnetic Field
To extract and investigate CHs, it is not sufficient to only use the information extracted from EUV filtergrams as it lacks information about the underlying magnetic structure. The magnetic field configuration is what distinguishes CHs from other dark structures (e.g., filament channels, coronal dimmings) in the solar corona. Studies suggest that it may be possible to differentiate those structures purely from intensity filtergrams (Reiss et al., 2014) but a clear distinction cannot always be made. A much more precise approach is the definition based on the underlying magnetic field (Reiss et al., 2015;Delouille et al., 2018). CHs are defined by their open magnetic field configuration which is reflected in the ratio of the total signed to the unsigned magnetic flux inside the CH and in the skewness of the magnetic field distribution. Filaments and filament channels on the other hand ideally show a symmetric distribution between pixels of positive and negative magnetic flux (closed magnetic structures), as they are located along polarity inversion lines. Thus, analyzing the magnetic field underlying an extracted dark structure reveals its magnetic configuration and enables a clearer classification as CH or filament. The calculation of the photospheric magnetic field underlying a CH is often performed by a simple projection of the EUV extracted boundary onto the photospheric magnetogram (line-of-sight or radial). However, it is important to stress that there are several uncertainties in the extraction. First of all, the height difference between coronal imaging in EUV (EUV 193Å: 1.01−1.05 R ) and photospheric magnetic field (1.00 R ). Second, the unknown expansion of the magnetic field over the EUV height. Simple projections will have an increased effect on the CH boundary the further it is located away from the center of the solar disk. Another source of uncertainty arises from the noise level, resolution and smoothing of the magnetogram. This can cause non-trivial effects on parameters like the unsigned magnetic flux, flux balance and skewness of the magnetic field. This complicates a comparison of magnetic field properties derived from differently prepared magnetograms. When interpreting such parameters a relative comparison should be preferred rather than relying on absolute values.
Hofmeister et al. (2019) showed that the photospheric magnetic field underlying CHs can be divided into 3 categories: ≈ 22 ± 4% of the signed magnetic flux is contributed by a slightly unbalanced background field. ≈ 5 ± 0.1% come from small scale unipolar magnetic elements (flux tubes, FTs) nearly symmetrically distributed over both polarities and which are associated with the super-, meso-, and granular motion of the photosphere. The major contribution, on average ≈ 69 ± 8%, comes from strong and long-lived FTs which have almost exclusively the dominant polarity of the CH. To map these properties, we calculate the contribution of FTs to various CH parameters. We define two FT categories, strong and weak (with the category weak also covering medium FTs; for more details see Heinemann et al. 2018b). FTs are extracted as structures of pixels above a magnetic field strength of 20 G and the mean magnetic field strength of each structure determines the category. If the mean magnetic field strength of one FT is between an absolute value of 20 to 50 G it is categorized as weak, if exceeding 50 G then it is considered strong.

Data and Data Processing
For the presented statistical study, we did not exhaust all the possibilities of CATCH but constrained the used dataset to one spacecraft. SDO was chosen over STEREO because of the availability of magnetic field maps, and over SOHO because of the better resolution and contrast. The dataset ranges from May 2010 until February 2019. The EUV 193Å filtergrams observed by AIA/SDO as well LoS magnetograms from HMI/SDO were acquired in a 1 day cadence using the Joint Science Operations Center Servers via the CATCH download application. For the magnetograms the 720s LoS data product was preferred over the 45s due to the lower photon noise of ≈ 3 G measured near the disk center and a better signal-to-noise ratio (Couvidat et al., 2016).
The EUV filtergrams and magnetograms were prepared to level 1.5 using standard SSW-IDL routines and the EUV filtergrams were down-scaled from a pixel scale of 4096 × 4096 to 1024 × 1024 to significantly enhance the processing speed. Before the extraction, the full-disk filtergrams were corrected for limbbrightening using the anulus limb brightening correction (Verbeeck et al., 2014) which is available in CATCH. The boundaries were smoothed using circular (2-pixel radius) morphological operators (open and close). To avoid the loss of information on the magnetic fine structure, the magnetograms were not downscaled. The EUV extracted boundaries were re-scaled to fit the magnetograms resolution.
Note that the effects on boundary detection as well as on the calculation of the parameters in the EUV due to down-scaling are negligible. E.g., we tested for an isolated CH, located close to the disk center on May 29, 2013 how the area of the extracted CH changes for a fixed threshold (43% of the median solar disk intensity). By varying only the resolutions between 4096 × 4096 and 1024 × 1024, (without smoothing) we find a deviation of the extracted CH of less than 0.5%. Other parameter behave similar. As such, the uncertainties from using different filtergram resolutions to extract CH boundaries is much lower than uncertainties in the extraction itself.
From the daily EUV images, dark structures located close to the central meridian were extracted (Center of Mass, CoM located ±10 o ). The extracted structures were limited to the central meridian to reduce longitudinal projection effects due to the spherical nature of the Sun. Polar CHs as well as polar connected CHs were excluded for the same reason. Each structure was extracted only once for each solar disk passage to avoid statistical biases because of similar datapoints. The magnetic properties of each dark structure were investigated and non-CH structures were identified (defined as structures with a flux balance below 10% or a magnetic field skewness below 1) and discarded from further analysis. This approach yielded 707 CHs over a wide range of sizes and latitudes spanning a timerange of more than 8 years.

Results
We analyzed 707 CHs near their central meridian passage and categorized them by their boundary stability. All the parameters presented here, are calculated with CATCH. Our findings are as follows:

Assessment of the Stability of the Extracted Boundaries
First, we assessed the stability of the extracted CH boundaries by analyzing A for the optimal threshold for all 707 CHs. Figure 5a shows the CH area (A CH ) against its uncertainty ( A ). We find a dependence on the area which seems to have two causes: (1) the larger impact of stray light for smaller CHs which could partly be compensated by performing a PSF deconvolution before the CH extraction and (2) the non-zero extent of the boundary layer whose area is growing linearly in contrast to the total CH area (which grows according to a power law). This causes larger percentage variation for smaller CH areas. To correct for this dependence we introduce the category factor (ζ) which can be given as: with f fit (A) being the fit shown in Figure 5a as the red line. It is given by: with A CH in units of 10 10 km 2 . The resulting ζ-factor as function of CH area is shown in Figure 5b. From this we define three categories of boundary stability: We find that 60.0% of the CHs under study have a high boundary stability, 34.2% a medium and only 5.8% are of low boundary stability.

Thresholds
Second, we investigated how the optimal threshold to extract CHs is distributed and varies over the course of the observed time period from 2010 to 2019. This period nearly covers the whole Solar Cycle 24. Figure 6 shows the threshold over time (a) in absolute counts (DN) and (b) in percent of the median intensity of the solar disk. The black line in panel (d) shows the smoothed daily sunspot number by SIDC/SILSO 2 , which acts as a proxy of the solar activity. We find a clear Solar Cycle dependence in the optimal threshold (between 25 − 55 DN) which cannot be correctly mitigated by modulation with the median solar disk intensity (of the full solar disk). It seems that the correction is too strong, especially during solar minimum. Additionally, because of the individual configuration of CHs, the optimal threshold may vary by up to ≈ 20 DN for any given time.
The distribution of thresholds (in DN) is shown in Figure 7a (cyan) with a mean of 43.9 DN and a standard deviation of 12.1 DN. The distribution shifts from 53.5 ± 8.6 DN during solar maximum (red, 2012 − 2014) to 29.7 ± 4.6 DN during decline and minimum (blue, 2017-2019). When considering the threshold in percent of the median solar disk intensity (Figure 7b), the mean threshold is 40.4 ± 6.3% with a variation between the solar maximum (red, 2012 − 2014) with 37.3 ± 5.0% and the decline and minimum (blue, 2017-2019) with 46.2 ± 5.5%. We find the threshold to be independent of the CH size. We believe that the large change of the optimal threshold (in DN) is due to the change in intensity due to the Solar Cycle evolution (e.g., number of active regions, higher quiet Sun level, ...). Figure 8 shows six CHs extracted with a different optimal threshold, varying from 25 to 65 DN.

Area, Intensity, and Position
After investigating the extraction mechanism in terms of intensity threshold, we analyze how CH properties are distributed in our data-set. Figure 9 gives an overview of the main CH properties, i.e., the distribution of the areas, latitudes, and intensities of all CHs under study. Figure 9a shows the distribution of the deprojected areas. We find CH areas ranging from 1.6 × 10 9 km 2 to 1.8 × 10 11 km 2 , with an average of (2.69 ± 2.73) × 10 10 km 2 . CHs with an area below 2 × 10 10 km 2 account for 56% of all CHs, whereas only 5% of CHs exceed an area of 8 × 10 10 km 2 . The CoM of CHs under study are distributed over latitudes ranging from −63 o to +63 o (Figure 9b). 39% of all CHs, which are located between an absolute value of 40 o and 20 o , can be considered medium-latitude CHs and 50% are considered low-latitude CHs, located below 20 o . We find the CHs to be nearly balanced between the hemispheres (South: 48% CHs; North: 52% CHs) without a clear relation to the solar activity (see Figure 6c).
We calculate the median and mean intensity in the 193Å wavelength for each CH of the dataset. The mean of the median intensities is calculated to be 29.0 ± 8.5 DN ( Figure 9c) and the mean of the mean intensities is 29.5 ± 8.4 DN ( Figure 9d). When only considering the 50% and 25% pixel with the lowest intensities we find the mean intensity to be 23.9 ± 7.4 DN and 21.2 ± 6.9 DN respectively.
We investigated the intensity profile of the cross-section of the CHs. To that end we cut the CHs longitudinally through their CoM and superpose the intensity profiles. Figure 10a shows the superposed mean profile (black line) with the 1σ uncertainties represented by the shaded area and the second panel (b) shows the derivative of the mean profile. Note, that the intensity profiles were scaled so that the CH boundaries correspond to x = ±1. We find that when using CATCH to extract CHs the boundary is consequently extracted at the highest gradient in the intensity, which was the initial assumption. With this we can highlight the CH extraction according to a physical principle in contrast to an arbitrarily chosen (or empirically found) value.

Properties of the Underlying Photospheric Magnetic Field
The analysis of the magnetic field properties underlying the CHs yielded a near symmetric distribution of positive and negative polarity CHs (Figure 11a). The mean of the absolute values of the signed mean magnetic field strength (|B s |) is 2.9 ± 1.9 G (Figure 12c). There seems to be no correlation between |B s | and the CH area (Figure 12a; see also Figure 14). There is however a slight difference in the distribution of |B s | between the solar maximum against the decaying and minimum phase. In the maximum (2012 − 2014) the absolute value of the mean magnetic field strength exhibits a slightly higher average and a wider spread with 3.4 ± 2.1 G than in the decaying phase and solar minimum (2017-2019) with 1.6 ± 0.8 G (Figure 12b,c). The increased magnetic field strengths within CHs during solar maximum may be the result of enhanced magnetic activity during the reversal of the solar magnetic field which causes more active regions to appear and consequently decay (Karachik, Pevtsov, and Abramenko, 2010). In Figure 11b the unsigned mean magnetic field strength is shown. We find the mean to be 7.3±1.9 G and that 90% of the CHs have a value below 10 G. We note, that the unsigned magnetic flux is strongly dependent on the magnetogram resolution and smoothing, as it is dominated by the sum of the noise of the magnetic field pixel distribution. Therefore, the unsigned magnetic flux should be considered in relative comparison (e.g., between two CHs) rather than in absolute values. This is also true for the skewness and the flux balance. E.g., for the isolated CH located near the disk center on May 29, 2013 the skewness changes from 9.2 at a resolution of 4096 × 4096 to 7.4 at a resolution of 1024 × 1024, and the flux balance changes from 57.2% to 66.7%, respectively.
The signed magnetic fluxes of the CHs seem to be symmetrically distributed between both polarities. The mean of the absolute value is given at (7.2±13.5)× 10 20 Mx with a maximum value of 6.9 × 10 21 Mx. The unsigned magnetic fluxes range from 7.2 × 10 19 to 2.0 × 10 22 Mx with a mean of (2.0 ± 3.7) × 10 21 Mx (Figure 11c,d). The flux balance, the ratio of the signed magnetic flux to the unsigned magnetic flux and with that hinting towards open magnetic flux, shows that the CHs are distributed from 10% to 87% with a mean of 36, 3 ± 16.3% (Figure 11e). The 47% of CHs that show positive polarity have a mean flux balance of 37.4 ± 16.7% whereas the 53% of CHs that show negative polarity have a mean of −(35.6 ± 15.7)%. The shift in the magnetic field distribution that characterizes CHs is shown in Figure 11f. The mean of the absolute value of the skewness is 8.2 ± 2.1, clearly showing the asymmetry in the magnetic field caused by the abundance of open magnetic field lines. There is no clear difference between polarities or boundary categories.

Flux Tubes
Besides the magnetic parameters for the global structure of a CH, CATCH can analyze the fine structure of the magnetic field in form of FTs or magnetic elements. We analyzed the contribution of the small unipolar FTs categorized as weak (20 G < |B s,FT | < 50 G) and strong (|B s,FT | > 50 G) to the CH parameters. Figure 13 shows the distribution of FT number (a,b), area ratio (c,d) and flux ratio (e,f) for weak and strong FTs respectively. There is no significant difference for the three categories of boundary stability. The number of extracted FTs ranges from 15 to 2670 for the weak and from 1 to 223 for the strong FTs. This gives an average of 135 ± 30 FTs per 10 10 km 2 for weak and 15 ± 6 FTs per 10 10 km 2 for strong FTs. The number of strong FTs per area scales with the signed mean magnetic field strength of the CH (Pearson Correlation Coefficient (cc p ): 0.74 with a 95% confidence interval (CI) of [0.70, 0.78]), whereas the weak ones do not (cc p = −0.12 with a 95% CI of [−0.18, −0.06]).
When analyzing the contribution of the weak and the strong FTs to the area and signed magnetic flux of the CH we find that the strong ones are dominating. For most CHs (90%) the contribution of the strong FTs to the signed magnetic flux is between 40% and 80% with a mean of 58.1 ± 13.1%, although they only cover between 0.5% and 6% of the CHs area (on average 2.6±1.8%). We find that the coverage of the strong FTs is strongly correlated with the mean magnetic field strength of the CH (cc p = 0.98 with a 95% CI of [0.97, 0.98]). In contrast, the weak FTs only contribute 16.3 ± 8.8% of the signed magnetic flux and cover a rather constant CH area of 1.5 − 4% (84% of CHs) without a correlation to the mean magnetic field strength of the CH (cc p = −0.02 with a 95% CI of [−0.07, 0.03]).
We note that in the Appendix, Figure 14 the pairwise Spearman correlation coefficients of all parameters calculated in the statistical study using CATCH can be viewed. We note, that not all correlations imply a causal relationship, but might be correlated by definition.

Discussion
Using the intensity profile perpendicular to the boundary layer of CHs we were able to improve the intensity-based threshold method by Rotter et al. (2012), based on the concept initially proposed by Krista and Gallagher (2009). By adding an estimation of the boundary stability and uncertainty, local as well as global influences on the CH intensity can be described and compensated. By investigating the performance of the newly adjusted threshold method we highlight the advantages of such a supervised method.

CATCH
Reliable extraction of CHs from EUV observations is an important step towards understanding their configuration, a necessary aspect in solar-and space weather research and space weather applications. Without a precise definition of the CH boundaries, which is then applied to all CHs under study, an analysis is often biased by differences in the extracted boundary and by local conditions which lead to significant problems in the comparison of different studies. Approaches that aim to optimize a threshold for full-disk images (Rotter et al., 2012;Hofmeister et al., 2017;Garton, Gallagher, and Murray, 2018) or synoptic maps (Hamada et al., 2018) can adjust for global changes in the intensity distribution, but do not take into account the local variations. In another study, a dual-thresholdbased approach (ezseg: Caplan, Downs, and Linker (2016)) was developed with the aim to consider local variations, if the threshold pair is properly tuned.
Our analysis yielded that the optimal threshold (as defined in Section 2) between CHs in one single filtergram may vary significantly due to the abundance and proximity of active regions, quiet Sun areas and bright loops. Wendeln and Landi (2018) found from differential emission measure analysis (DEM), that a significant contribution within CHs comes from stray light of nearby active regions, high overlying loops and the instrumental point spread function (PSF). It is reasonable to suggest that these effects also influence the CH extraction in one (or multiple) wavelengths. However, by individually assessing the boundary of each CH, effects coming from local conditions can be mitigated. We also tested the influence of the PSF by deconvoluting the images before extraction (this option is available in CATCH using SSW-routines) and found clear enhancement in the extraction process but dismissed it for the statistical study because of the greatly increased processing time (up to a factor 100). There are faster options to perform PSF deconvolution when not using IDL (e.g., Prato et al., 2012), which have not been explored yet as CATCH is entirely written in SSW-IDL.
The intensity profile perpendicular to the CH boundary is very dependent on the coronal configuration outside the CH. Active regions have significantly higher intensities than the quiet Sun, but also loops associated with activity near the CH boundary show increased intensities. Enhancements near and at the boundaries may be the result of the CH evolution through the process of interchange reconnection (Madjarska and Wiegelmann, 2009;Edmondson et al., 2010;Yang et al., 2011). It is a known drawback of this method that by considering the average gradient along the full CH boundary, small scale conditions are neglected. The method approximates that the intensity gradient across the boundary of a given CH is constant along the boundary, which we know is not always true. This leads to uncertainties, which we approximate as presented in Section 2.1. To consider such small scale variations, a much more precise definition of the boundary needs to be established which requires a new approach for detecting CH boundaries. Automated threshold techniques are fast, but may extract several CHs in close vicinity which may or may not appear with merged boundaries depending on slight variations in the threshold. However, the threshold-based method described here delivers consistent results when manually supervised. This is due to the constrains set by the boundary gradient approach.
We therefore pursued to further develop this approach, having in mind the advantages of being computationally very inexpensive, fast and flexible. From the statistical results we derive, we find that our method consistently performs well (by standards of visual inspection) over the changing conditions of a full Solar Cycle and also mitigates local variations. Comparing to the method using a fixed intensity threshold of 35% of the median solar disk intensity (Vršnak, Temmer, and Veronig, 2007;Rotter et al., 2012;Reiss et al., 2016;Hofmeister et al., 2017;Heinemann et al., 2018a), we find significant deviations for the boundary we would consider as optimal. This is expected as automated methods are often judged by how close they come to manual or manually-tuned methods.
We also find that the 35% of the median solar disk intensity is a good estimate for the mean threshold during solar maximum (Figure 6b red line; Figure 7). In our study the mean threshold for the time period of the solar maximum (2012 − 2014) comes to 37.3 ± 5.0% of the median solar disk intensity. This is very well shown in the Solar Cycle dependence of the threshold (Figure 6). The threshold may vary even up to 20 DN for a given filtergram but is additionally modulated by a global trend.
By considering all these factors we can highlight the importance of individually extracting CHs without neglecting the local variations on CH size scales. Although manual input is needed, the extraction method implemented in CATCH aims to be as objective as possible without specifying any underlying extraction conditions except for the approach of the boundary gradient.

Distribution over CHs of Solar Cycle 24
By analyzing the CHs of the SDO-era we not only gain a large sample of different CHs but also cover nearly one full Solar Cycle. As such, the sample includes CHs from the rising phase (≈ 2010/2011), the maximum phase (≈ 2012 − 2014), the decaying and minimum phase (≈ 2015 − 2019) of this cycle.
The CH parameters derived from the dataset are in good agreement with the study of Hofmeister et al. (2017) who studied 288 low-latitude CHs near the maximum of Solar Cycle 24 and are as such a subset of this study. They found that the CH sizes are distributed around a median of 2.39 × 10 10 km 2 which is very close to the value derived in this study with a mean area of (2.69 ± 2.73) × 10 10 km 2 . Note here that the mean is strongly biased by the large amount of small CHs, of which a large portion is present in solar maximum. The spread in the CH sizes may also be influenced by the few large CHs (5% of CHs with an area exceeding 8 × 10 10 km 2 ). We excluded all polar and polar-connected CHs (as manually defined by the threshold tuning) from this statistical analysis which removes some of the largest CHs observed in this period from the study. This might be the reason why the extracted CH areas do not show the cycle dependence found by the Solar Cycle study by Lowder, Qiu, and Leamon (2017).
The mean signed magnetic field strength in our study shows a wider spread and higher average during the maximum phase than during the decaying and minimum phase. This was also previously stated by Harvey, Sheeley, and Harvey (1982), who studied 33 CHs at 63 occasions and found that CHs near solar minima have magnetic field strengths ranging from 1 to 7 G, while those detected near solar maxima, range from 3 to 36 G. In comparison, our values for the maximum (3.4±2.1 G) are significantly lower but for the minimum we are in good agreement. The difference may be due to the use of different instrumentation, as it has been shown that different instruments measure significantly different magnetic fields (e.g., Liu et al., 2012). Statistically, we find the mean absolute value of the mean magnetic field strength for all CHs under study to be 2.9 ± 1.9 G distributed from 0.4 to 14.0 G. Results from other studies are found inside this range (≈ 3 G: Bohlin andSheeley 1978, 1 − 5 G: Obridko andShelting 1989;Belenko 2001). Considering the property of the CHs open magnetic field configuration, we find that the flux balance, the ratio of the signed to the unsigned magnetic flux which is a measure of the percentage of open flux, is distributed from 10 to 87% which overlaps with the range found by Hofmeister et al. (2017) of 6 to 81%. A likely reason for the wide spread in the abundance of percentual open flux is that CHs of all evolutionary states are included in the dataset. The open magnetic field of the majority of CHs has been shown to possibly be due to the mean magnetic field strength which varies with the evolution of a CH (Heinemann et al., 2018b). This evolutionary process seems to be governed especially by interchange reconnection (Wang and Sheeley, 2004;Madjarska, Doyle, and van Driel-Gesztelyi, 2004;Krista, Gallagher, and Bloomfield, 2011;Ma et al., 2014;Kong et al., 2018) and flux emergence (Cranmer, 2009) and references therein).
Hofmeister et al.  (2018b) found that the abundance of the strong unipolar magnetic elements (flux tubes) is what defines the magnetic configuration of a CH. Notwithstanding that they cover only a small fraction of the CH area they contribute a major part of the total signed magnetic flux of the CH. Hofmeister et al. (2017) found that strong FTs cover 1% of the CHs area and contribute 38% to the signed flux. These values are slightly lower than the ones we found in our study with r A = 2.6 ± 1.8% and r Φ = 58 ± 13%. This might be due to the differences in the extraction and definition of the strong FTs. Our results are in better agreement with the study of Heinemann et al. (2018b) who found values of r A 5% and r Φ = 48 to 71%. The recent study by Hofmeister et al. (2019) found that these strong FTs have lifetimes larger that those of supergranular cells essentially making them the fundamental building blocks of CHs, and are not governed by the photospheric network motion.

Summary
In this comprehensive study we investigated the intensity gradient across the CH boundary to develop a new CH extraction method using an intensity-based threshold method as well as to estimate the uncertainties of the extracted CH boundaries. We successfully implemented the flexible and fast method into an easy-to-use GUI and applied it to the SDO-era to extract CHs. We created a CH catalogue of considerable size covering the time period from May 2010 to February 2019, which includes 707 non-polar CHs that were closely analyzed. Our major findings can be summarized as follows: i) By incorporating the principle of the maximum gradient into the intensitybased threshold method we were able to: • Create, for the first time, CH boundaries with reasonable estimates for the uncertainties • Achieve a high consistency between boundaries extracted by different users • Develop an objective as possible CH extraction method, without disregarding the advantages of manual user input ii) Changes in the threshold due to small scale variations in the vicinity of CHs as well as global intensity variations as a consequence of the Solar Cycle show the importance of the individual extraction of a CH.
iii) By implementing the code into an SSW-IDL GUI we provide an user friendly environment for more objectively extracting CHs for scientific analysis, including reasonable uncertainties. iv) Using CATCH we created an extensive catalogue for the CHs observed by SDO between its operational start in 2010 and February 2019. Over this era, we extracted and analyzed 707 non-polar CHs and found them to exist in sizes ranging from 1.6 × 10 9 to 1.8 × 10 11 km 2 . Small CHs (< 2 × 10 10 km 2 ) were found to be most abundant (56%). The strength of the photospheric magnetic field underlying the CHs is distributed around 2.9 ± 1.9 G which is in agreement with most results found in literature and shows that CHs are mostly covered by low magnetic field. v) We confirm previous studies (Hofmeister et al., 2017;Heinemann et al., 2018b; Hofmeister et al., 2019) that the magnetic configuration of CHs is highly dependent on the abundance and field strength of the small unipolar magnetic elements (flux tubes), that only cover a small fraction of the CH area.
We plan to continue to develop CATCH. Planned major upgrades are the (partial-) implementation in Python and the option to use synoptic magnetograms. Also compatibility with Parker Solar Probe and Solar Orbiter are planned. New functionalities and upgrades will be published on GitHub and future user manual versions. Figure 14 shows the Spearman correlation coefficients of all the CH properties derived in the statistical analysis. The top right side shows representative squares for all correlation coefficients and additionally the significance level is marked with black asterisks (*** indicates a significance level of p ≤ 0.001, ** indicates p ≤ 0.01 and * indicates p ≤ 0.05). The left bottom side shows the values of the correlation coefficient with insignificant values (cc with p > 0.05) were omitted. The values have been converted to percent in order to improve visualization. Positive values correspond to a correlation and negative values to an anti-correlation. The parameters listed from left to right and top to bottom are the following: the optimal threshold as described in Section 2.1 in percent of the median solar disk intensity (Thr) and in DN (Thr DN ); the CH area (A CH ); the mean CH intensity (Ī) and the mean intensities of the lowest 50% and 25% percentile of pixel intensities within the CH (Ī 50 ,Ī 25 ); the same for the median intensities ( I, I 50 , I 25 ); the longitudinal (|λ CoM |) and latitudinal (|ϕ CoM |) position in absolute values; the absolute value of the signed mean magnetic field strength (|B s |) and the unsigned mean magnetic field strength (B us ); the absolute value of the signed magnetic flux (|Φ s |) and the unsigned magnetic flux (Φ us ); the flux balance (R Φ ) and the absolute value of the skewness of the magnetic field (|γ B |); the FT number, area ratio and flux ratio for both strong and weak FTs (N FT,s , r A,s , r Φ,s , N FT,w , r A,w , r Φ,w ).    The sample represents the threshold value distribution as given in Figure 7. The optimal thresholds range form 25 to 65 DN and are primarily caused by a different intensity quiet Sun level rather than large changes in the CH intensity. The red boundary corresponds to the boundary derived by the optimal threshold and the blue shaded areas are the uncertainties (see Section 2.1.2). All images are equally scaled.  Figure 10. The superposed intensity profile of the longitudinal cross-sections at the CoM of the CHs under study (a) and its derivative (b). Before superposing, each intensity profile is scaled so that x ± 1 represents the CH boundaries. The black line is the mean profile and the shaded gray-blue area represents the 1σ standard deviation. The dotted vertical lines mark the location of the CH boundary. (c) Figure 12. In panel (a) the CH area is plotted against the absolute value of the signed mean magnetic field strength (|Bs|) in double logarithmic depiction. Panel (b) gives the temporal evolution of the absolute value of the signed mean magnetic field strength (y-axis is logarithmically scaled). Panel (c) shows the distribution of |Bs| for the whole dataset in cyan, for the solar maximum in red and for the declining phase and the minimum in blue.