Kanzelhöhe Observatory: Instruments, Data Processing and Data Products

Kanzelhöhe Observatory for Solar and Environmental Research (KSO) of the University of Graz (Austria) is in continuous operation since its foundation in 1943. Since the beginning, its main task was the regular observation of the Sun in full disc. In this long time span covering almost seven solar cycles, a substantial amount of data was collected, which is made available online. In this article we describe the separate processing steps from data acquisition to high level products for different observing wavelengths. First of all we present in detail the quality classification, which is important for further processing of the raw images. We show how we construct centre-to-limb variation (CLV) profiles and how we remove large scale intensity variations produced by the telescope optics in order to get images with uniform intensity and contrast. Another important point is an overview of the different data products from raw images to high contrast images with heliographic grids overlaid. As the data products are accessible via different sources, we also present how to get information about the availability and how to obtain these data. Finally, in an appendix, we describe in detail the information in the FITS headers, the file naming and the data hierarchy.


Introduction
Kanzelhöhe Observatory for Solar and Environmental Research (KSO) of the University of Graz is located on a mountain ridge at an altitude of 1526 m a.s.l. near Villach in southern Austria. The observatory was founded in 1943 as part of a network of solar observatories operated by the German Airforce (Pötzi et al., 2016;Jungmeier, 2017). The discovery of Mögel and Dellinger (Dellinger, 1935) that solar flares can cause ionospheric disturbances by a sudden increase of the ionisation of the D-layer in the ionosphere, such that blackouts of radio communications can occur, made it necessary to investigate such phenomena to arrive at a better understanding and predictions of these effects (Seiler, 2007). Solar observations at KSO date back to 1943. In the early years of the observatory, most observations were performed using a heliostat guiding the light beam down to a laboratory Extended author information available on the last page of the article in the basement, where a projection device for sunspot drawings and a spectroheliograph (Siedentopf, 1940;Comper, 1958) were installed. In a second tower a coronagraph was operated (Comper and Kern, 1957), which was transferred in 1948 to the top of the Gerlitzen mountain (1900 m a.s.l). First observations in the Hα spectral line were made visually at the spectroheliograph. In 1958, a Zeiss-Lyot monochromator acquired on occasion of the International Geophysical Year made photographic observations in the Hα line possible. In addition, a piggy-back mounted telescope on the coronagraph was used to record the photosphere on glass plates. A more detailed description about the early observations at KSO can be found in Kuiper (1946). Corona observations were suspended in 1964 as the seeing conditions on top of the mountain worsened. In 1973 all observations were transferred to the northern tower ( Figure 1) offering better seeing conditions (Pötzi et al., 2016).
In predigital times, i.e. up to the year 2000, observing was a combination of "manual" and "visual" work. Photographs were made on glass plates (later on sheet film and 36 mm film), which had to be manually exposed and afterwards developed. Observations at the coronagraph were on the one hand noted on special form sheets describing the intensity in the green (5303 Å) and red (6874 Å) corona lines (cf. Waldmeier, 1942) and on the other hand also photographs of the corona intensity in Hα (full width at half maximum, FWHM = 20 Å) were taken (Comper and Kern, 1957). White-light observations on photographic material date back to 1943, images were recorded irregularly, mostly on occasion of big sunspots. From 1989 on, every day at least three images were taken using the photoheliograph in the northern tower (Pettauer, 1990). Patrol observations of Hα images started in 1958 on an irregular basis and were already automated in 1973 at a cadence of one image every 4 minutes on 35 mm film rolls. Large parts of the data have been digitised like the white-light images on sheet film from 1989 to 2007 (Pötzi, 2010) and the Hα film rolls from 1973 to 2000 (Pötzi, 2007). Digitising the corona images and the Hα images before 1973 is not planned as the effort is too high and the quality of the data is often very low. Digitising the white-light images on glass plates between 1943 and 1970 is planned in the future. Sunspot drawings have always been made by hand and they will not be subject to automated procedures in order to contain long-term consistency, as is explained in detail by Clette et al. (2014). Nonetheless all drawings are scanned immediately and published in our archive. In 2010 full disc observations in the Ca II K line widened out the monitoring of the solar activity.

Instruments
The main instrument at KSO is the patrol instrument, which is in operation since 1973 in the northern tower. This instrument carries four telescopes ( Figure 2) on a parallactic mounting: (1) Ca II K: A refractor with a diameter of 100 mm and a focal length of 1500 mm. The system is protected by a dielectric energy reflection filter. The main filter was originally a DayStar etalon, which degraded in 2012 and was replaced by a Lunt B1800 Ca-K module in 2013. As the specifications required a F/20 configuration in order to guarantee the designed bandwidth (Hirtenfellner-Polanec et al., 2011) the aperture front lens was reduced to 70 mm.

Figure 2
The KSO solar patrol instrument in the northern tower carries four telescopes: (1) Ca II K, (2) whitelight, (3) Hα, (4) drawing device. The slit of the dome moves automatically with the telescopes.
(2) White-light: A refractor with an objective lens of 130 mm and a focal length of 2000 mm. The broadband filter with a FWHM of 100 Å is centred in the green range of the visual electromagnetic spectrum at 5460 Å (Otruba, Freislich, and Hanslmeier, 2008). The telescope is based on the Kanzelhöhe Photoheliograph (PhoKa) which went into operation in 1989 and was described in detail in Pettauer (1990). In order to compensate for thermal expansion the camera position is adapted each day. (3) Hα: A refractor with 100 mm diameter and a focal length of 1950 mm. The objective lens is coated with gold serving as heat filter and the Zeiss-Lyot Hα filter with a central wavelength of 6562.8 Å and a FWHM of 0.7 Å is placed behind a broadband Hα prefilter Pötzi et al., 2015). A beam-splitter provides the possibility to attach a second camera, which can be used, e.g. for ultra-high cadence observations with several images per second additional to the standard patrol observations. (4) Drawing Device: The drawing device is mounted near the declination axis of the telescope, which makes it comfortable to produce the drawing on the projected image (which has a diameter of 25 cm) and also minimises the forces applied to the telescope by the drawing procedure itself. The drawing is used for obtaining the relative sunspot number, which is sent to SILSO (Sunspot Index and Long-term Solar Observations 1 ) World Data Center at the Royal Observatory of Belgium for calculating the International Sunspot Number (ISN), more details can be found in Pötzi et al. (2016).
An overview of the telescope configurations is given in Table 1. In the current configuration the same CCD camera type (2048 × 2048 pixels with 4096 intensity levels) is used for data acquisition (further technical details on the camera specification can be found in Appendix A). Identical grabbing software and almost the same processing software is implemented for all observation types. This configuration reduces the workload in handling, maintaining and keeping up to date. As a measure of precaution we have acquired additional CCD cameras of the same type for immediate replacement if necessary.

Data Acquisition
The data acquisition software is written in C++ and makes use of commercial camera control libraries (Common Vision Blox from Stemmer Imaging). The user interface ( Figure 3) is kept as simple as possible but also offers many additional features, which are not used in the standard patrol mode. The observer marks a rectangular box on the solar disc; during enhanced solar activity this box is placed over an active region and in case of quiet solar conditions the box is placed at solar disc centre. This AOI (area of interest) is used for calculating the exposure time and the image contrast. The mean brightness in this AOI is set to a fixed value (600 counts for Hα in order to avoid saturation of the CCD chip in case of bright flares, 2000 for white light, and 1000 for Ca II K). Based on these predefined mean values, the exposure time for the camera is calculated. As the camera is much faster (6 to 7 images per second) than the typical image cadence (6 seconds) the system can use frame selection and profits from very short moments of better seeing conditions (Scharmer, 1989). For this purpose the image contrast in the AOI is calculated, i.e. the camera selects from a sequence of about 10 images the one that has the highest root mean square (rms) contrast. Before and after the daily observations, when the objective lens is protected with the dust cover, a dark current image is made. For this purpose the exposure time is fixed to 3 ms, which corresponds to an average exposure time in patrol mode. Dark current images during the observation time are not possible as there light always falls onto the CCD chip, since the cameras are not equipped with a mechanical shutter. The button "SnapFixExpTimes" starts a series of images at 5 ms, 20 ms, and 50 ms exposure time. It is used especially for the Hα camera in order to get an image of the lower corona in Hα light by overexposing the solar disc. These images are used as backup and additional data for the solar prominence catalogue of the Lomnický Štít Observatory (Rybák et al., 2011). All images are stored directly in a live archive on a RAID system as raw FITS files and JPEG files for a quick inspection. Figure 4 shows the data pipeline for images from their recording at the CCD camera to the main archive. The whole process can be divided into three steps: data acquisition, data Figure 4 Flow diagram of the data pipeline for solar images at KSO. Each of the three branches -acquisition, processing, and provision -is running on a separate system. All systems work in parallel, the whole process takes not more than 5 seconds for an individual image from acquisition to provision. processing, and data provision. Each step is performed by separate workstations, which work in parallel. The whole process from data acquisition to the provision of the processed data takes about 5 seconds per image. Hα images undergo a second step of processing (FITS files of quality 1) in order to automatically detect and characterise flares and filaments in the real-time observations. This procedure is elaborated in detail in Pötzi et al. (2015) and evaluated in Pötzi, Veronig, and Temmer (2018).

Data Pipeline
In the following we describe the standard processing pipeline which every image grabbed by one of the CCD cameras passes through. In order to keep the system simple and to reduce maintenance and work load for updates all following steps are performed by almost the same script that is executed three times in parallel, one instance for each of the three CCD cameras. This script calls individual programs, which process the images or manage the database. The scripts are started before sunrise and just wait for new images appearing in the live archive for processing until they stop after sunset.
Step 1: Before processing, the observed image is transferred from the live archive to the temporary archive, which can hold up to nearly one year of the raw observation data. In order to free space for new images, the oldest ones in this temporary archive are deleted successively. Normally in the live archive there is only the latest image. If there is some interruption or delay in the processing pipeline, a larger amount of images can accumulate there. But also in this case only the latest image is processed and transferred in order to prevent any delay in data provision. All images that remain on the live archive until the end of the observation day are processed later. The temporary archive is used as an emergency backup, in case the processing fails due to some unexpected reason or even if images of low quality can be of interest for inspection of some extraordinary solar event. It can also be of interest in developing algorithms for the parametrisation of image quality or classification purposes (Jarolim et al., 2020), see Section 3.2.
Step 2: A crucial step is the quality check (described in detail in Section 3.2) that selects the images that are processed and stored in the main data archive. This procedure should be fast and deliver stable results independently of the solar cycle phase or time of the day. The images are categorised as good (quality = 1), fair (quality = 2) and bad. Images of quality 1 are suitable for all further automatic processing steps, whereas quality 2 images can still be used for visual inspection.
Step 3: Images of quality 2 (fair image quality, suitable for visual inspections) and of quality 1 (good quality) get an updated FITS header containing all relevant image information (see Appendix C) and a normal contrast JPEG image is produced for real time display. The JPEG image is centred, contrast enhanced by unsharp masking and contains logos, date and time stamp, and is overlaid with a heliographic grid. The FITS file and the JPEG file are stored in the data archive.
Step 4: From images of quality 1 additionally high contrast JPEG images are generated, i.e. they are corrected for global anisotropies in order to provide a homogeneous contrast across the solar disc (details in Section 3.4). Like normal contrast images, they are overlaid with a heliographic grid, they contain logos and a date and time stamp. These images are also stored in the data archive.
Dark current is not subtracted from the images, This effect cannot be seen in the JPEG images as the dark current corresponds to a very low level of additional noise, which is removed by the JPEG compression algorithm. Flat field is also not applied for reasons explained in Section 3.4.3.

Image Quality Detection
In this section we describe the details of the image quality determination, which is in operation at KSO. The quality estimate makes use of a variety of properties of the images, which allow us to characterise the quality in both global and local scales: Since the exposure time is automatically controlled and set in a way that the AOI has a predefined mean count value, it is sensitive to clouds and to a low elevation of the Sun above the horizon. The mean intensity in the AOI can vary when clouds move very fast across the solar disc, in such cases the exposure time adjustment can be too slow, and the image can be under-or overexposed.

Detection of Solar Limb and Radius
This procedure is an enhanced version of the method published by Veronig et al. (2000) and is based on the limb detection by finding the limb points as the maximum gradient of intensity profiles in x and y directions. To get stable results the profiles are smoothed over 7 pixels before the gradient is applied via a morphological operation. For reducing the calculation time only 64 profiles from each direction are generated (see Figure 5). Through these limb points a circle is fitted using Taubins' method (Taubin, 1991). This method is very stable and fast and it needs only a limited number of points for the fit. The circle fit tries to minimise Illustration of the determination of the solar limb and centre. The maximum gradient (green dots) of the intensity in 64 profiles for each direction is calculated. A circle fit through these points gives the radius and the centre coordinates of the solar disc. In this white-light image from 6 September 2017 large sunspots lead to some outliers in the first step of the limb detection, which are automatically removed in the following iterative steps. where which is the geometric distance from each detected limb point (x i , y i ) on the solar limb to the fitting circle whose centre and radius are (a, b) and R , respectively. As a measure for the quality of the circle fit we obtain the value σ which is the rms of d i . In order to improve the circle fit and to obtain a robust solution, outliers are iteratively removed. For example, in the first iteration limb points with a distance from the fitted circle d ≥ R /30 are excluded and a new circle fit is performed. If σ is still ≥ 2 this procedure is repeated with d ≥ R /100 and if necessary also with d ≥ R /200. In Figure 6 two examples demonstrate the workflow of this process. The white-light image at good observing conditions in the first row needs just one iteration for finding sufficiently precise solar disc parameters, the outliers near the large sunspot group (red dots) are removed immediately. In the second row an Hα image is strongly disturbed by clouds. In this case three iteration steps are needed in order to obtain a good circle fit through the limb. As the profiles used for detection of the limb points go from the image border one fourth into the image (cf. Figure 5), some of the detected points are aligned in almost a straight line (red dots in the left image of the second row in Figure 6). These "outliers" are responsible for the bad circle fitting (overplotted green circle). Therefore in the next iterations outliers are removed and the circle fit is recalculated. The right image shows the result obtained in the last step, where the circle fit (indicated as a green line) is within the required limits (σ ≤ 2). We note that such image is not further processed. This is an example to demonstrate how robust this method is. We note that the result of this limb finding procedure is also used to fine-tuning the telescope guider. Thanks to this robustness, the telescope position can be maintained accurately to the centre of the solar disc also during cloudy periods.

Figure 6
Illustration of the determination of the solar limb and centre. Detected limb points (red dots) and limb fit (green circle) obtained after the first (left panels) and final (right panels) iterations. In the first row a white-light image from 6 September 2017 at good observation conditions shows already after one iteration a good fit for the solar limb. In the second row clouds disturbed the Hα observations on 31 December 2020 and therefore three iterations where necessary to obtain a good circle fit. Figure 7 shows the determined radius and the Hα image quality for a complete observation day for the Hα telescope (11 April 2020). In total 4204 Hα images were recorded on this day: 3098 of quality class 1, 738 of quality class 2, and 375 of quality 3. One can see from the plot, that the observations get worse over the day. The observation conditions are commonly the best in the morning. In case of fine and sunny weather, over the day the rising air causes turbulences that decrease the seeing conditions. In the afternoon occurring cumulus clouds interrupt the observations, in most cases these clouds dissolve until the evening. The determined radius of the solar disc (bottom panel) reveals in general very stable results, with an average R = 940.16 ± 0.52 pixels (determined from all images of quality class 1). The radius values in the morning and later in the afternoon are somewhat smaller due to refraction effects and less image motion. As one can also see, the radius distribution becomes wider when the image quality decreases. The radius determinations are coloured according to the corresponding image quality. This is a typical seeing situation during a fine sunny day at KSO, where the seeing conditions become worse during the day and cumulus clouds evolve in the afternoon. The data series plotted contains a total of 4204 images.

Parametrisation of Large-Scale Image Inhomogeneities
The detection of image inhomogeneities is normally a very computationally expensive task, which can be done with structural bandpass filters (cf. Pötzi et al., 2015) or also with neural networks (Jarolim et al., 2020, see also Section 3.3). For the real-time processing this method must be fast and effective. To accomplish this task, in the current implementation at KSO, the recorded and centred image is reduced to 2 × 2 pixels and the ratio of the maximum to the minimum value of these pixels, the so-called global quadrant intensity ratio (fpr), is computed. Images without any inhomogeneities should have a fpr of nearly 1, active regions or filaments have only a small influence on this ratio as their area is in the range of a few percent of the solar hemisphere. Clouds, however, can shift this ratio to values far larger than 1 as illustrated in Figure 8 top right. A drawback of this method is that clouds uniformly distributed also lead to a value of about 1 (Figure 8 top left). In such case other measures are needed to quantify the reduced image quality like the exposure time or the image sharpness.

Parametrisation of Image Sharpness
Normally for the image sharpness the rms of the intensity distribution is used, but in case of the Sun this value changes considerably with solar activity. The presence of filaments, active regions or flares produce substantially higher rms values compared to a quiet Sun. Therefore, for an alternative approach that works fully automatically, we use the central part of the solar disc, covering the central quarter of the image area, smooth it by convolving the subimage with a simple kernel consisting only of 1s, e.g. a kernel of size three looks like For KSO images the kernel k 15 is used (≈15 arcsec), which is the same as averaging pixels in a neighbourhood of 225 pixels. This smoothed subimage is then correlated with the original images. The full-disc images are reduced to 2 × 2 pixels (right to each solar disc image) and the ratio of the maximum to the minimum of the four pixels is computed (annotated in each image). The images that are partially obstructed by clouds lead to high ratios, far higher than the optimum case of 1.
subimage. Sharpness values S are then obtained by: where r is the Pearson's correlation coefficient between the smoothed and the original subimage. The factor of 1000 was introduced to obtain integer results in a typical range up to ≈100. The reasoning behind this method is the following: when the smoothing of the image does not substantially change the image, i.e. when the image was already unsharp before smoothing, the value of r would be 1 and therefore S = 0. Typical values are S > 50 in case of quality class 1 Hα images, whereas images in quality class 3 have S < 25. These boundaries are lower in case of white-light and Ca II K images.

Summary of Quality Determination
The three quality levels are determined by the above described single parameters or procedures or their combinations, e.g. for Hα images: high exposure times (exp ≥ 18 ms): quality class 3 rms of solar radius determination > 5 pixel: quality class 3 global quadrant intensity ratio > 1.08: quality class 3 exp > 5 ms and rms of solar radius > 3: quality class 3 mean intensity in AOI ≤ 400 or ≥ 1200 counts: quality class 3 image sharpness < 25: quality class 3 image sharpness < 50: quality class 2 The settings for white-light and Ca II K images differ slightly, similarly, the mean intensity in the AOI has a much higher default value for white-light images as we do not expect large changes of image brightness, in contrast to Hα images that may vary strongly during flares. The parameters defining the quality classes for each camera are listed in Appendix B.

Quality Parametrisation with Generative Adversarial Networks
A different approach for image quality assessment of ground-based solar observations was presented by Jarolim et al. (2020), where the authors used a neural network to identify quality degradation. The method is based on generative adversarial networks (GANs) to model the true image distribution of high quality KSO Hα observations and assessing deviations from it, such as large deviations caused by atmospheric influences (e.g. clouds). This metric is more objective in terms of the solar activity level. The method provides a human-like quality assessment that agrees in 98.5% of the cases with human assigned labels. The developed neural network provides a continuous image quality metric and a binary classification into high-and low-quality observations. In the upper panel of Figure 9 we compare the image sharpness (defined in Section 3.2.3) with the continuous image quality value of the neural network. For comparison, fixed thresholds in the image quality metric were set and additionally the binary classification to further distinguish between good (class 1) and fair (class 2) images was used (cf. Jarolim et al., 2020). The direct comparison shows that both assessment methods are in good agreement in general. The neural network shows a higher sensitivity to clouds, while it is less sensitive to the variations in seeing, in agreement with the objective for large scale image inhomogeneities. The fast performance of the neural network makes it suitable for the critical step of quality assessment in a real-time environment. The further extension to small scale quality estimations (e.g. seeing), multiple channels (e.g. white light, Ca II K), and the applicability to a real-time data pipeline are under investigation.

Centre-to-Limb Variation
In order to derive a centre-to-limb variation (CLV) profile we apply a median filter on each concentric ring of the solar disc with a width of one pixel. Features like sunspots, active regions or filaments do not have in general any effect on such a filter, as they are clearly smaller than the half length of such a concentric ring. However, close to the centre of the solar disc, these activity features can cover a substantial part of the rings. Thus, for robustness, for the inner rings a further post-processing should be applied, e.g. an interpolation via a CLV functional fitting as is described below. This procedure is computationally intensive, but as each ring can be handled independently the code can easily be parallelised.
According to Scheffler and Elsässer (1990) the CLV profile can theoretically be expressed as where I is the intensity, β is the centre-to-limb variation factor and θ is the exit angle of the light (disc centre: β = 0, limb: β = π/2). In order to use distances from the centre of the solar disc this equation can also be rewritten to with a as scale factor of the solar radius, b as centre-to-limb variation factor and x = r/R . This formula does not describe the whole profile as the numerator is always larger than or equal to 1 and the upper limit of the denominator is defined by 1 + b, which is in most cases smaller than 3. By fitting the calculated CLV profile by Equation 6, a complete smooth CLV profile can be obtained which is not affected by intensity fluctuations introduced by the individual rings. In order to avoid the potential effects of solar activity features (sunspots, flares, filaments) on the inner rings, we applied the fits to different subranges (avoiding the inner parts) and evaluated the robustness of the fit results, namely These evaluations were done with KSO white-light and Hα images recorded at times of very low activity (no sunspots present) and the resulting fit range was then also used for images obtained at times of high activity. The fit result is extrapolated on the full range using Equation 6 giving the new centre-to-limb variation (CLV new ) profile. This method also works for active solar conditions, i.e. images with magnetic features like sunspots in white-light or active regions in Hα. An example for this procedure, for very high solar activity with one of the largest sunspot groups in the last decade on 22 October 2014 and for a quiet Sun from 1 August 2017, is show in Figure 10. This approach assures that large sunspots, filaments, and flares close to the inner (smaller) rings, where they may produce a substantial effect on the obtained (median) ring intensities, do not affect the determined CLV profile. The application of this procedure to produce a smooth CLV is demonstrated in Figure 11. One can see that in the inner regions, the raw CLV profile calculated from the median intensities along the rings is affected by the flare (panel b). This effect is removed by the CLV fit and extrapolation for the inner parts r > 0.5 R (panel c). Also, the image contrast is improved considerably by dividing the original image with the CLV new intensity profile (panel d). We note that such strong deviations show up in the presence of large flares in Hα images, due to the large intensity enhancement during flares. Big sunspots in white-light images do not reveal in general a significant distortion of the inner CLV rings.
Finally, the parameters a and b of the CLV profile fit were also studied in their evolution over complete observing days and compared to different other image parameters, such as exposure time (exp), the global image quadrant intensity ratio (fpr), the image sharpness parameter, and the image quality (Q) from Jarolim et al. (2020). Figure 12 shows as an example the comparison of these parameters over about one hour of Hα observations. As can be seen, the CLV parameters evolution during the day also delivers an assessment of the image quality. In general, values deviating from average a and b are related to lower image quality.

Figure 10
Radial CLV intensity profile derived from KSO white-light (left) and Hα (right) images. In the upper panel the images were selected during quiet Sun conditions on 1 August 2017. The lower panel shows the profiles for 22 October 2014, when a very large sunspot group was present. The red curve is the CLV profile calculated from the median intensity of the rings around Sun centre, the blue crosses correspond to the CLV functional fit to the data according to Equation 6 for r > 0.5 R , and the dashed line is the extrapolation of the CLV fit to r < 0.5 R .

Global Anisotropies
Normally the telescope optics introduces additional intensity inhomogeneities. This can be due to dust, stray-light, filters or even shutters (a good example can be seen in Denker et al., 1999). In order to remove also such influences the method of obtaining the CLV profile has to be adapted by replacing the median of each concentric ring with a running median. This method was first used by K.A. Burlov-Vasiljev at Kiepenheuer Institute in Freiburg, Germany. For this purpose the filter width of the running median filter applied to each concentric ring has to be set in order to neglect solar features but still detect large scale inhomogeneities. A filter width of twice the radius of each concentric ring (≈ one third of the length of the ring) seems to be a choice that fulfils these criteria. The central part (r < R /20) is treated as for the CLV determination, where the median of the whole disc inside this boundary is taken. In the Hα images at KSO there is a small intensity gradient from east to west (left to right), this can be seen in the contour lines of the anisotropy map of Figure 13. The right panel illustrates this gradient more apparently. The red profile from centre to north looks like a standard CLV profile, whereas the green profile from centre to west drops down too rapidly.
To get a widely flat image -having a uniformly distributed intensity level and everywhere almost the same image contrast -the image has to be divided by the obtained anisotropic map. Figure 14 shows a comparison between the original image (left) and the flat image (right). Especially features at the limb, like the small active region in the west (right), become better visible. The radial intensity profiles shown beneath the image demonstrate the flatness of the image, the higher values and the variations in the central part (r/R ≈ 0) are a result of the small number of averaged pixels. The variations near r/R = 0.55 are introduced by the large active region. The peak at the limb of the corrected profile is due to the roughness of the solar limb caused by seeing effects. The jump at r/R > 1.0 in the intensity profile is a result of the scaling above the solar limb in order to enhance prominences.

Flat Field Determination
The Kuhn-Lin-Loranz (Kuhn, Lin, and Loranz, 1991) method is used to obtain a real flat field, that also includes smaller intensity variations, like dust particles. This method is based on the assumption that the Sun does not change its appearance within a few seconds. A set Figure 12 Evolution of KSO image parameters during one hour of Hα observations on 19 May 2020. From top to bottom: CLV parameters a and b, image quality parameter from GAN (Q), exposure time (exp), and global quadrant intensity ratio (fpr). Deviations of the parameter a and b are linked to lower image quality Q (i.e. green dots for quality class 1 and orange or red dots for quality classes 2 and 3, respectively).

Figure 13
Global anisotropies map derived from an Hα image of 26 January 2021 and the corresponding normalised radial intensity profiles. Due to the optical parts in the telescope the intensity distribution in the map is not radially symmetric, which can be clearly seen in the overlaid contour lines (blue). The red and green lines correspond to the intensity profiles shown in the right panel.
of Sun images shifted on the CCD is used to calculate the intensity for each CCD pixel. The more images are used and the less the Sun changes, good seeing conditions are a premise, the better the flat field becomes. A drawback of this method is, that this procedure needs manual interaction, and cannot be performed very often as it interrupts the patrol observations.

Figure 14
Hα images for comparison between normal contrast image (left) and high contrast image (right) divided by an anisotropy map from 28 November 2020. The radial intensity profile under each image shows that the corrected image is nearly flat after the correction. The wiggly structure near r/R ≈ 0 is due to the little number of averaged pixels. The jump at the limb in the right profile is a result of the extra treatment of the scaling inside the solar disc and above the solar limb.
At KSO for such a flat field one centre image plus 28 shifted images are taken. The shifted images are on three concentric rings, four images with a shift of 115 pixels (≈ R /8), 8 images with a shift of 230 pixels, and 16 images with a shift of 345 pixels. The exposure time is set to a fixed value and the duration of the whole procedure is kept as short as possible in order to have almost the "same" Sun for all images. Taking the full frameset takes about four minutes, most of the time is used for moving the telescope. Starting in 2021, flat field frames are produced for all telescopes (Hα, Ca II K, and white light), when the seeing conditions are exceptionally good or when there are any changes in the telescope setup. Figure 15 shows an example for flat field frames of all three telescopes. For Ca II K images the interpolation of the CLV according to Equation 6 does not work properly. The reason for this is the intensity distribution of the flat field, which seems to be produced mainly by the Ca II K filter. This flat field shows a ring-like intensity enhancement at R /3, the central intensity is even lower in this ring. The white-light flat field (centre panel of Figure 15) shows a relatively uniform distribution, some very small dust particles are visible. The Hα flat field (right panel of Figure 15) is dominated by three features: The interference pattern (dark and bright bands) are very likely produced by the Lyot filter, additionally there is a global intensity gradient from the centre to the left and to the right and there are some dust particles. We have to note that the contrast in these flat field frames was strongly enhanced to make all these features visible, the intensity difference within the solar disc is about 2%. The horizontal and vertical structures are an artefact due to the image motion caused by atmospheric conditions, which should be theoretically zero for applying the Kuhn-Lin-Loranz method. Also the small scale structures are a result of the seeing. To get rid of these artefacts a really exceptionally good seeing is necessary for getting a good flat field with this method. The most important application for such a flat field map is to detect dust particles on the optical parts and to see the influences of the telescope optics on the camera image. Applying such a flat field on the raw image can only be done after some smoothing steps in order to get rid of the fine structures introduced by the seeing.

KSO Data Archive
The image archive of KSO as presented in Pötzi, Hirtenfellner-Polanec, and Temmer (2013) underwent a number of changes and updates in the recent years. The most important improvement is the instant of time of availability of the data in the archive. Whereas the archive was normally updated at the end of an observing day, it is now updated continuously in quasi real-time simultaneously to the observations. Every minute, for each telescope one newly recorded image is transferred to the archive. As there are differences in the image cadence (10 images per minute for Hα and Ca II K and 3 images per minute for white light), additional recorded images are transferred to the archive after the observing day. The number of additional images to be transferred is selected depending on the camera type and solar activity. Normally three chromospheric (Hα and Ca II K) images per minute are stored and one photospheric (white light). In case of large solar flares 2 (defined as of GOES class ≥ M1 Figure 16 The data path for the full white-light data set. A data set of one daily processed image is in the synoptic path, the whole daily data is located in the archive path. or optical flare importance class ≥ 1) all images of good and fair quality recorded during flare activity are transferred to the archive.
The data archive consists of the synoptic and the main archive branch (see Figure 16). The synoptic archive consists of one higher level image data set for each observing day. The main archive holds all the observed raw images as FITS files and their corresponding processed JPEG quick look files. The data hierarchy in the synoptic branch goes from data type to yearly directories. In the main archive the data are stored in yearly directories containing the folders for each observing day. All archive data are mirrored in a local archive simultaneously. With a delay of a few hours, the data are also transferred to a mirror archive located physically at the University of Graz (Figure 17), which provides a higher bandwidth for faster data transfer. The archive can be accessed via the following ways: • KSO archive webpage http://cesar.kso.ac.at • KSO ftp ftp://ftp.kso.ac.at, instructions how to access the server can be found on http:// cesar.kso.ac.at/main/ftp.php • for the transfer of large data sets we suggest the use of the mirror http://kanzelhohe.unigraz.at ( Figure 17) as it guarantees faster download rates. • ESA Space Situational Awareness (SSA) Space Weather portal https://swe.ssa.esa.int, registration is necessary to access this data.

Synoptic Archive Data Products
The directory structure for the synoptic archive shown in Figure 16 (left) consists of a FITS and JPEG path for each camera containing a set of files for each observing day. The image

Figure 17
The mirror archive located physically at University of Graz is operated for fast data access. In case of large data transfers this archive should be favoured.
is selected manually by the observer and set of various images and profiles is processed for displaying on the observatory website. These data are for a quick daily overview of the situation on the Sun.
The following files are generated: •

Main Archive Data Products
The main archive contains all images of quality classes 1 and 2 in raw format (only FITS header updated) and as quick look JPEG files in various types as described below. The cadence of the data is three images per minute for Hα and Ca II K observations and one image per minute for white light. In case of flares this cadence is increased to 10 images per minute for chromospheric observations. On a day with good observing conditions and a very active Sun up to 10 000 images can be stored in the archive.
The archive consists of the following files in each daily directory [IdId] denotes a 4-digit internal number for identifying the flare region in the automatic flare detection system number. The colours in the coloured images are red for Hα and grey for white light. The sizes of the JPEG images are reduced to 1024 × 1024 pixels.

Derived Higher Level Data Products
The following higher level data products that are derived from the above data are also available: • Daily graphical observation logs are created after each observing day to present a quick overview with time marks for each archived image for all cameras (see Figure 18). These are created after each observing day and made available via the KSO archive webpage. • Sensitive filament maps are generated from the filament data produced by the automatic flare and filament detection system (Pötzi et al., 2015). These filament data are combined to hourly sensitive maps (Figure 19), where Hα images are overlaid with detected filaments. When the mouse pointer is moved over a filament, position, size, and area information is shown. Via the ESA SSA space weather portal also data from the database can be obtained in a JSON 3 format. • Flare data and movies which are also produced by the automatic flare detection system are available within the ESA SSA Space Weather portal and the KSO archive webpage. Each detected Hα flare is characterised by start time, peak time, end time, and flare importance class. Each flare is linked to its corresponding interactive JavaScript movie (described in the main archive products above). • In the Hα light curves a plot of the brightest pixel in each Hα image is overlaid by GOES full-Sun SXR flux profiles. The GOES data is obtained from NOAA/SWPC (National   Oceanic and Atmospheric Administration/Space Weather Prediction Center), using the long wavelength band (1.0 -8.0 Å). Additionally automatically detected flares together with their Hα importance class are annotated in the plot (see Figure 20).

Other Data Products and Former Data Products
The links to the different data products can be found in Table 2.
• Prominence images: Once a day, when there are no thin clouds, a set of Hα images is taken with the exposure times up to 50 ms. As a result the solar disc is completely saturated, but prominences above the limb become better visible. Due to telescope limitations, prominences can only be viewed up to 1.13 R . This data are used as complementary data to fill gaps of the Lomnický Štít prominence catalogue (Rybák et al., 2011). • The daily sunspot drawings are scanned immediately and available on the KSO archive webpage together with the derived sunspot relative numbers. This is the longest data set of KSO reaching back to 1944. • Hα photographic film data between 1973 and 2000 were scanned (Pötzi, 2007) and are available as full FITS and JPEG archives. • Hα CCD data observed with the former cameras (see Table 1) with less resolution and less image depth are also available as synoptic and full data archives. • White-light photographic images, of which normally three exposures on 13 × 18 cm flat film were taken per day between 1989 and 2007, have been scanned and processed (Pötzi, 2010). The archive consists of the raw FITS files and JPEG images with an overlaid heliographic grid.
• filesize in bytes of the FITS file • filename of FITS file including path as accessible via http • available JPEG file types The database has nearly 5 million entries of solar images. All of them are accessible through the archive. Most entries are Hα images (about 3.5 million). For each entry there exists a FITS file and at least one JPEG file, in case of good quality additionally a coloured JPEG file and a coloured high contrast JPEG file are available. The image database can be accessed via the archive page directly http://cesar.kso.ac.at/catalogue/search_obj.php (left side "Database Search"). Figure 21 shows the query page and the result of this query beneath.
• to: end date and time in the format yyyy-mm-ddTHH:MM:SS.
• ftype: (optional) can be one of all, jpeg, fits.
• color: (optional) set to 1 if coloured normal contrast and high contrast (global anisotropies corrected) images are wanted. • html: (optional) set to 1 if html output is requested, the default output is a list with comma separated entries (csv). • qual: (optional) choose the highest quality level to select, default is 3, i.e. 1, 2, and 3 are selected. • filesize: (optional) set to 1 if the filesize in bytes should be returned, this can be of interest for estimating the download time.
For example, to get all Hα image entries between 08:00:00 UT and 10:00:00 UT for 23 May 2021 including file sizes and coloured JPEGs the API looks like this: http:// cesar.kso.ac.at/database/get_halpha.php?from=2021-05-23T08:00:00&to=2021-05-23T10: 00:00&html=1 This gives in principle the same output as shown in the result in Figure 21. These APIs are also used, e.g. to populate the KSO data of the Virtual Solar Observatory (VSO, Davey, 2014), where all entries of our database are replicated back to 1973.

Conclusion
This article gives an overview of the data processing pipeline, the data archive, and the data products of the KSO solar observations. The Kanzelhöhe data pipeline could act as an example of how to process data fast and effectively. The time from data acquisition to data provision is outstandingly short, it can really be called "near real-time" data. This fast processing is due to the three separated tasks -acquisition, processing, and provision -running on independent servers. The processing part, e.g. is even implemented on two workstations to reduce the workload, but can in case of a hardware failure run also on one workstation alone. The data are never stored locally on camera controllers, all data beginning from data acquisition to processed data and temporary data are on network attached storages (NAS) that are accessible from all workstations. This separation of tasks on different workstations Figure 21 Example for the database query interface and result: Above the database query for Hα images between 08:00 UT and 10:00 UT for 23 May 2021. Below the result of this query, the file entries are links that can be viewed directly.
has the advantage that all steps can run in parallel, each step is just waiting for data produced by the previous step. The second important point is an almost immediate and easy access to the observed data. For this purpose data in lower cadence (one image per minute) is transferred immediately after processing to the archive and made accessible via the database. At the end of an observing day the archive and the database are filled up with the rest of the data. The data can then be retrieved via web interfaces or ftp access. Database queries can be done directly over the archive web page or automated via scripts using the described APIs. Both, the fast data processing and the fast data availability, can only exist due to the automated data processing pipeline. The basis for this pipeline is the quality determination of the incoming data, that decides whether an image is dismissed or further processed. The provided data products serve as direct input to automated solar feature detection algorithms and allow to study the Sun in high resolution and high cadence.

Appendix A: Camera Specifications
The cameras are equipped with an ON semiconductor KAI-4021 monochrome CCD sensor. The number of active pixels is 2048 × 2048, each with a size of 7.4 µm × 7.4 µm, resulting in a total sensor size of 15.1 mm × 15.1 mm. The peak quantum efficiency is 55% at 475 nm, i.e. the quantum efficiency for Ca II K (393 nm) is 45%, for white light (545 nm) ≈50%, and for Hα (656 nm) 30%. The maximum data rate is 40 MHz, which limits the frame rate to about 9 images per second in 12-bit mode. This frame rate is reduced to 7 frames per second in our software when frame selection is active. The sensor is an interline progressive scan CCD, which reads out all pixels simultaneously. The number of dead pixels is zero for the Hα camera, one for the white-light camera and three for the Ca II K camera. These dead pixels are replaced by the average intensity of the two neighbouring horizontal pixels. Detailed information about the image sensor can be found at https://www.onsemi.com/pdf/ datasheet/kai-4021-d.pdf.  and Williams, 2019). Most of the keywords are clear, but some need an additional explanation and several keywords having the same value or meaning are included for reasons of compatibility with older software.
DATE-OBS = DATE-BEG Date and time of image capture. DATE Date and time of FITS creation. EXP_TIME, XPOSURE Both are the exposure times but the first is in milliseconds and the latter in seconds. CDELT1, CDELT2 The angular aperture of pixels in arcsec, this angle refers to the viewing angle and not to heliographic coordinates CRVAL1, CRVAL2 The value of CRPIX1 and CRPIX2 in arcsec, it is for the horizontal coordinate (CRPIX1-CENTER_X) × CRDELT1. ANGLE, CROTA1 The inclination of solar north clockwise and counter clockwise, it is the sum of SOLAR_P0 and the camera tilt angle. RSUN_REF The reference radius of the Sun in meters based on the IAU (Prša et al., 2016) value for the photosphere and for the chromosphere transit measurements in Hα from KSO were taken, resulting in a thickness of 3600 km. RSUN_ARC The actual diameter of the Sun in arcsec depending on the distance of the sun and RSUN_REF. PRE_INT The requested intensity in the AOI for the frame selection procedure which defines the exposure time. A_O_INT The coordinates of the AOI rectangle.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.