1 Introduction

The Sun’s temperature and its variations over timescales from hours to decades have been determined since 1978 from satellite measurements of associated variations in Total Solar Irradiance (TSI). Since the deployment of the SORCE satellite in 2003, the Sun’s temperature has also been determined for a continuous range of wavelengths that span ultraviolet, visible and near-infrared wavelengths, from solar spectral irradiance (SSI) measurements across the peak of the SSI distribution. These are of great interest due to the fundamental role that solar variations play in understanding the variations of the Earth’s climate (Harder et al., 2005; Eddy, 2009). Beyond decadal timescales, the solar irradiance is key to estimating the Sun’s luminosity, and on the longest timescales it determines Earth’s lifetime, since it determines when the Sun will exhaust its energy from fusion of hydrogen in the core (Bahcall, 2000).

The average radiative temperature of the Earth is determined by an approximate balance between the amount of energy it receives from the Sun, which can be calculated from the TSI and the Earth’s albedo, and the amount of energy that Earth emits into space that depends on Earth’s emissivity (Stephens et al., 2015). Earth’s albedo is the fraction of solar energy reflected back into space, which averages about 30%, the remainder being absorbed by the atmosphere and surface (Wild et al., 2013). To determine how solar variations impact Earth’s atmosphere–ocean system at various heights, SSI must be monitored in addition to TSI.

The relationship between Earth’s temperature and the variability of solar irradiance was first speculated on by Herschel, and as observations have improved, so has the understanding of solar variability and its contribution to climate change (Gray et al., 2010; Bahcall, 2000). The variability of both TSI and SSI occurs due to variations in magnetic fields on the solar surface, which in turn cause the appearance of sunspots and faculae (Shapiro et al., 2015). Various models attempt to predict these changes over a wide range of timescales.

Prior to SORCE, limited observations of SSI made studying the variability of SSI and solar brightness temperature difficult, but databases over several years now enable these calculations that are important for both Heliospheric and Earth sciences (Rottman and Cahalan, 2002). In this article, we present a study of linear and quadratic analytic and statistical approximations of the solar brightness temperature, \(T\), using either a single “reference day” during a solar minimum, or using statistical properties over many days in the available record. The estimation of values using linear and quadratic approximations, both analytic and statistical, are of great help in simplifying the calculations of \(T\), and in interpreting its variability. In order to determine the accuracy of the approximated values, it is necessary to compare them with very nearly exact values of \(T\), calculated from the monochromatic exact analytic \(T\) equation, Equation D.2, derived from the Planck distribution D.1, or equivalently by applying root-finding techniques to the Equation D.3 that implicitly determines \(T\) from observed values of SSI.

This paper shows that the daily values of \(T\) over the SIM wavelengths are well determined from a polynomial that is quadratic in the observed daily values of SSI. The coefficients may be expressed as analytic functions of wavelength, with small RMS errors. Even smaller RMS errors are achieved with coefficients determined from statistical fits of the observed data. Advantages of the quadratic \(T\) over the exact \(T\) include ease of interpretation, and computational speed. Interpretation of the linear term is the sensitivity of \(T\) to changes in SSI, a concept widely used in climate studies. The speed gain will become important as time-dependent models of solar variations are further developed.

The article is structured as follows: Section 2 describes the data analyzed here, provides the online link to download it, and describes the temporal and wavelength range. Section 3 summarizes the methodology used for the analysis of the SSI spectral data, the calculation of exact values of solar brightness temperatures \(T\), as well as linear and quadratic analytic and statistical approximations of \(T\). Section 4 discusses the results of exact computations of \(T\), the time series of observed SSI, and the time-series comparisons of exact and approximate \(T\) values. Section 5 concludes by summarizing the key results and suggests future directions for research related to variations in solar irradiance and brightness temperature. Finally, the article contains nine appendices that derive results referenced in Sections 1 – 5. Several figures and tables are discussed throughout. Readers may interact with several of the plotted results by going to the following dashboard that was coded in Microsoft Power BI. Dashboard link: http://wayib.org/solar-temperature-variations-relative-to-a-quiet-sun-day-in-august-2008/.

2 Data

The TSI and SSI data were downloaded from the University of Colorado’s LASP Interactive Solar Irradiance Datacenter (LISIRD), based on measurements made by instruments onboard the Solar Radiation and Climate Experiment (SORCE) satellite. The data is free and publicly available here: https://lasp.colorado.edu/lisird/data/sorce_sim_ssi_l3/.

The SORCE Total Irradiance Monitor (TIM) instrument provides records of Total Solar Irradiance (TSI), while the Spectral Irradiance Monitor (SIM) instrument provides records of the Solar Spectral Irradiance (SSI). Both instruments provide daily averages, with TIM beginning 2003-02-25, and SIM beginning 2003-04-14 and both ending on 2020-02-25 when the SORCE instruments were passivated (i.e., turned off). We employ throughout the latest “final” data versions, v19 for TSI, and v27 for SSI, as discussed in Kopp (2020) and Harder (2020), respectively. All dates in this article are given in the format YYYY-MM-DD, in accord with https://www.iau.org/static/publications/stylemanual1989.pdf.

The SIM measures SSI as a function of wavelength over the range from 240 nm to 2416 nm. Though measurements of SSI were made prior to SORCE, for example by the UARS SOLSTICE (operating during 1991 – 2001), SIM was the first to provide SSI for a continuous range of wavelengths across the peak of the solar spectrum that occurs near 500 nm, and well into the near-infrared (IR) wavelengths, with sufficient precision to determine true solar variations (see, e.g., Harder et al., 2009; Lee, Cahalan, and Dong, 2016).

Note that all irradiance data from SORCE, including all TSI and SSI values, are adjusted to the mean Earth–Sun distance of one astronomical unit, 1 AU. Doppler corrections are also made to remove any variations due to the satellite orbit. Absolute and relative calibrations are enabled by a variety of laboratory measurements carried out at both the University of Colorado’s LASP (Laboratory for Atmospheric and Space Physics), and at NIST facilities. Onboard instrument degradation is monitored and corrected for. Our focus in this paper is on the day-to-day variability at near-ultraviolet, visible, and near-infrared wavelengths. For this, we rely primarily on the high precision and repeatability of TIM and SIM, more than on the absolute calibration. The high quality of TIM and SIM data has been amply documented in the literature.

Due to operational difficulties encountered, particularly after 2011 as SORCE aged, there are a limited number of days where the records are given as NA (not available) or no values were recorded. These were omitted in all calculations reported here. As an example of the SSI records measured by the SIM instrument, the time series of the solar spectrum from 2003 to 2020 is shown in Figure 4 for a fixed wavelength, 656.20 nm, which corresponds to the hydrogen alpha (\(\text{H}{\upalpha } \)) transition in the Balmer series.

For much of the data analysis, open-source \(R\) and Python software was used, as well as commercial software including Wolfram Mathematica, and Microsoft Excel. Mathematica enabled precise computation of the brightness temperatures of the SSI data, using efficient interpolation and root-finding methods, and provided a check on exact values computed from the analytic equation for \(T\) derived from the Planck distribution for the spectral irradiance, shown in Appendix D.

For more details on the TSI and SSI data used here, see the “release notes” for SORCE TIM v19, and for SORCE SIM v27, available from the NASA Goddard Space Flight Center Earth Sciences Data and Information Services Center, or from the University of Colorado’s LASP (Harder, 2020; Kopp, 2020).

3 Methodology

For the radiation from a blackbody, the irradiance spectrum may be computed theoretically using the Planck distribution. However, the Sun is not a perfect blackbody, due to wavelength-dependent processes in the Sun’s atmosphere. Large deviations from the Planck distribution are observed, as we show below. However, it is very useful for interpreting irradiance observations to define a solar “brightness temperature,” either for the TSI, integrating all wavelengths, or for the solar spectral irradiance, SSI, at each available wavelength. This is the temperature for which the irradiance computed from a Planck distribution coincides with the irradiance observed by an instrument outside Earth’s atmosphere, for example TIM for the wavelength-integrated irradiance, the TSI, or SIM for the wavelength spectrum of irradiance, SSI.

Computation of the brightness temperature from TSI, \(T_{\mathit{eff}}\), is simply a matter of explicitly solving the Stefan–Boltzmann Law for \(T_{\mathit{eff}}\), with a result proportional to the one-quarter power of TSI. Appendices A, B and C discuss the importance of TSI and related quantities. Appendix D displays the Equations D.2 and D.3 that determine the value of the spectral brightness temperature \(T\) as an explicit function of the observed SSI for each fixed wavelength. Equations D.1 and D.3 also determine \(T\) as an implicit function of the observed SSI, by solving Equation D.3 for \(T\) as a function of SSI at each fixed wavelength using a root-finding procedure. We employ a root-finding algorithm developed in Wolfram Mathematica, using the following initial condition \(T=5770~\mbox{K}\), where \(T \) is chosen near the effective radiative temperature computed using TSI = 1360.8 W/m2 as provided by the SORCE TIM (Kopp and Lean, 2011). These two approaches produce the same values of \(T\), referred to in this paper as the “exact” values, and each method provides a check on the other.

SORCE SIM provides a daily SSI record for each associated wavelength from 240 nm to 2416 nm, so that over the 17 year period there is a large amount of data. To handle the large number of records, algorithms were developed in R, Python and Mathematica, to provide approximate values of \(T\). These approximate alternatives allow more rapidly computed values of \(T\) for any date, given a fixed set of wavelengths. In this article, we investigate linear and quadratic analytic approximations as a function of the observed SSI values, derived in Appendix E. Below, it will be shown that these approximations bracket the exact values, which motivates the development of linear and quadratic fit approximations, that minimize the root-mean-square-error (RMSE) across a large range of days, which can include all available days. These fit approximations are developed in Appendix G.

For the development of the linear and quadratic analytic approximations, a Taylor expansion is used (see Appendix E). Having the derivatives of \(T\) with respect to the SSI, this expansion gives a representation of \(T \) in terms of polynomial functions of SSI. To keep the models simple, only the first and second terms of this expansion are considered.

To apply a Taylor expansion it is necessary to have a reference value around which to expand. For this, we choose the SSI on a single “reference” day during the 2008–2009 solar minimum of Cycle 23. Namely, we choose 2008-08-24, and label that day’s exact values (\(T_{o}\), \(\mathit{SSI}_{o}\)). With the observed value of \(\mathit{SSI}_{o}\) and associated computed value of \(T_{o}\) during a solar minimum, the linear and quadratic coefficients were calculated for the analytic approximation models. The remainder of this section discusses the time series of SSI and estimated \(T\) values. Section 4 then compares the approximate values with the exact values, and also compares the analytic approximations with analogous fit approximations that use coefficients obtained by minimizing RMSE (root-mean-square-errors) over all days, and also over two selected ranges of days.

To compare the estimation with the exact value of brightness temperature, we compute difference values, and relative differences, or delta values as:

$$ \mathit{Delta}= \frac{T_{\mathit{exact}} - T_{\mathit{estimate}}}{T_{\mathit{exact}}}. $$
(3.1)

In addition to the linear and quadratic analytic approximations obtained with the Taylor expansion, a linear and quadratic fit model is developed in Appendix G, with the help of \(R\) statistical software. The linear and quadratic fit models have coefficients that depend on a given temporal range of available data, and not only on the chosen reference day, as is the case with the analytic approximations. In Section 4 we report results for the full range of available days, as well as for two subranges, those of “early” and “late” days, R1 and R2, respectively.

The comparison between the brightness temperatures calculated with the linear and fit approximations are shown in the tables, along with a comparison between the linear coefficients. In order to make the computations very explicit, in Appendix H, an example of the calculation of the brightness temperature is given for the linear and quadratic analytic approximation methods, as well as for the linear and quadratic fit approximation methods, for a randomly selected day.

In Appendix I, a method of rapid interpolation is given for the linear analytic and fit coefficients, valid over a broad range of wavelengths that satisfies \(400~\mbox{nm}\leq \lambda \leq 1800~\mbox{nm}\).

4 Results

Before considering the temporal variations of SSI observed by the SORCE TIM and SIM instruments over the 17 years, 2003 – 2020, we first consider the wavelength variations of SSI on our chosen “reference day” 2008-08-24. Figure 1 shows this \(\mathit{SSI}_{o}\) wavelength dependence observed on the reference day, in green, and for comparison the Planck irradiance distributions computed for temperatures \(T =4500\) K, 5770 K, 6500 K using Equation D.1, in blue, tan, and red, respectively. The lower and upper Planck temperatures are seen to give computed SSI values that bracket the observations of \(\mathit{SSI}_{0}\) for this wavelength range, while the computed SSI for the intermediate 5770 K (tan) approximately follows the observed \(\mathit{SSI}_{o}\) (green). Although the observed value coincides with the computed 5770 K Planck value for a few wavelengths only, the observed values of \(\mathit{SSI}_{o}\) lie above or below the Planck curve. The measured value of TSI (historically “solar constant”) by the Total Irradiance Monitor (TIM) instrument on the reference day is \(\mathit{TSI}_{o} = 1360.4704\) W/ m2 and is associated with an effective radiative temperature of \(T_{o} =5771.2685\) K, close to \(T=5770\) K, used in computing the intermediate tan curve in Figure 1 (Kopp and Lean, 2011).

Figure 1
figure 1

Solar Spectral Irradiance (SSI) vs. wavelength for reference day 2008-08-24, plotted in green, as measured by the SIM instrument onboard SORCE. For comparison, we also show Planck distributions for 6500 K in red, 5770 K in tan, and 4500 K in blue. The Planck distributions use Equation D.1 for a fixed temperature, with wavelength as the independent variable, and transformed to spectral irradiance by multiplying by the factor \(\alpha _{s} =\pi * ( \frac{R_{s}}{\text{AU}} )^{2} = 6.79426* 10^{-5}\), with \(R_{s}\) the Sun’s mean radius, and AU the mean Earth–Sun distance, as in Equation D.3.

Figure 2 is a zoom of Figure 1 for the wavelength range 240 nm to 680 nm. The apparently irregular bumps in this plot, and in Figure 1, are due to well-known Fraunhofer lines in the solar spectrum, smoothed to the SIM instrument’s bandpass, which varies from about 1 nm width near wavelength 240 nm, up to almost 30 nm near 1000 nm, then decreases slightly (Harder et al., 2005). The width of a typical atomic Fraunhofer line is of order 1 Å, or 0.1 nm, so the observed bumps are smoothed clusters of several nearby lines. A few of the contributing atomic lines are indicated in the labels on the vertical dashed lines. For example, the green dashed line near 430 nm, is labeled CaFeg to indicate that lines of calcium, iron, and oxygen (g-band) are all included within the plotted bump in the green line. For identification of g-band lines (both atomic and molecular) and its variability related to magnetic field strength, see Shelyag et al. (2004).

Figure 2
figure 2

A zoom of Figure 1 for the wavelength range 240 nm to 680 nm. The apparently irregular bumps in this plot, and in Figure 1, are due to well-known Fraunhofer lines in the solar spectrum, smoothed to the SIM instrument’s bandpass, which varies from about 1 nm width near wavelength 240 nm, up to about 30 nm width near 1000 nm, then decreases slightly. The width of a typical atomic Fraunhofer line is of order 1 Å, or 0.1 nm, so many of the observed bumps are smoothed clusters of nearby lines. A few of the contributing atomic lines are indicated in the labels on the vertical dashed lines. For example, the tan dashed line near 430 nm, is labeled CaFeg to indicate that lines of calcium, iron, and oxygen (g-band) are all included within the plotted bump in the green line. Effects of ionization thresholds are also seen, such as just above the Ca ii H and K lines near 400 nm, which has photon energies near 3.1 eV.

Effects of ionization thresholds are also seen, such as just above the Ca ii H and K lines near 400 nm, which has photon energies near 3.1 eV.

TSI provides key observational data about the Sun and is needed to compute the Sun’s luminosity and lifetime (see Appendix B). TSI is not a solar constant, as had been assumed prior to the satellite era. Its value varies due to turbulent magnetic processes on the Sun. TSI variations amount to about 0.1% (1000 ppm) of the mean value over the four solar cycles so far observed by satellite (Cycles 21 through 24), since 1978. The average solar luminosity, and thus the TSI, is determined by nuclear processes in the Sun’s core. These change over a much longer timescale than the solar cycle, up to billions of years, as nuclear processes transform hydrogen into helium. The present value of TSI, and thus solar luminosity can provide a good estimate of the Sun’s lifetime, and thus the time that the Sun’s nuclear fuel will eventually run out. Such calculations are shown in Appendix B, where it is shown that the current best TSI value at solar minimum, \(1360.80 \pm 0.50\) W/ m2 (Kopp and Lean, 2011), gives the overall lifetime of the Sun as approximately 10.70 billion years. The current estimated age of the Sun, and of our solar system, is about equal to Earth’s estimated age of 4.54 billion years (±50 million years). Hence, this leaves about 6.2 billion years, more or less, before the Sun will expand into a Red Giant, leaving a white dwarf star behind.

The importance of TSI in climatic variability has been mentioned, for example in computing Earth’s global average effective radiative temperature. Appendix C estimates the effective temperature of the Earth as 255.48 K, using the TSI on the reference day, and Earth’s average albedo of 0.29 (Stephens et al., 2015).

TSI is the integral of SSI over all wavelengths, and SSI in turn determines the solar spectral brightness temperature \(T\) at each wavelength. Determining \(T\) as well as SSI is useful in understanding the physical and chemical processes that take place on the Sun. For example, Figure 3a, a plot of the brightness temperature, \(T_{o}\), on the reference day, shows a broad peak above 1600 nm. This is associated with transitions in hydrogen ions H (1 proton + 2 electrons). Photons with a wavelength \(\lambda <1644\) nm are dominated by the H bound–free transitions, while photons with \(\lambda >1644\) nm are absorbed and re-emitted in H free–free transitions (Wildt, 1939). The H ion is the major source of optical opacity in the Sun’s atmosphere, and thus the main source of visible light for the Sun and similar stars.

Figure 3
figure 3

Temperature vs. Wavelength on the reference day, 2008-08-24. The plot in 3a shows the same wavelength range as in Figure 1. Plot 3b is a zoom into the same short-wavelength range as in Figure 2. As in Figure 2, several bumps are labeled with contributing atomic lines, such as the green dashed line near 430 nm, labeled CaFeg (calcium, iron, oxygen g-band). As in Figure 2, the rise due to the ionization threshold is evident near 400 nm, just above the Ca ii H and K lines. In both plots, the temperature at each wavelength was computed using a Mathematica root-finding procedure to solve for \(T\) in Equation D.3, \(\mathit{SSI}= \alpha _{s} B \left ( \lambda ,T \right )\), with SSI the observed value.

Now, we consider the temporal variations of SSI. Figure 4 shows the time series of the irradiance corresponding to a fixed wavelength, in particular for \(\text{H}{\upalpha } \) (\(\text{H}{\upalpha }\) \(\text{wavelength} = 656.2\) nm), the longest wavelength in hydrogen’s Balmer Series. The variability of the SSI can be seen, with the deepest minimum occurring early in the record, during Oct–Nov 2003. The spike that goes below 1.523 W/m2/nm is associated with the Halloween solar storms, a series of solar flares and coronal mass ejections that occurred from mid-October to early November 2003, peaking around October 28 – 29. See, for example, https://en.wikipedia.org/wiki/Halloween_solar_storms,_2003.

Figure 4
figure 4

Time series of irradiance for all records of daily average data from the full 17 years of SIM data, version 27, downloaded from LISIRD. In this case, we have chosen the \(\text{H}_{\upalpha } \) wavelength, 656.2 nm. In this plot, it is evident that there is a minimum of solar activity in mid-2008. We choose as a reference day 2008-08-24, and consider variations about this day to approximate the temperatures on all other days.

This occurred during the declining phase of Solar Cycle 23. On the slower year-to-year timescale, the Sun’s activity declines into the much quieter period of the solar minimum during 2008 – 2009 (Kopp, 2016). The solar minimum implies about a 0.1% decrease in solar energy that arrives on Earth, causing the Earth’s temperature to decrease slightly (Gray et al., 2010). After this solar minimum, solar activity increases again, as Cycle 24 sunspots and other solar activity increase in intensity into a solar maximum in 2014 – 2015, before declining again, into a quieter minimum period of 2019 – 2020.

As can be seen in Figure 5, the brightness-temperature time series for \(\text{H}{\upalpha } \) is also similar to the temporal variability of the SSI for \(\text{H}{\upalpha } \). It is evident that they are in phase. As SSI data is extended beyond the end of SIM, by TSIS-1 and successor missions, the solar cycles will become more evident, as happened with TSI (Solanki, Krivova, and Haigh, 2013). It is important to emphasize that the spectral brightness temperatures are wavelength-dependent radiative temperatures of the Sun, the temperatures at which the SSI data measured by the satellite coincides with what is obtained using the Planck distribution (Trishchenko, 2005).

Figure 5
figure 5

Time series of the temperature \(T\) calculated in Wolfram Mathematica for all records of solar spectral data with fixed wavelength, \(\text{H}{\upalpha } = 656.20\) nm, using Equation D.3. We term the root-finding solution of Equation D.3 the “exact” value of the temperature, to distinguish it from the two analytic approximations (linear and quadratic) described in Equation E.18, with E.10 and E.17, and from the two statistical “fit” approximations (also linear and quadratic) described in Appendix G.

Figure 6 shows a plot of the linear analytic approximation of brightness temperature compared with the exact value, the value obtained by Equation D.2, or the root-finding solution of Equations D.1 and D.3. The linear analytic approximation is given by neglecting the quadratic term in Equation E.18, taking as reference the date during the solar minimum, 2008-08-24. Figure 6 shows that this approximation closely overlays the exact.

Figure 6
figure 6

Linear analytic approximation of the temperature for \(\text{H}{\upalpha } = 656.20\) nm, compared with the exact value (value obtained by the root-finding solution of Equation D3. The linear analytic approximation is given by Equation E.18, with E.10.

To more clearly see the difference between the exact and the linear analytic approximation, Figure 7a shows the difference, exact – approximation, in units of mK = 10−3 K, and Figure 7b the delta, difference/exact (Equation 3.1) in parts per million (ppm). The negative differences in Figures 7a and b show that the linear analytic approximation overestimates the exact value of the brightness temperature. The root-mean-square-error (RMSE) is \(412.4545 \times 10^{-6}\), i.e., very small, which explains why such differences are not evident in Figure 6. A significant increase in variability is seen in 2011 and afterwards, hence Figure 7a also displays the RMSE for both the earlier, quieter period, as well as the later, noisier period. Some of this increased noise is due to Solar Cycle 24, but some is likely also due to the aging of the satellite and the SIM instrument.

Figure 7
figure 7

(a) The difference between the exact value of \(T\) and the value obtained with the linear analytic approximation. (b) Delta linear approximation of the temperature using Equation 3.1 and expressed in parts per million (ppm). The negative values in this figure indicate that the linear analytic approximation overestimates the exact value of the temperature.

Figure 8 shows the plot obtained using the quadratic analytic approximation given by Equation E.18, with D.3, together with the exact values calculated from Equation D.2 or from root finding with D.1 and D.3. This looks nearly identical to the analogous Figure 6 for the linear analytic approximation. However, in the plot analogous to Figure 7, we plot in Figure 9 the difference between the exact and the quadratic analytic approximation, and here the results are quite different from the linear case. Figure 9a shows the difference, exact – approximation in units of μK = 10−6 K, and Figure 9b the delta, difference/exact in parts per million (ppm) in the quadratic case. The positive differences in Figures 9a and b show that the quadratic approximation underestimates the exact value of the brightness temperature, though are much closer than the linear, with RMSE reduced to \(0.3428 \times 10^{-6}\), more than 1000× smaller than the linear case in Figure 7, and Table 2 shows that the Mean Error (Bias) is also more than 1000× smaller than the linear. Comparing Figures 7 and 9 (and Table 2) indicates that the opposite signs of the bias suggests there may be a better approximation that lies “in between” the linear and quadratic approximations. Below, we will show that the “fit” approximations do typically provide such improvements.

Figure 8
figure 8

Quadratic analytic approximation of \(T\) compared with the exact value. The quadratic approximation model is given by Equation E.18, with E.17.

Figure 9
figure 9

(a) The difference between the exact value of \(T\) and the value obtained with the quadratic analytic approximation. (b) Delta quadratic approximation of \(T\) using Equation 3.1 and expressed in parts per million (ppm). The positive values in this figure indicate that the quadratic analytic approximation underestimates the exact value of the temperature. Combining this result with that of Figure 7 shows that the exact value lies between the linear and quadratic analytic approximations.

Figure 9a also shows that, in accord with intuition, the decrease in RMSE indicates that the approximate value is better the more terms are considered in the Taylor expansion. The improvement from RMSE in Figures 7a to 9a removes the most significant figures in RMSE in Figure 7a, suggesting a rapidly converging series. This indicates that finding an improved “in between” fit approximation will be a challenge, as the quadratic analytic approximation is excellent.

Table 2 supports this last point, comparing the RMSE for linear and quadratic analytic models, with the RMSE for the linear and quadratic fit models for the same \(\text{H}{\upalpha } \) wavelength used in Figures 7 and 9. Indeed, though the linear fit model RMSE is about 2.85× smaller than the linear analytic RMSE, the quadratic fit model RMSE is 2.81× times smaller again than the 1000× smaller quadratic analytic RMSE. Hence, at the \(\text{H}{\upalpha } \) wavelength, the quadratic fit model is more precise even than the very precise quadratic analytic model.

Tables 13 and 4 extend Table 2 to wavelengths 285.5 nm, 855.93 nm, and 1547.09 nm, respectively. As noted for \(\text{H}{\upalpha } \), at these near-ultraviolet and near-infrared wavelengths, the linear fit model also has smaller RMSE than the linear analytic. Also, if we compare the two quadratic models, then again for 255.5 nm, 855.3 nm, and 1547.09 nm, the quadratic fit model wins, and for 255.5 and 855.3 it is by an even larger factor than it does for \(\text{H}{\upalpha } \), by factors 10.34 and 7.78, respectively, while for 1547.09 nm the quadratic fit model wins over the quadratic analytic by a factor 2.00. If we take these four wavelengths as representative, then, the quadratic fit model is preferred, and nearly reproduces the exact values, despite the high precision of the quadratic analytic model.

Table 1 Root-mean-square-error (RMSE) and the mean error (ME) of the temperature values obtained with the different models compared with exact temperature (\(T\)) values, for 2003 to 2020, and a wavelength of 285.48 nm.
Table 2 RMSE and ME of the temperature values obtained with the different models compared with the exact temperature values, over the full range 2003 to 2020, for a wavelength of 656.20 nm (\(\text{H}{\upalpha } \)).

Some applications may not require such high precision. If we choose to restrict ourselves to linear models the fit model is still preferred, though it is a close call at 1547.09 nm, where the linear analytic model RMSE is 1.05× larger than the linear fit, so has only a 5% improvement. At that wavelength, the linear analytic may be sufficient, and indeed an analytic approach has some advantages. For example, it may be optimized for a particular range of dates of particular interest, and the single coefficient interpreted as a “linear sensitivity” of temperature to irradiance at this wavelength.

Note that the SIM instrument registers a higher variability of spectral irradiance for shorter wavelengths, i.e., 285.5 nm and 355.93 nm. This occurs because the more energetic photons (according to the Planck–Einstein relationship E = hc/\(\lambda \)) allow for more transition and ionization processes than at near-infrared wavelengths, such as those shown in Figure 10, 855.3 nm and 1547.09 nm.

Figure 10
figure 10figure 10figure 10

(a) Irradiance versus time during 2003 – 2008, for four chosen wavelengths, overlaid on a plot of temperature \(T\) versus wavelength on the reference day, and indicating the reference-day temperatures at those same four chosen wavelengths. (b) Irradiance versus time during 2003 – 2020, for the shortest chosen wavelength, 285.48 nm. (c) Irradiance versus time during 2003 – 2020, for the second chosen wavelength, 619.4 nm. (d) Irradiance versus time during 2003 – 2020, for the third chosen wavelength, 855.93 nm. (e) Irradiance versus time during 2003 – 2020, for the longest chosen wavelength, 1547.09 nm.

Continuing with the plans for simplifying the calculations of the brightness temperature, which is the central objective of the article, Figure 11 shows the plots of the quotients of the linear analytic coefficients for certain wavelengths. Looking at the behavior of the curve of the quotients \(\boldsymbol{a}'\), a polynomial interpolation was obtained, as discussed in Appendix I. This provides a simple mathematical expression useful in calculating the linear coefficients for any wavelength in the range from 400 nm to 1800 nm. With this, calculating the brightness temperature becomes simpler and faster than Equation E.18 with D.3, and valid for interpolating to wavelength bins not aligned with wavelength bins measured by the SIM instrument.

Figure 11
figure 11figure 11

For each wavelength there is an associated linear “fit” coefficient, calculated using Mathematica and \(R\) software. (a) The ratio obtained from the division of the linear coefficients between the respective wavelengths. (b) The differences between the ratios. (c) Comparison between the analytic ratio \(\boldsymbol{a}'\) and the value obtained with the polynomial interpolation in the same range of wavelengths, as in Equations I.1 and I.2.

To compare the linear analytic and linear fit models, Figures 12 and 13 show the differences between the coefficients of the linear analytic approximation model, Equations E.10 and E.18 omitting the quadratic term, or G.5, and that of the linear fit model, Equation G.1. Note that the fit coefficient \(\boldsymbol{a}\) in Equation G.1 is computed using \(R\) software, and depends on the range of days supplied. This can range over the full set of days available from SORCE SIM (17 years of daily data). For comparison we also compute \(\boldsymbol{a}\)R1 over the set of days in the first half of the data, that have the smaller or RSME values shown in Figure 7a, as well as \(\boldsymbol{a}\)R2 over the late-day range, with larger RMSE. In short, early and late-year ranges are R1 = 2003 – 2010, and R2 = 2011 – 2020. All three ranges, overall, R1 and R2 are shown in Figures 12 and 13. In the figures we can see that the values obtained with Equations G.1 and G.5 (with E.10) do not vary much for wavelengths less than 1400 nm and greater than 400 nm, therefore the brightness temperature values that are calculated in that range of wavelengths also do not differ much, using the linear analytic and linear fit models. Note that \(\boldsymbol{a}\)R1 and \(\boldsymbol{a}\)R2 values lie on either side of the overall difference value of \(\boldsymbol{a}\), which in every case lies in between, for each wavelength.

Figure 12
figure 12figure 12

Three curves where: Diff is the difference between the analytic linear coefficient \(\boldsymbol{a}'\) obtained with Equations G.5 or E.10, and the fit linear coefficient \(\boldsymbol{a}\) obtained with Equation G.1, for selected wavelengths, using either the complete data from 2003 to 2020, or only early data in range R1 to compute \(\boldsymbol{a}\)R1 or only late data in range R2 to compute \(\boldsymbol{a}\)R2. In brief, R1 = 2003 – 2010, while R2 = 2011 – 2020. Note that \(\boldsymbol{a}\)R1 and \(\boldsymbol{a}\)R2 values lie on either side of the overall difference value of \(\boldsymbol{a}\), for each wavelength.

Figure 13
figure 13figure 13

The relative error between the analytic linear coefficient \(\boldsymbol{a}'\) and the fit linear coefficient \(\boldsymbol{a}\), where the fit coefficient \(\boldsymbol{a}\) is calculated in the same three ways as in Figure 12, namely using the full available time period 2003 – 2020, or R1 = 2003 – 2010, or R2 = 2011 – 2020.

5 Summary and Conclusions

Our results and conclusions may be summarized as follows: (i) The linear and quadratic analytic approximation models, Equations E.18, with Equation E.10 for the linear term, and E.17 for the quadratic term, and E.3 to compute \(B\) from SSI, simplify calculations of solar brightness temperature \(T\) on any chosen day for a fixed wavelength, with \(B\) or SSI as a single variable. (ii) The linear analytic approximation overestimates the exact values of \(T\), while the quadratic analytic approximation underestimates the exact values, but has much smaller RMSE (rms error) than the linear. (iii) By using the full dataset to find coefficients that minimize the RMSE we find linear and quadratic “fit” approximations that lie closer to the exact values for representative wavelengths, as can be seen by the “fit” RMSE values in Tables 1 to 4, being smaller than the corresponding analytic RMSEs, i.e., (fit RMSE)/(analytic RMSE) < 1 for both linear and quadratic cases, for near-ultraviolet, visible, and near-infrared wavelengths. (iv) For wavelengths in between the tabulated ones, Equations I.1 and I.2 provide a smooth interpolating polynomial function of wavelength, which is simpler and faster to apply than Equation E.10 in the analytic case, or the \(R\) software in the fit case, and accurate for any wavelength within a broad range across the peak of the SSI, extending into near-infrared wavelengths that are of particular importance in modeling Earth’s climate.

The statistical measure used to understand the differences between values calculated by the linear and quadratic analytic approximation models with the exact values of \(T\) obtained from Equation D.2 (or root finding in Mathematica software), is the RMSE (root-mean-square-error). Figures 7a and 9a and Table 2 show that for the \(\text{H}{\upalpha } \) wavelength the RMSE for the linear (\(412.455 \times 10^{-6}\) K) and quadratic (\(0.3428 \times 10^{-6}\) K) analytic approximation models are small, and therefore the deviations between the estimated and exact values are small. Table 1 shows that for a wavelength of 285.5 nm the RMSEs for both analytic models remain small, though larger than for \(\text{H}{\upalpha } \). For both these wavelengths, the quadratic analytic model is superior to the linear analytic model. Tables 3 and 4 shows that for the longer near-infrared wavelengths 855.93 nm and 1547.09 nm this pattern continues, with the quadratic analytic model being superior to the linear analytic. The fact that at all four wavelengths the quadratic analytic RMSE is smaller than the linear analytic RMSE suggests that further terms in the Taylor expansion may converge towards the exact over the full wavelength range. However, we do not have proof of convergence. Even if the series does converge, there is only a suggestion, not a guarantee, that it will converge to the exact value given by Equation D.2.

Table 3 RMSE and ME of the temperature values obtained with the different models compared with the exact temperature values from 2003 to 2020 for the wavelength 855.93 nm.
Table 4 RMSE and ME of the temperature values obtained with the different models compared with the exact temperature values from 2003 to 2020 for the wavelength 1547.09 nm.

Comparisons of the linear analytic coefficient (Equation E.10 or G.5) with the coefficient of the linear least squares fit of the data performed with the statistical packages of \(R\) software are shown in Figures 11 and 12. The linear fit model shows the line that best represents the entire data set, whereas the linear analytic approximation model has its maximum accuracy on the chosen reference day. Gaps in the data, the primary one being that which occurs from 2013-07-20 and 2014-03-12 (Harder, Beland, and Snow, 2019) have a direct influence on the coefficient of linear fit, because the solar spectrum measurement instruments SIM A and SIM B showed significant differences from the spectrum measured at the beginning of 2011, as can be seen for example in Figure 1 of the article of Harder, Beland, and Snow (2019).

Despite the good quality of the two analytic approximations, we find that the two fit models provide better “in between” approximations. The most accurate of the four approximations considered here is the quadratic fit model. We have seen that the brightness temperatures that it produces are in most cases indistinguishable from the exact temperatures that are found as roots of the equation that defines the brightness temperature, \(\mathit{SSI}= \alpha _{s} B(T)\), where \(B\) is the Planck distribution, and \(\alpha _{s}\) is the solid angle subtended by the Sun at the mean Earth distance.

There will soon be new opportunities to apply and extend this study. Both TIM and SIM instruments are now acquiring daily data onboard the International Space Station. The new record, begun on 2018-03-14, had sufficient overlap with SORCE to enable the prior dataset to be adjusted to match TSIS-1 (https://lasp.colorado.edu/lisird/data/sorce_sim_tav_l3b/). Currently, TSIS-1 extends to 2021-07-20 and continues to be extended. TSIS-1 will be succeeded by TSIS-2, which is expected to continue the record beyond the peak of Solar Cycle 25. We look forward to testing and applying the approximations studied here to future solar-cycle data, to enable improved understanding of the Sun’s irradiance and temperature variations.