Introduction

Luteinising hormone (LH) plays a complex role in human ovarian physiology. During the follicular phase, it stimulates thecal cells to produce androgens, which are converted into oestrogens by the granulosa cells. During the luteal phase, LH has a major role in maintaining the full function of the corpus luteum. However, regarding fertility control, its most significant role is to induce ovulation at mid-cycle [1]. During ovulation, LH triggers the completion of the maturational process of the oocyte to metaphase II, and its release from the follicle. Finding reliable ways to determine the moment of ovulation is a critical task in the control of human conception and contraception. It also has implications in reproductive medicine, such as timed intercourse, artificial insemination and the diagnosis of anovulation [2].

Understanding the underlying population variability with regard to ovulation is extremely important to appropriately design methodologies to prospectively predict ovulation; for example, for the purposes of timing intercourse to achieve pregnancy. The measurement of reproductive hormones in daily urine samples during the menstrual cycle, especially LH, combines an easy way to collect specimens, and an objective reference to determine the onset of ovulation.

The level of LH suddenly surges, then peaks mid-cycle causing the dominant follicle to rupture and release a mature ovum. Ovulation typically occurs approximately 28–48 h after the beginning of the LH surge [3] and approximately 0.5 days after the peak in levels of LH [4]. Determination of peak LH is simple, but as peak urinary LH levels coincide with, or occur after ovulation, calculation of the peak LH day does not provide the advanced warning of ovulation that is needed for timing of intercourse, or fertility procedures [3, 5, 6]. In addition, urinary peak LH day can differ according to the assay used to measure LH in urine, with those detecting beta-core LH (LH-βc) identifying a peak day at least 1 day after assays that do not detect LH-βc [4]. Identification of the day of the urinary LH surge does not appear to be influenced by the assay used, and provides 24–36 h warning of impending ovulation [710], therefore making it an excellent parameter to measure in order to provide prospective information regarding ovulation. Therefore, it is important to have a well-validated method for assignment of the LH surge and to examine retrospective datasets to provide information on population variability of LH surge.

Detecting the day of the LH surge in urine, when the first sustained rise in the level of this hormone occurs, is not straightforward. The onset of the LH surge in women of reproductive age has been investigated by many researchers, who have developed different approaches for surge identification [3, 6, 7, 1018]. Nevertheless, no comparison has been made between these different methods to date, to determine whether they are equally valid, or whether there is one method that is more effective than the others.

The aim of this study was to compare the different methods reported in published studies that have been developed to estimate the onset of the LH surge in urine during the menstrual cycle. The study sought to identify the different methods used, and subsequently compare them on a sufficiently large number of complete LH profiles from ovulatory cycles of premenopausal fertile women, to determine the most appropriate method to use for the retrospective analysis of study populations. The results of these analyses will lead to further understanding of LH profiles in women and may assist in improving the accuracy of prospective methods of predicting ovulation based on the detection of the LH surge.

Methods

Selection of peer-reviewed papers

To identify published methods used to detect the onset of the LH surge in urine during the menstrual cycle, a thorough search of the scientific literature of peer-reviewed papers in biomedical journals was completed. PubMed and Web of Science databases were searched using the following terms: LH (all spelling variations and abbreviations); surge (all equivalent terms e.g. rise, increase). All abstracts were examined and full papers obtained if the LH surge (or equivalent) was a parameter measured in the study. Citations within identified papers were also investigated to identify further relevant papers. Out of these studies, only those in human subjects describing full methodologies to detect the LH surge during the menstrual cycle were selected for inclusion in this analysis.

Testing the methodology

From the selected studies, the algorithms employed to detect the LH surge in each case were investigated, the methodology was compared, and a classification of the different methods was established. The identified methods were then compared by analysing their performance in complete urinary hormonal profiles of a reliable sample of menstrual cycles.

Menstrual cycle collection

Complete menstrual profiles (from menses to menses) of urinary levels of LH and pregnanediol-3-glucuronide (PdG) were used to compare the different methods designed for detecting the LH surge during the cycle. Daily urine samples (first morning void) were collected by 227 premenopausal women, aged 18–45 years, attempting to become pregnant, with regular cycles of normal duration (21–42 days), and no known fertility problem. Volunteers were required to have two natural cycles prior to study entry and could not be using any fertility medications, breast feeding or have diagnosed polycystic ovarian syndrome. The volunteers collected daily urine samples into sample pots containing sodium azide as a preservative. Volunteers were required to refrigerate samples on collection and return them to the study site, where samples were frozen at −80 °C prior to analysis.

Sample analysis

Hormone analyses were conducted as batch analyses, ensuring complete cycles were analysed on single assay plates. Samples were brought to room temperature and mixed prior to analysis. It had previously been determined that up to five freeze–thaw cycles had no effect on analyte concentration.

Hormonal analysis of urinary LH and PdG was conducted using AutoDELFIA (Perkin Elmer, Waltham, USA) as previously described [19]. The in-house LH assay consists of immobilised biotinylated antibody (antibody #2119; SPD Development Co., Ltd, Bedford, UK) that recognises the alpha LH subunit bound to streptavidin plates, and a second, europium-labelled antibody that recognises the beta subunit (antibody #2301; SPD Development Co., Ltd, Bedford, UK), thus it is only able to measure intact LH. Assay sensitivity was 0.1 mIU/ml and inter- and intra-assay percentage coefficient of variation (CV) was less than 5 %; linearity was seen in dilutions up to 1 in 20 and no high-dose hook was observed when testing up to 1000 mIU/ml. The PdG assay used was also an in-house competitive immunoassay based on competition between sample and europium-labelled PdG for binding by a high-affinity antibody (antibody #5806; SPD Development Co., Ltd, Bedford, UK). Assay sensitivity was 0.021 µg/ml and intra- and inter-assay percentage CV was below 10 % for the standards tested (0.16, 0.80, 4.00, 20.00, 100.00 µg/ml); linearity was seen in up to a 1 in 20 dilution of urine sample.

Cycle inclusion and analysis of methods identified in the literature for determining the urinary LH surge

A total of 254 reference cycles with complete hormonal profiles of LH and PdG were included in this study. Out of these cycles, 148 ended in conception. Only hormonal profiles showing these unambiguous features of ovulation were included in the study. The existence of the LH surges followed by peaks, as well as a consistent rise in the levels of PdG, was proved visually on the plotted hormonal profiles (LH and PdG). The LH surge day was assigned following abundant visual analysis of the graphical data by the authors. This method has been shown repeatedly to be effective and reliable in the interpretation of basal body temperature charts for assigning reference point in menstrual cycles [20, 21]. LH and PdG profiles corroborate the correctness of this visually assigned LH surge; rise of urinary LH surge occurs 0.81 ± 0.89 days prior to ovulation and the rise of urinary PdG from baseline has been demonstrated to occur 3.20 ± 1.07 days after ovulation (Roos et al., submitted), using the assay previously reported and employed [4]. This LH surge was considered the reference surge day against which modelled surges were compared for agreement. The algorithms defined by each method were evaluated in the cycles using SAS 9.3 software. LH baseline was determined for each method for each cycle, then surge assigned for each method identified in the literature. The surge was determined as the first sustained rise in LH above the baseline, with a sustained rise defined as a level of at least 2.5-fold of the standard deviation (SD) over the mean baseline level.

Agreement between the reference surge and the results from the modelled surges from the different literature-identified methods were examined to the exact day.

Results

Analysis of the scientific literature

Altogether, 12 studies investigating the onset of the LH surge in the menstrual cycle, containing a well-defined, reliable, methodology for detecting the surge in urine were identified (Table 1) [3, 6, 7, 1018]. From the methodology described in these 12 studies, 16 different methods used for detecting the LH surge were identified. Analysis of these 16 methods found that they could be separated into three major methodological categories.

Table 1 List of publications identified by searching literature databases for studies on the detection of the LH surge [3, 6, 7, 1018]

All the identified methods defined the LH surge as ‘the first sustained rise in the level of LH, from the first day of the cycle’. Similarly, all methods used the mean value of 4 or 5 days to estimate baseline LH values.

The definition of the magnitude of the rise in the level of LH to identify the occurrence of the surge was defined differently in the various studies. Some studies took a multiple value of the baseline level (e.g. twice or three times the baseline level), others considered it as a number of SD values over the mean baseline level.

However, the key difference between the algorithms developed is the method used to determine the baseline level of LH during the follicular phase of the cycle from where the rise occurs and, specifically, the reference points used to estimate this baseline value.

Methods identified for the detection of the onset of the LH surge in the menstrual cycle

Three major categories of methods, based on estimation of the baseline values of urinary LH can be distinguished:

  1. 1.

    Using fixed days during the cycle. The basal level of urinary LH is determined by calculating the mean LH value from day 5 (or 6) of the cycle, and up to days 8, 9 or 10. Four different variations of this method have been used: mean LH levels from days 5 to 8, from days 5 to 9, from days 6 to 9 and from days 6 to 10. This method can be applied both retrospectively and prospectively, as it does not need data of the complete menstrual cycle.

  2. 2.

    Using the day of the LH peak in the cycle. Urinary LH peak day of the cycle under investigation is identified, then 3, 4, 5 or 6 days prior to this is taken as a reference day, and the mean LH value on the reference day is then calculated, plus the level on the previous 3 or 4 days. In total, eight variations of this method were observed; estimating the mean LH value of, either, the previous 3 or 4 days plus the reference day, which are days 3, 4, 5 or 6 before the day of the LH peak. This method can only be applied retrospectively as data from the complete menstrual cycle are necessary for its application.

  3. 3.

    Using an estimate of the LH surge. This approach makes a rough estimation of the day of urinary LH surge by examination of the data of the cycle under investigation. It then takes days 1 or 2 before the estimated LH surge, as a reference level, plus the level on the previous 3 or 4 days, to calculate the mean value of LH. Four variants of this approach are documented. This method also can only be applied retrospectively as it requires data from the complete menstrual cycle.

Comparison of the methods used to determine baseline value of LH

For all of the methods identified above, the baseline LH level was calculated. The method-dependent surge was then determined as the first sustained rise in LH above the baseline, with a sustained rise defined as a level of at least 2.5-fold of the SD over the mean baseline level. Figure 1 shows an example of complete hormonal profiles of LH during the menstrual cycle, highlighting the references used by each of the methods to determine the baseline value of LH in the follicular phase of the cycle.

Fig. 1
figure 1

Application of methods to determine baseline for LH surge identification to an example menstrual cycle. Each individual method is numbered: a using fixed days to calculate baseline, b using LH peak to calculate baseline, c using estimated LH surge to calculate baseline

Performance of the identified methods for identifying the LH surge in complete urinary hormonal profiles of a reliable sample of menstrual cycles

Table 2 shows the results of the analysis evaluating the performance of all the different methods identified in the scientific literature for determining the onset of the LH surge in the 254 complete hormonal profiles analysed, whereby the surge day calculated by the various methods identified in the literature was compared with the reference surge day (as determined by an examination of the graphical cycle images by an expert panel).

Table 2 Summary of the identified methods developed to determine basal LH levels, to detect the onset of the LH surge in the menstrual cycle

Method category 1 based on fixed days to estimate the baseline levels of LH

Four variants are reported in the literature that use fixed days during the cycle to estimate baseline urinary LH levels. These methods provided correct surge detection in 44.5–52.8 % of cycles (dependent on exact method). The values obtained for each specific method were: 44.5 % when using cycle days 5–8; 48.4 % when using days 5–9; 48.0 % when using days 6–9; 52.8 % when using days 6–10. The performance of these methods for identifying the surge on ±1 day of the reference surge ranged between 52.0 and 63.8 %.

Method category 2 based on the day of the LH peak to estimate baseline LH levels

When the reference to determine the baseline values was the urinary LH peak, the approaches identified performed better than the fixed days method, detecting 63.4–70.5 % of the surges correctly, depending on which set of the eight variants of this category were applied.

When the reference was taken on day 3 before the LH peak, plus the previous 3 days to determine baseline values, the surge was correctly identified in 63.4 % of the cases; however, when it was day 3 before the peak plus the previous 5 days, it was identified correctly in 65.4 % of cases.

When the reference started on day 4 before the peak plus the previous 3 days, the surge was correctly identified in 70.5 % of cases, and if taking the previous 4 days instead, it correctly identified 68.5 % of the cases. When the reference was day 5 before the peak, plus the previous 3 days, it correctly identified the surge in 67 % of the cases, whereas if it took the previous 4 days instead, it correctly identified 64.6 % of the cases.

Finally, when the reference started on day 6 before the peak, plus the previous 3 days, it correctly identified the surge in 59.1 % of the cases, whereas when taking the previous 4 days it correctly identified 60.2 % of cases.

When considering the performance at determining the surge ±1 day of the reference surge, the range rose to 73.3–90.5 %.

Method category 3 based on the day of the LH surge to estimate baseline LH levels

Methods using the estimated urinary LH surge as a reference to determine the baseline LH value were observed to offer superior performance in correctly detecting the surge, finding them correctly ranged from 74.4 to 78.4 %.

When the reference to estimate baseline values started from day 1 before the estimated surge plus the previous 3 days, it correctly identified the surge in 78.4 % of the cases, whereas taking the previous 4 days instead identified 76.4 % of the surges correctly. When the reference used was day 2 before the estimated surge plus the previous 3 days, it correctly identified 74.4 % of the cases, whereas using 4 days, produced 75.2 % of correct identifications.

The performance of these methods at finding the surge within ±1 day of the reference surge ranged from 94.5 to 97.6 %.

Discussion

Prediction of ovulation can be important for women who are trying to conceive, in order for them to time intercourse to their fertile period to maximise their chances of conception. It can also have clinical importance for appropriate timing of procedures, such as intrauterine insemination, frozen embryo transfer or donor sperm insemination.

Over the years, several methods have been developed to predict ovulation. One of the most common methods used in fertility awareness is the sympto-thermal method (STM) because abundant evidence of its reliability has been presented [2225]. However, the STM requires training in use, detailed recording of symptoms of self-observation and the application of several calculation rules, all lowering general acceptance rates. With the advancement of technology, other reliable, objective and less user-dependent methods have been introduced, which rely on the analysis of the levels of reproductive hormones measured in urine [26]. The study of hormonal profiles in urine during the menstrual cycle has proved to be very valuable in understanding menstrual cycle dynamics, as well as variability in ovulation [19, 27, 28]. Likewise, urinary hormone testing has become an established application for personal monitoring of fertility awareness [9]. Both temperature and hormonal analysis using repeated measurements in menstrual cycles are currently under intensive research as part of the evolving field of digital signal tracking in gynaecological endocrinology [21].

In premenopausal women, the menstrual cycle has a recognisable general pattern, and an average duration; however, there are important variations in cycle length and dynamics, both between and within women [2933]. Likewise, there are differences in hormonal patterns, even in ovulatory cycles, which introduce a further source of difficulty into the analysis [34, 35]. This compromises the utility of all fertility awareness methods that solely rely on statistical parameters or calculation rules of the menstrual cycle (i.e. monitoring cycle days), and strengthens the importance of looking at specific events of physiological relevance leading to ovulation, such as the LH surge. It is the LH surge which always precedes ovulation, rather than the LH peak, and which is a signal of impending ovulation and peak fertility. Therefore, it is important to have a reliable algorithm to determine the onset of the LH surge in population studies. A standardised algorithm for determining the timing of the urinary LH surge in studies that have examined the hormone profiles in women is still not available.

This study examined the scientific literature, searching for studies focusing on the detection of the onset of the LH surge during the cycle, in urine samples, and found 12 relevant studies. In these papers, there were three major methodological categories designed and used to determine the onset of the surge. The key difference between these methods was the reference used for estimating the baseline values of urinary LH during the follicular phase. Important differences in the performance of these methods were identified. Methods that used the LH surge as a reference to estimate baseline values of LH clearly had the highest level of reliability in this respect, detecting on average up to 76.1 % of the surges, compared with only 64.8 % when the LH peak was used as a reference and 48.4 % when using fixed days of the cycle. The findings of this study might help to improve the analysis of the human menstrual cycle for monitoring to assist with conception and contraception, as well as for biomedical research. In addition, a common methodology would facilitate comparisons between studies.

Many studies report LH peak [29, 36]; however, as a prospective measure, peak LH has less value as it often occurs post ovulation [37]. In addition, assignment of the day of peak urinary LH is dependent on the assay used [4]. Some urinary LH assays use antibodies that recognise the beta-core fragment of LH (LH-βcf), which is a by-product of LH metabolism, and the predominant molecular form of LH in the urine [38]. Levels of LH-βcf continue to rise after the level of the physiologically relevant intact LH reaches peak concentration, sometimes achieving maximal levels up to 5 days later [4].

The comparatively poor performance of this prospectively applicable approach that uses fixed days during the menstrual cycle to determine baseline levels of LH is somewhat expected, since this approach does not take into consideration important factors, such as the variability in the length of the cycle, which mostly affects the follicular phase and exerts an influence over the hormonal dynamics [30]. Ovulation has been seen as early as day 8 of the menstrual cycle and, as the fixed days method generally includes day 8, it is clear that this method will fail under such circumstances.

The higher reliability of a mid-cycle reference (method #2) to determine baseline LH was anticipated, because data from the complete menstrual cycle have to be considered for its application, and due to its proximity with the fertile window, which is a very stable stage of the cycle, and unlike the early follicular phase, is not affected by age [39, 40]. It is also clear why the approach that uses the estimated surge (method #3) as a reference performs better than the method using the peak (method #2) because in almost 25 % of normal menstrual cycles the peak may occur postovulatory (Roos et al. submitted); differences in the LH dynamics around the time of the surge and peak in LH account for this. Some complex LH peaks have been described when researchers have used an assay that recognises LH-βcf, with a longer duration than normal, sometimes described as ‘biphasic’ or in ‘plateau’ [6, 11]; this often presents a significant delay between the surge and the peak of LH, compared with the ‘spike’ type of peaks. It is possible that when the reference is taken starting 3 or 4 days before the peak, in this type of cycle, levels of LH are already high, decreasing the accuracy of the estimation of the baseline levels. Likewise, when using a large number of days away from the peak (such as 6 or 5), in short cycles or those of normal duration, the levels estimated might be closer to those found in the fixed approach.

For fertility monitors, however, a retrospective approach with analysis of data from a complete menstrual cycle is not applicable. Therefore, learning monitors should consider the location reference points from previous cycles for higher reliability of surge detection. Current research focuses on a formal mathematical approach (signal tracking), finding distinct patterns in hormonal and temperature curves to predict change points and distinguish them from the random ‘noise’ (change-point analysis) [21].

A further way to improve the results includes other hormonal markers in urine, in addition to LH. These are estradiol and its metabolites, since a rise in the levels of estradiol triggers the rise in the levels of LH [38], and peak estrone-3-glucuronide (E3G) levels occur on the day of ovulation, as do those of follicle-stimulating hormone (FSH). However, this level of detail would add considerable complexity and additional cost to the analysis, especially by repeated measurement.

Ultrasound-observed ovulation is the gold-standard method used to determine the timing of ovulation and it would be of interest to observe how different approaches to LH surge definition compare to this reference. However, there is variability between women in the timing of the LH surge in relation to ultrasound-observed ovulation [6], thus further work could consider this relationship.

In conclusion, this study found that there are many methods for calculating the LH surge in urine reported in the literature, and their effectiveness at correctly identifying the surge varies between methods. Some of these methods are in use in current fertility monitors. Care should be taken when comparing between studies reporting urinary LH surge data, as results will be dependent on the method employed for surge identification. This study supports the recommendation for using methods that involve an initial estimation of the LH surge to establish which days should be used to determine baseline.