Introduction

Polycystic ovary syndrome affects 8 to 13% of women of reproductive age worldwide, making it the most common endocrine problem. This condition can cause menstrual disorders, lack of ovulation, obesity, acne, hirsutism, hair loss, and baldness. Long-term complications include endometrial cancer, infertility, insulin resistance, type 2 diabetes, high blood pressure, heart disease, depression, and stress. Short-term consequences and complications may also arise [1]. Early diagnosis of the disease is very important and can reduce the duration of the disease and the mortality rate. Studies show that in many diseases, early diagnosis is difficult for health care providers. On the other hand, patients do not have the information related to self-management and do not have the necessary knowledge to obtain this information. Social withdrawal may become more prevalent as physical symptoms like acne, hair loss, and depression manifest [2,3,4]. Following diagnosis, implementing lifestyle modifications to address symptoms like elevated cholesterol and insulin resistance is recognized as an innovative therapeutic approach. Given the prevalent overweight status among women with this syndrome, the significance of adhering to a nutritious diet and engaging in physical activity has been underscored. Consequently, the imperative of devising strategies to promote adherence to healthy dietary habits and facilitate weight loss among affected individuals is deemed essential and inevitable [1].

The use of information technology greatly facilitates the prevention, diagnosis and treatment of chronic diseases and increases their accuracy. Among these technologies, it can be mentioned decision support systems, mobile-based applications, virtual reality, augmented reality, and intelligent decision-making systems. These technologies largely solve the challenge of accessing data and evidence-based information for both patients and medical providers [5,6,7,8,9,10].

In the healthcare domain, recommender systems represent a practical technology enabling self-care through tailored recommendations. Ultrasound imaging can support early disease detection, alleviating healthcare provider workload and expediting diagnosis. This strategy not only conserves resources and reduces expenses but also leverages mobile phone platforms to enhance awareness, disseminate evidence-based knowledge, and foster beneficial shifts in individual behaviors and habits. By harnessing artificial intelligence, a recommender system can enhance dietary practices among patients with PCOS, leveraging the widespread adoption of smartphones to improve access to medical professionals and reliable information [2, 4].

The study conducted by Jan et al. in India in 2022 analyzed six AI models for diagnosing PCOS. They compared these models based on the number of ultrasound images, segmentation, and classification methods. The evaluation highlighted that the Bayesian classifier achieved the highest accuracy of 93.93%. This study underscores the significant potential of AI in diagnosing PCOS and recommends further research to implement this technology effectively [11].

Boyle et al. conducted a study in Australia in 2018 to assess the need for assistance among individuals with PCOS and evaluate mobile health applications in this area. The results show that 98% of participants owned smartphones, 72% had previously used an application for self-care, and 91% expressed willingness to use a PCOS-specific app if available for managing this syndrome. Accurate, evidence-based information was deemed essential in this study, and all the assessed applications met the required quality standards [12].

The research conducted by Portugal et al. in Canada in 2017 focused on the use of machine learning techniques in recommender systems. The study aimed to identify associated issues and assist researchers in implementing these systems more effectively. The findings of the study highlighted various machine learning techniques and their applications, as well as primary and alternative performance criteria [13].

In 2019, Abhari et al. conducted a study in Iran to assess the characteristics of nutritional recommender systems. The study revealed that if these systems are properly designed, implemented, and evaluated, they can serve as effective tools to improve nutrition and promote a healthy lifestyle [14].

The study conducted in India in 2021 by Kaur and colleagues aimed to develop a method for classifying food images to track patients’ meals and provide guidance to nutritionists on recommended tactics and image classification. The research focused on utilizing deep learning approaches, particularly convolutional neural networks (CNNs), to classify Indian food images accurately [15].

After conducting research, we found no systematic review that assesses nutritional recommendations for individuals with PCOS. Given the significance of utilizing companion health and AI in managing chronic conditions like PCOS, this study aims to explore the effects and uses of AI-based systems for PCOS.

Motivation

The advancement of AI in healthcare has made it necessary to use AI-powered recommender systems for predicting, diagnosing, treating, and managing chronic ailments like PCOS. There are several reasons why this is important. Ultrasound images are necessary to diagnose PCOS. However, inaccuracies in counting follicles, high diagnostic test costs in developing countries, time-consuming tests, doctor workload, and diagnostic errors can lead to inaccurate diagnoses. AI techniques can automatically diagnose diseases using ultrasound images, overcoming challenges [16,17,18]. Additionally, since this disease is intricate and has no definitive treatment, the current approach involves a combination of medication and lifestyle changes for disease management. Hence, utilizing recommender systems or self-care systems that prioritize healthy nutrition could be beneficial in mitigating symptoms and lessening the likelihood of associated mental health issues [19, 20]. Besides disease prediction and probability estimation, recommender systems can detect suspicious cases based on AI and take action to prevent disease occurrence or early detection [21].

Contribution

The article aims to explore the use of artificial recommender systems in polycystic ovary syndrome research. We also examine the challenges and limitations of using these systems and algorithms. This article attracts researchers to conduct studies in the field of recommender systems. The sections of this article are as follows:

  • Review of studies from the perspective of publication.

  • Review studies of characteristics.

  • Examining the limitations and challenges of designing recommender systems.

Materials and methods

Study design

The current systematic study was designed and implemented based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement.

Data sources

In this study, we were six researchers. The first one determined the search strategy and performed the search in PubMed, Scopus, and Web of Science databases from inception until 6 June 2023. To conduct a search for relevant articles, we utilized a combination of keywords from the article abstracts, including “polycystic ovary syndrome”, “recommender system”, “application program”, “artificial intelligence” and “nutritional program”. We also incorporated Medical Subject Headings (Mesh) into our search strategy. For the complete search strategy, please refer to the supplementary file provided (Table S1-S3).

Selection criteria

One of the researchers determined the inclusion and exclusion criteria based on similar studies, and finally criteria were approved by the supervisor’s opinion. Inclusion and exclusion criteria are as follows:

Inclusion criteria:

  1. 1)

    Studies about recommender systems

  2. 2)

    Articles related to PCOS

  3. 3)

    Articled related to diet management in PCOS

  4. 4)

    Studies that developed an AI system or application

Exclusion criteria:

  1. 1)

    Review articles, meta-analysis, conference abstracts, letters to the editor, book chapter.

  2. 2)

    Articles that are unrelated to the goals of the current research.

  3. 3)

    Articles whose full text is written in non-English language.

  4. 4)

    Articles whose full text is not available for data extraction.

Study selection

In this stage, one of the researchers entered all the articles retrieved from the three databases (PubMed, Web of Science, and Scopus) into the Endnote X9 (Thomson Reuters, Toronto, Ontario, Canada) software. Another researcher removed duplicates. Two of researchers separately, checked the title and abstract of the articles. The items that did not match the inclusion and exclusion criteria were excluded from the study. Finally, by studying and examining the full text of the remaining articles, he selected the articles related to the topic as the final articles. In cases where the two researchers had differing opinions, the supervisor provided the final decision.

Data extraction

A researcher created an Excel form with the guidance of our supervisor. The research team reviewed articles and extracted necessary data elements. The Excel form included data such as the number of articles, publication year and country.

Quality assessment

Two independent reviewers assessed the quality of studies using the Newcastle-Ottawa quality assessment criteria proposed by Malhotra [17], Wen et al. [16], and Kitchenham et al. [18]. The quality assessment criteria consist of eleven questions: Q1) Are the aims of the study clearly defined?; Q2) Are all study questions answered?; Q3) Are the variables used in the study clearly stated?; Q4) Are AI techniques, such as machine learning, clearly defined?; Q5) Is the data set size appropriate?; Q6) Is the data collection method clearly stated?; Q7) Is the study methodology repeatable?; Q8) Are the results and findings clearly presented?; Q9) Are the performance measures used to assess the model(s) clearly stated?; Q10) Are the limitations of the study stated?; Q11) Does the research have value for the academic or industry community? The questions were ranked based on three values: “Yes = 2”, “Partial = 1”, or “No = 0”. Each study could obtain a maximum score of 22 and a minimum score of 0. Criteria used to rank the quality assessment of each study include: i) ≤ 49% = Low quality; ii) 50% and 69% = Moderate quality; iii) above 70% = High quality.

Results

Study selection

Figure 1 shows the process of searching and selecting articles based on the PRISMA flowchart. We found a total of 15,064 articles by searching in PubMed, Web of Science, and Scopus databases. After removing duplicates (n = 6537), We took three steps: (1) screening the article titles, (2) reviewing the article abstracts, and (3) reviewing the full text of the articles and extracting the data by the second group. Based on the predetermined criteria, we eliminated 8,408 studies during the one and two-stage process. In the third stage, from 119 articles unrelated studies (n = 86), articles with unavailable full text (n = 8), review studies (n = 4), and book chapters (n = 1) were excluded. Finally, 20 articles entered the third stage, i.e., a review of the full text of the articles. The research team extracted required data elements such as publication year, country, journal/conference, purpose, study design, sample size, sample age range, results, tools, challenges/limitations, Relevance to the study and system target. We recorded the extracted data in an Excel (Table 1).

Fig. 1
figure 1

PRISMA flow diagram indicating results of identification and screening process for included and excluded papers

Table 1 Summary of included contributions (n=20)

Publication analysis

Distribution of studies by year

The studies were conducted from 1997 to 2023. Figure 2 shows the results. 2021 (n = 5) has the most frequency. The second frequency was related to 2018 and 2022. (n = 3).

Fig. 2
figure 2

Distribution of studies by country and year

Distribution of studies by country

The reviewed studies were conducted in eight different countries. The frequency of these studies in each country is depicted in Figs. 2 and 3. India had the highest number of studies (n = 9), followed by Indonesia (n = 3), the United States of America, and China (each with 2), ranking second and third, respectively.

Fig. 3
figure 3

Distribution of studies by country

Distribution of articles based on journal/conference name, publisher and impact factor

The articles were appeared in 11 different journals and seven conferences. The journal “Frontiers in Endocrinology” had the highest number (n = 2), while all the other journals had only published 1 article. All conferences, except the “International Conference on Data and Information Science,” presented a paper on the topic. Tables 2 and 3 display the distribution of articles in this field.

Table 2 Distribution of articles based on journal/conference name, publisher and impact factor
Table 3 Distribution of articles based on conference name

Study specifications

Frequency of studies based on AI/application

Based on the survey, most (95%) of the studies focused on models and systems utilizing AI technology [2, 16, 21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37], while only one study resulted in the creation of an AI-based application [18].

Frequency of studies based on the type of system application: When it comes to systems and applications designed for various purposes, they can be classified into three types: prediction, diagnosis, and management. Among all the conducted studies, 60% focused on diagnosis [2, 16, 24,25,26, 29, 30, 32, 33, 35,36,37], 30% on prediction [21,22,23, 27, 31, 34], and none on management. Additionally, 10% of studies focused on both diagnosis and prediction [18, 28]. However, none of the studies examined the role of nutrition in managing PCOS.

Specifications of performance metrics for model evaluation

Table 4 displays the metrics utilized in the articles. MAE [23], RMSE [23, 33], and RRSE [23] calculate three types of errors in implemented models, so the lowest value is considered for an ideal model. MAE in mathematics is the arithmetic equivalent of absolute errors. This criterion only measures the magnitude of the error and does not give a significant indication of the direction of the error. These three criteria have been used in a study to evaluate the model’s performance [23, 33].

The Kappa statistic is a tool that evaluates the effectiveness of a model’s reliability and ensures an accurate representation of changes in collected data. It can range from − 1 to + 1. Despite being a common statistic, there is no consensus on measures of it in health studies. It has been utilized solely in one study [38].

Most studies commonly used Sensitivity [2, 18, 28,29,30, 32,33,34,35,36,37], Specificity [2, 28], and Accuracy [2, 18, 26, 28,29,30, 32,33,34,35,36,37] as criteria. These metrics measures by True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN). F1 score or 𝐹 − 𝑀𝑒𝑎𝑠𝑢𝑟e as a harmonic average of accuracy and recall. Where there is a balance between accuracy and recall, it is a better metric [2, 18, 26, 28,29,30, 32,33,34,35,36,37].

Table 4 Performance metrics for model evaluation

Specifications of decision support systems for PCOS prediction

As presented in Table 1, Eight studies [18, 21,22,23, 27, 28, 31, 34] designed a model to predict disease. These studies used Decision trees [22, 31], Topology-preserving forward network [21], multi-layer perceptron [21], NFRS [23], Artificial neural network [23], Apriori algorithm [27], NB classifier method [28], LR [18, 28], KNN [18, 28, 31], CART [28], RF Classifier [18, 28], Gaussian Naive Bayes [18], Fuzzy [31], CNN [34], GCN techniques [34] and SVM [18, 28, 31].

Specifications of decision support systems for PCOS diagnosis

Automated detection models based on database: Eight studies [2, 18, 28,29,30, 32, 36, 37] have implemented automated detection models based on a database to diagnose or screen patients automatically. These studies aimed to diagnose diseases automatically by creating an AI-based model using readily available data or data from those who seek treatment at health centers. The studies all followed a similar methodology. They first collected data from healthy and sick individuals. After that, they performed pre-processing to identify parameters and characteristics. They designed the model using selected techniques and evaluated its performance using model evaluation metrics. The studies utilized various methods such as LR [2, 28,29,30], Bayesian classifier [2], DT [29, 37], SVM [18, 28,29,30, 37], CNN [32], KNN [18, 28], quadratic discriminant classifier [18, 29], RF [18, 28,29,30], CART [28, 30], Gaussian naive Bayes [18], and K-means clustering [36] to develop the automated diagnosis model.

Classification models based on images: Four studies [24,25,26, 35] have developed a diagnostic model for this disease using ultrasound image classification. Ultrasound images were used to train and test the model., and different methods such as CNN [24,25,26], and SVM [26] were used in these studies.

Follicle segmentation models: Two studies [16, 33] created a model to segment follicles in ultrasound images for automatic disease diagnosis. The model specifically diagnoses through follicle segmentation, reducing the time needed for follicle counting. The process began with the publication of images, followed by image processing to segment the follicles. After extracting features, classifiers were used to design the model. Techniques used during the pre-processing stage included histogram equalization, contrast enhancement, and the Wiener filter for noise reduction of the images. The segmentation stage utilized the Fuzzy logicis, Hybrid Intelligent, Water Drop (IWD), KNN, SVM [33] and K-means clustering [16].

Limitations and challenges mentioned in the studies

As presented in Tables 1 and 10 studies pointed out the limitations and refinements. Accordingly, the small volume of sample size and features in 4 studies, the need to conduct more studies in 3 studies, the increase in the time of automatic diagnosis by the system by using more data in one study [26], the unwillingness of patients to disclose reports and clinical data in one [19] and the need to improve accuracy using other classifiers is mentioned in one study [2].

Quality assessment of included studies

The quality assessment of the included studies is detailed in the Supplementary file (Table S4), with 17 studies rated as high quality and three as moderate qualities.

Discussion

The purpose of this study was to conduct a thorough review of recommendation systems for women with PCOS. Specifically, we focused on models or applications that utilized artificial intelligence. We collected information from various sources such as publication year, country, journal or conference, sample size, age of participants, limitations and challenges, and results. During systematic review, we found five studies that shared a similar approach [2, 11, 12, 14, 17, 19]. . We found reasons for using mobile or AI-based recommender systems in PCOS disease management. We will now delve into the study’s findings and other studies.

Abhari et al. investigated nutritional recommendation systems without considering a specific disease. In this study, we reviewed the proposed recommender systems for polycystic ovary syndrome with its various applications [14].

Based on the study results, obesity in people with PCOS, with the escalation of symptoms, increases the cost of treatment and reduces it, especially in infertility. The expenditure of lifestyle modification with the help of health and weight loss is lower than drug therapy. Modifying lifestyle and nutrition using mobile phones is considered a low-cost intervention with a lower percentage of invasion [20]. The results of a 2018 study by Jacqueline A. Boyle and colleagues in Australia showed that a quality disease management application met the needs of patients; however, none of the applications reviewed had quality [12].

As mentioned, early diagnosis of the disease in the early stages is associated with risk reduction of disease consequences. Therefore, we may need recommender systems to reduce risk reduction and time of the diagnosis and increase accuracy [32]. One study by Naila Jan and colleagues in 2023 investigated AI techniques for PCOS diagnosis. The results of this study show that early diagnosis of this disease is difficult despite different symptoms in people, so automatic detection systems can be used as an accurate solution in this field [17]. As with our results, the existence of a limited amount of data is considered as one of the obstacles to the implementation of this type of study [17, 27].

Two studies conducted by Naila Jan and colleagues in India in 2022 reported a PCOS rate of 3.7 to 22.5%, which was higher in urban than rural women [11, 17]. Based on the review, 40.9% of studies have been conducted in India, which justifies the high rate of PCOS. Also, unhealthy lifestyles, including unhealthy eating, can be a reason for most urban women to do this.

Among the models designed to predict PCOS, the best accuracy belongs to a fuzzy logic-based model with an accuracy of 98.2% [31]. In the field of PCOS diagnosis, a hybrid model based on ANN, CNN and InceptionV3 has the best performance among the designed models with an accuracy of 98.12 [35]. RF and SVM were two widely used algorithms with acceptable performance, but the performance of the CNN-based model with 97% accuracy is better than these two algorithms [32]. Based on the study by Naila Jan and colleagues, the best performance belonged to a CNN-based system with an average of 76.36% and a micro-average f1-score of 100%. KNN, ANN, and Fuzzy logic with an accuracy of 97%, 97.5%, and 97.30 were the best classification techniques among the reviewed articles [11, 17].

Another study in 2018 by Jue Xie and colleagues in Australia aimed to investigate the AskPCOS application and the steps involved in creating it. The results showed that the mentioned program is one of the best evidence-based user programs to manage PCOS disease. Due to the support of 5 common languages of the world, it can eliminate the inequality of lack of access to information in developing countries. According to the evaluation, 80% of people were satisfied. The application’s usefulness was 70%. Jacqueline A. Boyle’s study showed that evidence-based application programs positively affect the patient’s awareness [19].

Study limitations

The study has several limitations. (1) only two studies addressed the aspect of concomitant health management in PCOS. (2) there is a lack of access to several paid articles that can provide better results by reviewing them. 3, the case was a similar study in other developing countries such as Iran, which may cause problems in the results of this study in this country. Therefore, we recommend carrying out more specialized studies about recommender systems. We should use AI to modify lifestyles to nutritional patterns, such as providing a dedicated budget, creating motivation, and creating a culture among researchers.

Conclusion

We conducted a systematic review to explore the use of AI and companion health systems in managing polycystic ovary syndrome, with a focus on nutrition. Although AI has primarily been used for disease diagnosis, the positive impact of AI and companion health systems in providing nutrition-based treatment solutions is significant. Therefore, we recommend that countries, particularly those with a high number of affected individuals, prioritize policies that encourage further studies to evaluate the effectiveness of recommendation systems on the nutrition of people with polycystic ovary syndrome. We can work on the quality of life for individuals affected by this condition.