Skip to main content

CoSSC—Comparative Study of Stellar Classification Using DR-16 and DR-17 Datasets

  • Conference paper
  • First Online:
Third Congress on Intelligent Systems (CIS 2022)

Abstract

The domain of astrophysics involves the documentation of celestial bodies with the means of procuring supporting data which have evolved with technological advancements; the present procedure involving spectral data has received a healthy response with two recent prominent datasets. The collaboration of Computer Science and Astrophysics has been a blessing in disguise which has helped advance the research stagnant at the developing process. Observing an unexplored plethora of opportunities, the team has undertaken the task of inspecting the accuracy provided by the collected SDSS datasets to the sought-after problem of classification. The fundamentals of Astronomy within Astrophysics lie in the study of heavenly bodies under observation, and the query regarding the birth of Milky Ways has become prominent under curious scrutiny. The research implements comparative study between DR-16 and DR-17 datasets through usage of prominent machine learning algorithms—logistic regression, support vector machine, multilayer perceptron, and decision trees—to arrive at respective results. The mentioned models have been opted as they have a history of arriving at optimal solutions pertaining to the issue of classification. The efficiency of the process is promoted through data filtration. The filtered data is partitioned into training and testing sets, which is finally used to train the algorithm models, and the other half is utilized in the process of predicting the outcomes. Finally, the result with highest accuracy is declared as the most suitable algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Xing F, Guo P (2004) Classification of stellar spectral data using SVM. In: International symposium on neural networks. Springer, Berlin, Heidelberg, pp 616–621

    Google Scholar 

  2. York DG, Adelman J, Anderson Jr JE, Anderson SF, Annis J, Bahcall NA, Bakken JA, Barkhouser R, Bastian S, Berman E, Boroski WN (2000) The Sloan Digital Sky Survey: technical summary. Astron J 120(3):1579

    Google Scholar 

  3. Beers TC (2020) The 16th data release of the Sloan Digital Sky Surveys. Astrophys J Suppl Ser 249:3 (21pp)

    Google Scholar 

  4. SDSS-IV Data Release 17 (DR17) [Online]. Available at: https://risweb.st-andrews.ac.uk/portal/en/datasets/sdssiv-data-release-17-dr17(6cc5bcc9-4dd8-4a37-8c9e-09d23064ad84).html

  5. Kim J (2018) Machine learning approaches to star-galaxy classification. PhD diss., University of Illinois at Urbana-Champaign

    Google Scholar 

  6. Iyer GR, Vastare KC (2017) Deep learning for star-galaxy classification

    Google Scholar 

  7. Caniglia G, Krokos M, Becciani U, Gheller C, Nichol R, Comparato M, Costa A et al (2009) Visual discovery in large-scale astrophysical datasets; experiences using the Sloan Digital Sky Survey. In: 2009 second international conference in visualisation. IEEE, pp 10–15

    Google Scholar 

  8. Bai Y, Liu J, Wang S, Yang F (2018) Machine learning applied to Star–Galaxy–QSO classification and stellar effective temperature regression. Astron J 157(1):9

    Article  Google Scholar 

  9. Brice M, Andonie R (2019) Classification of stars using stellar spectra collected by the Sloan Digital Sky Survey. In: 2019 international joint conference on neural networks (IJCNN). IEEE, pp 1–8

    Google Scholar 

  10. https://pweb.cfa.harvard.edu/research/sloan-digital-sky-survey-sdss

  11. https://www.sdss.org/dr16/algorithms/qso_catalog/

  12. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M et al (2011) Scikit-learn: machine learning in Python. J Mach Learn Res 12:2825–2830

    MathSciNet  MATH  Google Scholar 

  13. Breiman L (2001) Random forests. Mach Learn 45(1):5–32

    Article  MATH  Google Scholar 

  14. Pal M (2005) Random forest classifier for remote sensing classification. Int J Remote Sens 26(1):217–222

    Article  Google Scholar 

  15. Masetic Z, Subasi A (2016) Congestive heart failure detection using random forest classifier. Comput Methods Programs Biomed 130:54–64

    Article  Google Scholar 

  16. Jedari E, Wu Z, Rashidzadeh R, Saif M (2015) Wi-Fi based indoor location positioning employing random forest classifier. In: 2015 international conference on indoor positioning and indoor navigation (IPIN). IEEE, pp 1–5

    Google Scholar 

  17. Wan S, Liang Y, Zhang Y, Guizani M (2018) Deep multi-layer perceptron classifier for behavior analysis to estimate Parkinson’s disease severity using smartphones. IEEE Access 6:36825–36833

    Google Scholar 

  18. Zhao C, Gao Y, He J, Lian J (2012) Recognition of driving postures by multiwavelet transform and multilayer perceptron classifier. Eng Appl Artif Intell 25(8):1677–1686

    Article  Google Scholar 

  19. Vashisth S, Dhall I, Saraswat S (2020) Chronic kidney disease (CKD) diagnosis using multi-layer perceptron classifier. In: 2020 10th international conference on cloud computing, data science & engineering (Confluence). IEEE, pp 346–350

    Google Scholar 

  20. Li B, Friedman J, Olshen R, Stone C (1984) Classification and regression trees (CART). Biometrics 40(3):358–361

    Google Scholar 

  21. Yoo SH, Geng H, Chiu TL, Yu SK, Cho DC, Heo J, Choi MS et al (2020) Deep learning-based decision-tree classifier for COVID-19 diagnosis from chest X-ray imaging. Front Med 7:427

    Google Scholar 

  22. Chern CC, Lei WU, Huang KL, Chen SY (2021) A decision tree classifier for credit assessment problems in big data environments. IseB 19(1):363–386

    Article  Google Scholar 

  23. Chien C, Pottie GJ (2012) A universal hybrid decision tree classifier design for human activity classification. In: 2012 Annual international conference of the IEEE Engineering in Medicine and Biology Society. IEEE, pp 1065–1068

    Google Scholar 

  24. Wright RE (1995) Logistic regression

    Google Scholar 

  25. Shipe ME, Deppen SA, Farjah F, Grogan EL (2019) Developing prediction models for clinical use using logistic regression: an overview. J Thoracic Dis 11(Suppl 4)

    Google Scholar 

  26. Lombardo L, Mai PM (2018) Presenting logistic regression-based landslide susceptibility results. Eng Geol 244:14–24

    Article  Google Scholar 

  27. Shen L, Tan EC (2005) Dimension reduction-based penalized logistic regression for cancer classification using microarray data. IEEE/ACM Trans Comput Biol Bioinf 2(2):166–175

    Article  Google Scholar 

  28. Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press

    Google Scholar 

  29. Zendehboudi A, Baseer MA, Saidur R (2018) Application of support vector machine models for forecasting solar and wind energy resources: a review. J Clean Prod 199:272–285

    Article  Google Scholar 

  30. Sethy PK, Barpanda NK, Rath AK, Behera SK (2020) Deep feature based rice leaf disease identification using support vector machine. Comput Electron Agric 175:105527

    Article  Google Scholar 

  31. Singh V, Poonia RC, Kumar S, Dass P, Agarwal P, Bhatnagar V, Raja L (2020) Prediction of COVID-19 corona virus pandemic based on time series data using Support Vector Machine. J Discrete Math Sci Cryptogr 23(8):1583–1597

    Article  MATH  Google Scholar 

  32. Turner CR, Fuggetta A, Lavazza L, Wolf AL (2019) A conceptual basis for feature engineering. J Syst Softw 49(1):3–15

    Article  Google Scholar 

  33. Li J, Cheng K, Wang S, Morstatter F, Trevino RP, Tang J, Liu H (2017) Feature selection: a data perspective. ACM Comput Surv (CSUR) 50(6):1–45

    Article  Google Scholar 

  34. Bai Y, Chen M, Zhou P, Zhao T, Lee J, Kakade S, Wang H, Xiong C (2021) How important is the train-validation split in meta-learning? In: International conference on machine learning. PMLR, pp 543–553

    Google Scholar 

  35. Scogin S, Karreth J, Beger A, Williams R (2019) BayesPostEst: an r package to generate postestimation quantities for Bayesian MCMC estimation. J Open Source Softw 4(42):1722

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Bhuvaneshwari .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bhuvaneshwari, R., Karthika Devi, M.S., Vishal, R., Sarvesh, E., Sanjeevaditya, T.M. (2023). CoSSC—Comparative Study of Stellar Classification Using DR-16 and DR-17 Datasets. In: Kumar, S., Sharma, H., Balachandran, K., Kim, J.H., Bansal, J.C. (eds) Third Congress on Intelligent Systems. CIS 2022. Lecture Notes in Networks and Systems, vol 608. Springer, Singapore. https://doi.org/10.1007/978-981-19-9225-4_45

Download citation

Publish with us

Policies and ethics