Research Article
BibTex RIS Cite

Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects

Year 2024, Volume: 7 Issue: 5, 960 - 970, 15.09.2024
https://doi.org/10.34248/bsengineering.1517904

Abstract

In recent times, astronomy has entered a new era with rapidly growing data sources and advanced observation techniques. The construction of powerful telescopes has enabled the collection of spectral data from millions of celestial objects. However, the increasing number and variety of data have made it challenging to categorize these celestial objects. This study employs machine learning methods to address the fundamental problem of classifying stars, galaxies, and quasars in astronomy. The dataset underwent detailed preprocessing to identify effective features for classification. KNIME Analytics Platform was used for data analysis and visualization, facilitating rapid and efficient data analysis through its drag-and-drop interface. Among the machine learning methods used in our study—Decision Trees, Random Forest, and Naive Bayes—the highest accuracy rate of 97.86% was achieved with the Random Forest model. Notably, despite its lower overall performance compared to other models, the Naive Bayes classifier exhibited superior performance in distinguishing the STAR class, which is one of the study's interesting findings. Future studies aim to enhance model accuracy by using larger and more diverse datasets and exploring different machine learning algorithms. Additionally, the impact of deep learning methods on classification performance will be investigated.

References

  • Brice MJ. 2019. Classification of stars from redshifted stellar spectra utilizing machine learning. MSc Thesis, Central Washington University, Computational Science, Washington, US, pp: 73.
  • Chen YC. 2018. Lecture 6: Density Estimation: Histogram and Kernel Density Estimator. URL= http://faculty.washington.edu/yenchic/18W_425/Lec6_hist_KDE.pdf (accessed date: May 10, 2024).
  • Clarke AO, Scaife AMM, Greenhalgh R, Griguta V. 2020. Identifying galaxies, quasars, and stars with machine learning: A new catalogue of classifications for 111 million SDSS sources without spectra. Astronomy Astrophys, 639: A84.
  • Erickson BJ, Kitamura F. 2021. Magician’s corner: 9. Performance metrics for machine learning models. Radiol Artif Intel, 3(3): e200126.
  • Fedesoriano. 2022. Stellar Classification Dataset-SDSS17. URL= https://www.kaggle.com/fedesoriano/stellar-classification-dataset-sdss17 (accessed date: May 15, 2024).
  • Fillbrunn A, Dietz C, Pfeuffer J, Rahn R, Landrum GA, Berthold MR. 2017. KNIME for reproducible cross-domain analysis of life science data. J Biotechnol, 261: 149-156.
  • Haghighi MHZ. 2023. Analyzing astronomical data with machine learning techniques. arXiv Preprint, arXiv: 2302.11573.
  • Hughes AC, Bailer-Jones CA, Jamal S. 2022. Quasar and galaxy classification using Gaia EDR3 and CatWise2020. Astronomy Astrophys, 668: A99.
  • Huichaqueo MO, Orrego RM. 2022. Automatic spectral classification of stars using machine learning: An approach based on the use of unbalanced data. Machine Learn Appl, 9(4): 01-16.
  • Kumar A, Gharat S. 2023. Star cassification: A deep learning approach for ıdentifying binary and exoplanet stars. arXiv Preprint, arXiv: 2301.13115.
  • Li J, Zhu Q, Wu Q, Zhang Z, Gong Y, He Z, Zhu F. 2021. SMOTE-NaN-DE: Addressing the noisy and borderline examples problem in imbalanced classification by natural neighbors and differential evolution. Know Based Syst, 223: 107056.
  • Mehta T, Bhuta N, Shinde S. 2022. Experimental analysis of stellar classification by using different machine learning algorithms. 2022 International Conference on Industry 4.0 Technology (I4Tech), September 23-24, Pune, India, pp: 1-8.
  • Mouchel‐Vallon C, Hodzic A. 2023. Toward emulating an explicit organic chemistry mechanism with random forest models. J Geophys Res Atmospheres, 128(10): e2022JD038227.
  • Omat D, Otey J, Al-Mousa A. 2022. Stellar objects classification using supervised machine learning techniques. International Arab Conference on Information Technology (ACIT), November 22-24, Abu Dhabi, United Arab Emirates, pp: 1-8.
  • Ramana PV. 2022. Naïve Bayes to machine learning approach for structural dynamic complications. ASPS Conf Proc, 1(4): 1283-1291.
  • Savyanavar AS, Mhala N, Sutar SH. 2023. Star-galaxy classification using machine learning algorithms and deep learning. Int J Info Technol Secur, 15(2): 87-96.
  • Thomas T, Vijayaraghavan P, Emmanuel A, Thomas S, Vijayaraghavan TP, Emmanuel S. 2020. Applications of decision trees. Machine Learn Appr Cyber Secur Analyt, 2020: 157-184.

Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects

Year 2024, Volume: 7 Issue: 5, 960 - 970, 15.09.2024
https://doi.org/10.34248/bsengineering.1517904

Abstract

In recent times, astronomy has entered a new era with rapidly growing data sources and advanced observation techniques. The construction of powerful telescopes has enabled the collection of spectral data from millions of celestial objects. However, the increasing number and variety of data have made it challenging to categorize these celestial objects. This study employs machine learning methods to address the fundamental problem of classifying stars, galaxies, and quasars in astronomy. The dataset underwent detailed preprocessing to identify effective features for classification. KNIME Analytics Platform was used for data analysis and visualization, facilitating rapid and efficient data analysis through its drag-and-drop interface. Among the machine learning methods used in our study—Decision Trees, Random Forest, and Naive Bayes—the highest accuracy rate of 97.86% was achieved with the Random Forest model. Notably, despite its lower overall performance compared to other models, the Naive Bayes classifier exhibited superior performance in distinguishing the STAR class, which is one of the study's interesting findings. Future studies aim to enhance model accuracy by using larger and more diverse datasets and exploring different machine learning algorithms. Additionally, the impact of deep learning methods on classification performance will be investigated.

References

  • Brice MJ. 2019. Classification of stars from redshifted stellar spectra utilizing machine learning. MSc Thesis, Central Washington University, Computational Science, Washington, US, pp: 73.
  • Chen YC. 2018. Lecture 6: Density Estimation: Histogram and Kernel Density Estimator. URL= http://faculty.washington.edu/yenchic/18W_425/Lec6_hist_KDE.pdf (accessed date: May 10, 2024).
  • Clarke AO, Scaife AMM, Greenhalgh R, Griguta V. 2020. Identifying galaxies, quasars, and stars with machine learning: A new catalogue of classifications for 111 million SDSS sources without spectra. Astronomy Astrophys, 639: A84.
  • Erickson BJ, Kitamura F. 2021. Magician’s corner: 9. Performance metrics for machine learning models. Radiol Artif Intel, 3(3): e200126.
  • Fedesoriano. 2022. Stellar Classification Dataset-SDSS17. URL= https://www.kaggle.com/fedesoriano/stellar-classification-dataset-sdss17 (accessed date: May 15, 2024).
  • Fillbrunn A, Dietz C, Pfeuffer J, Rahn R, Landrum GA, Berthold MR. 2017. KNIME for reproducible cross-domain analysis of life science data. J Biotechnol, 261: 149-156.
  • Haghighi MHZ. 2023. Analyzing astronomical data with machine learning techniques. arXiv Preprint, arXiv: 2302.11573.
  • Hughes AC, Bailer-Jones CA, Jamal S. 2022. Quasar and galaxy classification using Gaia EDR3 and CatWise2020. Astronomy Astrophys, 668: A99.
  • Huichaqueo MO, Orrego RM. 2022. Automatic spectral classification of stars using machine learning: An approach based on the use of unbalanced data. Machine Learn Appl, 9(4): 01-16.
  • Kumar A, Gharat S. 2023. Star cassification: A deep learning approach for ıdentifying binary and exoplanet stars. arXiv Preprint, arXiv: 2301.13115.
  • Li J, Zhu Q, Wu Q, Zhang Z, Gong Y, He Z, Zhu F. 2021. SMOTE-NaN-DE: Addressing the noisy and borderline examples problem in imbalanced classification by natural neighbors and differential evolution. Know Based Syst, 223: 107056.
  • Mehta T, Bhuta N, Shinde S. 2022. Experimental analysis of stellar classification by using different machine learning algorithms. 2022 International Conference on Industry 4.0 Technology (I4Tech), September 23-24, Pune, India, pp: 1-8.
  • Mouchel‐Vallon C, Hodzic A. 2023. Toward emulating an explicit organic chemistry mechanism with random forest models. J Geophys Res Atmospheres, 128(10): e2022JD038227.
  • Omat D, Otey J, Al-Mousa A. 2022. Stellar objects classification using supervised machine learning techniques. International Arab Conference on Information Technology (ACIT), November 22-24, Abu Dhabi, United Arab Emirates, pp: 1-8.
  • Ramana PV. 2022. Naïve Bayes to machine learning approach for structural dynamic complications. ASPS Conf Proc, 1(4): 1283-1291.
  • Savyanavar AS, Mhala N, Sutar SH. 2023. Star-galaxy classification using machine learning algorithms and deep learning. Int J Info Technol Secur, 15(2): 87-96.
  • Thomas T, Vijayaraghavan P, Emmanuel A, Thomas S, Vijayaraghavan TP, Emmanuel S. 2020. Applications of decision trees. Machine Learn Appr Cyber Secur Analyt, 2020: 157-184.
There are 17 citations in total.

Details

Primary Language English
Subjects Communications Engineering (Other)
Journal Section Research Articles
Authors

Maide Feyza Er 0000-0003-2580-1309

Turgay Tugay Bilgin 0000-0002-9245-5728

Early Pub Date September 4, 2024
Publication Date September 15, 2024
Submission Date July 18, 2024
Acceptance Date September 3, 2024
Published in Issue Year 2024 Volume: 7 Issue: 5

Cite

APA Er, M. F., & Bilgin, T. T. (2024). Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. Black Sea Journal of Engineering and Science, 7(5), 960-970. https://doi.org/10.34248/bsengineering.1517904
AMA Er MF, Bilgin TT. Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. BSJ Eng. Sci. September 2024;7(5):960-970. doi:10.34248/bsengineering.1517904
Chicago Er, Maide Feyza, and Turgay Tugay Bilgin. “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”. Black Sea Journal of Engineering and Science 7, no. 5 (September 2024): 960-70. https://doi.org/10.34248/bsengineering.1517904.
EndNote Er MF, Bilgin TT (September 1, 2024) Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. Black Sea Journal of Engineering and Science 7 5 960–970.
IEEE M. F. Er and T. T. Bilgin, “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”, BSJ Eng. Sci., vol. 7, no. 5, pp. 960–970, 2024, doi: 10.34248/bsengineering.1517904.
ISNAD Er, Maide Feyza - Bilgin, Turgay Tugay. “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”. Black Sea Journal of Engineering and Science 7/5 (September 2024), 960-970. https://doi.org/10.34248/bsengineering.1517904.
JAMA Er MF, Bilgin TT. Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. BSJ Eng. Sci. 2024;7:960–970.
MLA Er, Maide Feyza and Turgay Tugay Bilgin. “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”. Black Sea Journal of Engineering and Science, vol. 7, no. 5, 2024, pp. 960-7, doi:10.34248/bsengineering.1517904.
Vancouver Er MF, Bilgin TT. Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. BSJ Eng. Sci. 2024;7(5):960-7.

                                                24890