Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects

Maide Feyza Er; Turgay Tugay Bilgin

doi:10.34248/bsengineering.1517904

Research Article

Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects

Year 2024, Volume: 7 Issue: 5, 960 - 970, 15.09.2024

Maide Feyza Er , Turgay Tugay Bilgin

https://doi.org/10.34248/bsengineering.1517904

Abstract

In recent times, astronomy has entered a new era with rapidly growing data sources and advanced observation techniques. The construction of powerful telescopes has enabled the collection of spectral data from millions of celestial objects. However, the increasing number and variety of data have made it challenging to categorize these celestial objects. This study employs machine learning methods to address the fundamental problem of classifying stars, galaxies, and quasars in astronomy. The dataset underwent detailed preprocessing to identify effective features for classification. KNIME Analytics Platform was used for data analysis and visualization, facilitating rapid and efficient data analysis through its drag-and-drop interface. Among the machine learning methods used in our study—Decision Trees, Random Forest, and Naive Bayes—the highest accuracy rate of 97.86% was achieved with the Random Forest model. Notably, despite its lower overall performance compared to other models, the Naive Bayes classifier exhibited superior performance in distinguishing the STAR class, which is one of the study's interesting findings. Future studies aim to enhance model accuracy by using larger and more diverse datasets and exploring different machine learning algorithms. Additionally, the impact of deep learning methods on classification performance will be investigated.

Keywords

Machine learning, Classification, Decision tree, Naive Bayes, Random forest

References

Brice MJ. 2019. Classification of stars from redshifted stellar spectra utilizing machine learning. MSc Thesis, Central Washington University, Computational Science, Washington, US, pp: 73.
Chen YC. 2018. Lecture 6: Density Estimation: Histogram and Kernel Density Estimator. URL= http://faculty.washington.edu/yenchic/18W_425/Lec6_hist_KDE.pdf (accessed date: May 10, 2024).
Clarke AO, Scaife AMM, Greenhalgh R, Griguta V. 2020. Identifying galaxies, quasars, and stars with machine learning: A new catalogue of classifications for 111 million SDSS sources without spectra. Astronomy Astrophys, 639: A84.
Erickson BJ, Kitamura F. 2021. Magician’s corner: 9. Performance metrics for machine learning models. Radiol Artif Intel, 3(3): e200126.
Fedesoriano. 2022. Stellar Classification Dataset-SDSS17. URL= https://www.kaggle.com/fedesoriano/stellar-classification-dataset-sdss17 (accessed date: May 15, 2024).
Fillbrunn A, Dietz C, Pfeuffer J, Rahn R, Landrum GA, Berthold MR. 2017. KNIME for reproducible cross-domain analysis of life science data. J Biotechnol, 261: 149-156.
Haghighi MHZ. 2023. Analyzing astronomical data with machine learning techniques. arXiv Preprint, arXiv: 2302.11573.
Hughes AC, Bailer-Jones CA, Jamal S. 2022. Quasar and galaxy classification using Gaia EDR3 and CatWise2020. Astronomy Astrophys, 668: A99.
Huichaqueo MO, Orrego RM. 2022. Automatic spectral classification of stars using machine learning: An approach based on the use of unbalanced data. Machine Learn Appl, 9(4): 01-16.
Kumar A, Gharat S. 2023. Star cassification: A deep learning approach for ıdentifying binary and exoplanet stars. arXiv Preprint, arXiv: 2301.13115.
Li J, Zhu Q, Wu Q, Zhang Z, Gong Y, He Z, Zhu F. 2021. SMOTE-NaN-DE: Addressing the noisy and borderline examples problem in imbalanced classification by natural neighbors and differential evolution. Know Based Syst, 223: 107056.
Mehta T, Bhuta N, Shinde S. 2022. Experimental analysis of stellar classification by using different machine learning algorithms. 2022 International Conference on Industry 4.0 Technology (I4Tech), September 23-24, Pune, India, pp: 1-8.
Mouchel‐Vallon C, Hodzic A. 2023. Toward emulating an explicit organic chemistry mechanism with random forest models. J Geophys Res Atmospheres, 128(10): e2022JD038227.
Omat D, Otey J, Al-Mousa A. 2022. Stellar objects classification using supervised machine learning techniques. International Arab Conference on Information Technology (ACIT), November 22-24, Abu Dhabi, United Arab Emirates, pp: 1-8.
Ramana PV. 2022. Naïve Bayes to machine learning approach for structural dynamic complications. ASPS Conf Proc, 1(4): 1283-1291.
Savyanavar AS, Mhala N, Sutar SH. 2023. Star-galaxy classification using machine learning algorithms and deep learning. Int J Info Technol Secur, 15(2): 87-96.
Thomas T, Vijayaraghavan P, Emmanuel A, Thomas S, Vijayaraghavan TP, Emmanuel S. 2020. Applications of decision trees. Machine Learn Appr Cyber Secur Analyt, 2020: 157-184.

Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects

Year 2024, Volume: 7 Issue: 5, 960 - 970, 15.09.2024

Maide Feyza Er , Turgay Tugay Bilgin

https://doi.org/10.34248/bsengineering.1517904

Abstract

Keywords

Machine learning, Classification, Decision tree, Naive Bayes, Random forest

References

Brice MJ. 2019. Classification of stars from redshifted stellar spectra utilizing machine learning. MSc Thesis, Central Washington University, Computational Science, Washington, US, pp: 73.
Chen YC. 2018. Lecture 6: Density Estimation: Histogram and Kernel Density Estimator. URL= http://faculty.washington.edu/yenchic/18W_425/Lec6_hist_KDE.pdf (accessed date: May 10, 2024).
Clarke AO, Scaife AMM, Greenhalgh R, Griguta V. 2020. Identifying galaxies, quasars, and stars with machine learning: A new catalogue of classifications for 111 million SDSS sources without spectra. Astronomy Astrophys, 639: A84.
Erickson BJ, Kitamura F. 2021. Magician’s corner: 9. Performance metrics for machine learning models. Radiol Artif Intel, 3(3): e200126.
Fedesoriano. 2022. Stellar Classification Dataset-SDSS17. URL= https://www.kaggle.com/fedesoriano/stellar-classification-dataset-sdss17 (accessed date: May 15, 2024).
Fillbrunn A, Dietz C, Pfeuffer J, Rahn R, Landrum GA, Berthold MR. 2017. KNIME for reproducible cross-domain analysis of life science data. J Biotechnol, 261: 149-156.
Haghighi MHZ. 2023. Analyzing astronomical data with machine learning techniques. arXiv Preprint, arXiv: 2302.11573.
Hughes AC, Bailer-Jones CA, Jamal S. 2022. Quasar and galaxy classification using Gaia EDR3 and CatWise2020. Astronomy Astrophys, 668: A99.
Huichaqueo MO, Orrego RM. 2022. Automatic spectral classification of stars using machine learning: An approach based on the use of unbalanced data. Machine Learn Appl, 9(4): 01-16.
Kumar A, Gharat S. 2023. Star cassification: A deep learning approach for ıdentifying binary and exoplanet stars. arXiv Preprint, arXiv: 2301.13115.
Li J, Zhu Q, Wu Q, Zhang Z, Gong Y, He Z, Zhu F. 2021. SMOTE-NaN-DE: Addressing the noisy and borderline examples problem in imbalanced classification by natural neighbors and differential evolution. Know Based Syst, 223: 107056.
Mehta T, Bhuta N, Shinde S. 2022. Experimental analysis of stellar classification by using different machine learning algorithms. 2022 International Conference on Industry 4.0 Technology (I4Tech), September 23-24, Pune, India, pp: 1-8.
Mouchel‐Vallon C, Hodzic A. 2023. Toward emulating an explicit organic chemistry mechanism with random forest models. J Geophys Res Atmospheres, 128(10): e2022JD038227.
Omat D, Otey J, Al-Mousa A. 2022. Stellar objects classification using supervised machine learning techniques. International Arab Conference on Information Technology (ACIT), November 22-24, Abu Dhabi, United Arab Emirates, pp: 1-8.
Ramana PV. 2022. Naïve Bayes to machine learning approach for structural dynamic complications. ASPS Conf Proc, 1(4): 1283-1291.
Savyanavar AS, Mhala N, Sutar SH. 2023. Star-galaxy classification using machine learning algorithms and deep learning. Int J Info Technol Secur, 15(2): 87-96.
Thomas T, Vijayaraghavan P, Emmanuel A, Thomas S, Vijayaraghavan TP, Emmanuel S. 2020. Applications of decision trees. Machine Learn Appr Cyber Secur Analyt, 2020: 157-184.

There are 17 citations in total.

Details

Primary Language	English
Subjects	Communications Engineering (Other)
Journal Section	Research Articles
Authors	Maide Feyza Er 0000-0003-2580-1309 Turgay Tugay Bilgin 0000-0002-9245-5728
Early Pub Date	September 4, 2024
Publication Date	September 15, 2024
Submission Date	July 18, 2024
Acceptance Date	September 3, 2024
Published in Issue	Year 2024 Volume: 7 Issue: 5

Cite

APA	Er, M. F., & Bilgin, T. T. (2024). Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. Black Sea Journal of Engineering and Science, 7(5), 960-970. https://doi.org/10.34248/bsengineering.1517904
AMA	Er MF, Bilgin TT. Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. BSJ Eng. Sci. September 2024;7(5):960-970. doi:10.34248/bsengineering.1517904
Chicago	Er, Maide Feyza, and Turgay Tugay Bilgin. “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”. Black Sea Journal of Engineering and Science 7, no. 5 (September 2024): 960-70. https://doi.org/10.34248/bsengineering.1517904.
EndNote	Er MF, Bilgin TT (September 1, 2024) Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. Black Sea Journal of Engineering and Science 7 5 960–970.
IEEE	M. F. Er and T. T. Bilgin, “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”, BSJ Eng. Sci., vol. 7, no. 5, pp. 960–970, 2024, doi: 10.34248/bsengineering.1517904.
ISNAD	Er, Maide Feyza - Bilgin, Turgay Tugay. “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”. Black Sea Journal of Engineering and Science 7/5 (September 2024), 960-970. https://doi.org/10.34248/bsengineering.1517904.
JAMA	Er MF, Bilgin TT. Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. BSJ Eng. Sci. 2024;7:960–970.
MLA	Er, Maide Feyza and Turgay Tugay Bilgin. “Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects”. Black Sea Journal of Engineering and Science, vol. 7, no. 5, 2024, pp. 960-7, doi:10.34248/bsengineering.1517904.
Vancouver	Er MF, Bilgin TT. Performance Comparison of Supervised Machine Learning Methods in Classifying Celestial Objects. BSJ Eng. Sci. 2024;7(5):960-7.

Download Cover Image

Article Files

Full Text

24890