Internet users frequently prefer digital journalism to acquire information. However, the content produced by malicious news sources leads to various issues for users. One of these issues is clickbait headlines, which are used to capture users' attention and direct them to specific content. Clickbait headlines exploit users' curiosity, causing them to navigate to targeted content and spend more time on it. Such content, which can be malicious, is one of the main problems for today's internet users. In the literature, artificial intelligence-based approaches using machine learning and deep learning models have been developed for the problem of clickbait detection. However, there is a need for studies on the explainability of artificial intelligence models developed in this field. Explainable artificial intelligence (XAI) aims to explain the transparency, understandability and decision-making processes of machine learning models. This study aims to develop explainable artificial intelligence-based models for the clickbait detection problem. In this context, a Turkish dataset compiled from different news sources was used. Initially, data preprocessing activities including feature engineering, missing data handling, stemming, normalization and term frequency-inverse document-frequency (TF-IDF) transformation were performed. Subsequently, k-nearest neighbors, Naive Bayes, logistic regression, decision tree, random forest, extreme gradient boosting (XGBoost), support vector machine and multi-layer perceptron (MLP) models were developed using the dataset. Hyperparameter optimization was applied to determine the most suitable parameter values for each model. The performances of the applied models were comparatively evaluated. Finally, to ensure the explainability of artificial intelligence models in clickbait detection, the SHAP method was used for identifying the factors affecting the classification results.
Clickbait Detection Natural Language Processing SHAP Explainable Artificial Intelligence
Birincil Dil | İngilizce |
---|---|
Konular | Doğal Dil İşleme |
Bölüm | Research Article |
Yazarlar | |
Yayımlanma Tarihi | 21 Kasım 2024 |
Gönderilme Tarihi | 13 Ekim 2024 |
Kabul Tarihi | 21 Kasım 2024 |
Yayımlandığı Sayı | Yıl 2024 Cilt: 7 Sayı: 1 |
AI Research and Application Center, Sakarya University of Applied Sciences, Sakarya, Türkiye.