Sentiment analysis is considered the process to extract the overall expression, opinions, or feelings from reviews about something such as products, services, or movies. Pre-processing is considered a crucial phase in sentiment analysis for text mining because it allows us to analyze the reviews according to their intended meaning by removing all of the appendages which are the words that do not affect the semantics of sentences. And therefore, the number of features will decrease and thus accuracy will increase. Accordingly, we have decided to evaluate our experiment in identifying the best influencing technique of pre-processing for several features by making a comparison between the features and by combining them together to reach the best result based on the feature number for each pre-processing technique and classification accuracy. this comparison was done by using three algorithms for classification SVM, NB, and DT after applying tools for feature selection and feature extraction with three techniques for tokenization. We concluded that there are some of these techniques that have a negative effect like lemmatization and the part of them is not due to any difference, other, which a little part, have an effect
Altinbas university
Thanks
Primary Language | English |
---|---|
Subjects | Computer Software |
Journal Section | Research Article |
Authors | |
Publication Date | December 31, 2022 |
Submission Date | October 12, 2022 |
Acceptance Date | November 3, 2022 |
Published in Issue | Year 2022 |