Sentiment Analysis on Youtube Comments Using Machine Learning and Deep Learning with PCA- and LDA-Based Feature Selection
| dc.contributor.author | ÇiÇek, Gulay | |
| dc.contributor.author | Buldag, Nazli | |
| dc.contributor.author | Aydin, Elif | |
| dc.date.accessioned | 2026-01-31T15:04:24Z | |
| dc.date.available | 2026-01-31T15:04:24Z | |
| dc.date.issued | 2025 | |
| dc.department | İstanbul Beykent Üniversitesi | |
| dc.description.abstract | This study presents a comprehensive Sentiment Analysis (SA) framework applied to a novel, real-time collected YouTube comment dataset categorized into five distinct emotional classes: happiness, sadness, fear, anger, and surprise. Unlike conventional studies that rely on pre-existing or simplistic binary datasets, we employ dynamic data acquisition and rigorously evaluate the performance of eleven distinct classifiers, including traditional Machine Learning (ML) algorithms (K-Nearest Neighbors, Naïve Bayes (NB), Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM)) and advanced Deep Learning (DL) models (Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), Gated Recurrent Unit (GRU), Bidirectional Gated Recurrent Unit (BiGRU), and a CNN-LSTM hybrid). A central contribution involves the systematic comparison of Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) as feature selection techniques across all models. The findings demonstrate that dimensionality reduction techniques significantly impact model efficacy, with the LSTM model achieving the highest performance (88% accuracy) on the PCA-processed dataset, matched only by LR on the LDA-processed dataset. This work provides critical insights into optimizing classifier choice based on feature processing methods for multi-class sentiment analysis using dynamic social media data. © Bharati Vidyapeeth's Institute of Computer Applications and Management 2025. | |
| dc.identifier.doi | 10.1007/s41870-025-02938-7 | |
| dc.identifier.issn | 2511-2104 | |
| dc.identifier.scopus | 2-s2.0-105024328121 | |
| dc.identifier.scopusquality | Q1 | |
| dc.identifier.uri | https://doi.org/10.1007/s41870-025-02938-7 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12662/10537 | |
| dc.indekslendigikaynak | Scopus | |
| dc.language.iso | en | |
| dc.publisher | Springer Science and Business Media B.V. | |
| dc.relation.ispartof | International Journal of Information Technology (Singapore) | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/closedAccess | |
| dc.snmz | KA_Scopus_20260128 | |
| dc.subject | Classification algorithms | |
| dc.subject | Deep learning | |
| dc.subject | Hybrid modeling | |
| dc.subject | Linear discriminant analysis | |
| dc.subject | Machine learning | |
| dc.subject | Principal component analysis | |
| dc.subject | Sentiment analysis | |
| dc.subject | YouTube comments | |
| dc.title | Sentiment Analysis on Youtube Comments Using Machine Learning and Deep Learning with PCA- and LDA-Based Feature Selection | |
| dc.type | Article |












