Sentiment Analysis on Youtube Comments Using Machine Learning and Deep Learning with PCA- and LDA-Based Feature Selection

dc.contributor.authorÇiÇek, Gulay
dc.contributor.authorBuldag, Nazli
dc.contributor.authorAydin, Elif
dc.date.accessioned2026-01-31T15:04:24Z
dc.date.available2026-01-31T15:04:24Z
dc.date.issued2025
dc.departmentİstanbul Beykent Üniversitesi
dc.description.abstractThis study presents a comprehensive Sentiment Analysis (SA) framework applied to a novel, real-time collected YouTube comment dataset categorized into five distinct emotional classes: happiness, sadness, fear, anger, and surprise. Unlike conventional studies that rely on pre-existing or simplistic binary datasets, we employ dynamic data acquisition and rigorously evaluate the performance of eleven distinct classifiers, including traditional Machine Learning (ML) algorithms (K-Nearest Neighbors, Naïve Bayes (NB), Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM)) and advanced Deep Learning (DL) models (Long Short-Term Memory (LSTM), Convolutional Neural Network (CNN), Bidirectional Long Short-Term Memory (BiLSTM), Gated Recurrent Unit (GRU), Bidirectional Gated Recurrent Unit (BiGRU), and a CNN-LSTM hybrid). A central contribution involves the systematic comparison of Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) as feature selection techniques across all models. The findings demonstrate that dimensionality reduction techniques significantly impact model efficacy, with the LSTM model achieving the highest performance (88% accuracy) on the PCA-processed dataset, matched only by LR on the LDA-processed dataset. This work provides critical insights into optimizing classifier choice based on feature processing methods for multi-class sentiment analysis using dynamic social media data. © Bharati Vidyapeeth's Institute of Computer Applications and Management 2025.
dc.identifier.doi10.1007/s41870-025-02938-7
dc.identifier.issn2511-2104
dc.identifier.scopus2-s2.0-105024328121
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1007/s41870-025-02938-7
dc.identifier.urihttps://hdl.handle.net/20.500.12662/10537
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherSpringer Science and Business Media B.V.
dc.relation.ispartofInternational Journal of Information Technology (Singapore)
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_Scopus_20260128
dc.subjectClassification algorithms
dc.subjectDeep learning
dc.subjectHybrid modeling
dc.subjectLinear discriminant analysis
dc.subjectMachine learning
dc.subjectPrincipal component analysis
dc.subjectSentiment analysis
dc.subjectYouTube comments
dc.titleSentiment Analysis on Youtube Comments Using Machine Learning and Deep Learning with PCA- and LDA-Based Feature Selection
dc.typeArticle

Dosyalar