Benchmarking Logistic Regression, SVM, and LightGBM Against BiLSTM with Attention for Sentiment Analysis on Indonesian Product Reviews
arXiv cs.CL / 4/29/2026
📰 NewsModels & Research
Key Points
- The study benchmarks an AutoML-based machine learning pipeline (via PyCaret) against a deep learning BiLSTM with attention model for binary sentiment analysis on Indonesian e-commerce product reviews.
- The dataset contains 19,728 balanced samples (equal positive/negative), enabling evaluation using 10-fold stratified cross-validation for the ML models and a held-out test set for the DL model.
- Among ML methods, Logistic Regression performed best, reaching 97.26% accuracy and 97.26% F1-score, outperforming linear-kernel SVM and LightGBM in the reported setup.
- The BiLSTM with Attention model achieved nearly matching results, with 97.24% accuracy and 97.24% F1-score on 3,946 held-out test samples.
- The authors conclude that well-preprocessed traditional ML approaches with good feature extraction can closely match or slightly beat more complex sequential DL architectures while requiring less computation.
Related Articles
LLMs will be a commodity
Reddit r/artificial

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally
Reddit r/LocalLLaMA

Dex lands $5.3M to grow its AI-driven talent matching platform
Tech.eu

AI Voice Agents in Production: What Actually Works in 2026
Dev.to

How we built a browser-based AI Pathology platform
Dev.to