Benchmarking PyCaret AutoML Against IndoBERT Fine-Tuning for Sentiment Analysis on Indonesian IKN Twitter Data
arXiv cs.CL / 4/29/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The study benchmarks PyCaret AutoML using classical models against IndoBERT fine-tuning for binary sentiment analysis on Indonesian IKN-related Twitter comments.
- The dataset comprises 1,472 manually labeled samples (780 negative, 692 positive), and classical models were assessed with 10-fold cross-validation.
- Logistic Regression performed best among the classical baselines, reaching 77.57% accuracy and 77.17% F1-score.
- IndoBERT fine-tuning (indobenchmark/indobert-base-p1) achieved substantially higher results, with 89.59% test accuracy and 89.37% F1-score after five epochs.
- The findings indicate that Transformer-based contextual representations are highly effective for sentiment classification of informal Indonesian social media text, outperforming AutoML-style baselines.
Related Articles
LLMs will be a commodity
Reddit r/artificial

Indian Developers: How to Build AI Side Income with $0 Capital in 2026
Dev.to

What it feels like to have to have Qwen 3.6 or Gemma 4 running locally
Reddit r/LocalLLaMA

Dex lands $5.3M to grow its AI-driven talent matching platform
Tech.eu

AI Citation Registry: Why Daily Updates Leave No Time for Data Structuring
Dev.to