IndoBERT-Sentiment: Context-Conditioned Sentiment Classification for Indonesian Text

arXiv cs.CL / 4/9/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces IndoBERT-Sentiment, a context-conditioned model that uses both topical context and Indonesian text to improve sentiment classification versus context-free approaches.
IndoBERT-Sentiment is built on IndoBERT Large (335M parameters) and trained on 31,360 labeled context-text pairs spanning 188 topics.
The model reports strong performance, achieving an F1 macro of 0.856 and accuracy of 88.1% on its evaluation.
In comparisons on the same test set, it outperforms three widely used general-purpose Indonesian sentiment baselines by 35.6 F1 points.
The authors argue that transferring context-conditioning (previously used for relevancy) effectively boosts sentiment classification, including correcting systematic errors made by isolation-based models.

Abstract

Existing Indonesian sentiment analysis models classify text in isolation, ignoring the topical context that often determines whether a statement is positive, negative, or neutral. We introduce IndoBERT-Sentiment, a context-conditioned sentiment classifier that takes both a topical context and a text as input, producing sentiment predictions grounded in the topic being discussed. Built on IndoBERT Large (335M parameters) and trained on 31,360 context-text pairs labeled across 188 topics, the model achieves an F1 macro of 0.856 and accuracy of 88.1%. In a head-to-head evaluation against three widely used general-purpose Indonesian sentiment models on the same test set, IndoBERT-Sentiment outperforms the best baseline by 35.6 F1 points. We show that context-conditioning, previously demonstrated for relevancy classification, transfers effectively to sentiment analysis and enables the model to correctly classify texts that are systematically misclassified by context-free approaches.

Black Hat Asia

AI Business

Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual shareholder letter

TechCrunch

Why Anthropic’s new model has cybersecurity experts rattled

Reddit r/artificial

Does the AI 2027 paper still hold any legitimacy?

Reddit r/artificial

Why Most Productivity Systems Fail (And What to Do Instead)

Dev.to

IndoBERT-Sentiment: Context-Conditioned Sentiment Classification for Indonesian Text

Key Points

Abstract

Related Articles

Black Hat Asia

Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual shareholder letter

Why Anthropic’s new model has cybersecurity experts rattled

Does the AI 2027 paper still hold any legitimacy?

Why Most Productivity Systems Fail (And What to Do Instead)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer