Environmental, Social and Governance Sentiment Analysis on Slovene News: A Novel Dataset and Models

arXiv cs.CL / 4/9/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces the first publicly available Slovene ESG sentiment dataset, built from MaCoCu Slovene news using LLM-assisted filtering plus human annotation of company-related ESG content.
  • It evaluates multiple automated ESG sentiment detection approaches, including monolingual SloBERTa, multilingual XLM-R, embedding-based TabPFN classifiers, hierarchical ensembles, and several large language model setups.
  • Results indicate LLMs perform best for Environmental and Social aspect classification, while a fine-tuned SloBERTa model achieves the strongest performance for Governance classification.
  • A small case study demonstrates how the best-performing classifier (gpt-oss) can be used to analyze ESG aspects for selected companies over a long time horizon.
  • The work targets a key gap in reliable ESG ratings for smaller companies and emerging markets by enabling scalable, language-specific ESG sentiment analysis from news.

Abstract

Environmental, Social, and Governance (ESG) considerations are increasingly integral to assessing corporate performance, reputation, and long-term sustainability. Yet, reliable ESG ratings remain limited for smaller companies and emerging markets. We introduce the first publicly available Slovene ESG sentiment dataset and a suite of models for automatic ESG sentiment detection. The dataset, derived from the MaCoCu Slovene news collection, combines large language model (LLM)-assisted filtering with human annotation of company-related ESG content. We evaluate the performance of monolingual (SloBERTa) and multilingual (XLM-R) models, embedding-based classifiers (TabPFN), hierarchical ensemble architectures, and large language models. Results show that LLMs achieve the strongest performance on Environmental (Gemma3-27B, F1-macro: 0.61) and Social aspects (gpt-oss 20B, F1-macro: 0.45), while fine-tuned SloBERTa is the best model on Governance classification (F1-macro: 0.54). We then show in a small case study how the best-preforming classifier (gpt-oss) can be applied to investigate ESG aspects for selected companies across a long time frame.