[P] XGBoost + TF-IDF for emotion prediction — good state accuracy but struggling with intensity (need advice)

Reddit r/MachineLearning / 3/19/2026

💬 OpinionTools & Practical Usage

Read original →

共有:

Key Points

The project uses ~1200 samples to predict a 6-class emotional state and an intensity score (1–5), combining TF-IDF text features with engineered metadata and XGBoost models for classification and regression.
Emotion classification achieves about 66–67% accuracy, with most confusion between neighboring classes and the dataset described as small and noisy where text features dominate over metadata.
Intensity was initially tackled as a classification task with only ~21% accuracy, then switched to regression achieving a MAE of around 1.22 after rounding predictions to 1–5.
TF-IDF feature tuning shows a trade-off: reducing features to 500 hurts accuracy, increasing to 1000–1500 yields slight improvements, but the optimal balance remains unclear and feature engineering has had limited impact.

Hey everyone,

I’m working on a small ML project (~1200 samples) where I’m trying to predict:

Emotional state (classification — 6 classes)
Intensity (1–5) of that emotion

The dataset contains:

journal_text (short, noisy reflections)
metadata like:
- stress_level
- energy_level
- sleep_hours
- time_of_day
- previous_day_mood
- ambience_type
- face_emotion_hint
- duration_min
- reflection_quality

🔧 What I’ve done so far

1. Text processing

Using TF-IDF:

max_features = 500 → tried 1000+ as well
ngram_range = (1,2)
stop_words = 'english'
min_df = 2

Resulting shape:

~1200 samples × 500–1500 features

2. Metadata

Converted categorical (face_emotion_hint) to numeric
Kept others as numerical
Handled missing values (NaN left for XGBoost / simple filling)

Also added engineered features:

text_length
word_count
stress_energy = stress_level * energy_level
emotion_hint_diff = stress_level - energy_level

Scaled metadata using StandardScaler

Combined with text using:

from scipy.sparse import hstack X_final = hstack([X_text, X_meta_sparse]).tocsr()

3. Models

Emotional State (Classification)

Using XGBClassifier:

accuracy ≈ 66–67%

Classification report looks decent, confusion mostly between neighboring classes.

Intensity (Initially Classification)

accuracy ≈ 21% (very poor)

4. Switched Intensity → Regression

Used XGBRegressor:

predictions rounded to 1–5

Evaluation:

MAE ≈ 1.22

Current Issues

1. Intensity is not improving much

Even after feature engineering + tuning
MAE stuck around 1.2
Small improvements only (~0.05–0.1)

2. TF-IDF tuning confusion

Reducing features (500) → accuracy dropped
Increasing (1000–1500) → slightly better

Not sure how to find optimal balance

3. Feature engineering impact is small

Added multiple features but no major improvement
Unsure what kind of features actually help intensity

Observations

Dataset is small (1200 rows)
Labels are noisy (subjective emotion + intensity)
Model confuses nearby classes (expected)
Text seems to dominate over metadata

Questions

Are there better approaches for ordinal prediction (instead of plain regression)?
Any ideas for better features specifically for emotional intensity?
Should I try different models (LightGBM, linear models, etc.)?
Any better way to combine text + metadata?

Goal

Not just maximize accuracy — but build something that:

handles noisy data
generalizes well
reflects real-world behavior

Would really appreciate any suggestions or insights 🙏

submitted by /u/Udbhav96
[link] [comments]

Astral to Join OpenAI

Dev.to

I Built a MITM Proxy to See What Claude Code Actually Sends to Anthropic

Dev.to

Your AI coding agent is installing vulnerable packages. I built the fix.

Dev.to

ChatGPT Prompt Engineering for Freelancers: Unlocking Efficient Client Communication

Dev.to

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

Reddit r/LocalLLaMA

[P] XGBoost + TF-IDF for emotion prediction — good state accuracy but struggling with intensity (need advice)

Key Points

🔧 What I’ve done so far

1. Text processing

2. Metadata

3. Models

Emotional State (Classification)

Intensity (Initially Classification)

4. Switched Intensity → Regression

Current Issues

1. Intensity is not improving much

2. TF-IDF tuning confusion

3. Feature engineering impact is small

Observations

Questions

Goal

Related Articles

Astral to Join OpenAI

I Built a MITM Proxy to See What Claude Code Actually Sends to Anthropic

Your AI coding agent is installing vulnerable packages. I built the fix.

ChatGPT Prompt Engineering for Freelancers: Unlocking Efficient Client Communication

PearlOS. We gave swarm intelligence a local desktop environment and code control to self-evolve. Has been pretty incredible to see so far. Open source and free if you want your own.

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer