How do you use AI Agents for EDA/Data Analysis and getting it ready for ML model training? [D]

Reddit r/MachineLearning / 5/1/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical Usage

Key Points

  • The post asks how AI agents can autonomously replicate a typical manual data analysis workflow, including inspecting data, handling missing values, removing outliers, and applying transformations for better model accuracy.
  • It focuses specifically on using AI agents for EDA and preparing datasets so they are ready for machine learning model training.
  • The user already has an OpenCode + VS Code setup and wants step-by-step guidance on how to set up and use Claude/OpenCode to drive the process.
  • The post also requests help on having the agent handle not just preprocessing, but also training-related local setup (at least partially).

Like in manual workflow I would study the given data by using various functions like pd.info() and all column wise, remove null, outliers, etc transforming dataset such that the model gives best accuracy/result. This and like overall workflow how can the ai agent do it autonomously? I already have opencode+vscode setup and i don't know how this can go. I have claude and opencode go so need your advice and steps please.

I also want it to train the model for me or at least do all the setups for training locally.

submitted by /u/Arceus918
[link] [comments]