[P] Create datasets from TikTok videos

Reddit r/MachineLearning / 3/28/2026

💬 OpinionDeveloper Stack & InfrastructureTools & Practical Usage

Key Points

  • Tikkocampus is presented as a tool to convert TikTok creator timelines into timestamped, searchable video segments for ML experiments and RAG workflows.
  • The generated segments can be used to build datasets derived from TikTok videos, enabling retrieval-augmented generation and related analysis tasks.
  • The post emphasizes that the pipeline supports both dataset creation for experimentation and broader video analytics use cases.
  • A GitHub repository is linked so users can try or integrate the approach into their own RAG/dataset generation projects.

For ML experiments and RAG projects: Tikkocampus converts creator timelines into timestamped, searchable segments and then use it to perform RAG. It’s useful for creating datasets of TikTok videos or just make analysis. Repo: https://github.com/ilyasstrougouty/Tikkocampus

submitted by /u/Ilyastrou
[link] [comments]
広告