Semantic-Driven Topic Modeling for Analyzing Creativity in Virtual Brainstorming

arXiv cs.CL / 3/23/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • It presents a semantic-driven topic modeling framework for analyzing creativity in virtual brainstorming, integrating transformer-based embeddings (Sentence-BERT), dimensionality reduction (UMAP), clustering (HDBSCAN), and refined topic extraction.
  • The approach captures semantic similarity at the sentence level to extract coherent themes from brainstorming transcripts while filtering noise and outliers.
  • Empirical results show higher topic coherence (CV) of 0.687, outperforming LDA, ETM, and BERTopic on structured Zoom sessions.
  • The framework provides interpretable insights into the depth and diversity of topics, supporting both convergent and divergent dimensions of group creativity in synchronous meetings.

Abstract

Virtual brainstorming sessions have become a central component of collaborative problem solving, yet the large volume and uneven distribution of ideas often make it difficult to extract valuable insights efficiently. Manual coding of ideas is time-consuming and subjective, underscoring the need for automated approaches to support the evaluation of group creativity. In this study, we propose a semantic-driven topic modeling framework that integrates four modular components: transformer-based embeddings (Sentence-BERT), dimensionality reduction (UMAP), clustering (HDBSCAN), and topic extraction with refinement. The framework captures semantic similarity at the sentence level, enabling the discovery of coherent themes from brainstorming transcripts while filtering noise and identifying outliers. We evaluate our approach on structured Zoom brainstorming sessions involving student groups tasked with improving their university. Results demonstrate that our model achieves higher topic coherence compared to established methods such as LDA, ETM, and BERTopic, with an average coherence score of 0.687 (CV), outperforming baselines by a significant margin. Beyond improved performance, the model provides interpretable insights into the depth and diversity of topics explored, supporting both convergent and divergent dimensions of group creativity. This work highlights the potential of embedding-based topic modeling for analyzing collaborative ideation and contributes an efficient and scalable framework for studying creativity in synchronous virtual meetings.

Semantic-Driven Topic Modeling for Analyzing Creativity in Virtual Brainstorming | AI Navigate