SUMMIR: A Hallucination-Aware Framework for Ranking Sports Insights from LLMs
arXiv cs.AI / 4/8/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces SUMMIR, a framework for extracting pre-game and post-game sports insights from news articles and ranking them by user-specific interests.
- It builds a dataset of 7,900 sports news articles spanning 800 matches across Cricket, Soccer, Basketball, and Baseball, and uses a two-step validation pipeline with both open-source and proprietary LLMs.
- The approach generates insights using multiple state-of-the-art LLMs (GPT-4o, Qwen2.5-72B-Instruct, Llama-3.3-70B-Instruct, and Mixtral-8x7B-Instruct-v0.1) and evaluates factuality with FactScore.
- Hallucination awareness is addressed by applying SummaC (Summary Consistency) with GPT-4o, enabling more reliable ranking and comparison across models.
- Results show SUMMIR can produce relevant, high-quality insights while also exposing meaningful differences across LLMs in factual consistency and perceived interestingness, and the code is released on GitHub.
Related Articles
[N] Just found out that Milla Jovovich is a dev, invested in AI, and just open sourced a project
Reddit r/MachineLearning

ALTK‑Evolve: On‑the‑Job Learning for AI Agents
Hugging Face Blog

Context Windows Are Getting Absurd — And That's a Good Thing
Dev.to
Google isn’t an AI-first company despite Gemini being great
Reddit r/artificial

GitHub Weekly: Copilot SDK Goes Public, Cloud Agent Breaks Free
Dev.to