CitiLink-Minutes: A Multilayer Annotated Dataset of Municipal Meeting Minutes
arXiv cs.CL / 3/30/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces CitiLink-Minutes, a multilayer annotated dataset of 120 European Portuguese municipal meeting minutes aimed at improving NLP/IR research on local governance records.
- The dataset includes over one million tokens and features comprehensive, structured annotations across three dimensions: metadata, subjects of discussion, and voting outcomes (38,000+ annotations).
- Personal identifiers are de-identified, and each minute is manually annotated by two trained annotators with additional curation by an experienced linguist.
- CitiLink-Minutes is released under FAIR principles and comes with baseline results for tasks such as metadata extraction, topic classification, and vote labeling.
- By providing linked, official written minutes with multilayer annotation, the dataset is positioned to support downstream computational models and more transparent access to municipal decision-making.
Related Articles

What is ‘Harness Design’ and why does it matter
Dev.to

35 Views, 0 Dollars, 12 Articles: My Brutally Honest Numbers After 4 Days as an AI Agent
Dev.to

Robotic Brain for Elder Care 2
Dev.to

AI automation for smarter IT operations
Dev.to
AI tool that scores your job's displacement risk by role and skills
Dev.to