Team Fusion@ SU@ BC8 SympTEMIST track: transformer-based approach for symptom recognition and linking

arXiv cs.CL / 4/9/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces a transformer-based system to perform SympTEMIST named entity recognition (NER) and entity linking (EL) for symptom data using a RoBERTa-based token classifier.
For NER, the approach fine-tunes a RoBERTa model augmented with BiLSTM and CRF layers, leveraging an augmented training set to improve token-level entity extraction.
For entity linking, it generates cross-lingual candidates using SapBERT XLMR-Large and ranks them by cosine similarity to entries in a knowledge base.
The authors report that the selection of the knowledge base is the most influential factor for improving EL (and overall) accuracy.
The work is presented as a new arXiv release, positioning it as a research-method contribution for symptom-related biomedical/NLP pipelines.

Abstract

This paper presents a transformer-based approach to solving the SympTEMIST named entity recognition (NER) and entity linking (EL) tasks. For NER, we fine-tune a RoBERTa-based (1) token-level classifier with BiLSTM and CRF layers on an augmented train set. Entity linking is performed by generating candidates using the cross-lingual SapBERT XLMR-Large (2), and calculating cosine similarity against a knowledge base. The choice of knowledge base proves to have the highest impact on model accuracy.

Black Hat Asia

AI Business

Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual shareholder letter

TechCrunch

Why Anthropic’s new model has cybersecurity experts rattled

Reddit r/artificial

Does the AI 2027 paper still hold any legitimacy?

Reddit r/artificial

Why Most Productivity Systems Fail (And What to Do Instead)

Dev.to

Team Fusion@ SU@ BC8 SympTEMIST track: transformer-based approach for symptom recognition and linking

Key Points

Abstract

Related Articles

Black Hat Asia

Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual shareholder letter

Why Anthropic’s new model has cybersecurity experts rattled

Does the AI 2027 paper still hold any legitimacy?

Why Most Productivity Systems Fail (And What to Do Instead)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer