XekRung Technical Report

arXiv cs.AI / 5/4/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

XekRungは、サイバーセキュリティに特化した「フロンティア級」の大規模言語モデルで、幅広いセキュリティ機能を提供することを目的としています。
サイバーセキュリティ領域に合わせた多様なデータ合成パイプラインを開発し、高品質な学習データを大規模に構築できる土台を整えています。
学習は、継続事前学習（CPT）、教師あり微調整（SFT）、強化学習（RL）を含む一貫したパイプラインで行い、モデル能力の拡張を図っています。
ドメイン特化能力と汎用能力の双方を反復的に改善するための多次元評価システムも提案しています。
実験の結果、同規模のモデル群においてサイバーセキュリティのベンチマークで最先端性能を示しつつ、汎用ベンチマークでも強い性能を維持しています。

Abstract

We present XekRung, a frontier large language model for cybersecurity, designed to provide comprehensive security capabilities. To achieve this, we develop diverse data synthesis pipelines tailored to the cybersecurity domain, enabling the scalable construction of high-quality training data and providing a strong foundation for cybersecurity knowledge and understanding. Building on this foundation, we establish a complete training pipeline spanning continued pre-training (CPT), supervised fine-tuning (SFT), and reinforcement learning (RL) to further extend the model's capabilities. We further introduce a multi-dimensional evaluation system to guide the iterative improvement of both domain-specific and general-purpose abilities. Extensive experiments demonstrate that XekRung achieves state-of-the-art performance on cybersecurity-specific benchmarks among models of the same scale, while maintaining strong performance on general benchmarks.