A Korean-Specialized Language Model with Balanced Multilingual Performance
Offline-Ready Evaluation Frameworks for Korean AI Models
Development of an LLM-Based Forex Trader Model for Enhanced Financial Market Predictions
A benchmark dataset designed to evaluate LLM's proficiency in the korean language
A Refined Extension of KO-BENCH for Korean LLM Evaluation
A Benchmark Aligned with Korean Educational Standards for Evaluating LLMs
Korean Evaluation Datasets and Task Integration for LLMs Using Lighteval
A Human-Verified Instruction-Following Benchmark for Korean LLMs