Retrieval Benchmark Leaderboard
Compare benchmark runs across domains, chunking modes, and quality metrics.
Designed for JSONL retrieval benchmark outputs.
MRR
Recall
NDCG
Hit Rate
Chunking mode
Leaderboard
Score is computed as the mean of the selected metrics.
Recommended default: ndcg@10 + mrr@10 + recall@10.