contamination-free benchmark

LiveBench

LiveBench stands as the leading contamination-free LLM benchmark in late 2025, using regularly refreshed questions from recent sources and objective ground-truth scoring. It challenges top models like GPT-5.1 and Claude 4.5 across reasoning, math, coding, and more—providing fair, reproducible results trusted for research and development.