Web Apps

💻
TerminalBench Explorer
89 real-world terminal tasks with full execution traces across 64 agent/model combos
🌐
AppWorld Explorer
585 API-driven tasks with step-by-step agent trajectories across 12 model/method combos
🔍
GAIA Explorer
507 multi-step reasoning questions with agent traces, tool calls, and web browsing
📊
Benchmark Sampler
Random samples from GPQA, SWE-Bench, MMMU, HumanEval, and other ML evals
📚
BibTeX Lookup
Instant citations from arXiv IDs, DOIs, or paper titles
📅
Conference Deadlines
Countdowns for NeurIPS, ICML, ICLR, ACL, and more
📋
Instruction Datasets
Browse samples from IFEval, Arena-Hard, WildBench, and other LLM eval sets
🖥
SLURM Generator
Build sbatch scripts with cluster presets and common options
📊
NIW Visa Bulletin
EB-2 priority date charts with historical trends
Tao Te Ching
All 81 chapters with six parallel translations
Interval Timer
Color-coded intervals for HIIT and custom workouts
Visual Timer
Shrinking disk countdown to see time remaining at a glance
🎤
Talk Timer
Traffic-light segments for conference talks and meetings