view article Article "The Child That Surpassed Both Parents Through MRI-Guided Evolutionary Merge" 3 days ago โข 13
view article Article Introducing WM Bench: A Benchmark for Cognitive Intelligence in World Models 5 days ago โข 13
view article Article MARL: Runtime Middleware That Reduces LLM Hallucination Without Fine-Tuning 26 days ago โข 15
view article Article Structural Problems in AI Benchmarking and the Case for a Unified Evaluation Framework 27 days ago โข 12
view article Article ๐๏ธ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do 24 days ago โข 38