Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Benchmark for measuring code erosion under iterative specification refinement (scbench.ai)
1 point by dnw 23 days ago | past
SlopCodeBench: Benchmarking How Coding Agents Degrade over Long-Horizon Tasks (scbench.ai)
2 points by matt_d 24 days ago | past

Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: