Nikhil
|
84a79ba0a1
|
feat: refactor eval pipeline workflow (#875)
* feat(eval): add suite variant config bridge
* feat(eval): add stable run artifacts
* refactor(eval): add shared grader contract
* feat(eval): persist grader artifacts
* refactor(eval): rename runner layers
* refactor(eval): add executor backend boundary
* refactor(eval): split clado backend
* feat(eval): add workflow compatible cli
* feat(eval): add r2 publisher module
* ci(eval): migrate weekly workflow to eval cli
* docs(eval): document suite pipeline
* chore(eval): verify pipeline refactor
* fix: address review feedback for PR #875
* docs(eval): add env example
* docs(eval): explain suites and variants
* chore(eval): organize config layouts
* chore(eval): colocate grader python evaluators
|
2026-04-29 17:21:02 -07:00 |
|