rafa-ruiz
|
fe43cd6fa9
|
scripts documentation
|
2026-03-26 07:51:01 -07:00 |
rafa-ruiz
|
ccd9073a52
|
feat(dataset): add ADR-0006 and scaffold reward algorithm pipeline
|
2026-03-25 22:19:19 -07:00 |
rafa-ruiz
|
90857e1b0a
|
UPDATE: Modified LRM and generate_mbap.py to ensure better samples
|
2026-03-11 20:09:05 -07:00 |
rafa-ruiz
|
b5167b71e3
|
UPDATE: Sample generator now includes a new key in each item.
|
2026-03-11 12:22:08 -07:00 |
rafa-ruiz
|
35ca56118d
|
feat: add MBPP-style dataset generator and evaluation docs
|
2026-03-10 13:37:19 -07:00 |