WangResearchLab/verification-1.5b
2B
•
Updated
•
27
None defined yet.
Predicting Task Performance with Context-aware Scaling Laws
Budget-aware Test-time Scaling via Discriminative Verification