Skip to content

Pull requests: huggingface/lighteval

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix callable type hint in parallelism helper
#1239 opened May 20, 2026 by GoparapukethaN Loading…
docs: fix custom model examples
#1237 opened May 15, 2026 by MukundaKatta Loading…
fix typo
#1235 opened May 8, 2026 by fpetrakov Loading…
Popotest patch 1
#1231 opened May 5, 2026 by popotest Loading…
test: style-bot trigger
#1221 opened May 4, 2026 by paulinebm Contributor Loading…
Add Bayes@N metric
#1219 opened Apr 29, 2026 by mohsenhariri Loading…
Log per-sample details as trackio.Trace in push_to_wandb
#1217 opened Apr 27, 2026 by abidlabs Member Loading…
Add LICA-Bench: graphic design VLM evaluation (39 tasks, 7 domains)
#1212 opened Apr 15, 2026 by purvanshi Loading…
3 of 4 tasks
POLLUX LLM-Judge metric
#1210 opened Apr 10, 2026 by ulyanaisaeva Loading…
catch task has no docs instead of throw
#1207 opened Apr 8, 2026 by BuiHoangTu Loading…
add multilingual flag to vllm
#1206 opened Apr 8, 2026 by BuiHoangTu Loading…
Add --load-tasks-multilingual and fix --custom-tasks for inspect backend
#1199 opened Mar 25, 2026 by dzautner Loading…
4 tasks done
[Bugfix] Check all responses when n>1 instead of only the first one
#1197 opened Mar 23, 2026 by eldarkurtic Contributor Loading…
[Litellm Enhancement] Enable extra sampling args for litellm backend
#1195 opened Mar 20, 2026 by eldarkurtic Contributor Loading…
Fix litellm connection pool limiting concurrent_requests
#1190 opened Mar 18, 2026 by sihyeonn Loading…
ProTip! Adding no:label will show everything without a label.