-
Notifications
You must be signed in to change notification settings - Fork 201
Pull requests: UKGovernmentBEIS/inspect_evals
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Restrict shuffle parameter to Literal type in personality_TRAIT #733
#743
opened Dec 4, 2025 by
Sidhi-03
Loading…
Add CLAUDE.md workflow for testing an evaluation to the evaluation checklist
#741
opened Dec 4, 2025 by
Jay-Bailey
Loading…
NoveltyBench benchmark implementation
implementation
An implementation of a new eval
#717
opened Dec 1, 2025 by
iphan
Loading…
Fix/issue 685 abstention types
enhancement
New feature or request
#695
opened Nov 21, 2025 by
mjbroerman
Loading…
Add emergent misalignment evals
implementation
An implementation of a new eval
#682
opened Nov 17, 2025 by
dtch1997
Loading…
14 of 17 tasks
Fix issues uncovered when running all pre-commit hooks & add GH action to enforce it
enhancement
New feature or request
Remove New feature or request
as_posix() calls & add custom POSIX-check pre-commit
enhancement
#666
opened Nov 11, 2025 by
AnselmC
Loading…
GDPval Implementation
implementation
An implementation of a new eval
#598
opened Oct 11, 2025 by
jeqcho
Loading…
Swe Lancer implementation
implementation
An implementation of a new eval
#352
opened May 26, 2025 by
NelsonG-C
Loading…
2 of 6 tasks
ProTip!
Mix and match filters to narrow down what you’re looking for.