Research Judgment Checklist
When reviewing a claim, ask:
- What exactly changed?
- What exactly stayed fixed?
- Is the baseline strong?
- Is the metric aligned with the claim?
- Are compute and data controlled?
- Are results stable across seeds?
- Are slice results reported?
- Are there failure cases?
- Is there any leakage or contamination risk?
- What is the strongest conclusion the evidence actually supports?