Implement MCQEval
Our built-in MCQEval framework serves as an evaluation task factory. To implement, define two components and let openbench take care of the rest!Component 1: Map Your Dataset to MCQ Samples
Define a mapping for dataset records to fit the expected MCQSample format:evals/openbookqa.py
Component 2: Define Your Eval Task
Given a record_to_mcq_sample function, MCQEval will spin up an InspectAI Task:bench eval!