Skip to main content

📏

Module 25Benchmarking & Experimental Methodology

Module 25: Benchmarking & Experimental Methodology - How to Run Fair Experiments and Not Fool Yourself

The difference between science and tinkering is methodology. This module teaches rigorous benchmarking: BBOB/COCO, statistical testing, performance profiles, and how to avoid the common sins of EC experimentation.

Learning Objectives

Use COCO/BBOB to benchmark an algorithm on 24 functions
Generate ECDF plots and interpret them
Perform statistical tests (Mann-Whitney, Wilcoxon signed-rank)
Understand effect sizes and performance profiles
Know the No Free Lunch theorem and its practical implications

Concept Explanation

Coming soon.

Code Examples

Coming soon.

Exercises

Coming soon.

Milestone Checklist

Ran a COCO benchmark experiment
Generated proper ECDF plots
Performed statistical comparison of two algorithms
Understand why cherry-picked benchmarks are misleading

Was this page helpful?

Learning Objectives
Concept Explanation
Code Examples
Exercises
Milestone Checklist