AI competitions and benchmarks: the science behind the contests