Badge, STARS and STRESS-DES evaluation. Total evaluation time used: 1h 0m.
This study did alot of things right - well structured code, seeds, provision of some code towards making figures, to name a few - but still, didn’t manage to reproduce most parts. This doesn’t mean it was bad/worse! And is just likely to be that there are some mismatch parameters in there causing everything to be out of sync. It just shows how important that is in enabling reproduction of results, that the model is parametrised to match the paper (as well as for each scenario).
As not able to get consensus on reproduction at the moment, moving onto evaluation.
15.07-15.12: Badges evaluation
As in previous evaluations, marked as having not included license as add on request.
Not complete set of materials as had to right some code myself (e.g. scenarios, processing results).