I attempt to compute the mAP of YOLO_v2 and SSD on the VOC2007 test set using VOCevaldet.m, but in both cases I reach lower mAPs than the values reported in the papers.
To produce the detection txt files in VOC format (a file per class), I use the command ./darknet detector valid cfg/voc.data cfg/yolo-voc.cfg weights/yolo-voc.weights for YOLO in darknet, and the python script score_ssd_pascal.py for SSD in Caffe.
What am I missing? Why don't I get the same results as in the papers?
Thanks.