[Dataset] the MIT LabelMe
[Scene] Street; Collected from 17 Folders (Table 3)
[dataset selection] Table 1
[6 object classes] building, tree, sky, car, road, body (person).
[Image preprocessing] downscale to less than 400*300 while keeping the aspect ratio.
[Accuracy] Table 2; (the classification performance of human body, cars and
trees is NOT good).
[Testing results] [download 24M] (up left: input image; up right: ground truth; down left: boosting result; down right: probability map of ground truth;)
[Parameter settings] For Cascaded boosting: 15 layers; 800 weak classifiers per layer; DetectRate = 0.998; False Positive Rate = 0.7;
[Reference] Long Zhu, Yuanhao Chen, Yuan Lin, Chenxi Lin, Alan Yuille:
Recursive Segmentation and Recognition Templates for 2D Parsing. NIPS 2008.
Table 1: Dataset
Dataset |
Dataset A |
Dataset B |
Dataset C |
Dataset D |
Selection |
10% labeled |
randomly selected from A |
80% labeled; randomly selected from {A-B} |
80% labeled; {A-B-C} |
Number of images |
1942 |
500 |
300 |
400 |
Usage |
|
boosting training |
Boosting testing; HIM training |
|
Processing Time |
|
training: 30 hours |
testing: 3 minutes /image |
|
Accuracy |
|
|
86.0 |
|
Table 2: Testing Results.
Global Accuracy : 86.0 |
Average Accuracy : 70.1 |
Confusion Matrix: |
Table 3: 17 Folders containing street scene from the MIT LabelMe dataset; http://labelme.csail.mit.edu/
05june05_static_street_porter |
boston_static_street |
database_static_street |
dec_static_street |
paris_static_street |
static_barcelona_street_city_outdoor_2_2005 |
static_boston_street_april |
static_harvard_outdoor_street |
static_madrid_outdoor_august_2005 |
static_newyork_city_urban |
static_outdoor_street_berkeley |
static_outdoor_street_campus_maryland_usa |
static_outdoor_street_city_cambridge_uk |
static_outdoor_street_village_puigpunyent_mallorca_spain |
static_outdoor_urban_city_london_uk |
static_outdoor_urban_city_street_boston |
static_outdoor_urban_city_street_coruna_spain |