Objective Analysis |
We compare between objective measures and the collected user data by correlating their rankings. Kendall tau distance (Equation 3 in the paper) is used to measure the degree of correlation between the rankings induced by the two measures. This page shows results of this analysis.
|
Analysis images
We examine the bias of each metric by simulating a voting process based on the calculated distances. For each operator, we count the number of times its result is smaller than another result, and accumulate over all images and metrics. The following figure shows the distribution of metric “votes” among the operators for the different metrics we tested.
![](figures/metricVotes_ref_37.png)
The following tables show the correlation of objective and subjective measures for the full ranking, and with respect to the k top-ranked results, for k=2,3,4. For each k we also show the pairwise metric correlations (measured by correlating the rankings produced by each pair of metrics for each image).
|
Full ranking (k=inf)
Metric |
Attribute |
Total |
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
0.040 |
0.190 |
0.060 |
0.167 |
-0.004 |
-0.012 |
0.083 |
0.268 |
0.017 |
BDS-PM |
0.054 |
0.162 |
0.083 |
0.167 |
0.063 |
-0.024 |
0.097 |
0.232 |
0.013 |
BDW |
0.031 |
0.048 |
-0.048 |
0.060 |
0.004 |
0.119 |
0.046 |
0.181 |
0.869 |
SIFTflow |
0.097 |
0.252 |
0.119 |
0.218 |
0.085 |
0.071 |
0.145 |
0.262 |
0.031 |
EMD |
0.220 |
0.262 |
0.107 |
0.226 |
0.237 |
0.500 |
0.251 |
0.272 |
0.000 |
EH |
0.043 |
-0.076 |
-0.060 |
-0.079 |
0.103 |
0.298 |
0.004 |
0.334 |
0.641 |
CL |
-0.023 |
-0.181 |
-0.071 |
-0.183 |
-0.009 |
0.214 |
-0.068 |
0.301 |
0.384 |
Correlation with users
In each column the mean correlation coefficient is shown,
calculated over all images in the dataset with the corresponding attribute. The last three columns show the mean score, standard deviation,
and respective p-value over all image types.
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.77 |
0.48 |
0.68 |
0.14 |
-0.01 |
-0.33 |
BDS-PM |
0.77 |
1.00 |
0.53 |
0.77 |
0.15 |
0.01 |
-0.37 |
BDW |
0.48 |
0.53 |
1.00 |
0.45 |
0.29 |
0.15 |
0.04 |
SIFTflow |
0.68 |
0.77 |
0.45 |
1.00 |
0.21 |
-0.03 |
-0.36 |
EMD |
0.14 |
0.15 |
0.29 |
0.21 |
1.00 |
0.24 |
0.31 |
EH |
-0.01 |
0.01 |
0.15 |
-0.03 |
0.24 |
1.00 |
0.31 |
CL |
-0.33 |
-0.37 |
0.04 |
-0.36 |
0.31 |
0.31 |
1.00 |
Correlation between metrics
Each cell indicates the
mean rank coefficient between metric i and metric j over all images
in the dataset. |
k=2
Metric |
Attribute |
Total
|
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
-0.240 |
0.286 |
-0.208 |
0.155 |
-0.375 |
-0.500 |
-0.134 |
0.827 |
0.000 |
BDS-PM |
-0.120 |
0.279 |
-0.208 |
0.233 |
-0.078 |
-0.208 |
-0.008 |
0.800 |
0.000 |
BDW |
0.184 |
0.073 |
0.042 |
0.047 |
0.053 |
0.750 |
0.147 |
0.735 |
0.000 |
SIFTflow |
-0.126 |
0.386 |
-0.542 |
0.355 |
0.037 |
-0.151 |
0.078 |
0.831 |
0.000 |
EMD |
0.220 |
0.369 |
0.000 |
0.155 |
0.078 |
0.433 |
0.224 |
0.726 |
0.000 |
EH |
-0.150 |
-0.467 |
-0.458 |
-0.514 |
0.016 |
-0.167 |
-0.236 |
0.810 |
0.000 |
CL |
-0.676 |
-0.794 |
-0.792 |
-0.861 |
-0.688 |
-0.234 |
-0.673 |
0.593 |
0.000 |
Correlation with users
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.83 |
0.11 |
0.64 |
0.20 |
-0.44 |
-0.86 |
BDS-PM |
0.83 |
1.00 |
0.17 |
0.71 |
0.18 |
-0.40 |
-0.86 |
BDW |
0.11 |
0.17 |
1.00 |
-0.09 |
0.51 |
-0.18 |
-0.26 |
SIFTflow |
0.64 |
0.71 |
-0.09 |
1.00 |
0.25 |
-0.36 |
-0.97 |
EMD |
0.20 |
0.18 |
0.51 |
0.25 |
1.00 |
-0.01 |
-0.25 |
EH |
-0.44 |
-0.40 |
-0.18 |
-0.36 |
-0.01 |
1.00 |
0.01 |
CL |
-0.86 |
-0.86 |
-0.26 |
-0.97 |
-0.25 |
0.01 |
1.00 |
Correlation between metrics |
k=3
Metric |
Attribute |
Total
|
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
0.062 |
0.280 |
0.134 |
0.249 |
-0.025 |
-0.247 |
0.108 |
0.532 |
0.005 |
BDS-PM |
0.165 |
0.299 |
0.273 |
0.350 |
0.221 |
-0.052 |
0.210 |
0.464 |
0.000 |
BDW |
0.213 |
0.141 |
0.123 |
0.115 |
0.212 |
0.439 |
0.200 |
0.395 |
0.002 |
SIFTflow |
0.241 |
0.428 |
0.312 |
0.442 |
0.303 |
0.002 |
0.298 |
0.483 |
0.000 |
EMD |
0.301 |
0.416 |
0.216 |
0.295 |
0.226 |
0.534 |
0.326 |
0.496 |
0.000 |
EH |
-0.036 |
-0.207 |
-0.331 |
-0.177 |
0.111 |
0.294 |
-0.071 |
0.593 |
0.013 |
CL |
-0.307 |
-0.336 |
-0.433 |
-0.519 |
-0.366 |
0.088 |
-0.320 |
0.543 |
0.000 |
Correlation with users
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.80 |
0.38 |
0.69 |
0.27 |
-0.21 |
-0.64 |
BDS-PM |
0.80 |
1.00 |
0.35 |
0.76 |
0.24 |
-0.25 |
-0.76 |
BDW |
0.38 |
0.35 |
1.00 |
0.25 |
0.42 |
0.14 |
-0.03 |
SIFTflow |
0.69 |
0.76 |
0.25 |
1.00 |
0.30 |
-0.24 |
-0.76 |
EMD |
0.27 |
0.24 |
0.42 |
0.30 |
1.00 |
0.12 |
0.09 |
EH |
-0.21 |
-0.25 |
0.14 |
-0.24 |
0.12 |
1.00 |
0.27 |
CL |
-0.64 |
-0.76 |
-0.03 |
-0.76 |
0.09 |
0.27 |
1.00 |
Correlation between metrics |
k=4
Metric |
Attribute |
Total
|
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
0.063 |
0.236 |
0.122 |
0.146 |
-0.010 |
-0.057 |
0.097 |
0.392 |
0.052 |
BDS-PM |
0.156 |
0.278 |
0.215 |
0.262 |
0.148 |
0.018 |
0.187 |
0.338 |
0.003 |
BDW |
0.077 |
0.084 |
0.005 |
0.082 |
0.030 |
0.164 |
0.092 |
0.287 |
0.102 |
SIFTflow |
0.218 |
0.416 |
0.354 |
0.360 |
0.214 |
0.148 |
0.272 |
0.376 |
0.000 |
EMD |
0.299 |
0.429 |
0.154 |
0.297 |
0.243 |
0.676 |
0.338 |
0.436 |
0.000 |
EH |
0.031 |
-0.014 |
-0.049 |
-0.030 |
0.108 |
0.349 |
0.021 |
0.401 |
0.003 |
CL |
-0.112 |
-0.230 |
-0.137 |
-0.277 |
-0.158 |
0.155 |
-0.155 |
0.394 |
0.000 |
Correlation with users
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.79 |
0.47 |
0.68 |
0.18 |
-0.11 |
-0.47 |
BDS-PM |
0.79 |
1.00 |
0.45 |
0.79 |
0.18 |
-0.11 |
-0.59 |
BDW |
0.47 |
0.45 |
1.00 |
0.37 |
0.30 |
0.13 |
-0.07 |
SIFTflow |
0.68 |
0.79 |
0.37 |
1.00 |
0.24 |
-0.13 |
-0.57 |
EMD |
0.18 |
0.18 |
0.30 |
0.24 |
1.00 |
0.22 |
0.23 |
EH |
-0.11 |
-0.11 |
0.13 |
-0.13 |
0.22 |
1.00 |
0.30 |
CL |
-0.47 |
-0.59 |
-0.07 |
-0.57 |
0.23 |
0.30 |
1.00 |
Correlation between metrics |
|
All user study images
When considering all images, we normalize by the number of times each result was chosen by the number of times it was shown. This number might vary between images with different number of results as different designs are used.
![](figures/metricVotes_ref.png)
|
Full ranking (k=inf)
Metric |
Attribute |
Total
|
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
0.069 |
0.204 |
0.080 |
0.164 |
-0.018 |
0.106 |
0.081 |
0.305 |
0.056 |
BDS-PM |
0.048 |
0.157 |
0.021 |
0.135 |
0.030 |
0.041 |
0.069 |
0.265 |
0.038 |
BDW |
0.045 |
0.089 |
-0.077 |
0.069 |
0.002 |
0.088 |
0.056 |
0.197 |
0.191 |
SIFTflow |
0.111 |
0.249 |
0.071 |
0.214 |
0.080 |
0.167 |
0.124 |
0.317 |
0.078 |
EMD |
0.269 |
0.341 |
0.098 |
0.222 |
0.278 |
0.462 |
0.278 |
0.325 |
0.000 |
EH |
0.000 |
-0.108 |
-0.015 |
-0.112 |
0.012 |
0.095 |
-0.031 |
0.339 |
0.477 |
CL |
-0.035 |
-0.170 |
-0.083 |
-0.219 |
-0.034 |
-0.003 |
-0.053 |
0.342 |
0.560 |
Correlation with users
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.78 |
0.52 |
0.70 |
0.16 |
-0.06 |
-0.33 |
BDS-PM |
0.78 |
1.00 |
0.57 |
0.76 |
0.17 |
-0.06 |
-0.36 |
BDW |
0.52 |
0.57 |
1.00 |
0.48 |
0.29 |
0.07 |
0.00 |
SIFTflow |
0.70 |
0.76 |
0.48 |
1.00 |
0.20 |
-0.08 |
-0.36 |
EMD |
0.16 |
0.17 |
0.29 |
0.20 |
1.00 |
0.18 |
0.30 |
EH |
-0.06 |
-0.06 |
0.07 |
-0.08 |
0.18 |
1.00 |
0.36 |
CL |
-0.33 |
-0.36 |
0.00 |
-0.36 |
0.30 |
0.36 |
1.00 |
Correlation between metrics |
k=2
Metric |
Attribute |
Total
|
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
-0.053 |
0.322 |
0.107 |
0.202 |
-0.225 |
-0.071 |
-0.023 |
0.794 |
0.000 |
BDS-PM |
-0.017 |
0.288 |
0.107 |
0.249 |
-0.031 |
0.104 |
0.005 |
0.777 |
0.000 |
BDW |
0.180 |
0.162 |
-0.071 |
0.064 |
0.140 |
0.536 |
0.133 |
0.737 |
0.000 |
SIFTflow |
-0.023 |
0.311 |
-0.232 |
0.253 |
0.024 |
0.206 |
0.034 |
0.838 |
0.000 |
EMD |
0.250 |
0.384 |
-0.023 |
0.143 |
0.152 |
0.384 |
0.224 |
0.719 |
0.000 |
EH |
-0.226 |
-0.533 |
-0.259 |
-0.528 |
-0.107 |
-0.357 |
-0.300 |
0.783 |
0.000 |
CL |
-0.613 |
-0.738 |
-0.411 |
-0.854 |
-0.585 |
-0.369 |
-0.616 |
0.634 |
0.000 |
Correlation with users
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.85 |
0.24 |
0.65 |
0.25 |
-0.52 |
-0.80 |
BDS-PM |
0.85 |
1.00 |
0.28 |
0.67 |
0.19 |
-0.48 |
-0.88 |
BDW |
0.24 |
0.28 |
1.00 |
0.08 |
0.51 |
-0.35 |
-0.33 |
SIFTflow |
0.65 |
0.67 |
0.08 |
1.00 |
0.24 |
-0.45 |
-0.90 |
EMD |
0.25 |
0.19 |
0.51 |
0.24 |
1.00 |
-0.14 |
-0.22 |
EH |
-0.52 |
-0.48 |
-0.35 |
-0.45 |
-0.14 |
1.00 |
0.10 |
CL |
-0.80 |
-0.88 |
-0.33 |
-0.90 |
-0.22 |
0.10 |
1.00 |
Correlation between metrics |
k=3
Metric |
Attribute |
Total
|
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
0.079 |
0.316 |
0.026 |
0.205 |
-0.050 |
-0.015 |
0.093 |
0.562 |
0.000 |
BDS-PM |
0.093 |
0.288 |
0.117 |
0.290 |
0.086 |
0.094 |
0.139 |
0.520 |
0.000 |
BDW |
0.180 |
0.169 |
-0.023 |
0.102 |
0.159 |
0.319 |
0.172 |
0.411 |
0.003 |
SIFTflow |
0.197 |
0.395 |
0.171 |
0.383 |
0.195 |
0.188 |
0.220 |
0.527 |
0.000 |
EMD |
0.305 |
0.489 |
0.183 |
0.301 |
0.236 |
0.497 |
0.328 |
0.444 |
0.000 |
EH |
-0.098 |
-0.221 |
-0.223 |
-0.173 |
-0.030 |
0.043 |
-0.127 |
0.567 |
0.006 |
CL |
-0.281 |
-0.368 |
-0.225 |
-0.542 |
-0.331 |
-0.162 |
-0.294 |
0.551 |
0.000 |
Correlation with users
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.81 |
0.46 |
0.71 |
0.28 |
-0.25 |
-0.63 |
BDS-PM |
0.81 |
1.00 |
0.44 |
0.76 |
0.25 |
-0.29 |
-0.72 |
BDW |
0.46 |
0.44 |
1.00 |
0.36 |
0.41 |
0.03 |
-0.09 |
SIFTflow |
0.71 |
0.76 |
0.36 |
1.00 |
0.27 |
-0.27 |
-0.73 |
EMD |
0.28 |
0.25 |
0.41 |
0.27 |
1.00 |
0.03 |
0.10 |
EH |
-0.25 |
-0.29 |
0.03 |
-0.27 |
0.03 |
1.00 |
0.32 |
CL |
-0.63 |
-0.72 |
-0.09 |
-0.73 |
0.10 |
0.32 |
1.00 |
Correlation between metrics |
k=4
Metric |
Attribute |
Total
|
Lines/Edges |
Faces/People |
Texture |
Foreground Objects |
Geometric Structures |
Symmetry |
Mean |
Std |
p-value |
BDS |
0.077 |
0.255 |
0.084 |
0.168 |
-0.028 |
0.087 |
0.099 |
0.399 |
0.000 |
BDS-PM |
0.114 |
0.271 |
0.100 |
0.242 |
0.074 |
0.064 |
0.143 |
0.364 |
0.000 |
BDW |
0.066 |
0.119 |
-0.051 |
0.090 |
0.005 |
0.111 |
0.087 |
0.282 |
0.695 |
SIFTflow |
0.206 |
0.384 |
0.246 |
0.342 |
0.168 |
0.255 |
0.224 |
0.416 |
0.000 |
EMD |
0.330 |
0.471 |
0.172 |
0.305 |
0.294 |
0.600 |
0.349 |
0.416 |
0.000 |
EH |
-0.015 |
-0.066 |
0.043 |
-0.066 |
0.002 |
0.139 |
-0.033 |
0.407 |
0.001 |
CL |
-0.118 |
-0.228 |
-0.104 |
-0.319 |
-0.147 |
-0.050 |
-0.139 |
0.438 |
0.002 |
Correlation with users
Metric |
BDS |
BDS-PM |
BDW |
SIFTflow |
EMD |
EH |
CL |
BDS |
1.00 |
0.80 |
0.51 |
0.71 |
0.20 |
-0.16 |
-0.49 |
BDS-PM |
0.80 |
1.00 |
0.52 |
0.77 |
0.20 |
-0.17 |
-0.57 |
BDW |
0.51 |
0.52 |
1.00 |
0.43 |
0.32 |
0.03 |
-0.11 |
SIFTflow |
0.71 |
0.77 |
0.43 |
1.00 |
0.23 |
-0.18 |
-0.56 |
EMD |
0.20 |
0.20 |
0.32 |
0.23 |
1.00 |
0.13 |
0.22 |
EH |
-0.16 |
-0.17 |
0.03 |
-0.18 |
0.13 |
1.00 |
0.34 |
CL |
-0.49 |
-0.57 |
-0.11 |
-0.56 |
0.22 |
0.34 |
1.00 |
Correlation between metrics |
|