Data set
1: |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
cf |
df |
avg |
std |
Exactly 1 |
P(exactly 1) |
Exactly 2 |
At least 2 |
P(exactly 2) |
P(at least 2) |
Exactly 3 |
At least 3 |
P(exactly 3) |
P(at least 3) |
cf |
1 |
0.9179 |
0.0176 |
0.8918 |
0.7674 |
-0.021 |
0.8435 |
0.9696 |
0.0132 |
0.021 |
0.867 |
0.8748 |
0.0074 |
0.015 |
|
df |
0.9179 |
1 |
-0.0047 |
0.6617 |
0.9554 |
-0.003 |
0.9035 |
0.8321 |
0.0096 |
0.003 |
0.7822 |
0.619 |
-0.0004 |
-0.0056 |
|
avg |
0.0176 |
-0.0047 |
1 |
0.0375 |
-0.0217 |
-0.7748 |
0.011 |
0.0285 |
0.1797 |
0.7748 |
0.0275 |
0.0357 |
0.3136 |
0.8615 |
|
std |
0.8918 |
0.6617 |
0.0375 |
1 |
0.4486 |
-0.0316 |
0.6004 |
0.8958 |
0.0112 |
0.0316 |
0.7497 |
0.9392 |
0.0112 |
0.0313 |
|
Exactly 1 |
0.7674 |
0.9554 |
-0.0217 |
0.4486 |
1 |
0.0153 |
0.8134 |
0.6313 |
0.0007 |
-0.0153 |
0.608 |
0.3759 |
-0.0084 |
-0.0212 |
|
P(exactly 1) |
-0.021 |
-0.003 |
-0.7748 |
-0.0316 |
0.0153 |
1 |
-0.0306 |
-0.0367 |
-0.6725 |
-1 |
-0.0367 |
-0.034 |
-0.4292 |
-0.6698 |
|
Exactly 2 |
0.8435 |
0.9035 |
0.011 |
0.6004 |
0.8134 |
-0.0306 |
1 |
0.8454 |
0.0367 |
0.0306 |
0.8394 |
0.5688 |
0.007 |
0.0043 |
|
At least 2 |
0.9696 |
0.8321 |
0.0285 |
0.8958 |
0.6313 |
-0.0367 |
0.8454 |
1 |
0.0239 |
0.0367 |
0.9127 |
0.9202 |
0.0148 |
0.0253 |
|
P(exactly 2) |
0.0132 |
0.0096 |
0.1797 |
0.0112 |
0.0007 |
-0.6725 |
0.0367 |
0.0239 |
1 |
0.6725 |
0.0164 |
0.0099 |
-0.0641 |
-0.0991 |
|
P(at least 2) |
0.021 |
0.003 |
0.7748 |
0.0316 |
-0.0153 |
-1 |
0.0306 |
0.0367 |
0.6725 |
1 |
0.0367 |
0.034 |
0.4292 |
0.6698 |
|
Exactly 3 |
0.867 |
0.7822 |
0.0275 |
0.7497 |
0.608 |
-0.0367 |
0.8394 |
0.9127 |
0.0164 |
0.0367 |
1 |
0.7902 |
0.0338 |
0.0329 |
|
At least 3 |
0.8748 |
0.619 |
0.0357 |
0.9392 |
0.3759 |
-0.034 |
0.5688 |
0.9202 |
0.0099 |
0.034 |
0.7902 |
1 |
0.0176 |
0.0358 |
|
P(exactly 3) |
0.0074 |
-0.0004 |
0.3136 |
0.0112 |
-0.0084 |
-0.4292 |
0.007 |
0.0148 |
-0.0641 |
0.4292 |
0.0338 |
0.0176 |
1 |
0.6414 |
|
P(at least 3) |
0.015 |
-0.0056 |
0.8615 |
0.0313 |
-0.0212 |
-0.6698 |
0.0043 |
0.0253 |
-0.0991 |
0.6698 |
0.0329 |
0.0358 |
0.6414 |
1 |
|
|
Data set 2: |
|
|
cf |
df |
avg |
std |
Exactly 1 |
P(exactly 1) |
Exactly 2 |
At least 2 |
P(exactly 2) |
P(at least 2) |
Exactly 3 |
At least 3 |
P(exactly 3) |
P(at least 3) |
cf |
1 |
0.8674 |
0.0423 |
0.7897 |
0.6844 |
-0.0522 |
0.7991 |
0.9384 |
0.0264 |
0.0522 |
0.8468 |
0.942 |
0.0241 |
0.045 |
|
df |
0.8674 |
1 |
0.0128 |
0.4456 |
0.936 |
-0.0396 |
0.9311 |
0.8989 |
0.0311 |
0.0396 |
0.8673 |
0.7781 |
0.0214 |
0.0229 |
|
avg |
0.0423 |
0.0128 |
1 |
0.0571 |
-0.0066 |
-0.6177 |
0.0182 |
0.0346 |
0.103 |
0.6177 |
0.0268 |
0.0432 |
0.169 |
0.7444 |
|
std |
0.7897 |
0.4456 |
0.0571 |
1 |
0.2793 |
-0.0384 |
0.3568 |
0.5721 |
0.0107 |
0.0384 |
0.4254 |
0.6721 |
0.0131 |
0.042 |
|
Exactly 1 |
0.6844 |
0.936 |
-0.0066 |
0.2793 |
1 |
-0.0178 |
0.8069 |
0.687 |
0.0231 |
0.0178 |
0.6599 |
0.5234 |
0.0115 |
0.001 |
|
P(exactly 1) |
-0.0522 |
-0.0396 |
-0.6177 |
-0.0384 |
-0.0178 |
1 |
-0.0557 |
-0.0596 |
-0.6879 |
-1 |
-0.0544 |
-0.0561 |
-0.4045 |
-0.676 |
|
Exactly 2 |
0.7991 |
0.9311 |
0.0182 |
0.3568 |
0.8069 |
-0.0557 |
1 |
0.9174 |
0.0474 |
0.0557 |
0.9101 |
0.7568 |
0.0256 |
0.0284 |
|
At least 2 |
0.9384 |
0.8989 |
0.0346 |
0.5721 |
0.687 |
-0.0596 |
0.9174 |
1 |
0.0353 |
0.0596 |
0.9686 |
0.9545 |
0.0299 |
0.0461 |
|
P(exactly 2) |
0.0264 |
0.0311 |
0.103 |
0.0107 |
0.0231 |
-0.6879 |
0.0474 |
0.0353 |
1 |
0.6879 |
0.0286 |
0.0225 |
-0.0398 |
-0.0698 |
|
P(at least 2) |
0.0522 |
0.0396 |
0.6177 |
0.0384 |
0.0178 |
-1 |
0.0557 |
0.0596 |
0.6879 |
1 |
0.0544 |
0.0561 |
0.4045 |
0.676 |
|
Exactly 3 |
0.8468 |
0.8673 |
0.0268 |
0.4254 |
0.6599 |
-0.0544 |
0.9101 |
0.9686 |
0.0286 |
0.0544 |
1 |
0.9084 |
0.0387 |
0.0457 |
|
At least 3 |
0.942 |
0.7781 |
0.0432 |
0.6721 |
0.5234 |
-0.0561 |
0.7568 |
0.9545 |
0.0225 |
0.0561 |
0.9084 |
1 |
0.0299 |
0.0543 |
|
P(exactly 3) |
0.0241 |
0.0214 |
0.169 |
0.0131 |
0.0115 |
-0.4045 |
0.0256 |
0.0299 |
-0.0398 |
0.4045 |
0.0387 |
0.0299 |
1 |
0.5963 |
|
P(at least 3) |
0.045 |
0.0229 |
0.7444 |
0.042 |
0.001 |
-0.676 |
0.0284 |
0.0461 |
-0.0698 |
0.676 |
0.0457 |
0.0543 |
0.5963 |
1 |
|
|
Notes: |
|
At
least 1 is the same as cf (corpus frequency) and thus deleted. |
|
P(at
least 1 | occurs) = 1, and thus deleted |
|
P(exactly
1 | occurs) is negatively correlated with P(at least 2|occurs) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|