Data set 1:
cf df avg std Exactly 1 P(exactly 1) Exactly 2 At least 2 P(exactly 2) P(at least 2) Exactly 3 At least 3 P(exactly 3) P(at least 3)
cf 1 0.9179 0.0176 0.8918 0.7674 -0.021 0.8435 0.9696 0.0132 0.021 0.867 0.8748 0.0074 0.015
df 0.9179 1 -0.0047 0.6617 0.9554 -0.003 0.9035 0.8321 0.0096 0.003 0.7822 0.619 -0.0004 -0.0056
avg 0.0176 -0.0047 1 0.0375 -0.0217 -0.7748 0.011 0.0285 0.1797 0.7748 0.0275 0.0357 0.3136 0.8615
std 0.8918 0.6617 0.0375 1 0.4486 -0.0316 0.6004 0.8958 0.0112 0.0316 0.7497 0.9392 0.0112 0.0313
Exactly 1 0.7674 0.9554 -0.0217 0.4486 1 0.0153 0.8134 0.6313 0.0007 -0.0153 0.608 0.3759 -0.0084 -0.0212
P(exactly 1) -0.021 -0.003 -0.7748 -0.0316 0.0153 1 -0.0306 -0.0367 -0.6725 -1 -0.0367 -0.034 -0.4292 -0.6698
Exactly 2 0.8435 0.9035 0.011 0.6004 0.8134 -0.0306 1 0.8454 0.0367 0.0306 0.8394 0.5688 0.007 0.0043
At least 2 0.9696 0.8321 0.0285 0.8958 0.6313 -0.0367 0.8454 1 0.0239 0.0367 0.9127 0.9202 0.0148 0.0253
P(exactly 2) 0.0132 0.0096 0.1797 0.0112 0.0007 -0.6725 0.0367 0.0239 1 0.6725 0.0164 0.0099 -0.0641 -0.0991
P(at least 2) 0.021 0.003 0.7748 0.0316 -0.0153 -1 0.0306 0.0367 0.6725 1 0.0367 0.034 0.4292 0.6698
Exactly 3 0.867 0.7822 0.0275 0.7497 0.608 -0.0367 0.8394 0.9127 0.0164 0.0367 1 0.7902 0.0338 0.0329
At least 3 0.8748 0.619 0.0357 0.9392 0.3759 -0.034 0.5688 0.9202 0.0099 0.034 0.7902 1 0.0176 0.0358
P(exactly 3) 0.0074 -0.0004 0.3136 0.0112 -0.0084 -0.4292 0.007 0.0148 -0.0641 0.4292 0.0338 0.0176 1 0.6414
P(at least 3) 0.015 -0.0056 0.8615 0.0313 -0.0212 -0.6698 0.0043 0.0253 -0.0991 0.6698 0.0329 0.0358 0.6414 1
Data set 2:
cf df avg std Exactly 1 P(exactly 1) Exactly 2 At least 2 P(exactly 2) P(at least 2) Exactly 3 At least 3 P(exactly 3) P(at least 3)
cf 1 0.8674 0.0423 0.7897 0.6844 -0.0522 0.7991 0.9384 0.0264 0.0522 0.8468 0.942 0.0241 0.045
df 0.8674 1 0.0128 0.4456 0.936 -0.0396 0.9311 0.8989 0.0311 0.0396 0.8673 0.7781 0.0214 0.0229
avg 0.0423 0.0128 1 0.0571 -0.0066 -0.6177 0.0182 0.0346 0.103 0.6177 0.0268 0.0432 0.169 0.7444
std 0.7897 0.4456 0.0571 1 0.2793 -0.0384 0.3568 0.5721 0.0107 0.0384 0.4254 0.6721 0.0131 0.042
Exactly 1 0.6844 0.936 -0.0066 0.2793 1 -0.0178 0.8069 0.687 0.0231 0.0178 0.6599 0.5234 0.0115 0.001
P(exactly 1) -0.0522 -0.0396 -0.6177 -0.0384 -0.0178 1 -0.0557 -0.0596 -0.6879 -1 -0.0544 -0.0561 -0.4045 -0.676
Exactly 2 0.7991 0.9311 0.0182 0.3568 0.8069 -0.0557 1 0.9174 0.0474 0.0557 0.9101 0.7568 0.0256 0.0284
At least 2 0.9384 0.8989 0.0346 0.5721 0.687 -0.0596 0.9174 1 0.0353 0.0596 0.9686 0.9545 0.0299 0.0461
P(exactly 2) 0.0264 0.0311 0.103 0.0107 0.0231 -0.6879 0.0474 0.0353 1 0.6879 0.0286 0.0225 -0.0398 -0.0698
P(at least 2) 0.0522 0.0396 0.6177 0.0384 0.0178 -1 0.0557 0.0596 0.6879 1 0.0544 0.0561 0.4045 0.676
Exactly 3 0.8468 0.8673 0.0268 0.4254 0.6599 -0.0544 0.9101 0.9686 0.0286 0.0544 1 0.9084 0.0387 0.0457
At least 3 0.942 0.7781 0.0432 0.6721 0.5234 -0.0561 0.7568 0.9545 0.0225 0.0561 0.9084 1 0.0299 0.0543
P(exactly 3) 0.0241 0.0214 0.169 0.0131 0.0115 -0.4045 0.0256 0.0299 -0.0398 0.4045 0.0387 0.0299 1 0.5963
P(at least 3) 0.045 0.0229 0.7444 0.042 0.001 -0.676 0.0284 0.0461 -0.0698 0.676 0.0457 0.0543 0.5963 1
Notes:
At least 1 is the same as cf (corpus frequency) and thus deleted.
P(at least 1 | occurs) = 1, and thus deleted
P(exactly 1 | occurs) is negatively correlated with P(at least 2|occurs)