| Data set 1: | |||||||||||||||
| cf | df | avg | std | Exactly 1 | P(exactly 1) | Exactly 2 | At least 2 | P(exactly 2) | P(at least 2) | Exactly 3 | At least 3 | P(exactly 3) | P(at least 3) | ||
| cf | 1 | 0.9179 | 0.0176 | 0.8918 | 0.7674 | -0.021 | 0.8435 | 0.9696 | 0.0132 | 0.021 | 0.867 | 0.8748 | 0.0074 | 0.015 | |
| df | 0.9179 | 1 | -0.0047 | 0.6617 | 0.9554 | -0.003 | 0.9035 | 0.8321 | 0.0096 | 0.003 | 0.7822 | 0.619 | -0.0004 | -0.0056 | |
| avg | 0.0176 | -0.0047 | 1 | 0.0375 | -0.0217 | -0.7748 | 0.011 | 0.0285 | 0.1797 | 0.7748 | 0.0275 | 0.0357 | 0.3136 | 0.8615 | |
| std | 0.8918 | 0.6617 | 0.0375 | 1 | 0.4486 | -0.0316 | 0.6004 | 0.8958 | 0.0112 | 0.0316 | 0.7497 | 0.9392 | 0.0112 | 0.0313 | |
| Exactly 1 | 0.7674 | 0.9554 | -0.0217 | 0.4486 | 1 | 0.0153 | 0.8134 | 0.6313 | 0.0007 | -0.0153 | 0.608 | 0.3759 | -0.0084 | -0.0212 | |
| P(exactly 1) | -0.021 | -0.003 | -0.7748 | -0.0316 | 0.0153 | 1 | -0.0306 | -0.0367 | -0.6725 | -1 | -0.0367 | -0.034 | -0.4292 | -0.6698 | |
| Exactly 2 | 0.8435 | 0.9035 | 0.011 | 0.6004 | 0.8134 | -0.0306 | 1 | 0.8454 | 0.0367 | 0.0306 | 0.8394 | 0.5688 | 0.007 | 0.0043 | |
| At least 2 | 0.9696 | 0.8321 | 0.0285 | 0.8958 | 0.6313 | -0.0367 | 0.8454 | 1 | 0.0239 | 0.0367 | 0.9127 | 0.9202 | 0.0148 | 0.0253 | |
| P(exactly 2) | 0.0132 | 0.0096 | 0.1797 | 0.0112 | 0.0007 | -0.6725 | 0.0367 | 0.0239 | 1 | 0.6725 | 0.0164 | 0.0099 | -0.0641 | -0.0991 | |
| P(at least 2) | 0.021 | 0.003 | 0.7748 | 0.0316 | -0.0153 | -1 | 0.0306 | 0.0367 | 0.6725 | 1 | 0.0367 | 0.034 | 0.4292 | 0.6698 | |
| Exactly 3 | 0.867 | 0.7822 | 0.0275 | 0.7497 | 0.608 | -0.0367 | 0.8394 | 0.9127 | 0.0164 | 0.0367 | 1 | 0.7902 | 0.0338 | 0.0329 | |
| At least 3 | 0.8748 | 0.619 | 0.0357 | 0.9392 | 0.3759 | -0.034 | 0.5688 | 0.9202 | 0.0099 | 0.034 | 0.7902 | 1 | 0.0176 | 0.0358 | |
| P(exactly 3) | 0.0074 | -0.0004 | 0.3136 | 0.0112 | -0.0084 | -0.4292 | 0.007 | 0.0148 | -0.0641 | 0.4292 | 0.0338 | 0.0176 | 1 | 0.6414 | |
| P(at least 3) | 0.015 | -0.0056 | 0.8615 | 0.0313 | -0.0212 | -0.6698 | 0.0043 | 0.0253 | -0.0991 | 0.6698 | 0.0329 | 0.0358 | 0.6414 | 1 | |
| Data set 2: | |||||||||||||||
| cf | df | avg | std | Exactly 1 | P(exactly 1) | Exactly 2 | At least 2 | P(exactly 2) | P(at least 2) | Exactly 3 | At least 3 | P(exactly 3) | P(at least 3) | ||
| cf | 1 | 0.8674 | 0.0423 | 0.7897 | 0.6844 | -0.0522 | 0.7991 | 0.9384 | 0.0264 | 0.0522 | 0.8468 | 0.942 | 0.0241 | 0.045 | |
| df | 0.8674 | 1 | 0.0128 | 0.4456 | 0.936 | -0.0396 | 0.9311 | 0.8989 | 0.0311 | 0.0396 | 0.8673 | 0.7781 | 0.0214 | 0.0229 | |
| avg | 0.0423 | 0.0128 | 1 | 0.0571 | -0.0066 | -0.6177 | 0.0182 | 0.0346 | 0.103 | 0.6177 | 0.0268 | 0.0432 | 0.169 | 0.7444 | |
| std | 0.7897 | 0.4456 | 0.0571 | 1 | 0.2793 | -0.0384 | 0.3568 | 0.5721 | 0.0107 | 0.0384 | 0.4254 | 0.6721 | 0.0131 | 0.042 | |
| Exactly 1 | 0.6844 | 0.936 | -0.0066 | 0.2793 | 1 | -0.0178 | 0.8069 | 0.687 | 0.0231 | 0.0178 | 0.6599 | 0.5234 | 0.0115 | 0.001 | |
| P(exactly 1) | -0.0522 | -0.0396 | -0.6177 | -0.0384 | -0.0178 | 1 | -0.0557 | -0.0596 | -0.6879 | -1 | -0.0544 | -0.0561 | -0.4045 | -0.676 | |
| Exactly 2 | 0.7991 | 0.9311 | 0.0182 | 0.3568 | 0.8069 | -0.0557 | 1 | 0.9174 | 0.0474 | 0.0557 | 0.9101 | 0.7568 | 0.0256 | 0.0284 | |
| At least 2 | 0.9384 | 0.8989 | 0.0346 | 0.5721 | 0.687 | -0.0596 | 0.9174 | 1 | 0.0353 | 0.0596 | 0.9686 | 0.9545 | 0.0299 | 0.0461 | |
| P(exactly 2) | 0.0264 | 0.0311 | 0.103 | 0.0107 | 0.0231 | -0.6879 | 0.0474 | 0.0353 | 1 | 0.6879 | 0.0286 | 0.0225 | -0.0398 | -0.0698 | |
| P(at least 2) | 0.0522 | 0.0396 | 0.6177 | 0.0384 | 0.0178 | -1 | 0.0557 | 0.0596 | 0.6879 | 1 | 0.0544 | 0.0561 | 0.4045 | 0.676 | |
| Exactly 3 | 0.8468 | 0.8673 | 0.0268 | 0.4254 | 0.6599 | -0.0544 | 0.9101 | 0.9686 | 0.0286 | 0.0544 | 1 | 0.9084 | 0.0387 | 0.0457 | |
| At least 3 | 0.942 | 0.7781 | 0.0432 | 0.6721 | 0.5234 | -0.0561 | 0.7568 | 0.9545 | 0.0225 | 0.0561 | 0.9084 | 1 | 0.0299 | 0.0543 | |
| P(exactly 3) | 0.0241 | 0.0214 | 0.169 | 0.0131 | 0.0115 | -0.4045 | 0.0256 | 0.0299 | -0.0398 | 0.4045 | 0.0387 | 0.0299 | 1 | 0.5963 | |
| P(at least 3) | 0.045 | 0.0229 | 0.7444 | 0.042 | 0.001 | -0.676 | 0.0284 | 0.0461 | -0.0698 | 0.676 | 0.0457 | 0.0543 | 0.5963 | 1 | |
| Notes: | |||||||||||||||
| At least 1 is the same as cf (corpus frequency) and thus deleted. | |||||||||||||||
| P(at least 1 | occurs) = 1, and thus deleted | |||||||||||||||
| P(exactly 1 | occurs) is negatively correlated with P(at least 2|occurs) | |||||||||||||||