A look at AVG (CF/DF)
Probability distribution for different averages:
Data set 1
Data set 1 -- LOG
Data set 2
Data set 2 -- LOG
Comparison of the probability of occuring exactly once, twice, and three times, given the term occurs, across different averages:
Pr( once ) v. Pr( twice )
(
log
)
Pr( once ) v. Pr( thrice )
(
log
)
Pr( twice ) v. Pr( thrice )
(
log
)