Kyle Lahnakoski

This document has moved

December 2003

Correlated Words

Review

The Co-relation Coefficient

Analysis of the Co-relation Coefficient

Notes on (the lack of) Symmetry

    1. The form does not help indicate what word is a better choice for indicating spam. The PAB = PA (n PB) formula includes the assumption that A is being used to indicate spam, and we are solving the adjustments needed to include B in our Bayesian filter.
    2. The value for n when PAB = PA is complicated, and does not reveal the important conclusion that B is worth relatively nothing.

Implementation Challenges

Conclusion

kyle@lahnakoski.com