Challenges
A document model for clustering?
Fast clustering algorithms?
Cluster in SVD space
- SVD gives topics, not clusters
- but might yield better similarity metrics for clusters
Principled choice of clusters’ descriptive terms
- even harder after SVD, since can’t “print” topics