Implementing Vector Space
Goal:
- linear preprocessing
- interactive query processing
Want maximum entries in product qTA
Best known method: inverted index
- for each query term, list documents containing it
- accumulate dot products
Nothing better known than brute force
Challenge: high-dimensional near neighbors