Data Structures
Inverted File Structure (IFS)
Document File (DF) - assign unique document ids and identify all processing tokens
Dictionary - sorted list of all tokens (words)
Inversion List (IL) - pointers from tokens to documents using them
Document-by-Term Matrix
A = [aij], where aij denotes the frequency of term j in document i