MG = Managing Gigabytes; MIR = Modern Information Retrieval
| date | topic | resources | |||||
| Oct 22 | inverted index | ppt | MG Ch.s 3.2, 8.2 | ||||
| syllabus | CS276a at Stanford | ||||||
| Oct 25 | project ideas | pdf1 | ppt1 | ||||
| project resources | pdf2 | ppt2 | |||||
| lucene | pdf3 | ppt3 | |||||
| Oct 29 | more on indexes | ppt | MG Ch.s 3.6, 4.3 | ||||
| Nov 5 | index compression | html | 4in1 | ppt | MG Ch.s 3.3, 3.4 | ||
| Nov 12 | bioinformatics | html | ppt | sxi | |||
| Nov 15 | problem sets | gamma codes | |||||
| Nov 19 | naive bayes | html | ppt | sxi | |||
| Nov 22 | problem sets | ||||||
| Nov 26 | naive bayes | ||||||
| Dec 3 | question answering | 4in1 | |||||
| Dec 6 | evaluation | html | 4in1 | ppt | |||
| problem sets | |||||||
| Dec 15 | ranking I | 4in1 | |||||
| Dec 17 | ranking II | html | 4in1 | ppt | sxi | ||
| Dec 20 | problem sets | ||||||
| Jan 10 | problem sets | ||||||
| Jan 14 | web IR I | html | ppt | sxi | web IR bibliography | ||
| Jan 17 | web IR II | html | ppt | sxi | web IR bibliography | ||
| Jan 21 | web IR III | html | 4in1 | ppt | sxi | web IR bibliography | |
| Jan 28 | latent semantic indexing | html | ppt | sxi | LSI / SVD | ||
| Jan 31 | clustering | html | ppt | sxi | |||
| Feb 7 | spam classifiers | ||||||
| Feb 11 | semantic text mining | ||||||
| Feb 18 | advanced QA |