Information Retrieval and Text Mining

Information retrieval resources

MG = Managing Gigabytes; MIR = Modern Information Retrieval

Lectures

date topic resources
Oct 22 inverted index pdf ppt MG Ch.s 3.2, 8.2
syllabus CS276a at Stanford
Oct 25 project ideas pdf1 ppt1
project resources pdf2 ppt2
lucene pdf3 ppt3
Oct 29 more on indexes pdf ppt MG Ch.s 3.6, 4.3
Nov 5 index compression html pdf 4in1 ppt MG Ch.s 3.3, 3.4
Nov 12 bioinformatics html pdf ppt sxi
Nov 15 problem sets pdf gamma codes
Nov 19 naive bayes html pdf ppt sxi
Nov 22 problem sets pdf
Nov 26 naive bayes
Dec 3 question answering pdf 4in1
Dec 6 evaluation html pdf 4in1 ppt
problem sets pdf
Dec 15 ranking I pdf 4in1
Dec 17 ranking II html pdf 4in1 ppt sxi
Dec 20 problem sets pdf
Jan 10 problem sets pdf
Jan 14 web IR I html pdf ppt sxi web IR bibliography
Jan 17 web IR II html pdf ppt sxi web IR bibliography
Jan 21 web IR III html pdf 4in1 ppt sxi web IR bibliography
Jan 28 latent semantic indexing html pdf ppt sxi LSI / SVD
Jan 31 clustering html pdf ppt sxi
Feb 7 spam classifiers pdf
Feb 11 semantic text mining pdf
Feb 18 advanced QA