wk |
Topic |
1 |
Introduction, Inverted indexes, Issues in building such indexes, Course administrivia , Inverted index storage, Boolean queries, Wild-card queries, Positional/phrase queries, Evaluating IR systems. |
1 |
Section: IR project, VDK software, Index construction, Dynamic indices (updating), Term weighting, Vector space indices. |
1 |
Computing cosine-based ranking, Speeding up cosine ranking (Sampling and pre-grouping, Latent semantic indexing, Random projection), Section: IR project 2, VDK software. |
1 |
Generalized query operators, Bayesian nets for Text Retrieval, Structured+ Unstructured queries, Link-based ranking in web search engines. |
1 |
Rest of web ranking, Peer-to-peer search, Search deployment models, Review of search topics, Document Clustering. |
1 |
Automatic document classification, Centroid/Nearest-neighbor classification, Bayesian classification, Link-based classification, Document summarization. |
1 |
Link-based clustering, Enumerative clustering/trawling, Recommendation systems , Web characterization; Research problems. |
1 |
Distributed databases - Introductory topics; Fragmentation; Allocation, Query processing in distributed databases - localization, distributed query operators, optimization. |
1 |
Concurrency Control (Schedules, Serializability, Locking, Timestamp control); Reliability (Failure models, 2-phase commit). |
1 |
Reliability (3-phase commit, Majority 3PC); Network partitions. |
1 |
Fuzzy retrieval model. Coordination level matching. Vector space retrieval model. |
1 |
Probabilistic retrieval models , indexing , Latent Semantic Indexing. |
1 |
Language models. Retrieval evaluation. |
1 |
Introduction to Web retrieval, Web crawling. |