This course studies the theory, design, and implementation of text-based information systems. The Information Retrieval core components of the course include statistical characteristics of text, repre-sentation of information needs and documents, several important retrieval models (boolean, vector space, probabilistic, inference net, and language modeling), clustering algorithms, collaborative fil-tering, automatic text categorization, and experimental evaluation. The software architecture compo-nents include design and implementation of high-capacity text retrieval and text filtering systems. It also introduces web search including crawling, link-based algorithms, and Web metadata; text/Web clustering, classification; text mining.
Bachelor of Computer Science - 144 CRs
Croft B., Metzler D., and Strohman T., Search Engines: Information Retrieval in Practice, Addison-Wesley
content serial | Description |
---|
1 | Introduction |
2 | What is Information Retrieval |
3 | Building Dictionary |
4 | TF-IDF |
5 | TF-IDF-2 |
6 | Vector Space |
7 | 7th week exam |
8 | Naïve Bayes-Text Classification |
9 | KNN-Text Classification |
10 | Rocchio-Text Classification |
11 | SVM-Text Classification |
12 | 12th week exam |
13 | Link Analysis |
14 | Crawling |
15 | Revision |
16 | Final exam |
Start your application