This course studies the theory, design, and implementation of text-based information systems. The Information Retrieval core components of the course include statistical characteristics of text, representation of information needs and documents, several important retrieval models (boolean, vector space, probabilistic, inference net, and language modeling), clustering algorithms, collaborative filtering, automatic text categorization, and experimental evaluation. The software architecture components include design and implementation of high-capacity text retrieval and text filtering systems. It also introduces web search including crawling, link-based algorithms, and Web metadata; text/Web clustering, classification; text mining.
Bachelor of Computer Science - 132 CRs
Croft B., Metzler D., and Strohman T., Search Engines: Information Retrieval in Practice, Addison-Wesley
content serial | Description |
---|
1 | Introduction |
2 | What is Information Retrieval |
3 | Building Dictionary |
4 | TF-IDF |
5 | TF-IDF-2 |
6 | Vector Space |
7 | Naïve Bayes |
8 | Naïve Bayes-Text Classification |
9 | KNN-Text Classification |
10 | Rocchio-Text Classification |
11 | SVM-Text Classification |
12 | Support Vector Machine |
13 | Link Analysis |
14 | Crawling |
15 | Revision |
Start your application