UNIT 1:
Practical Issues on the Web and How People Search
The Retrieval and Ranking Processes
The Web , The e-Publishing Era and How the web changed Search
Information versus Data Retrieval ,The IR System and The Software Architecture of the IR System
Early Developments and The IR Problem , The User’s Task
Information Retrieval Techniques
UNIT 2:
Basic IR Models and Boolean Model
TF-IDF (Term Frequency/Inverse Document Frequency) Weighting
Latent Semantic Indexing Model
Neural Network Model in IRT
Retrieval Evaluation and Retrieval Metrics
Precision and Recall and Reference Collection
Relevance Feedback and Query Expansion
UNIT 3:
A Characterization of Text Classification – Unsupervised Algorithms: Clustering – Naïve Text Classification
Accuracy and Error – Organizing the classes – Indexing and Searching – Inverted Indexes – Sequential Searching – Multi-dimensional Indexing.
UNIT 4:
The Web – Search Engine Architectures – Cluster based Architecture
Distributed Architectures
Search Engine Ranking – Link based Ranking
UNIT 5:
IR applications-Information Extraction-Question answering