Home » Uncategorized

Information Retrieval Document Search Engine in R


In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query. 

Problem statement:

The problem statement explained above is represented as in below image. 

3Document retrieval system

High level design of document search system is shown below :

Information Retrieval Document Search Engine in R

The content of the post is as follows:

  • Explaining various techniques used in Information retrieval  such as vector space models, term document matrix, similarity score calculation
  • Data description 
  • High level design of the document search system
  • Code implementation in R

Please go thorough the complete blog at below location: