In this post, we learn about building a basic search engine or document retrieval system using Vector space model. This use case is widely used in information retrieval systems. Given a set of documents and search term(s)/query we need to retrieve relevant documents that are similar to the search query.
The problem statement explained above is represented as in below image.
High level design of document search system is shown below :
The content of the post is as follows:
- Explaining various techniques used in Information retrieval such as vector space models, term document matrix, similarity score calculation
- Data description
- High level design of the document search system
- Code implementation in R
Please go thorough the complete blog at below location: