Subscribe to DSC Newsletter


What clustering method is required for text documents

Let's say a set of documents 'S' has a large set of 'pure' texts.

On all documents in S, I am spelling normalisation method, which yields a normalised set S'.

Then I use the chosen method M (which method? ) to make clusters in S, obtaining a clustering result C.

Then I use the same method M to make clusters in S', obtaining a clustering results C'.

Finally I need to compare if there are statistically significant differences between C and C'.

Any help in identifying…


Added by MUSHTAQ AHMAD on May 25, 2015 at 11:48am — 3 Comments

Blog Topics by Tags

Monthly Archives



  • Add Videos
  • View All

© 2020   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service