It all started as a little data science project, possibly a job interview question for applicants: How would you compute the number of entries on Wikipedia. The idea was to use large keyword lists (say 5,000,000) and check how many keywords from these lists have a Wikipedia entry, using a web crawler to run 5,000,000 searches on Wikipedia. Based on the number of Wikipedia entries found in your li…
Most Popular Content on DSC
To not miss this type of content in the future, subscribe to our newsletter.
Other popular resources
Most popular articles