Is there any way (using any tools/medium etc) to make a dataset containing content on a particular topic from all the available resources?

For example, I want to collect question-answer pair for a particular field/topic of economics and want to collect all the possible question-answer pairs in that field/topic from wherever I can collect the data say books/online forums etc.

Any lead will be helpful.TIA, Priya

Tags: data-science

Views: 275

Reply to This

Replies to This Discussion

If you are looking to get Q&A from forums, you might want to try some screen scraping.  You can use beautiful soup in python or rvest in R. 

If you don't want to do your own screen scraping, there are some R libraries ppl developed which scrape particular sites for you: https://www.computerworld.com/article/3109890/data-analytics/these-...

Good luck!


© 2021   TechTarget, Inc.   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service