Subscribe to DSC Newsletter

Is there any way (using any tools/medium etc) to make a dataset containing content on a particular topic from all the available resources?

For example, I want to collect question-answer pair for a particular field/topic of economics and want to collect all the possible question-answer pairs in that field/topic from wherever I can collect the data say books/online forums etc.

Any lead will be helpful.TIA, Priya

Views: 82

Reply to This

Replies to This Discussion

If you are looking to get Q&A from forums, you might want to try some screen scraping.  You can use beautiful soup in python or rvest in R. 

If you don't want to do your own screen scraping, there are some R libraries ppl developed which scrape particular sites for you: https://www.computerworld.com/article/3109890/data-analytics/these-...

Good luck!

Reply to Discussion

RSS

Follow Us

Videos

  • Add Videos
  • View All

Resources

© 2018   Data Science Central™   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service