Subscribe to DSC Newsletter

Hi! I am reading data from scanned medical documents (provider Notes) using Pytesseract OCR. The resultant data has some noise and misspells. My ultimate goal is to extract useful medical information from data. Right now I'm stuck with how to correct both medical and English misspells. I have to create a dictionary which contains both medical and English words. I'm looking for direction on what steps I need to perform.

Views: 176

Reply to This


  • Add Videos
  • View All

© 2019   Data Science Central ®   Powered by

Badges  |  Report an Issue  |  Privacy Policy  |  Terms of Service