topbanner_forum
  *

avatar image

Welcome, Guest. Please login or register.
Did you miss your activation email?

Login with username, password and session length
  • Saturday December 7, 2024, 5:20 am
  • Proudly celebrating 15+ years online.
  • Donate now to become a lifetime supporting member of the site and get a non-expiring license key for all of our programs.
  • donate

Author Topic: *NIX; tesseract OCR experiences  (Read 4683 times)

ewemoa

  • Honorary Member
  • Joined in 2008
  • **
  • Posts: 2,922
    • View Profile
    • Donate to Member
*NIX; tesseract OCR experiences
« on: December 01, 2013, 10:07 PM »
Recently I've been trying out the tesseract OCR option (both via gimagereader and via the command line) with mixed (but tolerably good IMHO) results at least for English text.

In my usage, I notice occasional recognition results such as:


It seems to me that for some of these cases, there is little point in accepting the results as-is (e.g. "vv" seems like it's seldom used).  I'm about to go through a page with a description of tesseract configuration parameters in hope of turning up something applicable -- but anyone have any relevant tesseract experience to share?



I'm using tesseract 3.02.02.