Android Tesseract手写与字典训练
声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow
原文地址: http://stackoverflow.com/questions/12310287/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me):
StackOverFlow
Tesseract handwriting with dictionary training
提问by Ruel
I have a dictionary of words in a text file, separated by newlines. And I want to recognize the handwriting using Tesseract, and output the nearest matching line in the text file.
我有一个文本文件中的单词字典,用换行符分隔。并且我想使用 Tesseract 识别笔迹,并在文本文件中输出最近的匹配行。
This is the first time I'll be using Tesseract, and it's already in my project workspace, I just need the training data.
这是我第一次使用 Tesseract,它已经在我的项目工作区中,我只需要训练数据。
Is it possible to train Tesseract to do this?
是否可以训练 Tesseract 来做到这一点?
采纳答案by Leopd
It's possible to train tesseract to recognize handwriting. Here are the instructions: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract
可以训练 tesseract 来识别笔迹。以下是说明:https: //tesseract-ocr.github.io/tessdoc/Training-Tesseract
But don't expect very good results. Academics have typically gotten accuracy results topping out about 90%. Here are a couple references for wordsand numbers. So if your use case can deal with at least 1/10 errors, this might work for you.
但是不要期望非常好的结果。学术界通常获得的准确度结果高达 90% 左右。这里有几个关于单词和数字的参考。因此,如果您的用例可以处理至少 1/10 的错误,这可能对您有用。