Java OCR 实现
声明:本页面是StackOverFlow热门问题的中英对照翻译,遵循CC BY-SA 4.0协议,如果您需要使用它,必须同样遵循CC BY-SA许可,注明原文地址和作者信息,同时你必须将它归于原作者(不是我):StackOverFlow
原文地址: http://stackoverflow.com/questions/1813881/
Warning: these are provided under cc-by-sa 4.0 license. You are free to use/share it, But you must attribute it to the original authors (not me):
StackOverFlow
Java OCR implementation
提问by rat
This is primarily just curiosity, but are there any OCR implementations in pure Java? I'm curious how this would perform purely in Java, and OCR in general interests me, so I'd love to see how it's implemented in a language I thoroughly understand. Naturally, this would require that the implementation is open source, but I'm still interested in proprietary solutions, as I could at least check out the performance in that case.
这主要只是出于好奇,但是在纯 Java 中是否有任何 OCR 实现?我很好奇这将如何完全在 Java 中执行,而 OCR 总的来说我很感兴趣,所以我很想看看它是如何用我完全理解的语言实现的。自然,这将要求实现是开源的,但我仍然对专有解决方案感兴趣,因为我至少可以检查这种情况下的性能。
I've seen a couple which can be used in Java (like Asprise) but it doesn't seem that these are pure Java implementations... are there any?
我见过一些可以在 Java 中使用的(比如Asprise),但似乎这些不是纯 Java 实现......有吗?
回答by jitter
Just found this one (don't know it, not tested, check yourself)
刚找到这个(不知道,没测试过,自己查)
As you only need this for curiosity you could look into the source of this applet.
由于您只是出于好奇而需要它,因此您可以查看此小程序的来源。
It does OCR of handwritten characters with a neuronal network
它使用神经元网络对手写字符进行 OCR
回答by Joshua Fox
There are a variety of OCR libraries out there. However, my experience is that the major commercial implementations, ABBYY, Omnipage, and ReadIris, far outdo the open-source or other minor implementations. These commercial libraries are not primarily designed to work with Java, though of course it is possible.
那里有各种 OCR 库。但是,我的经验是主要的商业实现,ABBYY、Omnipage 和 ReadIris,远远超过开源或其他次要实现。这些商业库主要不是为与 Java 一起使用而设计的,尽管这当然是可能的。
Of course, if your interest is to learn the code, the open-source implementations will do the trick.
当然,如果您的兴趣是学习代码,开源实现就可以解决问题。
回答by davetapley
If you are looking for a very extensible option or have a specific problem domain you could consider rolling your own using the Java Object Oriented Neural Engine. Another JOONEreference.
如果您正在寻找一个非常可扩展的选项或有一个特定的问题域,您可以考虑使用Java Object Oriented Neural Engine滚动您自己的选项。另一个JOONE参考。
I used it successfully in a personal project to identify the letter from an image such as this, you can find all the source for the OCR component of my application on github, here.
我成功地用它在一个个人项目,以确定从图像信如这个,你可以找到所有的来源我的应用程序的GitHub上的OCR组件,在这里。
回答by Ron
I recommend trying the Java OCRproject on sourceforge.net. I originally developed it, and I have a blog posting on it.
我建议在 sourceforge.net 上尝试Java OCR项目。我最初开发了它,我有一篇关于它的博客文章。
Since I put it up on sourceforge, its functionality been expanded and improved quite a bit through the great work of a volunteer researcher/developer.
自从我把它放在 sourceforge 上以来,它的功能通过志愿者研究人员/开发人员的出色工作得到了相当大的扩展和改进。
Give it a try, and if you don't like it, you can always improve it!
尝试一下,如果您不喜欢它,您可以随时改进它!
回答by Blue Sky
We have tested a few OCR engines with Java like Tesseract,Asprise, Abbyy etc. In our analysis, Abbyy gave the best results.
我们已经用 Java 测试了一些 OCR 引擎,如 Tesseract、Asprise、Abbyy 等。在我们的分析中,Abbyy 给出了最好的结果。