問題描述
我最近遇到了 Tesseract 和 OpenCV.看起來 Tesseract 是一個(gè)成熟的 OCR 引擎,OpenCV 可以用作創(chuàng)建 OCR 應(yīng)用程序/服務(wù)的框架.
I recently came across Tesseract and OpenCV. It looks like Tesseract is a full-fledged OCR engine and OpenCV can be used as a framework to create an OCR application/service.
我嘗試在我的一些圖像上使用 Tesseract,它的準(zhǔn)確性似乎不錯(cuò).后來偶然發(fā)現(xiàn)了一個(gè)非常簡單的教程關(guān)于使用OpenCV來執(zhí)行使用 Python 進(jìn)行 OCR 并印象深刻.幾分鐘后,我完成了系統(tǒng)的訓(xùn)練,它的準(zhǔn)確性很好.但是,當(dāng)然,采用這種方法意味著我需要使用大型訓(xùn)練集來廣泛訓(xùn)練我的系統(tǒng).
I tried using Tesseract on some of my images and its accuracy seems decent. Later, I came across a very simple tutorial on using OpenCV to perform OCR using Python and was impressed. In a few minutes, I finished training the system and its accuracy was good. But of course, taking this approach means I need to train my system extensively using a large training set.
我的具體問題如下:
- 如何在 Tesseract 和使用 OpenCV 構(gòu)建自定義 OCR 應(yīng)用之間進(jìn)行選擇?
- 有用于不同語言的 Tesseract 訓(xùn)練數(shù)據(jù)集.OpenCV 是否有類似的東西,所以我不必從頭開始實(shí)現(xiàn) OCR?
- 哪個(gè)更適合商業(yè)應(yīng)用?
有什么建議嗎?
推薦答案
Tesseract 是一個(gè) OCR 引擎.它由 Google 使用、開發(fā)和資助,專門用于從圖像中讀取文本、執(zhí)行基本文檔分割以及對(duì)特定圖像輸入(單個(gè)單詞、行、段落、頁面、有限字典等)進(jìn)行操作.
Tesseract is an OCR engine. It's used, worked on and funded by Google specifically to read text from images, perform basic document segmentation and operate on specific image inputs (a single word, line, paragraph, page, limited dictionaries, etc.).
另一方面,OpenCV 是一個(gè)計(jì)算機(jī)視覺庫,其中包含可讓您執(zhí)行某些特征提取和數(shù)據(jù)分類的功能.您可以創(chuàng)建一個(gè)簡單的字母分割器和分類器來執(zhí)行基本的 OCR,但它不是一個(gè)很好的 OCR 引擎(我之前在 Python 中從頭開始制作了一個(gè).對(duì)于偏離訓(xùn)練數(shù)據(jù)的輸入確實(shí)不準(zhǔn)確).
OpenCV, on the other hand, is a computer vision library that includes features that let you perform some feature extraction and data classification. You can create a simple letter segmenter and classifier that performs basic OCR, but it is not a very good OCR engine (I've made one in Python before from scratch. It's really inaccurate for input that deviates from your training data).
如果您想基本了解 OCR 的難度,請(qǐng)嘗試 OpenCV.Tesseract 用于真實(shí) OCR.
If you want to get a basic understanding of how hard OCR is, try OpenCV. Tesseract is for real OCR.
這篇關(guān)于如何在 Tesseract 和 OpenCV 之間進(jìn)行選擇?的文章就介紹到這了,希望我們推薦的答案對(duì)大家有所幫助,也希望大家多多支持html5模板網(wǎng)!
【網(wǎng)站聲明】本站部分內(nèi)容來源于互聯(lián)網(wǎng),旨在幫助大家更快的解決問題,如果有圖片或者內(nèi)容侵犯了您的權(quán)益,請(qǐng)聯(lián)系我們刪除處理,感謝您的支持!