我是编程新手,我正尝试使用Tesseract OCR读取图像文本,但我无法使其正常工作!我在我的环境中安装了tesseract_OCR,pytesseract和枕头。有人给小费吗?
输入:
from PIL import Image
import pytesseract
print( pytesseract.image_to_string( Image.open('phrase.jpg') ) )
输出:
C:\Anaconda2\envs\ambiente36\python.exe
C:/Users/Simone/Desktop/curso_programacao/Ler_imagens/ler_imagens
Traceback (most recent call last):
File "C:\Anaconda2\envs\ambiente36\lib\site-
packages\pytesseract\pytesseract.py", line 194, in run_and_get_output
run_tesseract(**kwargs)
File "C:\Anaconda2\envs\ambiente36\lib\site-
packages\pytesseract\pytesseract.py", line 165, in run_tesseract
proc = subprocess.Popen(command, **subprocess_args())
File "C:\Anaconda2\envs\ambiente36\lib\subprocess.py", line 709, in __init__
restore_signals, start_new_session)
File "C:\Anaconda2\envs\ambiente36\lib\subprocess.py", line 997, in
_execute_child
startupinfo)
FileNotFoundError: [WinError 2] O sistema não pode encontrar o arquivo
especificado
During handling of the above exception, another exception occurred:
Traceback (most recent call last):
File "C:/Users/Simone/Desktop/curso_programacao/Ler_imagens/ler_imagens",
line 6, in <module>
phrase = pytesseract.image_to_string(Image.open('phrase.jpg'))
File "C:\Anaconda2\envs\ambiente36\lib\site-
packages\pytesseract\pytesseract.py", line 286, in image_to_string
return run_and_get_output(image, 'txt', lang, config, nice)
File "C:\Anaconda2\envs\ambiente36\lib\site-
packages\pytesseract\pytesseract.py", line 201, in run_and_get_output
raise TesseractNotFoundError()
pytesseract.pytesseract.TesseractNotFoundError: tesseract is not installed
or it's not in your path
最佳答案
在环境中配置tessaract应该遵循的步骤
这是您应该遵循的步骤
首先安装python和pip here是步骤
然后将枕头,pytesseract安装为here
from PIL import Image
from pytesser.pytesser import *
image_file = "FULL/PATH/TO/YOUR/IMAGE/image.png"
im = Image.open(image_file)
text = image_to_string(im)
text = image_file_to_string(image_file)
text = image_file_to_string(image_file, graceful_errors=True)
print "=====output=======\n"
print text
pytessaract的链接
您可以找到完整的示例download