本文介绍了Tesseract 3中词典的强度的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!
问题描述
如何在tesseract 3中增加/减少字典的强度?
How do I increase/decrease the strength of the dictionary in tesseract 3 ?
在常见问题解答中说我需要更改"NON_WERD"的值,并且"GARBAGE_STRING",但在Tesseract 3中不存在.
In the FAQ it says I need to change the value of "NON_WERD" and"GARBAGE_STRING" but they do not exist in Tesseract 3.
推荐答案
根据 http://code.google.com/p/tesseract-ocr/wiki/常见问题解答,您可以更改以下变量:
According to http://code.google.com/p/tesseract-ocr/wiki/FAQ, you change these variables:
enable_new_segsearch 1
language_model_penalty_non_freq_dict_word 0.2
language_model_penalty_non_dict_word 0.3
增加它们的值,使Tesseract更偏向于词典单词.
Increase their values to make Tesseract more biased to dictionary words.
注意:您必须设置enable_new_segsearch
,否则它们将无效
Note: You must set enable_new_segsearch
, otherwise they'll have no effect.
这篇关于Tesseract 3中词典的强度的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!