问题描述
如何自定义Plone搜索引擎以便对Excel文件进行全文索引?我已经安装了pdftotext和wv for pdf,word文件全文索引。如果添加到您的实例蛋,并将其安装在Plone中,您可以至少索引现代Office格式,至少为.docx和.xlsx。对于普通旧的Excel(.xls)文件,这不起作用。
我在几周前在Plone 4.3.2 buildout配置中尝试过:
[instance]
eggs =
...
Products.OpenXml
[版本]
#你需要一个比默认Plone更近的lxml,一些3.x版本
lxml = 3.3.3
Products.OpenXml = 1.1.1
或者或另外使用的。我只尝试过与Products.OpenXml结合使用,但如果您只对旧版本的Excel表单感兴趣,则自己就可以使用Products.AROfficeTransforms。在buildout配置中:
[instance]
eggs =
...
产品.AROfficeTransforms
[版本]
Products.AROfficeTransforms = 0.11.0
它需要在系统上安装xlhtml二进制文件。这是一个古老的二进制文件,2002年最后更改。我没有尝试自己安装。
how can I customize Plone search engine in order to actvate full text indexing of excel files? I have already installed pdftotext and wv for pdf, word files full text indexing.
If you add Products.OpenXml to your instance eggs and install it in Plone you can index modern Office formats, at least .docx and .xlsx. For plain old Excel (.xls) files this does not work.
I tried it in a Plone 4.3.2 buildout config a few weeks ago:
[instance]
eggs =
...
Products.OpenXml
[versions]
# You need a more recent lxml than default Plone, some 3.x version
lxml = 3.3.3
Products.OpenXml = 1.1.1
Alternatively or additionally, use Products.AROfficeTransforms. I have only tried it in combination with Products.OpenXml, but Products.AROfficeTransforms on its own is sufficient if you are only interested in old-style excel sheets, .xls. In a buildout config:
[instance]
eggs =
...
Products.AROfficeTransforms
[versions]
Products.AROfficeTransforms = 0.11.0
It requires the xlhtml binary to be installed on your system. This is an ancient binary, last changed in 2002. I did not try to install it myself.
这篇关于Plone全文索引Excel文件的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!