因此,当我尝试在Amazon EMR实例中安装textract时,我遇到了这个问题。我已经尝试了网络上几乎所有可能的解决方案,但似乎没有任何工作对我有用。谁能帮忙吗?


  命令-sudo pip install textract




Error -

sleading-indentation
    deps/sphinxbase/src/libsphinxad/ad_pulse.c:44:30: fatal error: pulse/pulseaudio.h: No such file or directory
     #include <pulse/pulseaudio.h>
                                  ^
    compilation terminated.
    error: command 'gcc' failed with exit status 1

    ----------------------------------------
Command "/usr/bin/python2.7 -u -c "import setuptools, tokenize;__file__='/mnt/tmp/pip-build-3dsfXm/pocketsphinx/setup.py';f=getattr(tokenize, 'open', open)(__file__);code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, __file__, 'exec'))" install --record /tmp/pip-lW6lBv-record/install-record.txt --single-version-externally-managed --compile" failed with error code 1 in /mnt/tmp/pip-build-3dsfXm/pocketsphinx/

最佳答案

答案较晚-但仍然可以帮助您。

面临类似的问题-通过安装以下依赖项解决:

sudo apt-get install python-dev libxml2-dev libxslt1-dev antiword unrtf poppler-utils pstotext tesseract-ocr flac ffmpeg lame libmad0 libsox-fmt-mp3 sox libjpeg-dev libpulse-dev swig


textract文档中缺少的一个是libpulse-dev

sudo apt-get install libpulse-dev


还通过以下链接解决:
Can't install textract (for python 2.x) on Ubuntu because of pocketsphinx

08-27 04:34