我正在尝试在edx上下载课程。请参考以下自述文件(https://github.com/coursera-dl/edx-dl/blob/master/README.md)。我安装了Windows版Anaconda(我在Parallel Desktop中运行Windows 10)。

在最后一步中,我输入以下命令:C:\ edx-dl-master \ edx-dl-master> edx-dl -u [email protected]课程URL,并输入我的edX密码后,获得以下内容:

为将来的请求构建初始标头。
获取初始CSRF令牌。
找到了CSRF令牌。
登录到Open edX网站:https://courses.edx.org/login_ajax
从仪表板提取课程信息。

Traceback (most recent call last):
  File "c:\programdata\anaconda3\lib\runpy.py", line 193, in
_run_module_as_main
    "__main__", mod_spec)
  File "c:\programdata\anaconda3\lib\runpy.py", line 85, in _run_code
    exec(code, run_globals)
  File "C:\ProgramData\Anaconda3\Scripts\edx-dl.exe\__main__.py", line 9, in <module>
  File "c:\programdata\anaconda3\lib\site-packages\edx_dl\edx_dl.py", line 1011, in main
    for selected_course in selected_courses}
  File "c:\programdata\anaconda3\lib\site-packages\edx_dl\edx_dl.py", line 1011, in <dictcomp>
    for selected_course in selected_courses}
  File "c:\programdata\anaconda3\lib\site-packages\edx_dl\edx_dl.py", line 186, in get_available_sections
sections = page_extractor.extract_sections_from_html(page, BASE_URL)
  File "c:\programdata\anaconda3\lib\site-packages\edx_dl\parsing.py", line 403, in extract_sections_from_html
for i, section_soup in enumerate(sections_soup, 1)]
  File "c:\programdata\anaconda3\lib\site-packages\edx_dl\parsing.py", line 403, in <listcomp>
for i, section_soup in enumerate(sections_soup, 1)]
  File "c:\programdata\anaconda3\lib\site-packages\edx_dl\parsing.py", line 392, in _make_subsections
for i, s in enumerate(subsections_soup, 1)]
  File "c:\programdata\anaconda3\lib\site-packages\edx_dl\parsing.py", line 392, in <listcomp>
for i, s in enumerate(subsections_soup, 1)]
AttributeError: 'NoneType' object has no attribute 'string'


我是Python的新手,并且不确定我可以采取哪些补救措施。

最佳答案

尝试

git clone https://github.com/coursera-dl/edx-dl/blob/master/README.md


首先安装git。

关于python - Python Web爬网(edX),我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/51348286/

10-12 20:16