问题描述
from urllib.request import urlopen
from bs4 import BeautifulSoup
html= urlopen("http://www.pythonscraping.com/pages/page3.html")
soup= BeautifulSoup(html.read())
print(soup.find("img",{"src":"../img/gifts/img1.jpg"
}).parent.previous_sibling.get_text())
上面的代码可以正常工作,但是下面的代码不能正常工作,它给出了如上所述的属性错误.谁能告诉我原因?
The above code works fine but not the one below.It gives an attribute error as stated above. Can anyone tell me the reason?
from urllib.request import urlopen
from bs4 import BeautifulSoup
html= urlopen("http://www.pythonscraping.com/pages/page3.html")
soup= BeautifulSoup(html.read())
price =soup.find("img",{"src=":"../img/gifts/img1.jpg"
}).parent.previous_sibling.get_text()
print(price)
谢谢! :)
推荐答案
如果比较第一个版本和第二个版本,您会注意到:
If you compare the first and the second version, you'll notice that:
第一: soup.find("img",{"src":"../img/gifts/img1.jpg"}).parent.previous_sibling.get_text()
- 注意:
"src"
第二: soup.find("img","src=":"../img/gifts/img1.jpg"}).parent.previous_sibling.get_text()
- 注意:
"src="
第二个代码返回Attribute Error:'NoneType' object has no attribute 'parent'
,因为它在提供的汤中找不到src=="../img/gifts/img1.jpg"
.
The second code returns Attribute Error:'NoneType' object has no attribute 'parent'
because it couldn't find src=="../img/gifts/img1.jpg"
in the provided soup.
因此,如果您删除第二个版本中的=
,它应该可以工作.
So, if you remove the =
in the second version, it should work.
顺便说一句,您应该明确地要使用哪个解析器,否则bs4
将返回以下警告:
Btw, you should explicitly which parser you want to use, otherwise bs4
will return the following warning:
要消除此警告,请更改如下代码:
To get rid of this warning, change code that looks like this:
BeautifulSoup([您的标记])
BeautifulSoup([your markup])
对此:
BeautifulSoup([您的标记],"lxml")
BeautifulSoup([your markup], "lxml")
因此,如警告消息中所述,您只需要将soup = BeautifulSoup(html.read())
更改为soup = BeautifulSoup(html.read(), 'lxml')
.
So, as stated in the warning message, you just have to change soup = BeautifulSoup(html.read())
to soup = BeautifulSoup(html.read(), 'lxml')
, for example.
这篇关于属性错误:"NoneType"对象没有属性“父级"的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!