python - 使用beautifulsoup美化html文档的一部分

我有一个html文档，我想从中提取表格并整理表格。到目前为止，我有：

with open('html.txt','r') as file1:
    read_f=file1.read()
soup = BeautifulSoup(read_f)

the_soup=soup.findAll('table', {'id': 'table_id'})
with open('prettified.txt','w') as f2:
    f2.write(the_soup.prettify())

但是我得到一个错误，美化是没有属性的。

最佳答案

soup.findAll将返回所有表元素的列表。您应该遍历此列表并打印每个匹配表的美化版本：

with open('prettified.txt','w') as f2:
    for table in the_soup:
        f2.write(table.prettify())

关于python - 使用beautifulsoup美化html文档的一部分，我们在Stack Overflow上找到一个类似的问题：https://stackoverflow.com/questions/29990366/