我有这个html和一系列图像来获得这个结构:
<portrait a>
<img class = "image" data = "random stuff" src = "image_link">
<portrait b>
<img class = "image" data = "random stuff" src = "image_link">
我想获得image_link。我试过这个但是没用。我不明白如何使用get()获取第二个属性。
i = d.find_all("img", class_ = "image")
for item in i:
img = item.get("src")
print(img)
谢谢
试试这样:
from bs4 import BeautifulSoup
htmlcontent = """
<portrait a>
<img class = "image" data = "random stuff" src = "image_link1">
<portrait b>
<img class = "image" data = "random stuff" src = "image_link2">
"""
soup = BeautifulSoup(htmlcontent,"lxml")
for item in soup.find_all("img",class_="image"):
print(item.get("src"))
输出:
image_link1
image_link2