BeautifulSoup：'Response'类型的对象没有len（）

Question

问题：当我尝试执行脚本时，BeautifulSoup(html, ...)给出错误消息“TypeError：类型'对象的对象'没有len（）。我尝试将实际的html作为参数传递，但它仍然无效。

import requests

url = 'http://vineoftheday.com/?order_by=rating'
response = requests.get(url)
html = response.content

soup = BeautifulSoup(html, "html.parser")

Answer 1

你得到response.content。但它将响应体返回为字节（docs）。但是你应该将str传递给BeautifulSoup构造函数（docs）。所以你需要使用response.text而不是获取内容。

Answer 2

尝试直接传递HTML文本

soup = BeautifulSoup(html.text)

Answer 3

如果你使用requests.get('https://example.com')来获取HTML，你应该使用requests.get('https://example.com').text。

Answer 4

你只得到'响应'中的响应代码并始终使用浏览器标题来保证安全，否则你将面临许多问题

在调试器控制台网络部分“header”UserAgent中查找标头

尝试

import requests
from bs4 import BeautifulSoup

from fake_useragent import UserAgent

url = 'http://www.google.com'
headers = {'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_13_6) 
AppleWebKit/537.36 (KHTML, like Gecko) Chrome/71.0.3578.98 Safari/537.36'}

response = requests.get(quote_page, headers=headers).text

soup = BeautifulSoup(response, 'html.parser')
print(soup.prettify())

Answer 5

它对我有用：

soup = BeautifulSoup(requests.get("your_url").text)

现在，下面的代码更好（使用lxml解析器）：

import requests
from bs4 import BeautifulSoup

soup = BeautifulSoup(requests.get("your_url").text, 'lxml')

BeautifulSoup：'Response'类型的对象没有len（）

问题描述投票：18回答：5

5个回答

最新问题

BeautifulSoup：'Response'类型的对象没有len（）

问题描述 投票：18回答：5

5个回答

最新问题

问题描述投票：18回答：5