使用 Selenium 查找 URL 无法正确打印

问题描述 投票:0回答:1

我正在尝试编译页面内的链接列表。然而,当打印列表时,输出是一堆随机数

links = driver.find_elements(By.CSS_SELECTOR, "meta[content*='www.airbnb.com.au/rooms/']")

print(links)

输出示例:

[<selenium.webdriver.remote.webelement.WebElement (session="faf70ce53ba59d6f6995883b0edfc006", element="f.CD9BA18B85DC3350F27BC5BF71FF60B2.d.5C798985F0B6BFD4214898A37DFC2A30.e.81")>, <selenium.webdriver.remote.webelement.WebElement (session="faf70ce53ba59d6f6995883b0edfc006", element="f.CD9BA18B85DC3350F27BC5BF71FF60B2.d.5C798985F0B6BFD4214898A37DFC2A30.e.82")>, <selenium.webdriver.remote.webelement.WebElement (session="faf70ce53ba59d6f6995883b0edfc006", element="f.CD9BA18B85DC3350F27BC5BF71FF60B2.d.5C798985F0B6BFD4214898A37DFC2A30.e.83")>, <selenium.webdriver.remote.webelement.WebElement (session="faf70ce53ba59d6f6995883b0edfc006", element="f.CD9BA18B85DC3350F27BC5BF71FF60B2.d.5C798985F0B6BFD4214898A37DFC2A30.e.84")>]

我正在尝试抓取的网站:

https://www.airbnb.com.au/s/黄金海岸--QLD/homes?place_id=ChIJt2BdK0cakWsRcK_e81qjAgM&refinement_paths%5B%5D=%2Fhomes&checkin=2024-12-22&checkout=2024-12-28&date_picker_type=calendar&adults= 9&children=2&pets=1&search_type=user_map_move&tab_id=home_tab&query=Gold%20Coast%2C%20QLD&flexible_trip_lengths%5B%5D=one_week&monthly_start_date=2024-08-01&monthly_length=3&monthly_end_date=2024-11-01&search_mode=regular_search&price_filter_ input_type=2&price_filter_num_nights=6&channel=探索&ne_lat=-27.8312213554841&ne_lng= 153.85466017727208&sw_lat=-28.314709414353157&sw_lng=153.3574718233147&zoom=10.257561001998951&zoom_level=10.257561001998951&search_by_map=true&price_min =8683&最大价格=14459&最小卧室=5

python selenium-webdriver screen-scraping
1个回答
0
投票
print(links)

您只是打印具有 Web 元素的

list
对象。要从目标元素获取 URL,您应该捕获
content
属性的值。

试试这个:

for link in links:
    print(link.get_attribute("content"))

输出:

www.airbnb.com.au/rooms/50961691?adults=9&children=2&pets=1&search_mode=regular_search&check_in=2024-12-22&check_out=2024-12-28&source_impression_id=p3_1720094600_P3E3TFbSLV_F8Hv-&previous_page_section_name=1000
www.airbnb.com.au/rooms/10732858?adults=9&children=2&pets=1&search_mode=regular_search&check_in=2024-12-22&check_out=2024-12-28&source_impression_id=p3_1720094600_P3xqAumg9A_T_K_Z&previous_page_section_name=1000
www.airbnb.com.au/rooms/25083963?adults=9&children=2&pets=1&search_mode=regular_search&check_in=2024-12-22&check_out=2024-12-28&source_impression_id=p3_1720094600_P3LxjqyKDwEQn8FV&previous_page_section_name=1000
www.airbnb.com.au/rooms/1112833302463251442?adults=9&children=2&pets=1&search_mode=regular_search&check_in=2024-12-22&check_out=2024-12-28&source_impression_id=p3_1720094600_P3NtINEuiRiA5r4F&previous_page_section_name=1000

Process finished with exit code 0
© www.soinside.com 2019 - 2024. All rights reserved.