file_get_html 错误,不起作用

问题描述 投票:0回答:3

我正在使用 Simple HTML Dom 尝试抓取 HTML 表格。 我按照他们的说明进行操作并查看了许多其他代码示例,但 file_get_html 似乎不起作用。

这是我的代码:

<?php

// Simple HTML Dom Parser
include('simple_html_dom.php');

//$worlds = ["Amera", "Antica", "Astera", "Aurera", "Aurora", "Bellona", "Belobra", "Beneva", "Calmera", "Calva", "Calvera", "Candia", "Celesta", "Chrona", "Danera", "Dolera", "Efidia", "Eldera", "Ferobra", "Fidera", "Fortera", "Garnera", "Guardia", "Harmonia", "Honera", "Hydera", "Inferna", "Iona", "Irmada", "Julera", "Justera", "Kenora", "Kronera", "Laudera", "Luminera", "Magera", "Menera", "Morta", "Mortera", "Neptera", "Nerana", "Nika", "Olympa", "Osera", "Pacera", "Premia", "Pythera", "Quilia", "Refugia", "Rowana", "Secura", "Serdebra", "Shivera", "Silvera", "Solera", "Tavara", "Thera", "Umera", "Unitera", "Veludera", "Verlana", "Xantera", "Xylana", "Yanara", "Zanera", "Zeluna"];

//foreach ($worlds as $world) {
  // All HTML from the online list
  $html = file_get_html('https://secure.tibia.com/community/?subtopic=worlds&world=Antica');

  // Search for the online list table content
  foreach ($html->find('tr[class=Table2]') as $row) {
    $name = $row->find('td', 0)->plaintext;
    $level = $row->find('td', 1)->plaintext;
    $vocation = $row->find('td', 2)->plaintext;

    echo $name . ' | ' . $level . ' | ' . $vocation . '<br>';
  }
//}

?>

我收到这些错误:

Warning: file_get_contents(): stream does not support seeking in D:\xampp\htdocs\simple_html_dom.php on line 76

Warning: file_get_contents(): Failed to seek to position -1 in the stream in D:\xampp\htdocs\simple_html_dom.php on line 76

Fatal error: Uncaught Error: Call to a member function find() on boolean in D:\xampp\htdocs\index.php:13 Stack trace: #0 {main} thrown in D:\xampp\htdocs\index.php on line 13

我做错了什么? 我试图抓取的表格是“在线玩家”表格: https://secure.tibia.com/community/?subtopic=worlds&world=Antica

php html web-scraping dom simple-html-dom
3个回答
4
投票

试试这个:

$html = str_get_html(file_get_contents($url));

4
投票

这是最新版本 PHP 的 simple_html_dom 库问题。 要纠正它,只需将“simple_html_dom.php

”文件中“
file_get_html”函数的参数中的“
$offset = -1,
”更改为“$offset = 0,”即可。


0
投票

我对 simpledom 不太了解,但我认为你可能需要使用更强大的库,例如 https://github.com/FriendsOfPHP/Goutte

© www.soinside.com 2019 - 2024. All rights reserved.