从外部html源读取时出现问题
我只想在我的例子中读取一个自定义对象'HSDPA 2100'
但我的实际代码是从外部源读取所有nfo类。
外部html:

<table cellspacing="0">
<tbody><tr>
<th rowspan="8" scope="row">General</th>
<td class="ttl"><a href="network-bands.php3">2G Network</a></td>
<td class="nfo">CDMA 800 / 1900 </td>
</tr><tr>
<td class="ttl">&nbsp;</td>
<td class="nfo">GSM 850 / 900 / 1800 / 1900 </td>
</tr>
<tr>
<td class="ttl"><a href="network-bands.php3">3G Network</a></td>
<td class="nfo">HSDPA 2100 </td>
</tr>
<tr>
<td class="ttl">&nbsp;</td>
<td class="nfo">CDMA2000 1xEV-DO </td>
</tr>
<tr>
<td class="ttl"><a href="network-bands.php3">4G Network</a></td>
<td class="nfo">LTE 800 </td>
</tr>
<tr>
<td class="ttl"><a href="glossary.php3?term=sim">SIM</a></td>
<td class="nfo">Mini-SIM</td>
</tr><tr>
<td class="ttl"><a href="#" onclick="helpW('h_year.htm');">Announced</a></td>
<td class="nfo">2013, January</td>
</tr>
<tr>
<td class="ttl"><a href="#" onclick="helpW('h_status.htm');">Status</a></td>
<td class="nfo">Coming soon. Exp. release 2013, February</td>
</tr>
</tbody></table><table cellspacing="0">
<tbody><tr>
<th rowspan="2" scope="row">Body</th>
<td class="ttl"><a href="#" onclick="helpW('h_dimens.htm');">Dimensions</a></td>
<td class="nfo">-</td>
</tr><tr>
<td class="ttl"><a href="#" onclick="helpW('h_weight.htm');">Weight</a></td>
<td class="nfo">&nbsp;</td>
</tr>

</tbody></table><table cellspacing="0">
<tbody><tr>
<th rowspan="4" scope="row">Display</th>
<td class="ttl"><a href="glossary.php3?term=display-type">Type</a></td>
<td class="nfo">TFT capacitive touchscreen, 16M colors</td>
</tr><tr>
<td class="ttl"><a href="#" onclick="helpW('h_dsize.htm');">Size</a></td>
<td class="nfo">1080 x 1920 pixels, 5.9 inches (~373 ppi pixel density)</td>
</tr>
<tr>
<td class="ttl"><a href="glossary.php3?term=multitouch">Multitouch</a></td>
<td class="nfo">Yes</td>
</tr>
<tr><td class="ttl">&nbsp;</td><td class="nfo">- Flux UX UI</td>

我正在尝试使用以下代码:
  <?php
  include_once('/simple_html_dom.php');
  $dom = file_get_html("http://www.site.com/pantech_vega_no_6-5268.php");
  // alternatively use str_get_html($html) if you have the html string already...
  foreach ($dom->find('td[class=nfo]') as $node)
{
$result = $node->innertext;
$price = explode(",", $result);
echo $price[0];
}
?>

我收到这个:CDMA 800 / 1900 GSM 850 / 900 / 1800 / 1900 HSDPA 2100 CDMA2000 1xEV-DO LTE 800 Mini-SIM2013Coming soon. Exp. r... etc
我想要的是HSDPA 2100但是对于其他型号的手机来说,HSDPA 1900的值可以是HSPDA或者其他,总是稳定的,并且是第一位的。

最佳答案

所有td都有相同的类名“nfo”,并且循环遍历所有元素,这样得到的结果与预期的一样。
如果所需的数据始终位于第三行,则可以填充数组而不是获取变量,然后获取第三个值。就像这个$result[2]
更新:如果HSDPA总是存在,只需检查它。

     <?php
      include_once('/simple_html_dom.php');
      $dom = file_get_html("http://www.site.com/pantech_vega_no_6-5268.php");
      // alternatively use str_get_html($html) if you have the html string already...
      foreach ($dom->find('td[class=nfo]') as $node)
    {
    $result = $node->innertext;
if (strpos($result, 'HSDPA') === false)
{
continue;
}
    $price = explode(",", $result);
    echo $price[0];
break;
    }
    ?>

关于php - 从外部html源自定义对象读取,我们在Stack Overflow上找到一个类似的问题:https://stackoverflow.com/questions/14571961/

10-09 19:58