php 通过php中的属性值获取HTML元素

Question

提问by stillenat

I need to extract some data from a webpage with php. The part that I'm interested in is structured similarly to this:

我需要使用 php 从网页中提取一些数据。我感兴趣的部分的结构与此类似：

<a href="somepath" target="fruit">apple</a>
<a href="somepath" target="animal">cat</a>
<a href="somepath" target="fruit">orange</a>
<a href="somepath" target="animal">dog</a>
<a href="somepath" target="fruit">mango</a>
<a href="somepath" target="animal">monkey</a>

First, I want to extract all fruits, and then all animals, so that I have them nicely grouped.

首先，我想提取所有水果，然后提取所有动物，以便将它们很好地分组。

I figured out how to loop through all attribute values. Here's the code:

我想出了如何遍历所有属性值。这是代码：

$dom = new DOMDocument();
$html = file_get_contents('example.html');

@$dom->loadHTML($html);

$a = $dom->getElementsByTagName('a');

for ($i; $i < $a->length; $i++) {
$attr = $a->item($i)->getAttribute('target');

echo $attr . "\n";
}

So I get:

所以我得到：

fruit animal fruit animal fruit animal

I also found out how to get the elements' text content:

我还发现了如何获取元素的文本内容：

$a->item($i)->textContent

So, if included in loop and echoed, I get:

所以，如果包含在循环中并得到回应，我得到：

apple cat orange dog mango monkey

I feel like I'm very close, but I can't get what I want. I need something like this:

我觉得我很接近，但我不能得到我想要的。我需要这样的东西：

if ( target = "fruit") then give me "apple, orange, mango".

如果（目标=“水果”）然后给我“苹果，橙子，芒果”。

Can someone please point me in the right direction?

有人可以指出我正确的方向吗？

Thanks.

谢谢。

Answer 1

回答by alex

Just continueon targetattributes which aren't fruit, and then add the textContentof the elements to an array.

只是continue在target不是的属性上fruit，然后将textContent元素的添加到数组中。

$nodes = array();

for ($i; $i < $a->length; $i++) {
    $attr = $a->item($i)->getAttribute('target');

    if ($attr != 'fruit') {
        continue;
    }

    $nodes[] = $a->item($i)->textContent;
}

$nodesnow contains all the nodes of the elements which have their targetattribute set to fruit.

$nodes现在包含target属性设置为的元素的所有节点fruit。

Answer 2

回答by fardjad

use DOMXPathand queries:

使用DOMXPath和查询：

$doc = new DOMDocument();
$doc->Load('yourFile.html');

$xpath = new DOMXPath($doc);

$fruits = $xpath->query("//a[@target='fruit']");
foreach($fruits as $fruit) {
    // ...
}

$animals = $xpath->query("//a[@target='animal']");
foreach($animals as $animal) {
    // ...
}

See thisdemo.

看到这个演示。

Answer 3

回答by XMen

Make two array

制作两个数组

$fruits=array();
$animals=array();

t and in loop when you get .

t 并在循环中获得 .

if(target=='fruit') {
   array_push($fruits,$valueofelement);

} else if ($target=='animal') {
   array_push($animals,$valueofelement);
}

php 通过php中的属性值获取HTML元素

提问by stillenat

回答by alex

回答by fardjad

回答by XMen

相关推荐

最近更新

标签

php 通过php中的属性值获取HTML元素

提问by stillenat

回答by alex

回答by fardjad

回答by XMen

相关推荐

php utf-8 特殊字符不显示

php Wordpress：WP_Query 如何使用自定义帖子类型应用搜索条件

php PHP刷新窗口？相当于 F5 页面重新加载？

php 如何从 Yahoo Finance 等网站获取数据？

相关推荐

最近更新

标签