Wyodrębnij zawartość tabeli za pomocą Perla

Question

Oct 23, 2012, 08:53 PM

Wyodrębnij zawartość tabeli za pomocą Perla

Próbuję wyodrębnić zawartość tabeli z pliku html za pomocą HTML :: TableExtract. Moim problemem jest to, że mój plik HTML ma następującą strukturę:

<!DOCTYPE html>
<html>
<body>

    <h4>One row and three columns:</h4>

    <table border="1">
      <tr>
        <td>
        <p> 100 </p></td>
        <td>
        <p> 200 </p></td>
        <td>
        <p> 300 </p></td>
        </tr>
      <tr>
        <td>
        <p> 100 </p></td>
        <td>
        <p> 200 </p></td>
        <td>
        <p> 300 </p></td>
        </tr>
    </table>
</body>
</html>

Z powodu tej struktury moje dane wyjściowe wyglądają tak:

Zamiast tego, co chcę:

   100|200|300|
   400|500|600|

Czy możesz pomóc? Oto mój kod Perla

use strict;
use warnings;
use HTML::TableExtract;

my $te = HTML::TableExtract->new();
$te->parse_file('Table_One.html');

open (DATA2, ">TableOutput.txt")
    or die "Can't open file";

foreach my $ts ($te->tables()) {

    foreach my $row ($ts->rows()) {

        my $Final = join('|', @$row );
    print DATA2 "$Final";
    }
}
close (DATA2);