Consider the following commands, and their results :
$ echo "<br/> <br/>a<br/>b<br/>c<br/><br/> <br/>"|lynx -dump -stdin
a
b
c
$ echo "<br/> <br/>a<br/>b<br/>c<br/><br/> <br/>"|elinks -dump
a
b
c
Neither prints the correct number of lines : elinks
skips the first white space, and both skip blank lines and trailing lines with white space only.
Is there a way to force lynx
or elinks
to interpret all spaces and line breaks ? I didn't see anything obvious in their manpage.
(I mean, beside using a temporary character to be suppressed by sed
or tr
or whatever after the display.)
~
). – Skippy le Grand Gourou Nov 30 '14 at 14:26paste
, and finally I usepsql
\copy
to fill the DB. Usually I usehxselect "td:nth-child()"
to get columns, but for one particular column it gives me HTML code which needs to be interpreted, which is why I uselynx
— then the issue arises when rows are empty. – Skippy le Grand Gourou Nov 30 '14 at 14:32sed
through it. I find bash tools quite adapted to text processing so for now I don't find the need for anything else (though I neededperl
'sHTML::TableExtract
once). – Skippy le Grand Gourou Nov 30 '14 at 15:39