I am extracting a page from website using cURL command.
curl "www.google.com" -s | w3m -dump -T text/html > foo.txt
The w3m command outputs the HTML page in a much more simpler format so that the string manipulation in the foo.txt
is simpler now.
Now, I have some values in my foo.txt
which get extracted as below.
Assistant director at Hollywood studios
Student at University of Texas at Arlington
Now, I need to extract only the values after at to store in my database. How can I do it? For example, for the above input, I need the values as,
Hollywood Studios
University of Texas at Arlington
at
Arlington. – jasonwryan Feb 10 '14 at 23:04