I have logs with this rows:
in24.inetnebr.com - - [01/Aug/1995:00:00:01 -0400] "GET /shuttle/missions/sts-68/news/sts-68-mcc-05.txt HTTP/1.0" 200 1839
where the first column (in24.inetnebr.com
) is the host, the second (01/Aug/1995:00:00:01 -0400
) is the timestamp, the 3rd (GET /shuttle/missions/sts-68/news/sts-68-mcc-05.txt HTTP/1.0
) is the downloaded page.
How can I find the last two downloaded page for every hosts with Pig?
Thank you very much for your help!
I've solved the problem, FYI: