How to get all the results from a web page just as the browser shows when finished scrolling down

532 views Asked by At

I'm trying to get all the video results from a web page :

$ curl -qs https://ok.ru/video/c335170 | pup '.video-card_lk attr{href}' | wc -l
24

Another method returns the same result :

$ wget --config="/dev/null" -qO- https://ok.ru/video/c335170 | grep -oP '/video/\d+' | sort -u | wc -l
24

EDIT 1: Scrolled the webpage to the end with firefox and saved it as c335170.html and I get the same result :

$ cat c335170.html | grep -oP '/video/\d+' | sort -u | wc -l
24

However, on the web browser, it shows, after the scrolling to the end, 81 results.

Same pb. with YouTube and the "Load more" button which hides results from http console clients :

$ curl -qs https://www.youtube.com/user/impacttvouaga/videos | grep -oP "/watch\?v=[\w-]+" | uniq | wc -l
21

EDIT 2: I've just saved this webpage with firefox as a "Web Page, HTML only" into RMC_IMPACTV__YouTube.html and then :

$ cat RMC_IMPACTV__YouTube.html | grep -oP "/watch\?v=[\w-]+" | uniq | wc -l
21

How can I have the remote HTTP server to give me all the results ?

1

There are 1 answers

0
SebMa On

To download the expanded html one I installed Save Page WE and to scroll down I installed Scroll it!