I hope all is well with you.
I need a solution to a time-out problem when web scraping a URL. The program is running in the cloud, at Hetzner.
This effect can be simulated with the curl command below:
curl https://www.fnde.gov.br/distribuicaosimadnet/confirmarCancelar
Failure Message:
curl https://www.fnde.gov.br/distribuicaosimadnet/confirmarCancelar curl: (28) Failed to connect to www.fnde.gov.br port 443 after 131364 ms: Connection timed out
The curious thing is that this access works correctly in our local machine and in AWS, both on Linux.
Another fact that draws attention is that the access below, via curl, can work on Hetzner:
The original application is written in Python, but I used curl as an alternative to simulate the problem.
It seems to me that
www.fnde.gov.br
site admin restricted the access of this website outside the country (Brazil here). This happens with many countries restricting their websites for access.I assume the Hetzner VM is somewhere in Germany same as where I am. The
traceroute
command fails after few hops. You could verify the same from ping.euNevertheless,
traceroute
works from perfops.net Brazil nodes. Their web application seems to be running when I checked withpingdom
tools. Test results for you here.As far as the timeouts are concerned, it may not work for you from Hetzner infrastructure as it's either based in Europe & North America only.
Another option could be using
vpn
/Cloudflare ZTNA
connecting to a server located inBrazil