how to crowd source my web crawling

Question

how to crowd source my web crawling

409 views Asked by hoju At 24 October 2009 at 11:17

My web application requires downloading content from the user URL specified. Currently this request go through my server, which is inefficient and could get my server IP blocked.

Is there a way to let the user download the URL content directly? The same-origin policy seems to prevent using AJAX or an iframe to download and reuse this content.

Any ideas? For example is there a way via flash to download and reuse URL content?

Original Q&A

There are 2 answers

**Martin v. Löwis** · Answer 1 · 2009-10-24T11:22:38+00:00

Martin v. Löwis On 24 October 2009 at 11:22

If it's a specific web side, I recommend to talk to the website operators rather than trying to crawl anonymously.

**Paul Dixon** · Answer 2 · 2009-10-24T11:22:43+00:00

You could use Tor to mask your requests, but if you're having to go such lengths to crawl a website perhaps you shouldn't be doing it?

Also, with your approach the iframe request will include your page URL as the referrer, which makes identifying these requests at the server end pretty straightforward...

TechQA.

how to crowd source my web crawling

There are 2 answers

Related Questions in AJAX

Related Questions in IFRAME

Related Questions in SAME-ORIGIN-POLICY

Related Questions in CROWDSOURCING

Popular Questions

Popular Tags

Trending Questions