Company name matching Common Crawl using mrjob

240 views Asked by At

I have a list of company name and details like ph.no, address, email etc.,. I want to get their company_url. We thought of using google API to make requests but it turns out to be costly.

After searching I found Common_Crawl which was somewhat close to google in website dumb data wise.

I found a website to actually map our phone number with the available phone numbers in Common_Crawl.

I need to find a way to match them using Company name.

Is there any way my which I can map by Company name with Common_crawl data. I don't want to look through 3.25 billion common_crawl records for each company name.

0

There are 0 answers