I want to scrape links to patents from a Google Patents Search using BeautifulSoup, but I'm not sure if Google converts their html into javascript, which cannot be parsed through BeautifulSoup, or what the issue is.
Here is some simple code:
url = 'https://patents.google.com/?assignee=Roche&after=priority:20110602&type=PATENT&num=100'
soup = BeautifulSoup(requests.get(url).content, 'html.parser')
links = []
for link in soup.find_all('a', href=True):
    print(link['href'])
I also wanted to append the links into the list, but nothing is printed because there are no 'a' tags from the soup. Is there any way to grab the links to all of the patents?
 
                        
Data is dynamically render so its hard to get from
bs4so what you can try go to chrome developer mode.Then go to Network tab you can now find xhr tab reload your web page so there will be links under Name tab from that one link is containing all data as json format
so you can copy the address of that link and you can use
requestsmodule make call and now you can extract what so ever data you wantalso if you want individual link so it is made of publication_number and you can join it with old link to get links of publications.
Output:
Image: