因此,在下面的代码中,我想打开一个公寓网站的网址并抓取该网页。 唯一的问题是,Beautiful Soup不会等到整个网页都呈现出来。 直到将它们加载到页面上之前,这些公寓才会在html中呈现,这需要几秒钟的时间。我该如何解决?
from urllib.request import urlopen as uReq
from bs4 import BeautifulSoup as soup
my_url = 'https://xxxxx.com/properties/?sort=latest'
uClient = uReq(my_url)
page_html = uClient.read()
uClient.close()
page_soup = soup(page_html,"html.parser")
containers = page_soup.findAll("div",{"class":"grid-item"})
#len(containers) is empty since the contents haven't been loaded yet!