我想,您希望返回0
属性中的content
,这可能很接近:
import re
import requests
from bs4 import BeautifulSoup
list3 = []
r = requests.get('https://bama.ir/car/peugeot/')
soup = BeautifulSoup(r.text,'html.parser')
res2 = soup.find_all('div',attrs={'class': 'listdata'})
# print(res2)
for item in res2:
z = re.findall(r'content="(\d*)"',str(item))
list3 += z
for item in list3:
print(item)
输出
145000000
145000000
77000000
77000000
0
116000000
116000000
106000000
106000000
105000000
105000000
0
58000000
3600000
0
0
142800000
142800000
35000000
35000000
0
如果您希望简化/修改/探索表达式,请在regex101.com的右上角进行说明。如果愿意,您还可以在this link中查看它如何与某些示例输入匹配。
,
我最终使用此代码获取零数字
import re
import requests
from bs4 import BeautifulSoup
list3 = []
r = requests.get('https://bama.ir/car/peugeot/')
soup = BeautifulSoup(r.text,attrs={'class': 'overview'})
for item in res2:
z1 = re.findall(r'content="([0-9])"',str(item))
list3 += z1
z2 = re.findall(r'<!-- <p class="cost"><span itemprop="price" content="(\d*)">',str(item))
list3 += z2
for item in list3:
print(item)
感谢您的帮助。
本文链接:https://www.f2er.com/3100973.html