当前位置: 首页 > 后端技术 > Python

爬虫--百度图片

时间:2023-03-26 18:52:20 Python

制作爬虫一般分为以下几个步骤:分析需求分析网页源码,用开发者工具编写正则表达式或XPath表达式,正式编写python爬虫代码代码#-*-coding:utf-8-*-importreimportrequestsdefdowmloadPic(html,keyword,limit):pic_url=re.findall('"objURL":"(.*?)",',html,re.S)i=1print('找到关键字:'+keyword+'图片,现在开始下载图片...')foreachinpic_url:print('Downloading'+str(i)+'图片地址:'+str(each))try:pic=requests.get(each,timeout=10)exceptrequests.exceptions.ConnectionError:print('[Error]当前图片无法下载')continuedir='./images/'+keyword+'_'+str(i)+'.jpg'fp=open(dir,'wb')fp.write(pic.content)fp.close()i+=1ifi>limit:breakif__name__=='__main__':word=input("请输入搜索关键字:")limit=int(input("请输入下载次数:"))url='http://image.baidu.com/search/flip?tn=baiduimage&ie=utf-8&word='+word+'&ct=201326592&v=flip'result=requests.get(url)下载Pic(result.text,word,limit)结果