fake_useragent库在pycharm中安装要注意下划线 _ 换成减号 –
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 |
import requests from fake_useragent import UserAgent import re url = 'https://www.qiushibaike.com/text/' headers = { 'User-agent':UserAgent().random } <em># 构造请求</em> response = requests.get(url,headers=headers) info = response.text <em># 正则表达式解析数据</em> infos = re.findall(r'<div class=""content"">\s*<span>\s*(.+)\s*</span>',info) <em># 保存文件</em> with open('qiushi.txt','w',encoding='utf-8') as f: for text in infos: text = re.sub(r'<br/>','',text) f.write(text+'\n\n\n') |
近期评论