这时候还是正常的(不过直接查看网页源代码貌似没这个print出来的多.......为什么会这样啊?)
然后我加了一句:
with open('/html_bd.txt','a') as f:
f.write(html)
然后就报错了,报错如下:
Traceback (most recent call last):
File "D:\Documents\notepad\webscrab.py", line 16, in <module>
f.write(html)
UnicodeEncodeError: 'gbk' codec can't encode character '\xbb' in position 28678: illegal multibyte sequence
网上查了很久,还是稀里糊涂没整好:
=-=怎么整哟.....