无法运行,却又没报错
代码点击运行,控制台不到一秒就说已结束,但是又不报错,也没文本生成。。。。。。求帮助0.0谢谢啦
代码点击运行,控制台不到一秒就说已结束,但是又不报错,也没文本生成。。。。。。求帮助0.0谢谢啦
2016-01-16
换成http://baike.baidu.com/view/4072022.htm
#conding:utf8
from spirder import manager, downloader, parser, output
class spirdermain(object):
def __init__(self):
self.manager=manager.UrlManager()
self.downloader=downloader.HtmlDownloader()
self.parser=parser.HtmlParser()
self.output=output.HtmlOutput()
def craw(self, root_url):
count=1
self.manager.add_new_url(root_url)
while self.manager.has_new_url():
try:
new_url=self.manager.get_new_url()
print 'craw %d:%s'%(count,new_url)
html_cont=self.downloader.downloader(new_url)
new_urls,new_data=self.parser.parser(new_url,html_cont)
self.manager.add_new_urls(new_urls)
self.output.collect_data(new_data)
if count==1000:
break
count=count+1
except:
print 'craw failed'
self.output.output_html()
if __name__=="_main_":
root_url="http://baike.baidu.com/link?url=sP2Dq8raiXUsDeUd8GbiC1C1HcvMO8I8dkoUi5UcIiDwFckEBG9G4KHTKVsCPWoPh1y4LDZZKtEeIA59EYISx_"
obj_spirder=spirdermain()
obj_spirder.craw(root_url)
额,这是代码。。。。。英文打错了,打成spirder了,但是应该没问题的
举报