CORC  > 厦门大学  > 航空航天-会议论文
Design and implementation of the topic-focused crawler based on scrapy
Xie, Dong Xiang ; Xia, Wen Feng ; Xie DX(谢东祥)
2014
关键词Data mining Testing Websites
英文摘要Conference Name:2013 International Forum on Materials Analysis and Testing Technology, IFMATT 2013. Conference Address: Qingdao, China. Time:December 9, 2013 - December 10, 2013.; E-commerce websites has abundant commercial data. Some very beneficial information to the analysis and prediction of the market can be discovered from these data by applying data mining techniques. The topic-focused web crawler can crawl and gather the subject-related web pages as soon as possible. This thesis has designed and realized the topic-focused crawler based on Scrapy. It firstly introduces the design idea of the crawler and highlights the functions of Scrapy's every part. Then, it uses this topic-focused crawler to realize the capture of information from the C2C e-commerce platform, for example TaoBao. At last, it obtains the running result and comparisons of crawling performance between Scrapy based crawler and general crawler. ? (2014) Trans Tech Publications, Switzerland.
语种英语
出处http://dx.doi.org/10.4028/www.scientific.net/AMR.850-851.487
出版者TRANS TECH PUBLICATIONS LTD
内容类型其他
源URL[http://dspace.xmu.edu.cn/handle/2288/85367]  
专题航空航天-会议论文
推荐引用方式
GB/T 7714
Xie, Dong Xiang,Xia, Wen Feng,Xie DX. Design and implementation of the topic-focused crawler based on scrapy. 2014-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace