Design and implementation of the topic-focused crawler based on scrapy | |
Xie, Dong Xiang ; Xia, Wen Feng ; Xie DX(谢东祥) | |
2014 | |
关键词 | Data mining Testing Websites |
英文摘要 | Conference Name:2013 International Forum on Materials Analysis and Testing Technology, IFMATT 2013. Conference Address: Qingdao, China. Time:December 9, 2013 - December 10, 2013.; E-commerce websites has abundant commercial data. Some very beneficial information to the analysis and prediction of the market can be discovered from these data by applying data mining techniques. The topic-focused web crawler can crawl and gather the subject-related web pages as soon as possible. This thesis has designed and realized the topic-focused crawler based on Scrapy. It firstly introduces the design idea of the crawler and highlights the functions of Scrapy's every part. Then, it uses this topic-focused crawler to realize the capture of information from the C2C e-commerce platform, for example TaoBao. At last, it obtains the running result and comparisons of crawling performance between Scrapy based crawler and general crawler. ? (2014) Trans Tech Publications, Switzerland. |
语种 | 英语 |
出处 | http://dx.doi.org/10.4028/www.scientific.net/AMR.850-851.487 |
出版者 | TRANS TECH PUBLICATIONS LTD |
内容类型 | 其他 |
源URL | [http://dspace.xmu.edu.cn/handle/2288/85367] ![]() |
专题 | 航空航天-会议论文 |
推荐引用方式 GB/T 7714 | Xie, Dong Xiang,Xia, Wen Feng,Xie DX. Design and implementation of the topic-focused crawler based on scrapy. 2014-01-01. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论