基于标签信息的协同过滤算法研究

CORC > 自动化研究所 > 中国科学院自动化研究所 > 毕业生 > 硕士学位论文

题名	基于标签信息的协同过滤算法研究
作者	彭景
学位类别	工学硕士
答辩日期	2011-05-29
授予单位	中国科学院研究生院
授予地点	中国科学院自动化研究所
导师	曾大军 ; 赵惠民
关键词	推荐系统协同过滤社会书签系统协同标注模型资源推荐 recommender systems collaborative filtering social bookmarking systems collaborative tagging model item recommendation
其他题名	Tag-based Recommendation in Social Bookmarking Systems
学位专业	计算机应用技术
中文摘要	社会书签系统允许用户用自由形式的文字或短语（即标签）对感兴趣的资源（如网页、商品、电影、音乐、相片等）进行标注。众多用户的标注数据聚合在一起就构成了一个协同标注平台，在该平台上用户可以基于标签进行信息的社会化管理和检索。社会书签系统对好友或群组关系的支持为自身的发展注入了非常核心的社交功能，进一步增强了其社会化特性。所有这些都为基于群体智慧的信息组织和发现方式提供了良好的基础，而基于标签信息的资源推荐算法是这种社会化信息发现机制获得成功的关键。目前基于标签信息的资源推荐算法研究才刚刚开始，还远未达到成熟。现有算法的不足主要体现在两个方面：一是协同标注数据内在结构的理解上；一是协同标注数据内所蕴含交互关系的全面利用。当前社会书签领域的绝大部分资源推荐算法都显式或隐式地假定协同标注数据内在是一种简单的三厢图结构，而这种结构在解释用户既具个性又具协同性的标注行为时存在本质上的困难。因此，在三厢图结构基础上提出的资源推荐算法都具有先天的局限性。另一方面，当前基于标签的资源推荐算法对于协同标注数据中可用信息的利用都很不充分，往往只能考虑多种交互关系中的两种，使得标签信息在推荐中的作用未能得到最大化。此外，如何在一个系统的框架下，全面有效地利用标签信息和社会网络信息进行社会化推荐也是一个亟待解决的问题。本文的工作主要针对以上几个问题进行，研究的主要内容如下： 1) 研究了协同标注数据的内在结构，提出了两种新的模型（以主题为中心的协同标注模型和统一的协同标注模型）对协同标注数据进行更加合理而完整的表述。这两种模型为全面地利用标签信息打下了坚实的基础。 2) 提出了资源-标签联合推荐框架，将资源推荐和标签推荐两个原本独立的任务整合到了同一个框架中。对资源和标签进行联合推荐的一大好处是可以充分利用各种信息对联合推荐结果进行调整。 3) 设计了利用未收藏资源所携带的信息缓解协同标注数据稀疏性的方法。该方法在解决稀疏性的同时将社会书签领域的资源推荐问题转化为了一个有关权重分配的带约束最优化问题。 4) 初步尝试了同时利用标签信息和社会网络信息进行社会化资源推荐，为后续的研究工作打下了一定的基础。
英文摘要	Social bookmarking systems encourage users to annotate Web resources of interest (e.g., URLs, products, movies, music, photos, etc.) with free-style words or phrases. The tagging data of all users constitutes a collaborative tagging platform, and individual users can manage and retrieve information items socially on top of this platform. The support of friendship or group membership equips social bookmarking systems with a critical social utility and further enhances its social aspect. All the above mentioned has laid promising foundations for crowd source driven social information organization and discovery. Going beyond these foundations, effective tag-based information item recommendation is the key to the success of such social information discovery mechanism. The research on tag-based item recommendation is just emerging and yet far from matures. Current research in this area mainly suffers from two problems: the precise understanding of the underlying structure of tagging data and the comprehensive use of all kinds of interaction appeared in the tagging data. So far, most tag-aware item recommendation methods either explicitly or implicitly assume a simple tripartite graph structure, whereas such a structure faces essential difficulties in explaining the personalized and yet collaborative tagging behavior of users. Thus, the algorithms designed on top of the tripartite graph structure have intrinsic limitations. On the other hand, most item recommendation methods in social bookmarking systems only take advantage of two types of all possible interaction available in tagging data, rendering the potential of tag-based recommendation far from being fully realized. In addition, how to make full use of both tagging information and social network information to sustain social recommendations is also an important problem remains to be explored. The major work of this thesis is as follows: a) Studied the underlying structure of collaborative tagging data and proposed two novel models (subject-centered model and integrated tagging model) to model tagging data in a more reasonable and comprehensive manner. Both models lay the foundations to make full use of tagging information. b) Proposed the joint item-tag recommendation framework, which integrates the item recommendation and tag recommendations tasks into the same framework. One advantage of joint item-tag recommendation is that all kinds of information can be easily incorporated to refine the joint recom...
语种	中文
其他标识符	200728017029223
内容类型	学位论文
源URL	[http://ir.ia.ac.cn/handle/173211/7593]
专题	毕业生_硕士学位论文
推荐引用方式 GB/T 7714	彭景. 基于标签信息的协同过滤算法研究[D]. 中国科学院自动化研究所. 中国科学院研究生院. 2011.

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

[发表评论/异议/意见]

暂无评论

评论
权益异议
反馈意见

评注功能仅针对注册用户开放，请您登录

您对该条目有什么异议，请向管理员反馈。
内容：
Email：	*
单位:
验证码：	刷新

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接

CORC

联系我们