Automatic Text Classification for Label Imputation of Medical Diagnosis Notes based on Random Forest
Yujie Yang; Bokai Yang; Guangzhe Dai; Darong Tang; Qi Li; Denan Lin; Jing Zheng; Yunpeng Cai
2018
会议日期2018
会议地点Cairns, Queensland, Australia
英文摘要Electronic medical records (EMRs) contain many information of pa- tients, which are of great value for data mining for various clinical applications. However, information missing, including label missing, is pervasive in nature EMRs which would bring lots of obstacles for processing of the medical text contents. The aim of this study is to adopt automatic text classification technolo- gies to recover missing medical text labels for EMRs and support downstream analyses. A combination of word-embedding technology and random forest clas- sifiers are applied to identify multiple medical note labels including disease types and examination types, from short texts of medical imaging diagnosis notes. The results show that the average binary classification accuracies are 91%. Our re- search results indicate that using advanced NLP techniques for EMRs can reach high classification accuracies.
语种英语
内容类型会议论文
源URL[http://ir.siat.ac.cn:8080/handle/172644/14151]  
专题深圳先进技术研究院_数字所
推荐引用方式
GB/T 7714
Yujie Yang,Bokai Yang,Guangzhe Dai,et al. Automatic Text Classification for Label Imputation of Medical Diagnosis Notes based on Random Forest[C]. 见:. Cairns, Queensland, Australia. 2018.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace