
基于位置及词频信息的优化CHI文本特征选择 方法基于位置及词频信息的优化CHI文本特征选择 方法.pdf

基于位置及词频信息的优化CHI文本特征选择 方法基于位置及词频信息的优化CHI文本特征选择 方法.pdf

  1. 1、本文档共9页,可阅读全部内容。
  2. 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
  3. 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载
  4. 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
基于位置及词频信息的优化CHI文本特征选择 方法基于位置及词频信息的优化CHI文本特征选择 方法

Computer Science and Application 计算机科学与应用, 2015, 5(9), 322-330 Published Online September 2015 in Hans. /journal/csa /10.12677/csa.2015.59040 An Improved CHI Text Feature Selection Method Based on the Location and Word Frequency Information Aling Song, Haifeng Liu, Shousheng Liu Institute of Sciences, PLA University of Science and Technology, Nanjing Jiangsu Email: hfliu1962@ nd th st Received: Oct. 2 , 2015; accepted: Oct. 16 , 2015; published: Oct. 21 , 2015 Copyright © 2015 by authors and Hans Publishers Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). /licenses/by/4.0/ Abstract Text feature selection is the core technology of text automatic categorization. Aiming at the short- comings of classical CHI model, we have screened the feature set which is based on the point of view of the positive and negative correlation between the feature and categories firstly. According to the type of deflection classification conditions, we adjust the feature weighting secondly. Third- ly, basing on characteristics of word frequency, we gradually improve the model based on the characteristics of a specific location in the text and the characteristics of distribution of informa- tion between classes. Finally, we propose an optimized CHI feature selection method. Text classi- fication experiments demonstrate the effectiveness of the optimized CHI model. Keywords Feature Selection, Chi-Square, Relevance, Location Distribution, Class Deflection 基于位置及词频信息的优化CHI文本特征选择 方法 宋阿羚,刘海峰,刘守生 解放军理工大学理学院,江苏 南京 Email: hfliu1962@ 文章引用: 宋阿羚, 刘海峰, 刘守生. 基于位置及词频信息的优化CHI 文本特征选择方法[J]. 计算机科学与应用, 2015, 5(



vshoulrengb3 + 关注


