一种改进的K-近邻分类法

An Improved K-nearest Neighbor Cassification Method

摘要: 介绍现有K-近邻分类法的基本思想和研究现状，并针对此方法在分类各类数据集分布不平衡时容易造成分类精度低的问题作相应的改进。改进的K-近邻分类法中引入类代表度和样本代表度，使得K-近邻分类法在相似度计算时选出的近邻样本更能代表其所在类，从而减小误判率。实验证明改进方法有效。

Abstract: This paper introduces the basic ideas and research status of the existing K-nearest neighbor method, and improve the low classification accuracy when all kinds of data sets are distributed unbalanced. In the improved K-nearest neighbor method, the class representation and sample representation are introduced, so that the nearest neighbor samples, which were selected by K-nearest neighbor classification in the similarity calculation, were more representative of its class, thus reducing the false positive rate. The validity of the improved method is proved by experiments.