Large Quantity of Text Classification Based on the Improved Feature-Line Method

* Final gross prices may vary according to local VAT.

Get Access

Abstract

Feature-Line Method deems that a line between two points in the same class of space represents the space feature better than a single point. However, it brings faults in the classification results in terms of distance only. Here coefficient was put forward to eliminate the influence of the off-group point to classification, which was also combined with the central distance of class, then formed the improved algorithm, which is used in two different capacity document repositories. The results of experiment show that the improved algorithm support large document repositories very well, and it can be used in large-scale text classification and text retrieval.