ABSTRACT

With the development of Internet, especially cloud computing database center, it becomes increasingly dicult for the traditional algorithm of dimensional calculation to meet the requirements which is due to the high-dimensional text vector and other swelling text data such as web pages. As arrangement of text data and an important way of organization, text classification can decrease dimension by the eectiveness of the feature selection, which can not only reduce the cost of system and run-time, but also can improve the accuracy of classification.