Abstract:
The article covers research in the field of topic clustering for large science publication collections. Demands of developing such methods are considered. The method and the algorithm for topic clustering for large science publication amounts are presented. A comparison of the proposed method with classic clustering approaches is performed.
Keywords:text clustering, text classification, lexical descriptors, text spectral index, inverted spectral index, TF, IDF, topic importance characteristic, assessment of clustering methods.