Abstract:
We present a matrix model of texts on natural languages and a model of quantitative assessment of similarity of text contents. An application of the model to search for the texts with similar content is considered. We discuss the difference of the proposed matrix models and commonly used approaches to analyze and model natural language texts.
Keywords:natural language texts, similarity of text contents, similarity assessment, text
models, text information retrieval.