Abstract:
Mathematical basis and original methodology for developing evaluation systems of semantic proximity of information objects (IO) in natural language are presented. A probabilistic statistical representation of the compared IO is introduced. The information theory is used to estimate the semantic proximity of IO. The methodology can be used for synthesis of computer-based systems. The results of the practical testing of the methodology effectiveness are presented.
Keywords:information objects; natural language; semantic adequacy; probabilistic model; information theory.