Abstract:
At the present stage of development of scientific and technological potential of the Russian Federation, it is necessary to improve the efficiency of research and development work. Such decisions should be based on using the objective analytical data on development trends of the world scientific and technological complex. These data can be obtained through the complex analysis of the global flow of scientific and technical information. The sources of such information may be foreign and domestic, branches and departmental portals, sites of scientific institutions, journals, scientific conferences and communities, electronic mass media, and other Internet sources of scientific and technical information. In the process of semantic analysis of multilingual heterogeneous distributed information sources, new knowledge can be obtained and the priority areas of activity of domestic and foreign research teams can be determined. The new principles for development of the system for monitoring and analysis of the global flow of scientific and technical information are proposed. These principles are based on the modern perceptions of conceptual text structure. These perceptions are based on the modern conception of phraseological conceptual analysis of texts. The methods and tools for formalization of semantic structure of the text information and adaptation methods for declarative means for the procedures of linguistic processing and semantic analysis of the text information are considered in detail. The architecture of the system and the list of functions of all its subsystems are given.
Keywords:automated text processing; semantic analysis; formal description of text; semantic structure; data extraction; linguistic software.