RUS  ENG
Full version
JOURNALS // Sistemy i Sredstva Informatiki [Systems and Means of Informatics] // Archive

Sistemy i Sredstva Inform., 2015 Volume 25, Issue 3, Pages 235–250 (Mi ssi428)

This article is cited in 17 papers

The system of facts extraction from historical texts

I. M. Adamovich, O. I. Volkov

Institute of Informatics Problems, Federal Research Center "Computer Science and Control" of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation

Abstract: Text surfing is described as a separate subclass of such important part of biographic investigation as Internet search. Text surfing is the search of useful information, the character of which cannot be foreseen, and therefore, the appropriate web search query cannot be formulated. The technology of automatic fact extraction is proposed for text surfing. The implementation of such technology is described. Special attention is paid to the problem of anaphora resolution, when the interpretation of an expression depends on another expression in the context. A new hierarchical view of a biographical fact is proposed and analyzed. The experimental verification of applicability of the proposed technology for the memoir and historical literature is described. The article reports the results of these experiments, which confirm applicability and perspectivity of the proposed approach. This technology is meant for a wide range of users, which are not professional historians and biographers. This is important today because public interest in family history is increasing.

Keywords: biographic investigation; facts extraction from texts; anaphora resolution; hierarchy of facts.

Received: 13.08.2015

DOI: 10.14357/08696527150315



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024