Abstract:
This paper deals with a software platform prototype for extraction of Linked Open Data (LOD) from a given collection of mathematical scholarly papers. The problem of obtaining the semantic representation of a collection in the chosen subject area is of topical interest since the LOD cloud currently lacks up-to-date data on professional mathematics. We believe that the main reason for that is the absence of appropriate tools that could analyze the underlying semantics in mathematical papers and effectively build their consolidated representation. In this article, we describe a complex approach to the analysis of these documents for representing their content and metadata in RDF format. We also consider methods and techniques based on special ontologies for extracting semantic data from mathematical papers and describe experiments on integration of the constructed RDF-set into the existing datasets on the Internet.