RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2016 Volume 10, Issue 1, Pages 119–128 (Mi ia409)

BioNLP ontology extraction from a restricted language corpus with context-free grammars

D. A. Alexeyevsky

National Research University Higher School of Economics; 20 Myasnitskaya Str., Moscow 101000, Russian Federation

Abstract: BioNLP is an emerging area of NLP that brings new challenging objects for language processing and new valuable resources for bioinformatics and medicine. One notable task in BioNLP is creating de-novo ontologies. This is generally a tedious process; however, in some cases, it is possible to automate it to some extent. One such case is when a corpus of texts in a restricted subset of natural language is available. This paper presents a simple approach to automate ontology creation in such cases. The approach is aimed to simplify mapping of entities in natural texts to predefined ontologies wherever possible. The paper discusses which properties of the corpus enable the approach presented.

Keywords: BioNLP; ontology creation; context-free grammar.

Received: 23.09.2015

DOI: 10.14357/19922264160111



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024