RUS  ENG
Full version
JOURNALS // Intelligent systems. Theory and applications // Archive

Intelligent systems. Theory and applications, 2021 Volume 25, Issue 4, Pages 243–249 (Mi ista457)

Part 4: Natural Language Processing

NEREL: A Russian dataset with nested named entities and relations

I. V. Denisov, I. S. Rozhkov, N. V. Lukashevich

Lomonosov Moscow State University

Abstract: NEREL is a Russian publicly available dataset for solving named entity recognition problem and relation extraction problem. The dataset contains more than 56K tagged entities and more than 39K relationships. An important difference between NEREL and previous datasets is the presence of markup for nested named entities.
The methods of extracting nested named entities differ from the methods of extracting "flat" named entities primarily by the architecture of the solution. Since NEREL provides annotations for nested entities, the article compared various approaches to solving this problem with the transfer to Russian language domain.

Keywords: NER, nested NER, named entity recognition, nested named entity recognition, dataset.



© Steklov Math. Inst. of RAS, 2025