RUS  ENG
Full version
JOURNALS // Vestnik Sankt-Peterburgskogo Universiteta. Seriya 10. Prikladnaya Matematika. Informatika. Protsessy Upravleniya // Archive

Vestnik S.-Petersburg Univ. Ser. 10. Prikl. Mat. Inform. Prots. Upr., 2019 Volume 15, Issue 2, Pages 235–244 (Mi vspui404)

This article is cited in 1 paper

Computer science

Semantic Textual Similarity on Brazilian Portuguese: An approach based on language-mixture models

A. Silvaa, A. Lozkinsb, L. R. Bertoldia, S. Rigoa, V. M. Bureb

a University of Vale do Rio dos Sinos, 950, Av. Unisinos, São Leopoldo, RS, 93020-190, Brazil
b St. Petersburg State University, 7-9, Universitetskaya nab., St. Petersburg, 199034, Russian Federation

Abstract: The literature describes the Semantic Textual Similarity (STS) area as a fundamental part of many Natural Language Processing (NLP) tasks. The STS approaches are dependent on the availability of lexical-semantic resources. There are several efforts to improve the lexical-semantics resources for the English language, and the state-of-art report a large amount of application for this language. Brazilian Portuguese linguistics resources, when compared with English ones, do not have the same availability regarding relation and contents, generation a loss of precision in STS tasks. Therefore, the current work presents an approach that combines Brazilian Portuguese and English lexical-semantics ontology resources to reach all potential of both language linguistic relations, to generate a language-mixture model to measure STS. We evaluated the proposed approach with a well-known and respected Brazilian Portuguese STS dataset, which brought to light some considerations about mixture models and their relations with ontology language semantics.

Keywords: Semantic Textual Similarity, natural language processing, computational linguistics, ontologies.

UDC: 004.912

MSC: 68T50

Received: November 18, 2018
Accepted: March 15, 2019

Language: English

DOI: 10.21638/11701/spbu10.2019.207



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2024