A. Kazakov, S. Denisova, I. Barsola, E. Kalugina, I. Molchanova, I. Egorov, A. Kosterina, E. Tereshchenko, L. Shutikhina, I. Doroshchenko, N. Sotiriadi, S. Budennyy, “ESGify: Automated classification of environmental, social and corporate governance risks”, Dokl. RAN. Math. Inf. Proc. Upr., 2023, Volume 514, Number 2,Pages <nobr>417

This article is cited in 4 papers

SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES

ESGify: Automated classification of environmental, social and corporate governance risks

A. Kazakov^a, S. Denisova^a, I. Barsola^a, E. Kalugina^a, I. Molchanova^a, I. Egorov^a, A. Kosterina^a, E. Tereshchenko^a, L. Shutikhina^a, I. Doroshchenko^a, N. Sotiriadi^a, S. Budennyy^ab

^a Sber AI Lab, Moscow, Russian Federation
^b Artificial Intelligence Research Institute, Moscow, Russian Federation

Abstract: The growing recognition of environmental, social, and governance (ESG) factors in financial decisionmaking has spurred the need for effective and comprehensive ESG risk assessment tools. In this study, we introduce an open-source Natural Language Processing (NLP) model, “ESGify”12, based on MPNet-base architecture and aimed to classify texts within the frames of ESG risks. We also present a hierarchical and detailed methodology for ESG risk classification, leveraging the expertise of ESG professionals and global best practices. Anchored by a manually annotated multilabel dataset of 2,000 news articles and dosmain adaptation with texts of sustainability reports, ESGify is developed to automate ESG risk classification following the established methodology. We compare augmentation techniques based on back translation and Large Language Models (LLMs) to improve the model quality and achieve 0.5 F1-weighted model quality in the dataset with 47 classes. This result outperforms ChatGPT 3.5 with a simple prompt. The model weights and documentation is hosted on Github https://github.com/sb-ai-lab/ESGify under the Apache 2.0 license.

Presented: A. A. Shananin
Received: 24.08.2023
Revised: 15.09.2023
Accepted: 24.10.2023

DOI: 10.31857/S2686954323601525