RUS  ENG
Full version
JOURNALS // Informatics and Automation // Archive

Informatics and Automation, 2024 Issue 23, volume 1, Pages 39–64 (Mi trspy1280)

Artificial Intelligence, Knowledge and Data Engineering

Sentiment analysis framework for telugu text based on novel contrived passive aggressive with fuzzy weighting classifier (CPSC-FWC)

G. Naidu, M. Seshashayee

Gandhi Institute of Technology and Management GITAM (Deemed to be University)

Abstract: Natural language processing (NLP) is a subset of artificial intelligence demonstrating how algorithms can interact with individuals in their unique languages. In addition, sentiment analysis in NLP is better in numerous programs, including evaluating sentiment in Telugu. Several unsupervised machine-learning algorithms, such as k-means clustering with cuckoo search, are used to detect Telugu text. However, these techniques struggle to cluster data with variable cluster sizes and densities, slow search speeds, and poor convergence accuracy. This study developed a unique ML-based sentiment analysis system for Telugu text to address the shortcomings. Initially, in the pre-processing stage, the proposed Linear Pursuit Algorithm (LPA) removes words in white spaces, punctuation, and stops. Then, for POS tagging, this research proposed a Conditional Random Field with Lexicon weighting; following that, a Contrived Passive Aggressive with Fuzzy Weighting Classifier (CPSC-FWC) is proposed to classify the sentiments in Telugu text. Consequently, the method we propose produces efficient outcomes in terms of accuracy, precision, recall, and f1-score.

Keywords: machine learning, natural language processing, polarity, sentiment analysis, Telugu.

UDC: 004.7

Received: 31.07.2023

Language: English

DOI: 10.15622/ia.23.1.2



© Steklov Math. Inst. of RAS, 2024