Unleashing the Power of Mathematics in Natural Language Processing: Sentiment Analysis Perspectives

Authors

  • Mritunjay Kumar Ranjan Assistant Professor, School of Computer Science and Engineering, Sandip University, Nashik, Maharashtra, India
  • Sanjay Kumar Tiwari Assistant Professor, PG Department of Mathematics, Magadh University, Bodhgaya, Gaya, Bihar, India
  • Arif Md. Sattar Assistant Professor, Department of Computer Science & Information Technology, Anugrah Memorial College, Gaya, Bihar, India

DOI:

https://doi.org/10.37591/rtpl.v10i2.581

Keywords:

CNN, Machine Learning, Deep Learning, Sentiment Analysis (SA) Email, NLP, Twitter, Dataset, Confusion Matrix, DSA, ABSA, Mathematical Opinion Mining

Abstract

Sentiment analysis (SA), which utilizes natural language processing (NLP), computational linguistics, text analysis, image processing, and video processing to extract and analyze subjective information from the internet, social media, and other sources, is becoming increasingly popular in both the business world and the scientific community. It is even possible to model it so that it focuses on polarity, sentiments and emotions, urgency, and even goals. It is able to distinguish between positive, negative, and neutral data. Document-level sentiment analysis (DLSA), aspect-based sentiment analysis (ABSA), and data-driven sentiment analysis (DSA) are the three levels that are included in sentiment analysis. It is frequently utilized for the classification of text, the translation of speech, the recognition of faces, objects, patterns, and voices, as well as the identification of spam in electronic communications. In recent years, deep learning strategies have become the method of choice for applications that require computer vision and pattern recognition. The convolutional neural network (CNN) modeling technique is one of the deep learning approaches that is utilized most frequently for image processing. Opinion clustering, opinion idea or entity extraction, opinion summarization, and opinion categorization are just some of the actions and processes that are involved in text mining utilizing deep learning. Text categorization is one of the most critical processes in the text mining process that uses machine learning, while working on a project involving machine learning, it is essential to have a solid understanding of the mathematical model and to predefine the feature extraction model before utilizing the choice classification technique. With the help of NLP, X(formally known as twitter)'s representation, and deep learning-based data mining techniques, this study provides an approach called opinion mining that is based on deep learning to analyze Twitter opinion data. The accuracy and reliability of detection systems can be evaluated with the help of confusion matrices by comparing the aggregated opinions of Twitter users through the application of numerical techniques, as well as by building and labeling separate camps of Twitter opinion. The predicted class instances are shown in the form of a list, whereas the actual class instances are presented in the form of a table with rows and columns. In our study, we design CNN-based opinion classification approach and mathematical opinion mining classification model to find sentiment.

References

Pogorilyy SD, Kramov AA. Method of the coherence evaluation of Ukrainian text. arXiv preprint arXiv:2011.00310. October 31, 2020. Available at https://arxiv.org/abs/2011.00310

Alghalibi M, Al-Azzawi A, Lawonn K. Deep Tweets analyzer model for Twitter mood visualization and prediction based deep learning approach. Int J Computer Commun Eng. 2019; 8: 1–7.

Hutchins J. The history of machine translation in a nutshell. [Online]. Revised January 2014. Available at https://aclanthology.org/www.mt-archive.info/10/Hutchins-2014.pdf.

Asveld PR. Fuzzy context-free languages—part 2: recognition and parsing algorithms. Theoret Computer Sci. 2005; 347 (1–2): 191–213.

Guthier B, Ho K, El Saddik A. Language-independent data set annotation for machine learning-based sentiment analysis. In: 2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Banff, Alberta, Canada, October 5–8, 2017. pp. 2105–2110.

Draskovic D, Zecevic D, Nikolic B. Development of a multilingual model for machine sentiment analysis in the Serbian language. Mathematics. 2022; 10 (18): 3236.

Wu Z, Palmer M. Verb semantics and lexical selection. In: 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, NM, USA, June 1994. pp. 133–138.

Patil R, Chen Z, Shi Y. Database keyword search: a perspective from optimization. In: 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, Macau, China, December 4, 2012. pp. 30–33.

Bolukbasi T, Chang KW, Zou JY, Saligrama V, Kalai AT. Man is to computer programmer as woman is to homemaker? Debiasing word embeddings. In: 30th Conference on Advances in Neural Information Processing Systems, Barcelona, Spain, 2016. pp. 1–99.

Patil R, Chen Z. Struct: incorporating contextual information for English query search on relational databases. In: Proceedings of the Third International Workshop on Keyword Search on Structured Data, May 20, 2012. pp. 11–22. doi: 10.1145/2254736.2254742.

Chiu B, Crichton G, Korhonen A, Pyysalo S. How to train good word embeddings for biomedical NLP. In: Proceedings of the 15th Workshop on Biomedical Natural Language Processing, Scottsdale, AZ, USA, August, 2016. pp. 166–174.

Ruder S. The state of multilingual AI. [Online]. ruder.io 2022. Available at https://www.ruder.io/state-of-multilingual-ai/

Open Data Science. Top recent NLP research – ODSC – Open Data Science – Medium. [Online]. 2021. Available at https://odsc.medium.com/top-recent-nlp-research-906e8d603eb7

Gazdar G. Generalized Phrase Structure Grammar. Boston, MA, USA: Harvard University Press; 1985.

Culnane C, Rubinstein BI, Teague V. Health data in an open world. arXiv preprint arXiv:1712.05627. December 15, 2017. Available at https://arxiv.org/abs/1712.05627

Sharman RA, Jelinek F, Mercer RL. Generating a grammar for statistical training. In: Speech and Natural Language: Proceedings of a Workshop Held at Hidden Valley, PA, USA, June 24–27, 1990..

Wang F. A fuzzy grammar and possibility theory-based natural language user interface for spatial queries. Fuzzy Sets Syst. 2000; 113 (1): 147–159.

Patil R, Boit S, Bowman N. SQL ChatBot – using context free grammar. In: 2022 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS), Toronto, Ontario, Canada, June 1–4, 2022. pp. 1–7.

Kozlowski AC, Taddy M, Evans JA. The geometry of culture: analyzing the meanings of class through word embeddings. Am Sociol Rev. 2019; 84 (5): 905–949.

Iftene M, Liu Q, Wang Y. Very high resolution images classification by fine tuning deep convolutional neural networks. In: Eighth International Conference on Digital Image Processing (ICDIP 2016), Chengu, China, August 29, 2016. pp. 464–468.

neo-project. Neo Documentation. [Online]. Neo.org. 2017. Available at https://docs.neo.org/v2/docs/en-us/basic/whitepaper.html

Published

2023-08-17