№2, 2023

COMPARATIVE ANALYSIS OF MACHINE LEARNING ALGORITHMS FOR SENTIMENT ANALYSIS OF TEXTS IN AZERBAIJANI AND ENGLISH

Mehdi Rasul

The prediction of sentiment of the text within different business spheres has been one of the challenging problems for a variety of linguistics. In this paper, the sentiment analysis of the texts is carried out using different machine learning (ML) techniques. Various feature extraction techniques and supervised learning algorithms are employed on the movie review dataset sourced from the Internet Movie Database and translated into Azerbaijani. Specifically, the techniques utilized encompass Suppo­rt Vector Machine, Logistic Regression, Decision Trees, Random Forest, AdaBoost, XGBoost, and Naïve Bayes. The proposed models depict the importance of language corpus that Azerbaijani language lacks by comparing the results obtained from both Azerbaijani and English versions of the dataset (pp.3-11).

Keywords: Sentiment analysis, Logistic regression, Support vector machine, Naïve Bayes, Feature extraction, Text preprocessing
DOI : 10.25045/jpit.v14.i2.01
References
  • Baid, P. & Gupta, A. & Chaplot, N. (2017). “Sentiment analysis of movie reviews using machine learning techniques”. International Journal of Computer Applications. 179. 45-49. https://doi.org/10.5120/ijca2017916005
  • Chandra, Y. & Jana, A. “Sentiment analysis using machine learning and deep learning”. (2020). Proceedings of the 2020 7th International Conference on Computing for Sustainable Global Development, New Delhi, India, pp.1-4. https://doi.org/10.23919/INDIACom49435.2020.9083703
  • Hasanli, H. & Rustamov, S. (2019). “Sentiment analysis of Azerbaijani twits using logistic regression, Naïve Bayes and SVM”. Proceedings of the 2019 IEEE 13th International Conference on Application of Information and Communication Technologies, Baku, Azerbaijan, pp.1-7. https://doi.org/10.1109/AICT47866.2019.8981793
  • Heredia, B., Prusa, J.D., & Khoshgoftaar, T.M. (2018). “Location-based twitter sentiment analysis for predicting the U.S. 2016 Presidential Election”. Proceedings of the 31st International Florida Artificial Intelligence Research Society Conference, vol. 2009, pp.265–270.
  • Mammadli, S., Huseynov, S., Alkaramov, H., Jafarli, U., Suleymanov, U., & Rustamov, S. (2019). “Sentiment polarity detection in Azerbaijani social news articles”. Proceedings of the International Conference on Recent Advances in Natural Language Processing., Varna, Bulgaria, pp.703-710. https://doi.org/10.26615/978-954-452-056-4_082
  • Neethu, M.S. & Rajasree, R. (2013). “Sentiment analysis in twitter using machine learning techniques”. Proceedings of the 2013 Fourth International Conference on Computing, Communications and Networking Technologies, 2013, pp.1-5. https://doi.org/10.1109/ICCCNT.2013.6726818
  • Suleymanov, U. & Kalejahi, B.K. & Amrahov, E. & Badirkhanli, R. (2019). “Text classification for Azerbaijani language using machine learning and embedding”. arXiv:1912.13362 https://doi.org/10.48550/arXiv.1912.13362
  • Wenando, F.A., Hayami, R., Bakaruddin & Novermahakim, A.Y. (2020). “Tweet sentiment analysis for 2019 Indonesia presidential election results using various classification algorithms”. Proceedings of the 2020 1st International Conference on Information Technology, Advanced Mechanical and Electrical Engineering, Yogyakarta, Indonesia, pp.279-282.
  • https://doi.org/10.1109/ICITAMEE50454.2020.9398513