skip to main content
research-article

Subword Attentive Model for Arabic Sentiment Analysis: A Deep Learning Approach

Published: 13 February 2020 Publication History

Abstract

Social media data is unstructured data where these big data are exponentially increasing day to day in many different disciplines. Analysis and understanding the semantics of these data are a big challenge due to its variety and huge volume. To address this gap, unstructured Arabic texts have been studied in this work owing to their abundant appearance in social media Web sites. This work addresses the difficulty of handling unstructured social media texts, particularly when the data at hand is very limited. This intelligent data augmentation technique that handles the problem of less availability of data are used. This article has proposed a novel architecture for hand Arabic words classification and understands based on convolutional neural networks (CNNs) and recurrent neural networks. Moreover, the CNN technique is the most powerful for the analysis of Arabic tweets and social network analysis. The main technique used in this work is character-level CNN and a recurrent neural network stacked on top of one another as the classification architecture. These two techniques give 95% accuracy in the Arabic texts dataset.

References

[1]
[n.d]. Twitter sentiment analysis. Retrieved June 22, 2019 from https://github.com/topics/twitter-sentiment-analysis.
[2]
Abu Bakr Soliman, Kareem Eissa, and Samhaa R. El-Beltagy. 2017. Aravec: A set of arabic word embedding models for use in arabic nlp. Proc. Comput. Sci. 117 (2017), 256--265.
[3]
Anwar Alnawas and Nursal Arici. 2019. Sentiment analysis of Iraqi Arabic Dialect on Facebook based on distributed representations of documents. ACM Trans. Asian Low-Resour. Lang. Inf. Process. 18, 3, Article 20 (January 2019), 17 pages.
[4]
T. Andrija, J. Predrag, and K. Vlado. 2006. n-Gram-based classification and unsupervised hierarchical clustering of genome sequences. Computer Methods and Programs in Biomedicine 81 (2006), 37--153.
[5]
Albert Bifet and Eibe Frank. 2010. Sentiment knowledge discovery in twitter streaming data. In Proceedings of the 13th International Conference on Discovery Science (DS'10). Springer-Verlag, Berlin, Heidelberg, 1--15.
[6]
Changhua Yang, Kevin Hsin-Yih Lin, and Hsin-Hsi Chen. 2007. Emotion classification using Web blog corpora. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence (WI'07). IEEE Computer Society, USA, 275--278.
[7]
F. Chollet. 2015. Keras. https://github.com/fchollet/keras.
[8]
Dmitry Davidov, Oren Tsur, and Ari Rappoport. 2010. Enhanced sentiment learning using Twitter hashtags and smileys. In Proceedings of the 23rd International Conference on Computational Linguistics: Posters (COLING'10). Association for Computational Linguistics, USA, 241--249.
[9]
Changliang Li, Bo Xu, Gaowei Wu, Saike He, Guanhua Tian, and Hongwei Hao. 2014. Recursive deep learning for sentiment analysis over social data. In Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) - Vol. 02 (WI-IAT'14). IEEE Computer Society, USA, 180--185.
[10]
Junyoung Chung, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2015. Gated feedback recurrent neural networks. In Proceedings of the 32nd International Conference on International Conference on Machine Learning - Volume 37 (ICML'15). JMLR.org, 2067--2075.
[11]
Yoon Kim. 2014. Convolutional neural networks for sentence classification. Eprint Arxiv, 1746--1751.
[12]
Kun-Lin Liu, Wu-Jun Li, and Minyi Guo. 2012. Emoticon smoothed language models for twitter sentiment analysis. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI'12). AAAI Press, 1678--1684.
[13]
Loai Alnemer, Bayan Alammouri, Jamal Alsakran, and Omar El Ariss. 2019. Enhanced classification of sentiment analysis of Arabic reviews. EIDWT 2019, LNDECT 29 (2019), 210--220.
[14]
K. P. Murphy. 2012. Machine Learning: A Probabilistic Perspective. The MIT Press.
[15]
Naaima Boudad, Rdouan Faizi, Rachid Oulad Haj Thami, and Raddouane Chiheb. 2017. Sentiment analysis in arabic: A review of the literature. Ain Shams Engineering Journal 9, 4 (2017), 2479--2490.
[16]
Alexander Pak and Patrick Paroubek. 2010. Twitter as a Corpus for Sentiment Analysis and Opinion Mining. In Proceedings of the Language Resources and Evaluation Confrence (LREC'10). 1320--1326.
[17]
J. R. Quinlan. 1986. Induction of decision trees. Machine Learning 1 (1986), 81--106.
[18]
J. Ross Quinlan. 1993. C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
[19]
E. Refaee and V. Rieser. 2014. An Arabic Twitter corpus for subjectivity and sentiment analysis. In Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14). European Language Resources Association, 2268--2273.
[20]
Giovanni Seni and John Elder. 2010. Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions. Morgan and Claypool Publishers.
[21]
S. A. Rupal Bhargava and Y. Sharma. 2018. Neural network based architecture for sentiment analysis in Indian languages. Journal of Intelligent Systems 7 (2018), 313--318.
[22]
Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Comput. 9, 8 (November 1997), 1735--1780.
[23]
Shubhi Mittal, Ashna Goel, and Rachna Jain. 2016. Sentiment analysis of ecommerce and social networking sites. In 3rd International Conference on Computing for Sustainable Global Development (INDIACom'16). 2300--2305.
[24]
G. Sidorov. 2013. Syntactic dependency-based n-grams in rule based automatic English as second language grammar correction. International Journal of Computational Linguistics and Applications 4, 2 (2013), 169--188.
[25]
Theresa Wilson, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing (HLT'05). Association for Computational Linguistics, USA, 347--354.
[26]
Rinalds Vīksna and Gints Jēkabsons. 2018. Sentiment analysis in Latvian and Russian: A survey. Applied Computer Systems 23, 1 (2018), 45--51.
[27]
Xiang Zhang, Junbo Zhao, and Yann LeCun. 2015. Character-level convolutional networks for text classification. In Proceedings of the 28th International Conference on Neural Information Processing Systems - Vol. 1 (NIPS'15). MIT Press, Cambridge, MA, USA, 649--657.
[28]
Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, and Yi Yang. 2019. CamStyle: A novel data augmentation method for person re-identification. IEEE Transactions on Image Processing 28, 3 (2019), 1176--1190.
[29]
Mourad Gridach and Hatem Haddad. 2017. Arabic named entity recognition: A bidirectional GRU-CRF approach. In Proceedings of the International Conference on Computational Linguistics and Intelligent Text Processing. 264--275.
[30]
Majdi Beseiso. 2019. Word and character information aware neural model for emotional analysis. Recent Patents on Computer Science 12, 2, 142.

Cited By

View all
  • (2024)Research on Recognition Method of Social Robot Based on T-A-GCNIIT in the MetaverseACM Transactions on Asian and Low-Resource Language Information Processing10.1145/362401423:6(1-0)Online publication date: 22-Jun-2024
  • (2024)A combined AraBERT and Voting Ensemble classifier model for Arabic sentiment analysisNatural Language Processing Journal10.1016/j.nlp.2024.100100(100100)Online publication date: Aug-2024
  • (2024)A review of sentiment analysis: tasks, applications, and deep learning techniquesInternational Journal of Data Science and Analytics10.1007/s41060-024-00594-xOnline publication date: 1-Jul-2024
  • Show More Cited By

Index Terms

  1. Subword Attentive Model for Arabic Sentiment Analysis: A Deep Learning Approach

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Transactions on Asian and Low-Resource Language Information Processing
    ACM Transactions on Asian and Low-Resource Language Information Processing  Volume 19, Issue 2
    March 2020
    301 pages
    ISSN:2375-4699
    EISSN:2375-4702
    DOI:10.1145/3358605
    Issue’s Table of Contents
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 February 2020
    Accepted: 01 August 2019
    Revised: 01 June 2019
    Received: 01 April 2019
    Published in TALLIP Volume 19, Issue 2

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Arabic
    2. convolutional neural network
    3. data augmentation
    4. gated recurrent unit
    5. sentiment evaluation
    6. unstructured texts

    Qualifiers

    • Research-article
    • Research
    • Refereed

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)12
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 04 Sep 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Research on Recognition Method of Social Robot Based on T-A-GCNIIT in the MetaverseACM Transactions on Asian and Low-Resource Language Information Processing10.1145/362401423:6(1-0)Online publication date: 22-Jun-2024
    • (2024)A combined AraBERT and Voting Ensemble classifier model for Arabic sentiment analysisNatural Language Processing Journal10.1016/j.nlp.2024.100100(100100)Online publication date: Aug-2024
    • (2024)A review of sentiment analysis: tasks, applications, and deep learning techniquesInternational Journal of Data Science and Analytics10.1007/s41060-024-00594-xOnline publication date: 1-Jul-2024
    • (2024)Annotation and evaluation of a dialectal Arabic sentiment corpus against benchmark datasets using transformersLanguage Resources and Evaluation10.1007/s10579-024-09750-yOnline publication date: 18-Aug-2024
    • (2023)Arabic Sentiment Analysis for Twitter Data: A Systematic Literature ReviewEngineering, Technology & Applied Science Research10.48084/etasr.566213:2(10292-10300)Online publication date: 2-Apr-2023
    • (2023)Convolutional Deep Belief Network Based Short Text Classification on Arabic CorpusComputer Systems Science and Engineering10.32604/csse.2023.03394545:3(3097-3113)Online publication date: 2023
    • (2023)Stock Price Trends Prediction Based on the Classical Models with Key Information Fusion of OntologiesACM Transactions on Asian and Low-Resource Language Information Processing10.1145/359259922:5(1-22)Online publication date: 9-May-2023
    • (2023)User-based Hierarchical Network of Sina Weibo Emotion AnalysisACM Transactions on Asian and Low-Resource Language Information Processing10.1145/357904822:5(1-22)Online publication date: 9-May-2023
    • (2023)Mutual Supervised Fusion & Transfer Learning with Interpretable Linguistic Meaning for Social Data AnalyticsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/356867522:5(1-20)Online publication date: 9-May-2023
    • (2023)WH2D2N2: Distributed AI-enabled OK-ASN Service for Web of ThingsACM Transactions on Asian and Low-Resource Language Information Processing10.1145/356424222:5(1-16)Online publication date: 9-May-2023
    • Show More Cited By

    View Options

    Get Access

    Login options

    Full Access

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media