Skip to main content
User Image

Nora S. AlTwairesh

Assistant Professor

Head, Information Technology Department

علوم الحاسب والمعلومات
KSU Female Campus - Building 6 T121
publication
Conference Paper
2016

AraSenTi: Large-Scale Twitter-Specific Arabic Sentiment Lexicons

Al-Salman, Nora Al-Twairesh, Hend Al-Khalifa , AbdulMalik . 2016

Sentiment Analysis (SA) is an active research area nowadays due to the tremendous interest in aggregating and evaluating opinions being disseminated by users on the Web. SA of English has been thoroughly researched; however research on SA of Arabic has just flourished. Twitter is considered a powerful tool for disseminating information and a rich resource for opinionated text containing views on many different topics. In this paper we attempt to bridge a gap in Arabic SA of Twitter which is the lack of sentiment lexicons that are tailored for the informal language of Twitter. We generate two lexicons extracted from a large dataset of tweets using two approaches and evaluate their use in a simple lexicon based method. The evaluation is performed on internal and external datasets. The performance of these automatically generated lexicons was very promising, albeit the simple method used for classification. The best F-score obtained was 89.58% on the internal dataset and 63.1-64.7% on the external datasets.

Conference Location
Berlin, Germany
Conference Name
The 54th annual meeting of the Association for Computational Linguistics (ACL)
Sponsoring Organization
ACL
more of publication
publications

The field of natural language processing (NLP) has witnessed a boom in language representation models with the introduction of pretrained language models that are trained on massive textual data…

publications

Approaches for developing Dialogue Systems (DSs) are typically categorized into rule-based and data-driven. Data-driven DSs require a massive quantity of training data, while rule-based DSs rely…

2021
publications

Over the past few years, Twitter has experienced massive growth and the volume of its online content has increased rapidly. This content has been a rich source for several studies that focused on…

2021