Skip to main content
User Image

شريفة أحمد محمد الغامدي

Lecturer

محاضرة بقسم تقنية المعلومات

علوم الحاسب والمعلومات
6T23

Named Entity Recognition Using Word-Embedding Techniques for ArabicWeb16: An Empirical Study

Al-Salman, Sharefah Al-Ghamdi; Mashael Al-Duwais; Hend Al-Khalifa; and Abdulmalik . 2018

The 3rd Workshop on Open-Source Arabic Corpora and Processing Tools introduces ArabicWeb16 Data Challenge track. The challenge is about experimenting with ArabicWeb16 dataset, the largest Arabic Web dataset publicly available with about 150M Arabic Web pages. In this paper, we explore the ArabicWeb16 dataset and experiment with it to build word-embedding models for Named Entity Recognition (NER) task. Word-embedding models are powerful for building many Natural Language Processing (NLP) tasks including NER. We tried two word-embedding models: Google Word2Vec model and Stanford GloVe model. The two models were used to recognize similar words for each named entity type. The ArabicWeb16 dataset was somehow hard to pre-process, however, the final results showed promising outputs.

Conference Location
Riyadh, Saudi Arabia
Conference Name
" 21st Saudi Computer Society National Computer Conference (NCC’2018)
Sponsoring Organization
IEEE
more of publication
publications

The 3rd Workshop on Open-Source Arabic Corpora and Processing Tools introduces ArabicWeb16 Data Challenge track. The challenge is about experimenting with ArabicWeb16 dataset, the largest Arabic…

by Sharefah Al-Ghamdi; Mashael Al-Duwais; Hend Al-Khalifa; and Abdulmalik Al-Salman
2018
publications

Data mining techniques have been shown its success in analyzing data to assist factors and make decisions in many different applications. In this work, we analyzed universities' faculty members…

by Sharefah Al-Ghamdi; Sumayah Al-Rabiaah
2018
publications

Natural language processing (NLP) is the branch of Artificial Intelligence that is concerned with enabling computers understand human languages. Implementing new NLP tools that effectively and…

2017