Filters








729 Hits in 5.7 sec

CLEF 2017 MC2 Search and Time Line tasks Overview

Lorraine Goeuriot, Philippe Mulhem, Eric SanJuan
2017 Conference and Labs of the Evaluation Forum  
The topics were in four languages: Arabic, English, French and Spanish, and results were expected in any language.  ...  The goal of the timeline illustration track is to study approaches that better retrieve microblogs issued during a cultural event, in order to get a glimpse of the attendees' perception.  ...  This resulted in long queries that were long to process, especially in the case of Focus retrieval. For Arabic, a stop word list was applied which improved efficiency.  ... 
dblp:conf/clef/GoeuriotMS17 fatcat:c77e3nqs7bdwzeq2pbtyanyyou

Microblog Search Task at CLEF 2017: Query Generation using IR and LDA Topic Modeling Combination

Malek Hajjem, Chiraz Latiri
2017 Conference and Labs of the Evaluation Forum  
The microblogs search task at CLEF 2017 consists of developing a system to search the most relevant microblogs for cultural query in a collection about festivals in all languages.  ...  This latter is based on Information Retrieval (IR) process to generate a query-specific set of similar tweets. The result then represent the input of a basic LDA topic modeling process.  ...  The firt corpus is a comparable tweet corpus about Arab spring collected through Twitter's API 3 in Arabic and French languages.  ... 
dblp:conf/clef/HajjemL17 fatcat:6mslj2fw75cthpsr42aqlyp6ry

Mining Trending Hash Tags for Arabic Sentiment Analysis

Yahya AlMurtadha
2018 International Journal of Advanced Computer Science and Applications  
Various sentiment methods were developed in many languages, such as English and Arabic with much more studies in the first one.  ...  People text millions of posts everyday on microblogging social networking especially Twitter which make microblogs a rich source for public opinions, customer's comments and reviews.  ...  Write retrieved tweets to a file for later processing.  ... 
doi:10.14569/ijacsa.2018.090227 fatcat:7kks4zz3vrccrcy7eqo2zn7qju

Cross-Lingual Relevance Transfer for Document Retrieval [article]

Peng Shi, Jimmy Lin
2019 arXiv   pre-print
, without any special processing, both for (non-English) mono-lingual retrieval as well as cross-lingual retrieval.  ...  Recent work has shown the surprising ability of multi-lingual BERT to serve as a zero-shot cross-lingual transfer model for a number of language processing tasks.  ...  We see different degrees of effectiveness gains across languages: for some languages (e.g., Chinese), we observe a large gain; for others (e.g., Arabic and French), the gains are more modest.  ... 
arXiv:1911.02989v1 fatcat:z5q67febnfcbzckxhwuwqn5hme

Taqreer: A System for Spatio-temporal Analysis on Microblogs

Amr Magdy, Mashaal Musleh, Kareem Tarek, Louai Alarabi, Saif Al-Harthi, Hicham G. Elmongui, Thanaa M. Ghanem, Sohaib Ghani, Mohamed F. Mokbel
2015 IEEE Data Engineering Bulletin  
Taqreer is composed of two main modules: The Taghreed query engine, which is a scalable and efficient query processing engine for spatio-temporal keyword queries on microblogs and a Report Generation Tool  ...  system for auto-generation of spatio-temporal analysis reports on microblogs.  ...  Figure 5 gives an example of analyzing tweets languages in Arab Gulf countries. The figure gives a pie chart for each sub-region/city.  ... 
dblp:journals/debu/0001MTAAEGGM15 fatcat:zhpay4yr6fa6blk26s6vvjle3m

Microblogs as Parallel Corpora

Wang Ling, Guang Xiang, Chris Dyer, Alan W. Black, Isabel Trancoso
2013 Annual Meeting of the Association for Computational Linguistics  
We present an efficient method for detecting these messages and extracting parallel segments from them.  ...  In the ever-expanding sea of microblog data, there is a surprising amount of naturally occurring parallel text: some users create post multilingual messages targeting international audiences while others  ...  We are also extremely grateful to Brendan O'Connor for providing the Twitter data and to Philipp Koehn and Barry Haddow for providing the Project Syndicate data.  ... 
dblp:conf/acl/LingXDBT13 fatcat:idc2cvcwnfgtjb4ltoq7m2eqou

Time-Sensitive Weighting for Microblog Retrieval

Hao Wu, Hui Fang
2011 Text Retrieval Conference  
We report our system and experiments for the realtime Adhoc task in the 2011 MicroBlog track. Our goal is to develop effective technique to retrieve relevant tweets that have been posted recently.  ...  Query expansion technique is also used to further improve the retrieval performance.  ...  However in our opinion, microBlog search is a dynamic rather than a static process. Thus term weight is determined by not only the background language model (IDF), but also tweet post time.  ... 
dblp:conf/trec/WuF11a fatcat:fl4mwxv4ungrrpvvhxhzajogp4

CLEF MC2 2018 Lab Technical Overview of Cross Language Microblog Search and Argumentative Mining

Jean-Valère Cossu, Julio Gonzalo, Malek Hajjem, Olivier Hamon, Chiraz Latiri, Eric SanJuan
2018 Conference and Labs of the Evaluation Forum  
The challenge was to find related microblogs in four different languages in a large archive.  ...  The idea was to perform a search process on a massive microblog collection that focuses on claims about a given festival.  ...  The objective of the task is for a given movie or microcritic language among French, English, Spanish, Portuguese and Arabic to provide a summary of the related microblogs.  ... 
dblp:conf/clef/CossuGHHLS18 fatcat:ciukylozhfbhxiej5prm6qmbvi

A Survey of Query Expansion Methods to Improve Relevant Search Engine Results

Nuhu Yusuf, Mohd Amin Mohd Yunus, Norfaradilla Wahid, Aida Mustapha, Mohd Najib Mohd Salleh
2021 International Journal on Advanced Science, Engineering and Information Technology  
Due to large volumes of documents available for retrieval in a search database, an intelligent method is required to retrieve relevant search results.  ...  A query expansion deals with expanding the query by adding additional information to the query for effective retrieving relevant results.  ...  In Arabic text retrieval, two sub-fields were found within our reviews, which are Quran and hadith. Moawad, Alromima, and Elgohary [7] present a query expansion for Arabic retrieval.  ... 
doi:10.18517/ijaseit.11.4.8868 fatcat:uljsx37ckrfjjljnxft27lqfsq

Arabic Information Retrieval

Kareem Darwish
2014 Foundations and Trends in Information Retrieval  
ACM Special Interest Group on Information Retrieval Forum, 40 (1). [56] Kareem Darwish, Walid Magdy, and Ahmed Mourad. 2012. Language processing for Arabic microblog retrieval.  ...  The survey covers: 1) general properties of the Arabic language; 2) some of the aspects of Arabic that affect retrieval; 3) Arabic processing necessary for effective Arabic retrieval; 4) Arabic retrieval  ... 
doi:10.1561/1500000031 fatcat:2nxjdu43erhdvbs35ykavrk76a

A Tweets Classifier based on Cosine Similarity

Carolina Fócil Arias, Jorge Zúñiga, Grigori Sidorov, Ildar Z. Batyrshin, Alexander F. Gelbukh
2017 Conference and Labs of the Evaluation Forum  
The 2017 Microblog Cultural Contextualization task consists in three challenges: (1) Content Analysis, (2) Microblog search, and (3) TimeLine illustration.  ...  This research used two approaches: (1) word2vec and (2) Bag-of-Words (BoW) for extracting all relevant tweets to each event related to the four festivals: Charrues, Transmusicales, Avignon and Edinburgh  ...  Sun, S., Luo, C., Chen, J.: A review of natural language processing techniques for opinion mining systems. Fig. 2 : 2 Fig. 2: An example of Bag-of-Words approach 1.  ... 
dblp:conf/clef/AriasZSBG17 fatcat:m6ixfneegjgidmcu67ispmbxyi

Preprocessing Arabic text on social media

Mohamed Osman Hegazi, Yasser Al-Dossari, Abdullah Al-Yahy, Abdulaziz Al-Sumari, Anwer Hilal
2021 Heliyon  
It provides an integrated solution for the challenges in preprocessing Arabic text on social media in four stages: data collection, cleaning, enrichment, and availability.  ...  Millions of people use social media for different purposes.  ...  Additional information No additional information is available for this paper.  ... 
doi:10.1016/j.heliyon.2021.e06191 pmid:33644469 pmcid:PMC7895730 fatcat:4yf4ziitnnb43imsnwl32w5asy

EveTAR: Building a Large-Scale Multi-Task Test Collection over Arabic Tweets [article]

Maram Hasanain, Reem Suwaileh, Tamer Elsayed, Mucahid Kutlu, Hind Almerekhi
2017 arXiv   pre-print
This article introduces a new language-independent approach for creating a large-scale high-quality test collection of tweets that supports multiple information retrieval (IR) tasks without running a shared-task  ...  Applying our methodology on Arabic tweets resulted in EveTAR , the first freely-available tweet test collection for multiple IR tasks.  ...  We would like to thank the crowd workers and in-house annotators for their valuable efforts in producing high-quality judgments of tweets.  ... 
arXiv:1708.05517v2 fatcat:bl75l7fx6bgh5gyqjge35upjgi

Taghreed

Amr Magdy, Louai Alarabi, Saif Al-Harthi, Mashaal Musleh, Thanaa M. Ghanem, Sohaib Ghani, Mohamed F. Mokbel
2014 Proceedings of the 22nd ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems - SIGSPATIAL '14  
This paper presents Taghreed; a full-fledged system for efficient and scalable querying, analyzing, and visualizing geotagged microblogs, e.g., tweets.  ...  Taghreed is the first system that addresses all these challenges collectively for microblogs data. In the paper, each system component is described in detail.  ...  Figure 5 shows another interface that employs query 7 to provide an analysis for language usage in Arab Gulf area using Twitter data.  ... 
doi:10.1145/2666310.2666397 dblp:conf/gis/0001AAMGGM14 fatcat:2rfszaz2y5hvzcjapghtysjo6e

Natural Language Processing for Dialectical Arabic: A Survey

Abdulhadi Shoufan, Sumaya Alameri
2015 Proceedings of the Second Workshop on Arabic Natural Language Processing  
This paper presents a wide literature review of natural language processing for dialectical Arabic. Four main research areas were identified and the dialect coverage in research work was outlined.  ...  The paper can be used as a quick reference to identify relevant contributions that address a specific NLP aspect for a specific dialect.  ...  Introduction The last ten years have experienced a growing interest in natural language processing for dialectical Arabic.  ... 
doi:10.18653/v1/w15-3205 dblp:conf/wanlp/ShoufanA15 fatcat:i5l3kkcdsjhi5arrmjlu5ibv34
« Previous Showing results 1 — 15 out of 729 results