Filters








2,098 Hits in 4.0 sec

Multilingual and Multi-Aspect Hate Speech Analysis

Nedjma Ousidhoum, Zizheng Lin, Hongming Zhang, Yangqiu Song, Dit-Yan Yeung
2019 Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)  
In this paper, we present a new multilingual multi-aspect hate speech analysis dataset and use it to test the current state-of-the-art multilingual multitask learning approaches.  ...  Current research on hate speech analysis is typically oriented towards monolingual and single classification tasks.  ...  department of the Hong Kong University of Science and Technology.  ... 
doi:10.18653/v1/d19-1474 dblp:conf/emnlp/OusidhoumLZSY19 fatcat:molbpoy2fffplie233bzlfktg4

Multilingual and Multi-Aspect Hate Speech Analysis [article]

Nedjma Ousidhoum, Zizheng Lin, Hongming Zhang, Yangqiu Song, Dit-Yan Yeung
2019 arXiv   pre-print
In this paper, we present a new multilingual multi-aspect hate speech analysis dataset and use it to test the current state-of-the-art multilingual multitask learning approaches.  ...  Current research on hate speech analysis is typically oriented towards monolingual and single classification tasks.  ...  department of the Hong Kong University of Science and Technology.  ... 
arXiv:1908.11049v1 fatcat:6b2evdtujze45kmp56c6e2xyby

Deep Learning Models for Multilingual Hate Speech Detection [article]

Sai Saketh Aluru, Binny Mathew, Punyajoy Saha, Animesh Mukherjee
2020 arXiv   pre-print
In this paper, we conduct a large scale analysis of multilingual hate speech in 9 languages from 16 different sources.  ...  These models could also act as good baselines for future multilingual hate speech detection tasks.  ...  Research into the multilingual aspect of hate speech is relatively new.  ... 
arXiv:2004.06465v3 fatcat:t2ds5n3tqjc4te3aj2tvxubl7a

Detection of Racist Language in French Tweets

Natalia Vanetik, Elisheva Mimoun
2022 Information  
Unfortunately, there are fewer datasets annotated for racist speech than for general hate speech available, especially for French.  ...  In France, there has been a significant increase in hate speech against migrant and Muslim communities following events such as Great Britain's exit from the EU, the Charlie Hebdo attacks, and the Bataclan  ...  The MLMA dataset is a multilingual multi-aspect hate speech analysis dataset containing Twitter posts in several languages.  ... 
doi:10.3390/info13070318 fatcat:c35tk5r4gzfkthh6evmii6jzz4

HateDetectors at HASOC 2020: Hate Speech Detection using Classical Machine learning and Transfer learning based approaches

Varsha Reddy, Surendra Telidevara
2020 Forum for Information Retrieval Evaluation  
We have highlighted the importance of a monolingual model over a multi lingual BERT based model for hate speech detection.  ...  We also highlight the importance of having a large, balanced training dataset on model performance for hate speech detection.  ...  We hope our experimentation and analysis helps in tackling the problem of hate speech detection in some way.  ... 
dblp:conf/fire/ReddyT20 fatcat:o34z5yu2xze4badtp6m7w6wavi

A Large-scale Dataset for Hate Speech Detection on Vietnamese Social Media Texts [article]

Son T. Luu, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen
2021 arXiv   pre-print
On social medias, hate speech has become a critical problem for social network users.  ...  To solve this problem, we introduce the ViHSD - a human-annotated dataset for automatically detecting hate speech on the social network.  ...  Moreover, current researches about hate speech detection do not focus on analyzing about the sentiment aspect of Vietnamese hate speech language.  ... 
arXiv:2103.11528v3 fatcat:owb3rzo2araydmlll3vcwyfem4

ComMA@FIRE 2020: Exploring Multilingual Joint Training across different Classification Tasks

Ritesh Kumar, Bornini Lahiri, Atul Kr. Ojha, Akanksha Bansal
2020 Forum for Information Retrieval Evaluation  
In this paper, we give a description of the systems submitted to the three tracks of FIRE 2020 -Hate Speech and Offensive Content Identification in Indo-European Languages (HASOC), Sentiment Analysis of  ...  While the first two tasks were binary and multi-class text classification problems, EDNIL was a sequence classification problem.  ...  In sub-task A, the data was annotated as HOF (Hate Speech and Offensive Language) and NOT (Not Offensive).  ... 
dblp:conf/fire/KumarLOB20 fatcat:suwylxin7fhpbmajwk3bxvbapq

Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine Tuned Multilingual Embeddings [article]

Arkadipta De, Venkatesh E, Kaushal Kumar Maurya, Maunendra Sankar Desarkar
2021 arXiv   pre-print
Our best performing neural classifier model includes One-vs-the-Rest approach where we obtained 92.60%, 81.14%,69.59%, 75.29% and 73.01% F1 scores for hostile, fake, hate, offensive, and defamation labels  ...  We view this hostility detection as a multi-label multi-class classification problem. We propose an effective neural network-based technique for hostility detection in Hindi posts.  ...  in traffic by hate and offensive speech promoters against the Asian community and a 900% increase in similar contents towards Chinese people.  ... 
arXiv:2101.04998v1 fatcat:z4bdmqg7mzdlbggjh7am472o2q

Towards multidomain and multilingual abusive language detection: a survey

Endang Wahyu Pamungkas, Valerio Basile, Viviana Patti
2021 Personal and Ubiquitous Computing  
and cross-lingual settings.  ...  This study also outlines several challenges and open problems of this area, providing insights and a useful roadmap for future work.  ...  [103] presented a more comprehensive study on resources and benchmarks available for hate speech detection tasks based on several aspects.  ... 
doi:10.1007/s00779-021-01609-1 fatcat:ufiyagb6grel7ojjkyhb2vjtrm

A systematic review of Hate Speech automatic detection using Natural Language Processing [article]

Md Saroar Jahan, Mourad Oussalah
2021 arXiv   pre-print
With the multiplication of social media platforms, which offer anonymity, easy access and online community formation, and online debate, the issue of hate speech detection and tracking becomes a growing  ...  challenge to society, individual, policy-makers and researchers.  ...  MLMA hate speech Link, dataset and soruce code available Multilingual multi-aspect hate speech analysis dataset Ousidhoum et al. [103], 2019b - LR 35 3 9.  ... 
arXiv:2106.00742v1 fatcat:qwxjwgma4zaynemge57cu7xqlm

Cross-Lingual Few-Shot Hate Speech and Offensive Language Detection using Meta Learning

Marzieh Mozafari, Reza Farahbakhsh, Noel Crespi
2022 IEEE Access  
15 datasets across 8 languages for hate speech and 6 datasets across 6 languages for offensive language.  ...  We propose a meta learning-based approach to study the problem of few-shot hate speech and offensive language detection in low-resource languages that will allow hateful or offensive content to be predicted  ...  [46] presented the first multilingual multi-aspect hate speech analysis dataset in English, French, and Arabic tweets and evaluated several multilingual multi-task learning approaches for the identification  ... 
doi:10.1109/access.2022.3147588 fatcat:aljv6fkhsvcspkrf7bojgkquwe

IRLab@IITV@Dravidian-CodeMix-FIRE2020: Sentiment Analysis on Multilingual Code Mixing Text Using BERT-BASE

Anita Saroj, Sukomal Pal
2020 Forum for Information Retrieval Evaluation  
This paper discusses our participation in the "Sentiment Analysis in Dravidian-CodeMix", Dravidian-CodeMix and "Hate Speech and Offensive Content Identification in Indo-European Languages"-FIRE 2020 tasks  ...  Several techniques are applied for sentiment analysis including the recent word embeddings-based methods.  ...  spread of hate speech and rude behavior [6] .  ... 
dblp:conf/fire/SarojP20 fatcat:wx5nyd7d55d3heyun3kkolfxu4

DLRG@HASOC 2020: A Hybrid Approach for Hate and Offensive Content Identification in Multilingual Tweets

Yashwanth Reddy B., Ratnavel Rajalakshmi
2020 Forum for Information Retrieval Evaluation  
Hate speech and posting offensive contents has become a major issue nowadays.  ...  In the proposed approach, Multi-class imbalance-based feature selection method is combined with an SVM classifier to classify the tweet as a hate speech or not.  ...  Also, the authors thank the Science and Engineering Research Board, Govt. of India for their financial support (ECR/2016/000484).  ... 
dblp:conf/fire/BR20 fatcat:nfdvwrresnaddkpepvglzioh3q

Large-Scale Hate Speech Detection with Cross-Domain Transfer [article]

Cagri Toraman, Furkan Şahinuç, Eyup Halit Yilmaz
2022 arXiv   pre-print
Existing datasets are mostly prepared with a limited number of instances or hate domains that define hate topics. This hinders large-scale analysis and transfer learning with respect to hate domains.  ...  large-scale hate speech detection.  ...  In Section 5, we provide discussions on error analysis and scalability.  ... 
arXiv:2203.01111v2 fatcat:zxxkhd2dnrheleckt3xgoe4rca

CONAN - COunter NArratives through Nichesourcing: a Multilingual Dataset of Responses to Fight Online Hate Speech

Yi-Ling Chung, Elizaveta Kuzmenko, Serra Sinem Tekiroglu, Marco Guerini
2019 Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics  
In this paper, we describe the creation of the first large-scale, multilingual, expert-based dataset of hate speech/counter-narrative pairs.  ...  Tackling hate speech in the standard way of content deletion or user suspension may be charged with censorship and overblocking.  ...  We are grateful to the following NGOs and all annotators for their help: Stop Hate UK, Collectif Contre l'Islamophobie en France, Amnesty International (Italian Section -Task force hate speech).  ... 
doi:10.18653/v1/p19-1271 dblp:conf/acl/ChungKTG19 fatcat:oegcmkbspve47l7l2lmvr7hawu
« Previous Showing results 1 — 15 out of 2,098 results