Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Browse
New in RUL
About RUL
In numbers
Help
Sign in
To BAN or not to BAN : Bayesian attention networks for reliable hate speech detection
ID
Miok, Kristian
(
Author
),
ID
Škrlj, Blaž
(
Author
),
ID
Zaharie, Daniela
(
Author
),
ID
Robnik Šikonja, Marko
(
Author
)
PDF - Presentation file,
Download
(3,06 MB)
MD5: 10D28EFDE9A05A29C78D6640B6BADA48
URL - Source URL, Visit
https://link.springer.com/article/10.1007/s12559-021-09826-9
Image galllery
Abstract
Hate speech is an important problem in the management of user-generated content. To remove offensive content or ban misbehaving users, content moderators need reliable hate speech detectors. Recently, deep neural networks based on the transformer architecture, such as the (multilingual) BERT model, have achieved superior performance in many natural language classification tasks, including hate speech detection. So far, these methods have not been able to quantify their output in terms of reliability. We propose a Bayesian method using Monte Carlo dropout within the attention layers of the transformer models to provide well-calibrated reliability estimates. We evaluate and visualize the results of the proposed approach on hate speech detection problems in several languages. Additionally, we test whether affective dimensions can enhance the information extracted by the BERT model in hate speech classification. Our experiments show that Monte Carlo dropout provides a viable mechanism for reliability estimation in transformer networks. Used within the BERT model, it offers state-of-the-art classification performance and can detect less trusted predictions.
Language:
English
Keywords:
natural language processing
,
machine learning
,
transformer neural networks
,
Bayesian neural networks
,
BERT models
,
prediction uncertainty
,
reliability estimation
,
Monte Carlo dropout
,
Bayesian BERT
,
sentic computing
,
model calibration
Work type:
Article
Typology:
1.01 - Original Scientific Article
Organization:
FRI - Faculty of Computer and Information Science
Publication status:
Published
Publication version:
Version of Record
Year:
2022
Number of pages:
Str. 353-371
Numbering:
Vol. 14, iss. 1
PID:
20.500.12556/RUL-144813
UDC:
004.85:81'322.2
ISSN on article:
1866-9956
DOI:
10.1007/s12559-021-09826-9
COBISS.SI-ID:
80879363
Publication date in RUL:
14.03.2023
Views:
802
Downloads:
82
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
Copy citation
Share:
Record is a part of a journal
Title:
Cognitive computation
Shortened title:
Cogn. comput.
Publisher:
Springer Nature
ISSN:
1866-9956
COBISS.SI-ID:
80861443
Licences
License:
CC BY 4.0, Creative Commons Attribution 4.0 International
Link:
http://creativecommons.org/licenses/by/4.0/
Description:
This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Secondary language
Language:
Slovenian
Keywords:
obdelava naravnega jezika
,
strojno učenje
,
nevronske mreže transformer
,
bayesovske nevronske mreže
,
modeli BERT
Projects
Funder:
ARRS - Slovenian Research Agency
Project number:
P6-0411
Name:
Jezikovni viri in tehnologije za slovenski jezik
Funder:
EC - European Commission
Funding programme:
H2020
Project number:
825153
Name:
Cross-Lingual Embeddings for Less-Represented Languages in European News Media
Acronym:
EMBEDDIA
Similar documents
Similar works from RUL:
Similar works from other Slovenian collections:
Back