Detecting Aggressiveness in Tweets: A Hybrid Model for Detecting Cyberbullying in the Spanish Language

Files
Statistics
Metrics and citations
Share
Metadata
Show full item recordDate
2021-11Department
MatemáticasSource
Applied Sciences (Switzerland), Vol. 11, Núm. 22Abstract
In recent years, the use of social networks has increased exponentially, which has led to a
significant increase in cyberbullying. Currently, in the field of Computer Science, research has been
made on how to detect aggressiveness in texts, which is a prelude to detecting cyberbullying. In this
field, the main work has been done for English language texts, mainly using Machine Learning (ML)
approaches, Lexicon approaches to a lesser extent, and very few works using hybrid approaches.
In these, Lexicons and Machine Learning algorithms are used, such as counting the number of bad
words in a sentence using a Lexicon of bad words, which serves as an input feature for classification
algorithms. This research aims at contributing towards detecting aggressiveness in Spanish language
texts by creating different models that combine the Lexicons and ML approach. Twenty-two models
that combine techniques and algorithms from both approaches are proposed, and for their application,
certain hyperparameters are adjusted in the training datasets of the corpora, to obtain the best results
in the test datasets. Three Spanish language corpora are used in the evaluation: Chilean, Mexican,
and Chilean-Mexican corpora. The results indicate that hybrid models obtain the best results in the
3 corpora, over implemented models that do not use Lexicons. This shows that by mixing approaches,
aggressiveness detection improves. Finally, a web application is developed that gives applicability
to each model by classifying tweets, allowing evaluating the performance of models with external
corpus and receiving feedback on the prediction of each one for future research. In addition, an API
is available that can be integrated into technological tools for parental control, online plugins for
writing analysis in social networks, and educational tools, among others.
Subjects
cyberbullying detect; emotions analysis in Spanish; hybrid approachCollections
- Artículos Científicos [4849]
- Articulos Científicos Matemáticas [162]