Would Your Tweet Invoke Hate on the Fly? Forecasting Hate Intensity of Reply Threads on Twitter

Snehil Dahiya, Dhruv Sahnan, Vasu Goel, Emilie Chouzenoux, Victor Elvira, Angshul Majumdar, Anil Bandhakavi, Tanmoy Chakraborty

Research output: Contribution to conferencePaperpeer-review

Abstract / Description of output

Curbing hate speech is undoubtedly a major challenge for online
microblogging platforms like Twitter. While there have been studies
around hate speech detection, it is not clear how hate speech finds
its way into an online discussion. It is important for a content
moderator to not only identify which tweet is hateful, but also
to predict which tweet will be responsible for accumulating hate
speech. This would help in prioritizing tweets that need constant
monitoring. Our analysis reveals that for hate speech to manifest
in an ongoing discussion, the source tweet may not necessarily be
hateful; rather, there are plenty of such non-hateful tweets which
gradually invoke hateful replies, resulting in the entire reply threads
becoming provocative.
In this paper,we define a novel problem – given a source tweet and
a few of its initial replies, the task is to forecast the hate intensity of
upcoming replies. To this end, we curate a novel dataset constituting
∼ 4.5𝑘 contemporary tweets and their entire reply threads. Our preliminary
analysis confirms that the evolution patterns along time
of hate intensity among reply threads have highly diverse patterns,
and there is no significant correlation between the hate intensity of
the source tweets and that of their reply threads. We employ seven
state-of-the-art dynamic models (either statistical signal processing
or deep learning based) and show that they fail badly to forecast the
hate intensity. We then propose DESSERT, a novel deep state-space
model that leverages the function approximation capability of deep
neural networks with the capacity to quantify the uncertainty of
statistical signal processing models. Exhaustive experiments and
ablation study show that DESSERT outperforms all the baselines
substantially. Further, its deployment in an advanced AI platform
designed to monitor real-world problematic hateful content has improved
the aggregated insights extracted for countering the spread
of online harms.
Original languageEnglish
Pages2732-2742
Number of pages11
DOIs
Publication statusPublished - 14 Aug 2021
EventProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021 -
Duration: 14 Aug 202114 Aug 2021

Conference

ConferenceProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2021
Period14/08/2114/08/21

Fingerprint

Dive into the research topics of 'Would Your Tweet Invoke Hate on the Fly? Forecasting Hate Intensity of Reply Threads on Twitter'. Together they form a unique fingerprint.

Cite this