Why We Built Uli's Redaction of Slurs Feature

Published on Mon Nov 14 2022Tarunima Prabhakar - Uli

One of the features of Uli is the redaction of slurs. Why did we build this? Not all abuse is explicit but the use of slurs, coded language and dog-whistling to target gender minorities is common on social media. And we have become normalised to it.

First, there are explicit slurs. Most slurs, even if undirected, are sexualized. Many are used to target members of the dominant group by describing their relation to women (mother or sister).

Dog-whistling is the use of phrases/terms, possibly understood only by an ‘inner group’ to target a community. It is a form of coded language. But coded language can also include misspellings or the use of words that sound similar to a slur.

This is especially common in mixed-language communities — one can use English words that sound similar to a Hindi or Tamil slur.

Such language is so pervasive on social media that one doesn’t even register when one encounters them. One is also likely to say things online that one would not say to someone in person. The slur redaction feature interrupts the unconscious scrolling to call attention to the presence of such language.

The slur list crowdsourced for Uli can be found here. A search on Twitter with some of these terms will reveal just how abusive Twitter can be to individuals of marginalised genders.

Coded language evolves rapidly. Someone comes up with a novel way to evade automated moderation everyday. And now we’re looking at a world where the new owner of Twitter wants to pull even further away from detecting and acting on such content.

So it remains a continuous effort to keep these lists updated. If you have slurs you think we need to include, feel free to ping us on Twitter or email.

Related Posts

Designing a Tool to Chisel Away at Online Abuse
Is it possible to imbue the creation of a #machinelearning tool with principles? That was our intent with Uli.
Moving Beyond the Dominant Metaphors of AI
How the moodboard for Uli can help us think about generative AI
Finding Similar Videos Efficiently
Data Science blog on finding similar videos in Tattle's archive by Feature Selection of anchor frames
Analysing the Katna library for video key frame extraction
Today we see a rampant proliferation of video content via various social media channels all over the world. We explore the Katna library to extract key frames from a video to feed into our duplicate image search engine
Text and illustrations on the website is licensed under Creative Commons 4.0 License. The code is licensed under GPL. For data, please look at respective licenses.