Utopia AI determines amounts of hate speech on different social platforms
[PRESS RELEASE] – Utopia AI determines amounts of hate speech on different social platforms 22 April 2021
Online hate speech shows up most often on discussion forums, according to a Utopia Analytics study for the Ministry of Justice in Finland. The report found that discussion forums are home to 97% of identified hate speech messages. The next largest platform types are Twitter messages at 2.5% and Instagram messages at 0.2%. Blogs, news comments and public Facebook messages make up less than 0.02% of all identified hate speech. The data set didn’t include private discussions, for example Facebook groups or accounts.
The project, part of the Facts Against Hate program by Ministry of Justice, tested the ability of artificial intelligence to recognize hate speech in online environments. The approach combined human evaluation with machine learning. A key goal was to find the main channels of hate speech, and to identify differences in hate speech from platform to platform.
The definition of hate speech was based on academic research in the social sciences. The definition was done by producing hate speech categories, and then used to manually identify examples of hate speech in a data set of online messages. These annotations were then used as training data for Utopia AI Moderator, a language-independent tool that utilizes text analytics and machine learning. The data set was 12 million Finnish comments and posts from September to October 2020.
The results show that about 150 000 messages that contain hate speech appear on publicly available Finnish social media platforms every month. That’s about 1.8% of all messages.
Among the public international social media platforms, Twitter seems the most prominent, with 7 450 messages identified as hate speech, or 0.14% of all tweets. Retweets play a significant role in circulating these messages: 39% of all hate-speech tweets are duplicates.
“While the data set consisted of mostly Finnish messages,” says Utopia’s CEO Dr. Mari-Sanna Paukkeri, “the results would be very similar in other languages . For example, the major platform for Finnish hate speech, Ylilauta, is a peer to the commonly known 4chan. Moreover, we can build a similar AI model to identify hate speech in any language in only two weeks. We only need a skilled individual to say how hate speech should be defined in your culture and language and we need the data to analyse.”
About Utopia Analytics
Utopia Analytics is a Finnish AI company, which specialises in analysing text in any language or dialect. The Helsinki-based company was founded in 2014.
Utopia AI Moderator and Utopia AI Claim Handler are the flagship products of the company. Both products can automatically process user-generated content in any language using Artificial Intelligence and machine learning.
They are used successfully, for example, on the children’s Facebook-equivalent Momio, gaming platforms like StarStable and on popular online marketplaces such as Swiss Tutti.ch, Finnish Tori.fi as well as Hungarian Jófogás. Brazillian frontline news channel UOL’s and Austrian Kronen Zeitung‘s news comments are also moderated by Utopia AI.
Director, Marketing and Communications
+358 50 444 0941
We use web analytics tools that use cookie information to monitor traffic on our website and to identify network visitors and to develop our service in order to improve client experience as well as to target marketing and remarketing. Please let us know if you agree, or you can customise your preference.
We use web analytic tools utilizing cookie information for monitoring traffic in our website and for identifying network visitors and for developing our service in order to improve client experience as well as to target marketing and remarketing. The information we track and collect includes information regarding who visits our website, what is previous website the visitor has accessed when visiting our website, what information is read and how long visitors stay on our website.
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.