Semi-Supervised Learning for Detecting of Abusive Sentence on Twitter using Deep Neural Network with Fuzzy Category Representation

Da-Sol Park; Jeong-Won Cha

Semi-Supervised Learning for Detecting of Abusive Sentence on Twitter using Deep Neural Network with Fuzzy Category Representation

Da-Sol Park

Jeong-Won Cha

Vol. 45, No. 11, pp. 1185-1192, Nov. 2018

10.5626/JOK.2018.45.11.1185

hate-speech

fuzzy category representation

Semi-supervised learning

Natural Language Processing

Machine Learning

PDF

Abstract

The number of people embracing damage caused by hate speech on the SNS(Social Network Service) is increasing rapidly. In this paper, we propose a detection method using Semi-supervised learning and Deep Neural Network from a large file to determine whether implied meaning of sentence beyond hate speech detection through comparison with a simple dictionary in twitter sentence is abusive or not. Most of the methods judge the hate speech sentence by comparing with a blacklist comprising of hate speech words. However, the reported methods have a disadvantage that skillful and subtle expression of hate speech cannot be identified. So, we created a corpus with a label on whether or not to hate speech on Korean twitter sentence. The training corpus in twitter comprised of 44,000 sentences and the test corpus comprised of 13,082 sentences. The system performance about the explicit abusive sentences of the F1 score was 86.13% on the model using 1-layer syllable CNN and sequence vector. And the system performance about the implicit abusive sentences of the F1 score 25.53% on the model using 1-layer syllable CNN and 2-layer syllable CNN and sequence vector. The proposed method can be used as a method for detecting cyber-bullying.

Statistics

Cumulative Counts from November, 2022
Multiple requests among the same browser session are counted as one view. If you mouse over a chart, the values of data points will be shown.

Cite this article

[IEEE Style]

D. Park and J. Cha, "Semi-Supervised Learning for Detecting of Abusive Sentence on Twitter using Deep Neural Network with Fuzzy Category Representation," Journal of KIISE, JOK, vol. 45, no. 11, pp. 1185-1192, 2018. DOI: 10.5626/JOK.2018.45.11.1185.

[ACM Style]

Da-Sol Park and Jeong-Won Cha. 2018. Semi-Supervised Learning for Detecting of Abusive Sentence on Twitter using Deep Neural Network with Fuzzy Category Representation. Journal of KIISE, JOK, 45, 11, (2018), 1185-1192. DOI: 10.5626/JOK.2018.45.11.1185.

[KCI Style]

박다솔, 차정원, "퍼지 범주 표현과 준지도 심층 신경망을 이용한 트위터 혐오 발언 문장 탐지," 한국정보과학회 논문지, 제45권, 제11호, 1185~1192쪽, 2018. DOI: 10.5626/JOK.2018.45.11.1185.

[Endnote/Zotero/Mendeley (RIS)] Download

[BibTeX] Download

Search

Journal of KIISE

ISSN : 2383-630X(Print)
ISSN : 2383-6296(Electronic)
KCI Accredited Journal

Editorial Office

Tel. +82-2-588-9240
Fax. +82-2-521-1352
E-mail. chwoo@kiise.or.kr