Search : [ keyword: sentiment analysis ] (10)

Prompt Tuning For Korean Aspect-Based Sentiment Analysis

Bong-Su Kim, Seung-Ho Choi, Si-hyun Park, Jun-Ho Wang, Ji-Yoon Kim, Hyun-Kyu Jeon, Jung-Hoon Jang

http://doi.org/10.5626/JOK.2024.51.12.1043

Aspect-based sentiment analysis examines how emotions in text relate to specific aspects, such as product characteristics or service features. This paper presents a comprehensive methodology for applying prompt tuning techniques to multi-task token labeling challenges using aspect-based sentiment analysis data. The methodology includes a pipeline for identifying emotion expression domains, which generalizes the token labeling problem into a sequence labeling problem. It also suggests selecting templates to classify separated sequences based on aspects and emotions, and expanding label words to align with the dataset’s characteristics, thus optimizing the model's performance. Finally, the paper provides several experimental results and analyses for the aspect-based sentiment analysis task in a few-shot setting. The constructed data and baseline model are available on AIHUB. (www.aihub.or.kr).

A Model for Topic Classification and Extraction of Sentimental Expression using a Lexical Semantic Network

JiEun Park, JuSang Lee, JoonChoul Shin, ChoelYoung Ock

http://doi.org/10.5626/JOK.2023.50.8.700

The majority of the previous sentiment analysis studies classified a single sentence or document into only a single sentiment. However, more than one sentiment can exist in one sentence. In this paper, we propose a method that extracts sentimental expression for word units. The structure of the proposed model is a UBERT model that uses morphologically analyzed sentences as input and adds layers to predict topic classification and sentimental expression. The proposed model uses topic feature of a sentence predicted by topic dictionary. The topic dictionary is built at the beginning of machine learning. The learning module collects topic words from a training corpus and expands them using the lexical semantic network. The evaluation is performed with the word unit F1-Score. The proposed model achieves an F1-Score of 58.19%, an improvement of 0.97% point over the baseline.

Method for the Automatic Generation of Training Sets for Word Embedding Reflecting Sentiment Information

Dahee Lee, Won-Min Lee, Byung-Won On

http://doi.org/10.5626/JOK.2022.49.1.42

Word embedding is a method of expressing a word as a vector. However, since existing word embedding methods predict words that appear together, they are expressed as similar vectors even if they have different emotions. When building a sentiment analysis model using this, sentences with similar patterns may be classified into the same polarity, which is one of the factors that degrade the performance of the emotional analysis model. In this paper, to address the problem, we proposed the automatic generation of a training set for word embedding reflecting sentiment information using morpheme analysis, dependence parsing, and a sentiment dictionary. Using sentiment-specific word embedding vectors generated by the proposed model, we showed that the proposed sentiment-specific word embedding model outperformed the existing word embedding models including CBOW, Skip-Gram, FastText, ELMo, and BERT.

SMERT: Single-stream Multimodal BERT for Sentiment Analysis and Emotion Detection

Kyeonghun Kim, Jinuk Park, Jieun Lee, Sanghyun Park

http://doi.org/10.5626/JOK.2021.48.10.1122

Sentiment Analysis is defined as a task that analyzes subjective opinion or propensity and, Emotion Detection is the task that finds emotions such as ‘happy’ or ‘sad’ from text data. Multimodal data refers to the appearance of image and voice data in addition to text data. In prior research, RNN or cross-transformer models were used, however, RNN models have long-term dependency problems. Also, since cross-transformer models could not capture the attribute of modalities, they got worse results. To solve those problems, we propose SMERT based on a single-stream transformer ran on a single network. SMERT can get joint representation for Sentiment Analysis and Emotion Detection. Besides, we use BERT tasks which are improved to utilize for multimodal data. To present the proposed model, we verify the superiority of SMERT through a comparative experiment on the combination of modalities using the CMU-MOSEI dataset and various evaluation metrics.

Combining Sentiment-Combined Model with Pre-Trained BERT Models for Sentiment Analysis

Sangah Lee, Hyopil Shin

http://doi.org/10.5626/JOK.2021.48.7.815

It is known that BERT can capture various linguistic knowledge from raw text via language modeling without using any additional hand-crafted features. However, some studies have shown that BERT-based models with an additional use of specific language knowledge have higher performance for natural language processing problems associated with that knowledge. Based on such finding, we trained a sentiment-combined model by adding sentiment features to the BERT structure. We constructed sentiment feature embeddings using sentiment polarity and intensity values annotated in a Korean sentiment lexicon and proposed two methods (external fusing and knowledge distillation) to combine sentiment-combined model with a general-purpose BERT pre-trained model. The external fusing method resulted in higher performances in Korean sentiment analysis tasks with movie reviews and hate speech datasets than baselines from other pre-trained models not fused with sentiment-combined models. We also observed that adding sentiment features to the BERT structure improved the model’s language modeling and sentiment analysis performance. Furthermore, when implementing sentiment-combined models, training time and cost could be decreased by using a small-scale BERT model with a small number of layers, dimensions, and steps.

Dimensional Sentiment Analysis of Korean Text using Data Balancing

Taehee Jeon, Changhwan Kim

http://doi.org/10.5626/JOK.2021.48.7.790

Compared with most studies on categorical sentiment analysis which aims to represent emotional states as a small set of emotion categories, there have been fewer studies on dimensional sentiment analysis which treats sentiment analysis as a regression problem because of the shortage of data. Recently, the National Information Society Agency (NIA) released open data, Multimodal Video Data, through their web site, AI Hub. Using this data, we experimented with dimensional sentiment analysis of Korean text. For this purpose, we used CNN which is one of the conventional deep learning models in NLP. We also verified that data balancing could improve the performance of models. The results show that the model trained on Multimodal Video Data performs well enough to show that the data should be useful for dimensional sentiment analysis of Korean text and that with data balancing the model can perform better in spite of their fewer training data.

Korean Movie Review Sentiment Analysis using Self-Attention and Contextualized Embedding

Cheoneum Park, Dongheon Lee, Kihoon Kim, Changki Lee, Hyunki Kim

http://doi.org/10.5626/JOK.2019.46.9.901

Sentiment analysis is the processing task that involves collecting and classifying opinions about a specific object. However, it is difficult to grasp the subjectivity of a person using natural language, so the existing sentimental word dictionaries or probabilistic models cannot solve such a task, but the development of deep learning made it possible to solve the task. Self-attention is a method of modeling a given input sequence by calculating the attention weight of the input sequence itself and constructing a context vector with a weighted sum. In the context, a high weight is calculated between words with similar meanings. In this paper, we propose a method using a modeling network with self-attention and pre-trained contextualized embedding to solve the sentiment analysis task. The experimental result shows an accuracy of 89.82%.

An Automatic Method of Generating a Large-Scale Train Set for Bi-LSTM based Sentiment Analysis

Min-Seong Choi, Byung-Won On

http://doi.org/10.5626/JOK.2019.46.8.800

Sentiment analysis using deep learning requires a large-scale train set labeled sentiment. However, direct labeling of sentiment by humans is time and cost-constrained, and it is not easy to collect the required data for sentiment analysis from many data. In the present work, to solve the existing problems, the existing sentiment lexicon was used to assign sentiment score, and when there was sentiment transformation element, the sentiment score was reset through dependency parsing and morphological analysis for automatic generation of large-scale train set labeled with the sentiment. The Top-k data with high sentiment score was extracted. Sentiment transformation elements include sentiment reversal, sentiment activation, and sentiment deactivation. Our experimental results reveal the generation of a large-scale train set in a shorter time than manual labeling and improvement in the performance of deep learning with an increase in the amount of train set. The accuracy of the model using only sentiment lexicon was 80.17% and the accuracy of the proposed model, which includes natural language processing technology was 89.17%. Overall, a 9% improvement was observed.

Korean Movie-review Sentiment Analysis Using Parallel Stacked Bidirectional LSTM Model

Yeongtaek Oh, Mintae Kim, Wooju Kim

http://doi.org/10.5626/JOK.2019.46.1.45

The sentiment analysis is a field of document classification that classifies the sensitivity of text documents. The sentiment analysis methodology that employs the use of deep learning can be divided into a process of tokenizing a document, obtaining a sentence vector through embedding and classifying a vectorized document. We reviewed the methods of various existing studies and found out the appropriate methodology focusing on embedding methods and deep learning models for the Korean documents through comparative experiments. The document pre-processing method compares documents to words, syllables and phonemes. Additionally, a comparative experiment was conducted on the Naver movie review data set nsmc (naver sentiment movie corpus) from the CNN to the LSTM, bi-LSTM, stacked bi-LSTM and the newly proposed Parallel Stacked Bidirectional LSTM model. The results showed that the performance of the proposed model was higher than that of the existing simple deep learning model. Moreover, itachieved the best classification performance of approximately 88.95% through the ensemble among the models learned through other pre-processing.

Opinion Classification in Professional Sports Fan Sites using Topic Keyword-Based Sentiment Analysis

Hyungho Byun, Sihyun Jeong, Chong-kwon Kim

http://doi.org/10.5626/JOK.2018.45.4.390

In this study, we propose the classification method using topic keyword-based sentiment analysis through the posts of professional sports fan sites in Korea. We studied ways to take into account the use of special communication methods or vocabulary in the community and defined keywords based on the characteristics of the topic or frequency of the community"s words. In addition, we presented a new sentiment analysis approach that utilizes the use of keyword pools and the proximity relation to keywords. Through three years of actual community dataset, sentiment analysis based on the topic keyword is more effective than the existing method and reflects the community environment.


Search




Journal of KIISE

  • ISSN : 2383-630X(Print)
  • ISSN : 2383-6296(Electronic)
  • KCI Accredited Journal

Editorial Office

  • Tel. +82-2-588-9240
  • Fax. +82-2-521-1352
  • E-mail. chwoo@kiise.or.kr