Digital Library[ Search Result ]
Community Detection Using Link Attribute-Based Classification
Jeongseon Kim, Soohwan Jeong, Sungsu Lim
http://doi.org/10.5626/JOK.2021.48.8.959
Attempts to discover knowledge through data are becoming gradually diversified to understand a fast and complex world. Graph data analysis, which models and analyzes correlated data as graphs, is drawing much attention as it is combined with the latest machine learning techniques. In this work, we propose a novel methodology for discovering graph community structures. We analyze similarity, curvature-based attributes to allow links existing inside and outside the community to have different attribute values, and exploit them to design and analyze algorithms that eliminate links that affect the community structure less to find better community structures on sparse graphs.
Opinion Classification in Professional Sports Fan Sites using Topic Keyword-Based Sentiment Analysis
Hyungho Byun, Sihyun Jeong, Chong-kwon Kim
http://doi.org/10.5626/JOK.2018.45.4.390
In this study, we propose the classification method using topic keyword-based sentiment analysis through the posts of professional sports fan sites in Korea. We studied ways to take into account the use of special communication methods or vocabulary in the community and defined keywords based on the characteristics of the topic or frequency of the community"s words. In addition, we presented a new sentiment analysis approach that utilizes the use of keyword pools and the proximity relation to keywords. Through three years of actual community dataset, sentiment analysis based on the topic keyword is more effective than the existing method and reflects the community environment.
A Method for Identifying Nicknames of a User based on User Behavior Patterns in an Online Community
http://doi.org/10.5626/JOK.2018.45.2.165
An online community is a virtual group whose members share their interests and hobbies anonymously with nicknames unlike Social Network Services. However, there are malicious user problems such as users who write offensive contents and there may exist data fragmentation problems in which the data of the same user exists in different nicknames. In addition, nicknames are frequently changed in the online community, so it is difficult to identify them. Therefore, in this paper, to remedy these problems we propose a behavior pattern feature vectors for users considering online community characteristics, propose a new implicit behavior pattern called relationship pattern, and identify the nickname of the same user based on Random Forest classifier. Also, Experimental results with the collected real world online community data demonstrate that the proposed behavior pattern and classifier can identify the same users at a meaningful level.
Fast Influence Maximization in Social Networks
Yun-Yong Ko, Kyung-Jae Cho, Sang-Wook Kim
http://doi.org/10.5626/JOK.2017.44.10.1105
Influence maximization (IM) is the problem of finding a seed set composed of k nodes that maximizes the influence spread in social networks. However, one of the biggest problems of existing solutions for IM is that it takes too much time to select a k-seed set. This performance issue occurs at the micro and macro levels. In this paper, we propose a fast hybrid method that addresses two issues at micro and macro levels. Furthermore, we propose a path-based community detection method that helps to select a good seed set. The results of our experiment with four real-world datasets show that the proposed method resolves the two issues at the micro and macro levels and selects a good k-seed set.
A Korean Community-based Question Answering System Using Multiple Machine Learning Methods
Sunjae Kwon, Juae Kim, Sangwoo Kang, Jungyun Seo
Community-based Question Answering system is a system which provides answers for each question from the documents uploaded on web communities. In order to enhance the capacity of question analysis, former methods have developed specific rules suitable for a target region or have applied machine learning to partial processes. However, these methods incur an excessive cost for expanding fields or lead to cases in which system is overfitted for a specific field. This paper proposes a multiple machine learning method which automates the overall process by adapting appropriate machine learning in each procedure for efficient processing of community-based Question Answering system. This system can be divided into question analysis part and answer selection part. The question analysis part consists of the question focus extractor, which analyzes the focused phrases in questions and uses conditional random fields, and the question type classifier, which classifies topics of questions and uses support vector machine. In the answer selection part, the we trains weights that are used by the similarity estimation models through an artificial neural network. Also these are a number of cases in which the results of morphological analysis are not reliable for the data uploaded on web communities. Therefore, we suggest a method that minimizes the impact of morphological analysis by using character features in the stage of question analysis. The proposed system outperforms the former system by showing a Mean Average Precision criteria of 0.765 and R-Precision criteria of 0.872.
Query Expansion based on Word Sense Community
Chang-Uk Kwak, Hee-Geun Yoon, Seong-Bae Park
In order to assist user’s who are in the process of executing a search, a query expansion method suggests keywords that are related to an input query. Recently, several studies have suggested keywords that are identified by finding domains using a clustering method over the documents that are retrieved. However, the clustering method is not relevant when presenting various domains because the number of clusters should be fixed. This paper proposes a method that suggests keywords by finding various domains related to the input queries by using a community detection algorithm. The proposed method extracts words from the top-30 documents of those that are retrieved and builds communities according to the word graph. Then, keywords representing each community are derived, and the represented keywords are used for the query expansion method. In order to evaluate the proposed method, we compared our results to those of two baseline searches performed by the Google search engine and keyword recommendation using TF-IDF in the search results. The results of the evaluation indicate that the proposed method outperforms the baseline with respect to diversity.
Search

Journal of KIISE
- ISSN : 2383-630X(Print)
- ISSN : 2383-6296(Electronic)
- KCI Accredited Journal
Editorial Office
- Tel. +82-2-588-9240
- Fax. +82-2-521-1352
- E-mail. chwoo@kiise.or.kr