Digital Library[ Search Result ]
Metadata Extraction based on Deep Learning from Academic Paper in PDF
Seon-Wu Kim, Seon-Yeong Ji, Hee-Seok Jeong, Hwa-Mook Yoon, Sung-Pil Choi
http://doi.org/10.5626/JOK.2019.46.7.644
Recently, with a rapid increase in the number of academic documents, there has arisen a need for an academic database service to obtain information about the latest research trends. Although automated metadata extraction service for academic database construction has been studied, most of the academic texts are composed of PDF, which makes it difficult to automatically extract information. In this paper, we propose an automatic metadata extraction method for PDF documents. First, after transforming the PDF into XML format, the coordinates, size, width, and text feature in the XML markup token are extracted and constructed as a vector form. Extracted feature information is analyzed using Bidirectional GRU-CRF, which is an deep learning model specialized for sequence labeling, and finally, metadata are extracted. In this study, 10 kinds of journals among various domestic journals were selected and a training set for metadata extraction was constructed and experimented using the proposed methodology. As a result of extraction experiment on 9 kinds of metadata, 88.27% accuracy and 84.39% F1 performance was obtained.
User-Weighted Viewpoint/Lighting Control for Multi-Object Scene
http://doi.org/10.5626/JOK.2018.45.9.888
In computer graphics, viewpoint selection for objects in a scene has been performed by evaluating the goodness of sampled viewpoints. Since the definition of a good viewpoint varies according to the user’s purpose, various measurements such as entropy and mesh saliency have been used. In this paper, we propose a method of selecting the best viewpoint and lighting for a multi-object scene, based on the user-assigned importance of each object. After sampling a viewpoint and lighting from the surrounding sphere of the scene, we render the images by combining the sampled viewpoint and lighting. We then select the best result that coincides with user-assigned importance by quantifying the saliency of each object in the rendered image. While this technique has the disadvantage of high computation cost due to the need to render combinations of viewpoints and lighting, it obtains the viewpoint and lighting most suitable for the user`s needs. In order to minimize the computation cost, an object-by-object pixel classification technique on GPU is also proposed in this paper.
The Design of Object-of-Interest Extraction System Utilizing Metadata Filtering from Moving Object
Taewoo Kim, Hyungheon Kim, Pyeongkang Kim
The number of CCTV units is rapidly increasing annually, and the demand for intelligent video-analytics system is also increasing continuously for the effective monitoring of them. The existing analytics engines, however, require considerable computing resources and cannot provide a sufficient detection accuracy. For this paper, a light analytics engine was employed to analyze video and we collected metadata, such as an object’s location and size, and the dwell time from the engine. A further data analysis was then performed to filter out the target of interest; as a result, it was possible to verify that a light engine and the heavy data analytics of the metadata from that engine can reject an enormous amount of environmental noise to extract the target of interest effectively. The result of this research is expected to contribute to the development of active intelligent-monitoring systems for the future.
A Design of Metadata Registry Database based on Object-Relational Transformation Methodology
Sooyoung Cha, Sukhoon Lee, Dongwon Jeong, Doo-Kwon Baik
The ISO/IEC 11179 Metadata registry (MDR) is an international standard that was developed to register and share metadata. ISO/IEC 11179 represents an MDR as a metamodel that is an object model. However, it is difficult to develop an MDR based on ISO/IEC 11179 because the standard has no clear criteria to transform the metamodel into a database. In this paper, we suggest the design of an MDR data model that is based on object-relational transformation methodology (ORTM) for the MDR implementation. Hence, we classify the transformation methods of ORTM according to the corresponding relationships. After classification, we propose modeling rules by defining the standard use of the transformation. This paper builds the relational database tables as an implementation result of an MDR data model. Through experiments and evaluation, we verify the proposed modeling rules and evaluate the suitability of the created table structures. As the result, the proposed method shows that the table structures preserve classes and relationships of the standard metamodel well.
Design of a Video Metadata Schema and Implementation of an Authoring Tool for User Edited Contents Creation
In this paper, we design new video metadata schema for searching video segments to create UEC (User Edited Contents). The proposed video metadata schema employs hierarchically structured units of ‘Title-Event-Place(Scene)-Shot’, and defines the fields of the semantic information as structured form in each segment unit. Since this video metadata schema is defined by analyzing the structure of existing UECs and by experimenting the tagging and searching the video segment units for creating the UECs, it helps the users to search useful video segments for UEC easily than MPEG-7 MDS (Multimedia Description Scheme) which is a general purpose international standard for video metadata schema.
Search

Journal of KIISE
- ISSN : 2383-630X(Print)
- ISSN : 2383-6296(Electronic)
- KCI Accredited Journal
Editorial Office
- Tel. +82-2-588-9240
- Fax. +82-2-521-1352
- E-mail. chwoo@kiise.or.kr