정보관리학회지, 한국정보관리학회

권한신청
P-ISSN1013-0799
E-ISSN2586-2073
KCI

검색어: Bibliographic features, 검색결과: 4

김혜진(연세대학교) ; 송민(연세대학교) 2017, Vol.34, No.1, pp.177-195 https://doi.org/10.3743/KOSIM.2017.34.1.177

초록보기

초록

연구전선이란 연구논문들 간에 인용이 빈번하게 발생하며, 지속적으로 발전이 이루어지고 있는 연구영역을 의미한다. 연구행위가 집중되는 핵심 연구분야로 발전 가능성이 높은 연구전선을 조기에 예측해내는 것은 학계와 산업계, 정부기관, 나아가 국가의 과학기술 발전에 큰 유익을 가져다 줄 수 있는 유용한 사회적 자원이 된다. 본 연구는 복합자질을 활용하여 연구전선을 추론하는 모델을 제시하고자 시도하였다. 연구전선 추론은 핵심 연구영역으로 발전할 가능성이 높은 문헌들이 포함될 수 있도록 문헌을 복합자질로 표현하고, 그 자질들을 심층학습하여 새로 발행된 문헌들이 연구전선에 포함될 수 있는지 그 가능성을 예측하였다. 서지 자질, 네트워크 자질, 내용 자질 등 복합자질 세트를 사용하여 문헌을 표현하고 피인용을 많이 받을 가능성이 있는 문헌을 추론하기 위해서 확률기반 팩터그래프 모델을 적용하였다. 추출된 자질들은 팩터그래프의 변수로 표현되어 합-곱 알고리즘과 접합 트리 알고리즘을 적용하여 연구전선 추론이 이루어졌다. 팩터그래프 확률모델을 적용하여 연구전선을 추론․구축한 결과, 서지결합도 4 이상으로 구축된 베이스라인 연구전선과 큰 차이를 보였다. 팩터그래프 기반 연구전선그룹이 서지결합 기반 연구전선그룹보다 문헌 간의 직접 연결정도가 강하며 연결 관계에 있지 않은 두 개의 문헌을 연결시키는 매개정도 또한 강한 집단으로 나타났다.

Abstract

This study attempts to infer research fronts using factor graph model based on heterogeneous features. The model suggested by this study infers research fronts having documents with the potential to be cited multiple times in the future. To this end, the documents are represented by bibliographic, network, and content features. Bibliographic features contain bibliographic information such as the number of authors, the number of institutions to which the authors belong, proceedings, the number of keywords the authors provide, funds, the number of references, the number of pages, and the journal impact factor. Network features include degree centrality, betweenness, and closeness among the document network. Content features include keywords from the title and abstract using keyphrase extraction techniques. The model learns these features of a publication and infers whether the document would be an RF using sum-product algorithm and junction tree algorithm on a factor graph. We experimentally demonstrate that when predicting RFs, the FG predicted more densely connected documents than those predicted by RFs constructed using a traditional bibliometric approach. Our results also indicate that FG-predicted documents exhibit stronger degrees of centrality and betweenness among RFs.

OWL을 이용한 온톨로지 기반의 목록시스템 설계 연구

이현실(원광대학교) ; 한성국(원광대학교) 2004, Vol.21, No.2, pp.249-267 https://doi.org/10.3743/KOSIM.2004.21.2.249

초록보기

초록

MARC는 목록 데이터를 상세하게 정의할 수 있는 장점이 있지만, 개념요소가 구조화 되어 있지 않고 표현체계가 복잡하기 때문에 단순 계층구조의 의미 어휘 체계를 지원하는 XML DTD나 RDF/S로는 그 구조를 모델화하기가 어렵다. 본 연구에서는 MARC의 데이터 요소를 추상화하여 목록 데이터의 개념 구조를 표현하는 서지 온톨로지를 구축하였으며, 개념간의 논리 관계와 프로퍼티의 카디널리티 및 프로퍼티 값에 대한 논리적 제한을 부가할 수 있는 OWL을 이용하여 MRAC 필드의 복합 구조를 모델링하여 구축한 목록 온톨로지를 구현하였다. 온톨로지 언어를 이용한 MARC 데이터를 기술 방법은 목록 데이터에 대한 메타데이터 구성과 목록의 호환성 문제를 해결할 수 있는 기초적 방안이 되며, 시맨틱 웹 서비스를 기반으로 하는 차세대 문헌 정보서비스 시스템 구현의 토대가 될 것이다.

Abstract

Although MARC can define the detail cataloguing data, it has complex structures and frameworks to represent bibliographic information. On account of these idiosyncratic features of MARC, XML DTD or RDF/S that supports simple hierarchy of conceptual vocabularies cannot capture MARC formalism effectively. This study implements bibliographic ontology by means of abstracting conceptual relationships between bibliographic vocabularies of MARC. The bibliographic ontology is formalized with OWL that can represent the logical relations between conceptual elements and specify cardinality and property value restrictions. The bibliographic ontology in this study will provide metadata for cataloguing data and resolve compatibility problems between cataloguing systems. And it can also contribute the development of next generation bibliographic information system using semantic Web services.

한글 저자명 중의성 해소를 위한 기계학습기법의 적용

강인수(경성대학교) 2008, Vol.25, No.3, pp.27-39 https://doi.org/10.3743/KOSIM.2008.25.3.027

초록보기

초록

동일한 인명을 갖는 서로 다른 실세계 사람들이 존재하는 현실은 인터넷 세계에서 인명으로 표현된 개체의 신원을 식별해야 하는 문제를 발생시킨다. 상기의 문제가 학술정보 내의 저자명 개체로 제한된 경우를 저자식별이라 부른다. 저자식별은 식별 대상이 되는 저자명 개체 사이의 유사도 즉 저자유사도를 계산하는 단계와 이후 저자명 개체들을 군집화하는 단계로 이루어진다. 저자유사도는 공저자, 논문제목, 게재지정보 등의 저자식별자질들의 자질유사도로부터 계산되는데, 이를 위해 기존에 교사방법과 비교사방법들이 사용되었다. 저자식별된 학습샘플을 사용하는 교사방법은 비교사방법에 비해 다양한 저자식별자질들을 결합하는 최적의 저자유사도함수를 자동학습할 수 있다는 장점이 있다. 그러나, 기존 교사방법 연구에서는 SVM, MEM 등의 일부 기계학습기법만이 시도되었다. 이 논문은 다양한 기계학습기법들이 저자식별에 미치는 성능, 오류, 효율성을 비교하고, 공저자와 논문제목 자질에 대해 자질값 추출 및 자질 유사도 계산을 위한 여러 기법들의 비교분석을 제공한다.

Abstract

In bibliographic data, the use of personal names to indicate authors makes it difficult to specify a particular author since there are numerous authors whose personal names are the same. Resolving same-name author instances into different individuals is called author resolution, which consists of two steps: calculating author similarities and then clustering same-name author instances into different person groups. Author similarities are computed from similarities of author-related bibliographic features such as coauthors, titles of papers, publication information, using supervised or unsupervised methods. Supervised approaches employ machine learning techniques to automatically learn the author similarity function from author-resolved training samples. So far, however, a few machine learning methods have been investigated for author resolution. This paper provides a comparative evaluation of a variety of recent high-performing machine learning techniques on author disambiguation, and compares several methods of processing author disambiguation features such as coauthors and titles of papers.

사회과학, 자연과학기술 및 융복합 분야의 약물중독 연구에 대한 계량서지학적 비교 분석 연구

남동인(연세대학교 문헌정보학과 석사과정) ; 박지홍(연세대학교 문헌정보학과) 2022, Vol.39, No.2, pp.203-232 https://doi.org/10.3743/KOSIM.2022.39.2.203

초록보기

초록

약물중독 혹은 약물사용장애(substance use disorder)는 세계적으로 그 위험성과 유행성이 지속적으로 관측 되고 있다. 이러한 배경에서 수많은 관련 연구들이 진행이 되어왔지만, 이와 관련한 계량서지학적 분석은 미진한 상황이다. 특히, 약물중독과 관련된 다양한 특성들을 종합적으로 반영한 거시적 차원의 계량서지학적 접근법을 활용한 연구는 찾아보기가 힘든 상황이다. 이 연구에서는 이러한 약물중독의 다차원적 특성을 반영하기 위해 사회과학, 자연과학기술, 융복합 분야에서의 약물중독 연구 동향을 비교 분석하였다. 이 연구는 2002년부터 2021년까지의 약물중독 연구 논문을 Web of Science로부터 검색 후 수집하였으며, SCI(E) 및 SSCI 정보를 토대로 학문 분야를 분류하였다. 저자 키워드 동시출현 분석을 수행한 결과, 자연과학기술은 신경정신약물과 보상시스템에 관한 연구가 주를 이루었고, 사회과학 분야에서는 이보다는 인구학적 특성이 반영된 약물중독 연구가 수행되어 왔음을 알 수 있었고, 융복합 분야에서는 이러한 동향을 모두 아우르고 있는 것을 확인할 수 있었다. 저자 동시인용 분석도 수행을 하였는데, 이를 통해 자연과학기술 분야는 슈퍼 저자들이 관측된 반면, 사회과학 분야에서는 개인 저자뿐 아니라 기관 저자까지도 인용이 많이 되는 것으로 확인이 되었다.

Abstract

Drug addiction or substance use disorder is continuously observed worldwide for its risks and prevalence. In this context, numerous studies have been conducted regarding this issue. However, bibliometric analysis related to drug addiction is insufficient. In particular, it is difficult to find research that utilizes a macro-level bibliographic approach that comprehensively reflects various characteristics related to drug addiction. In this study, to reflect the multidimensional features of drug addiction, research trends in drug addiction in social science, natural science, and multidisciplinary studies were compared and analyzed. This study collected drug addiction research articles from 2002 to 2021 by searching from the Web of Science, and classified academic disciplines based on SCI(E) and SSCI information. Author keyword co-occurrence analysis was also conducted, which provided confirmation that natural science mainly studied psychoactive substances and the reward system in the brain, while drug addiction studies reflecting demographic characteristics were conducted in the domain of social science. In the multidisciplinary field, all of the above topics were covered. Author co-citation analysis was also employed, which showed that there are superstars (i.e., authors who receive a rigorous amount of citation) in the field of natural science, while in the social science domain, authors were highly cited not only at the individual level but also at the institutional level.

바로가기메뉴

초록

Abstract

초록

Abstract

초록

Abstract

초록

Abstract

정보관리학회지