UMLS(Unified Medical Language System)와 한글의학용어 통합 과정의 소개
김승희1), 한승빈1), 최진욱*1)
1)서울대학교 의과대학 의공학교실
Abstract : Genome sequence data of organisms increase many times bigger annually. However, most of existing compression algorithms are just able to compress DNA data sequences by the ratio of 1.6 bits per base. In order to improve this status, this paper presents a new compression method for human DNA data based on the whole genome sequence. It just finds some regions in the whole genome sequence matching well with a given sequence to be compressed and then records the regions and some differences between them. The method achieves average compression ratio of about 0.2 bits per base and can be applied to any DNA data for organisms whose whole genome sequences are already setup. keyword : whole genome sequence, human, DNA, data compression
투고내용을 삭제하시려면 "삭제"버튼을 누르시고 다시 접수하여 주시기 바랍니다.
Copyright (c) 2002 by The Korean Society of Medical Informatics