Çѱ¹ÀÎÀÇ ¿ì¿ï °ü¸®¸¦ À§ÇÑ °Ç°­Á¤º¸¼­ºñ½º ½Ã½ºÅÛ °³¹ß

¹èÁ¤ÀÌ*1), À̼ҿì2), À±¼÷Èñ3), ¾È°æ¾Ö4)

1)ÀÎÁ¦´ëÇб³ ÀÇ°ú´ëÇÐ °£È£Çаú
2)¼­¿ï´ëÇб³ °£È£´ëÇÐ
3)ÀÎÁ¦´ëÇб³ ÀÇ°ú´ëÇÐ °£È£Çаú
4)¼­¿ï´ëÇб³ °£È£´ëÇÐ

Abstract : Genome sequence data of organisms increase many times bigger annually. However, most of existing compression algorithms are just able to compress DNA data sequences by the ratio of 1.6 bits per base. In order to improve this status, this paper presents a new compression method for human DNA data based on the whole genome sequence. It just finds some regions in the whole genome sequence matching well with a given sequence to be compressed and then records the regions and some differences between them. The method achieves average compression ratio of about 0.2 bits per base and can be applied to any DNA data for organisms whose whole genome sequences are already setup.
keyword : whole genome sequence, human, DNA, data compression

¿¬Á¦ºÐ·ù


÷ºÎÇÑ ÆÄÀÏ

Bae2(paper).hwp

÷ºÎÇÑ e-posterÆÄÀÏ

Åõ°í³»¿ëÀ» »èÁ¦ÇϽ÷Á¸é "»èÁ¦"¹öÆ°À» ´©¸£½Ã°í ´Ù½Ã Á¢¼öÇÏ¿© Áֽñ⠹ٶø´Ï´Ù.

 ¾Ï È£

´ëÇÑÀÇ·áÁ¤º¸ÇÐȸ
Copyright (c) 2002 by The Korean Society of Medical Informatics