À½¼ºÀÎ½Ä ¹× À½¼ºÇÕ¼º ±â¼úÀ» ÀÌ¿ëÇÑ ¾ÏÁ¤º¸ À½¼º¼­ºñ½º ±¸Ãà

Àӹΰæ1), ³ëÇàÀÎ1), ¹ÚÁ¤¹Ì1), °­°Ç¿í1), Á¶¿µÈ£*1)

1)±¹¸³¾Ï¼¾ÅÍ ¿¬±¸¼Ò ¾ÏÁ¤º¸¿¬±¸°ú

Abstract : Genome sequence data of organisms increase many times bigger annually. However, most of existing compression algorithms are just able to compress DNA data sequences by the ratio of 1.6 bits per base. In order to improve this status, this paper presents a new compression method for human DNA data based on the whole genome sequence. It just finds some regions in the whole genome sequence matching well with a given sequence to be compressed and then records the regions and some differences between them. The method achieves average compression ratio of about 0.2 bits per base and can be applied to any DNA data for organisms whose whole genome sequences are already setup.
keyword : whole genome sequence, human, DNA, data compression

¿¬Á¦ºÐ·ù

, ,


÷ºÎÇÑ ÆÄÀÏ

ncc_voic.hwp

÷ºÎÇÑ e-posterÆÄÀÏ

Åõ°í³»¿ëÀ» »èÁ¦ÇϽ÷Á¸é "»èÁ¦"¹öÆ°À» ´©¸£½Ã°í ´Ù½Ã Á¢¼öÇÏ¿© Áֽñ⠹ٶø´Ï´Ù.

 ¾Ï È£

´ëÇÑÀÇ·áÁ¤º¸ÇÐȸ
Copyright (c) 2002 by The Korean Society of Medical Informatics