Variant Maps to Identify Coding and Non-coding DNA Sequences of Genomes Selected from Multiple Species

Yuquan Liu and Jeffrey Zhen

Abstract

DNA sequences comprise complex genetic information, their specific characteristics are contained in both coding and non-coding sequences. Major gene components in higher levels of organisms are composed of non-coding sequences. In ENCODE project, there are evidence that 98% human Genomes are non-coding forms and 80% of them with functions. This paper provides a measurement model and a set of experiment results on genomic sequences using variant maps to distinguish differences on coding and non-coding sequences in visual representations. This model applies probability measurements on DNA sequences to coding and non-coding regions respectively to identify distinguished patterns from different sequences of multiple species.

Relevant Publications in Biological Systems: Open Access