Codon and amino-acid distribution in DNA

J. K. Kim, S. I. Yang, Y. H. Kwon, E. I. Lee

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

According to the Zipf's law, the distribution of rank-ordered frequency of words in the natural language can be modelled on the power law. In this paper, we examine the frequency distribution of 64 codons over the coding and non-coding regions of 88 DNA from EMBL and GenBank database, using exponential fitting. Also, we regard 20 amino-acids as vocabulary, perform the same frequency analysis to the same database and show that amino-acids can be used as biological meaningful words for Zipf's approach. Our analysis suggests that a natural language structure may exist not only in the coding region of DNA but in the non-coding one of DNA.

Original languageEnglish
Pages (from-to)1795-1807
Number of pages13
JournalChaos, Solitons and Fractals
Volume23
Issue number5
DOIs
Publication statusPublished - 2005 Mar

    Fingerprint

ASJC Scopus subject areas

  • Statistical and Nonlinear Physics
  • Mathematics(all)
  • Physics and Astronomy(all)
  • Applied Mathematics

Cite this