PDF (letter size)
PDF (legal size)

Objects in DNA analysis

Nasser M. Abbasi

sometime in 1997   Compiled on May 23, 2020 at 5:15am

This is a description of objects used in DNA/Genomics software development. I need to first come up with such a list, and the relationship between them to base any further software development of a sw library.

Each object is described and listing of what other objects it contains and what attributes the objects has.

Each object will be represented as a class. The object attributes will help to define the access methods of the object. 1. Chromosome.

contains: contig objects. one DNA Molecule


  1. Enuchromatice size, in kb.
  2. number of contigs it contains.
  3. number of bases.
  4. Name or ID of chromosom (1, 2, ... Y, X).
2. contig.

contains: physical position object, cytogenetic position object.


  1. Name or ID. such as NT_00039
  2. size (in kb in bases)
3. physical position of contig in the chromosom.

Attributes (these are estimates from RH map positions):

  1. start position. number
  2. end position. number
4. Cytogenetic position of contig in the chromosom.

Attributes (based on annotation from GenBank entry)

  1. start id. such as 1p34.3
  2. end id. such as p36.13
5. DNA molecule (or DNA sequence)

contains number of Gene objects, contains one sequence of DNA base pairs objects.

  1. number of genes.
  2. number of base pairs.
6. Gene

contains number of exons and number of introns.

  1. number of exons.
  2. number of introns.
7. exon. a region in a gene that is a coding region for protein. Contains number of codons, and a DNA sequence.
  1. size in bases.
  2. number of codons.
  3. start position in gene
  4. end position in gene.
8. intron. a region in a gene that is a not a coding region for protein. Contains a DNA sequence
  1. size in bases.
  2. start position in gene
  3. end position in gene.