GramAlign

Description

GramAlign is a time-efficient progressive Multiple Sequence Alignment (MSA) algorithm. The novelty of GramAlign comes from the sequence distance estimation step, whereby distances are determined by the natural grammar present in nucleotide and amino acid sequences.

Click here to access the online version of the program's README.

Run GramAlign

Interested in trying GramAlign? Click here to access the online version of GramAlign.

Related Publications

Grammar-based distance in progressive multiple sequence alignment
D. J. Russell, H. H. Otu, and K. Sayood
BMC Bioinformatics 2008, 9:306

Downloads

  • GramAlign Version 3.0 Source (.zip) - Updated release version 9/25/12.
  • Large Protein Data Set (.zip) - This archive contains the seven large protein data sets we used for comparison against other MSA programs in our paper. The archive contains both aligned and unaligned FASTA files generated using Rose 1.3.
  • Large DNA Data Set (.zip) - This archive contains the seven large DNA data sets we used for comparison against other MSA programs in our paper. The archive contains both aligned and unaligned FASTA files generated using Rose 1.3.
  • Sample Nucleotide Data Set (.zip) - This archive contains a simple set of 10 mitochondrial DNA sequences used during development.