getsize <sequence file> [options]
The program getsize prints statistics about sequences read from a FASTA file.
When counting letters, alias symbols are first converted to their core symbol.
The name of a file of sequences in FASTA format. If the filename is given as "-", then the program reads from standard input.
Writes statistics about the sequences to standard output.
| Option | Parameter | Description | Default Behavior |
|---|---|---|---|
| General Options | |||
| -f | Print the letter frequencies as a C array. | ||
| -ft | Print letter frequencies in a LaTex table. | ||
| -l | Just print the length of each sequence. | ||
| -nd | Do not print warnings about duplicate sequences. | ||
| -x | Translate DNA in 6 frames (use with -f or -ft) and print the protein letter frequencies in one C array for each frame. | ||
| -codons | As for -x, and also print frame 0 codon usage as a C array. | ||