Page tree

Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.


– The Unicode Standard. 2015. Version 8.0. Edited by Julie D. Allen et al. Unicode Consortium, Mountain View, CA.

2. In phylogenetics

In phylogenetics, a character refers to an individual feature that is used to characterise the taxa (see taxon). For instance, if the taxa are different species of butterflies, the wing span could be a feature, and the colour of the wings another, and each of them would be encoded as a character. A character can take on a number of different states. In the case of the wing colour, the states could be different colours: light blue, dark blue, green, and so on. If the taxa are represented as DNA sequences, the characters correspond to different positions, or loci (singular locus), in the DNA sequences, and the character states are A,T,G,C. A character matrix refers to a table containing a number of character sequences where each row corresponds to a taxon and each column corresponds to a character. A character may also be missing which is often encoded as '?'.

There are various data formats for representing character data, a popular one being the nexus format.