What is 'k' in sequencing?

Question

When a DNA sequence is sequenced, I've only ever dealt with A,T,C,G and N which indicates un-identifiable bases. However, I came across a 'k' recently and I had asked another researcher who gave me an answer for what 'k' represents but I don't quite recall. It can't just be an anomaly in that one sequence file. Any ideas?

K is also the lysine amino acid (and k-mers are something else of course) — Chris_Rands, Dec 29 '19 at 21:28

score 16 · Accepted Answer · edited Dec 28 '19 at 15:57

16

See IUPAC codes:

So, as you can see above, K means "Either G or T".

edited Dec 28 '19 at 15:57

terdon

10,071
5
22
48

answered Dec 28 '19 at 01:52

user6690

196
1
3

M__ · Answer 2 · 2019-12-28T15:04:40.540

It is recommended you learn the degenerate nucleotide code. In sequencing it can signify poor quality sequence data, but in primer design it is useful. R (mutation within purines) and Y (mutation within pyrimidines) are common. K, a purine to pyrimadine or pyrimadine to purine mutation is, in my opinion, rare. I would treat a K mutation with caution and consider the triplet codon around it, i.e. if it is part of a protein gene.

Most phylogeneitcs programs will work with the degenerate nucleotide code, so in theory you can still obtain useful information with it.

What is 'k' in sequencing?

2 Answers2