Effect of Keyword Length on the Letter Frequency in Encrypted Messages
Requires a Wolfram Notebook System
Interact on desktop, mobile and cloud with the free Wolfram Player or other Wolfram Language products.
This Demonstration shows the effect of keyword length on the letter frequency of an encrypted message.[more]
Letter frequency analysis is a common tool used for decryption. Ordered by their frequency, the letters in English text are: e, t, a, o, i, n, s, r, h, l, d, c, u, m, f, p, g, w, y, b, v, k, x, j, q, z. The efficiency of a cipher can be rated by its ability to change this "normal" order.
Monoalphabetic ciphers (like the Caesar cipher) merely shift the order of the letters but do not change the frequency distribution.
Polyalphabetic ciphers (like the De Vigenere cipher or the Autokey cipher) use a keyword. The longer the keyword, the flatter the frequency distribution and the harder the decryption.[less]
Contributed by: Erik Mahieu (January 2012)
Open content licensed under CC BY-NC-SA
Snapshot 1: change the offset for the Caesar cipher and see how the letters shift but the distribution remains unaffected
Snapshot 2: a short keyword with De Vigenere encryption slightly flattens the frequency distribution
Snapshot 3: a 26-letter keyword and Autokey encryption gives a flat frequency distribution