Cryptology - I: Vigenere-Based Systems

Cryptology-I: § 2.3: Vigenere-Based Systems.

Instructors: R.E. Newman-Wolfe and M.S. Schmalz

The production of ciphertext by the one-time pad and other such manual devices, while intuitively attractive and efficient in the field, does not lend itself to mechanization with the technologies that were available shortly after World War I. For example, consider what one could construct with components such as relays, electromagnets, primitive switching equipment, and sophisticated gearing and mechanical transmission devices (similar to miniature automobile transmissions). To mechanize the production of ciphertext, a family of devices called rotor machines was invented, which implement Vigenere ciphers with long periods. Two of the best-known instances of rotor machines are the Hagelin Machine, a commercial device, and the series of rotor machines generically called the Enigma Machine, which were employed by the German military in World War II. It is interesting to note that a similar predecessor of Enigma was invented in Germany by Arthur Scherbius and Arvid Damm in the 1920s, then later patented in the United States in 1928 [-].

The cracking of the Enigma code is has been said to be the most important historical contribution of cryptanalysis [-]. It is well known that the efforts of the Bletchley Park cryptanalysis team (also called "crippies", who were led in part by Alan Turing), directly resulted in the saving of at least tens of thousands of lives and the shortening of World War II by perhaps several years. An excellent review of this period in cryptology is given in Reference [-], with supplemental material in References [-], [-], and [-].

2.3.1. General Concepts of Rotor Machines.

Definition. A generalized rotor machine (GRM, as conceived by Scherbius) is an electromechanical device that has a keyboard, a series of rotors, and a set of lamps that are used as display devices. Given an alphabet F, each rotor is a wheel having |F| possible positions that implements a bijection T_j : P × F -> F, where P denotes a set of rotor positions and j = 1..n . Let a denote a plaintext message having m symbols. If p_i : P -> P denotes a position function of the i-th rotor, then the rotor machine's output is given by

T_R(a(i)) = T_n(p_n-1,T_n-1(p_n-2,T_n-2(... T₂(p₁,T₁(p₀,a(i))) ... ))) , idomain(a) , (I)
where a(i) denotes a symbol input on the keyboard, p₀ denotes the initial position of one or more rotors, and T_R(a(i)) is displayed by the lamp that illuminates a given character on the displayed alphabet F, as shown in Figure 1.

Figure 1. Schematic rotor machine with n = 3 rotors and alphabet F = {A,B,C,D}.
Here, the keyboard emits an "A", which is transformed by the rotors into a "C" (Rotor #1), then a "B" (Rotor #2), then a "C" (Rotor #3). Because the output of the rotors is a "C", the corresponding lamp is activated to display the letter "C". At the next character of the message, Rotor #3 will move ahead one character, which may (or may not) trigger the movement of Rotor #2.
Remark. In the WWII era, rotors were typically constructed of Bakelite or similar dielectric material, with wires embedded inside the dielectric. A wire implemented a given map between two symbols, where the rotor map was a bijection.
Observation. If the rotors of a GRM rotate at the same speed and maintain constant angular offset between adjacent rotors, then the GRM implements a Caesar cipher. This can be proven by noting that
Remark. In order for the rotor machine to implement a long-period Vigenere cipher, the rotors must have different rotational speeds, such that an n-rotor machine has a maximal period |F|ⁿ. That is, the position of the i-th rotor depends on the (i+1)-th rotor's position. This is the customary serial (odometer-like) gearing of most Enigma machines. Since there are no fixed rules for the way a rotor machine must be geared, many variations are possible, as discussed in the following section.

2.3.2. The ENIGMA Machine.

The German Enigma apparently began as a more-or-less standard rotor machine [-] with three rotors. However, requirements of increased security brought on by early phases of the war in Europe (1939-1942) dictated an increased number of rotors. In order to increase the effective number of rotors without drastically increasing weight and power consumption (important considerations for field operations), the developers of Enigma added a reflector, which routed the rotor machine's output back through the rotors, but by a different path than that shown in Figure 1. When the rotor gearing was chosen properly, this effected a doubling of the number of rotors and a squaring of the size of the search space associated with cryptanalysis. That is, instead of a maximal period |F|ⁿ, it was possible in certain circumstances to achieve a maximal effective period of |F|²ⁿ⁺¹.

A further addition was the Steckerboard, a manual plugboard not unlike a small telephone switchboard of the time. The Steckerboard first implemented a substitution, which Enigma's developers thought would render Enigma secure. Near the end of the war, there was an attempt to implement a transpostion using the Steckerboard, which was a difficult goal due to the requirement of buffer memory (then available using only relays or mercury delay lines). The Enigma machine developers thought this would render the machine resistant to all cryptanalytic attacks. In the more usual Enigma machine configuration, with the reflector in place, not only were the number of rotors effectively doubled, but the Steckerboard transposition was inverted at the end of the encryption sequence. An Enigma-like rotor machine is shown in Figure 2.

Algorithm. Using the notation developed in the preceding section (i.e., T for the rotor transform and p for the rotor position with p_x for the reflector position), and adding the reflector substitution R : F -> F, we can express Enigma's encryption function as
e_E(a(i)) = V₁(p₂,V₂(p₃,V₃(... V_n(p_x,R(p_n,T_n(p_n-1,T_n-1(p_n-2,T_n-2(... T₂(p₁,T₁(p₀,a(i)) ... ) ,
where idomain(a) and V_j inverts T_j for a given rotor position, with j = 1..m. Note that the reflector must not map any symbol to itself (e.g., "A" |-> "A"), since that would cause retracing of the encryption path, which would result in no change to the symbol that was encrypted along the forward path.
Remark. Adding the Steckerboard substitution causes e_E to be perturbed as
T_E(a(i)) = S^-1(e_E(S(a))) ,
where X = domain(a) and S : F -> F denotes the Steckerboard substitution. If the Steckerboard was to implement a transposition of form S : X -> X, then the preceding equation would become
T_E(a(i)) = e_E(a(S(i)))[S^-1(i)] ,
and encryption/decryption would be applied to blocks of |X| or fewer symbols.

Figure 2. Schematic Enigma machine with n = 3 rotors and alphabet F = {A,B,C,D},
where dotted lines denote the return path following reflection, and the Steckerboard implements a substitution that includes "B" |-> "A" and "C" |-> "D".
Observation. Enigma's keyspace was parameterized by
However, the Steckerboard that was implemented as a substitution had minimal effect, since the inverse Steckerboard was applied to the rotors' output. Thus, it was the period of the cipher as generated by the rotors that caused the cryptanalytic search space complexity to be high.
Remark. If the rotors do not move in relation to each other as the wheels of an odometer (e.g., rotor n moves once per input character, rotor n-1 moves once per rotation of rotor n-1, etc.), then the effective period of the Vigenere cipher implemented in the rotor machine can be less than the theoretical maximum. In such cases, the pattern of application of the Caesar cipher implemented in a given rotor may be less regular. This is particularly true when the gear ratios between adjacent rotors are comprised of prime numbers. Such facts are important in cryptanalysis, as follows.

2.3.3. Cryptanalysis of Rotor Machines.

The preceding discussion could lead one to surmise that cryptanalysis of the GRM or Enigma machine may not be as difficult as the mechanical complexity of the machine may indicate. In order to understand the associated techniques, let us recall some concepts from group theory.

Definition. Let A be a set with n members, and let S = { | : A -> A } denote a set of permutations on A.
Lemma. G = (S,o) is a group, where o denotes functional composition.
Question. Is G an Abelian group? Answer: No, because composition is not commutative.
Remark. The fact that G is not an Abelian group is important in practice, since this means that different rotors cannot be interchanged while preserving a given encryption transform. Additionally, the rotor initial position and current position become nontrivial considerations.
Observation. Feasible techniques for cryptanalysis of rotor machines include (a) brute-force methods, (b) the Kasiski attack, and (c) maximum-likelihood estimation (MLE) of the rotor configuration.
- Brute-force attacks utilize multiple rotor machines that are connected to effect parallel decryption of ciphertext using different rotor settings per machine. The Polish cryptanalysts who successfully attacked the early Enigma machines used this method in a configuration called a Bombe, because the clicking of the rotors sounded like a ticking time bomb. The output of each candidate decryption is scanned for well-known words or for groups of symbols that are expected to be in the plaintext (as determined from traffic analysis, semantic analysis, or n-gram based statistical analysis). For example, one of Hitler's officers would start his daily code transmission with a standard political greeting. Additionally, by comparing results from each day's candidate decryptions, key changes and rotor initial positions could be predicted a priori.
- The Kasiski attack is useful for determining the period of rotor configurations whose method of interaction (e.g., gearing) is unknown. This technique is not required for odometer-like (hierarchically-geared) rotor drives, since the period of such machines is |F|ⁿ. However, the Kasiski attack is not useful when the cipher period is long in relation to the plaintext (input) size. In such cases, the n-grams used as markers for the Kasiski test do not repeat sufficiently to furnish useful information.
- Semi-automatic cryptanalysis via MLE techniques is based on the following three steps:
The goal of this process is to produce rotor machine adjacency matrices that describe the transform which the rotor machine implements. The following theory is illustrative.
Assumption. Let the structure of a rotor over an alphabet F = {A,B,C,D,E} be as shown in Figure 3, below. The rotor transform T_r: F -> F can be expressed in terms of the graph
G(T_r) = {(A,C),(B,A),(C,E),(D,D),(E,B)}.

Figure 3. Schematic diagram of a rotor.
Definition. Given an alphabet F indexed by the function h : F -> , the rotor transform T_r: F -> F has an adjacency matrix representation M denoted by
M_G(T_r) = {((i,j),M(i,j)) : M(i,j) = 1 if (h^-1(i),h^-1(j)) p₂(G)
and zero otherwise, where i,j }.
Example. The adjacency matrix M_G of the rotor transform illustrated schematically in Figure 3 is shown in tabular form in Figure 4.

Figure 4. Adjacency matrix of the rotor in Figure 3.
Observation. If M_G(T_r) is converted to a real-valued matrix, then we have a basis for an optimization of M_G to yield a Boolean matrix similar to that shown in Figure 4. We begin by starting with the assumption of equiprobable outcomes, then perturb the associated numerical representation by small random values to seed the optimization process. In the preceding example, the matrix M_G would have weights of value 1/|F| = 1/5 = 0.2, perturbed by a small random value. For example, if single precision arithmetic is employed, then the random value would be in the range [10^-4,10^-6].
Algorithm. Given the preceding theory and observation, we are now able to address the problem of semi-automatically determining the rotor machine's adjacency matrices, and thereby guessing the rotor configuration. The following steps pertain.
Be aware that the MLE process usually does not produce a perfect decryption, due to quantization error, computational errors, and erroneous initial assumptions. However, with practice (starting with a one-rotor machine over a very small alphabet), you will be able to obtain reasonably efficient guesses at rotor machine configurations.