Mathematical and Software Engineering, vol. 4, no. 2, 18–23, 2018. Varεpsilon Ltd, http://varepsilon.com/index.php/mse A note on reducing the computation time for minimum distance and equivalence check of binary linear codes Nikolay Yankov1 and Krassimir Enev2 1Faculty of Mathematics and Informatics Konstantin Preslavski University of Shumen, 9712 Shumen, Bulgaria, e-mail: jankov_niki@yahoo.com, ORCID 0000-0003-3703-5867 2College Dobrich, Konstantin Preslavski University of Shumen, 9712 Shumen, Bulgaria, e-mail: kr.enev@shu.bg Abstract In this paper we show the usability of the Gray code with constant weight words for computing linear combinations of codewords. This can lead to a big improvement of the computation time for finding the minimum distance of a code. We have also considered the usefulness of combinatorial 2-(t, k, 1) designs when there are memory limitations to the number of objects (linear codes in particular) that can be tested for equivalence. Keywords: Classification, Combinatorial design, Linear code 1 Introduction Binary linear codes and self-dual codes in particular are extensively studied for the plethora of connections to communication, cryptography, combinatorial designs, among many. When computing self-dual codes one should be aware that with the increase of the code length the number of codes also rises exponentially. The classification of binary self-dual codes begun in 1972 with [11] wherein all codes of lengths n ≤ 20 are classified. Later Pless, Conway and Sloane classify all codes for n ≤ 30 [7]. Next lengths: 32 is due to Bilous and Van Rees [2], 34 by Bilous [1], 36 by Harada and Munemasa in [9]. Latest development in this area are for length 38 in [5] and for n = 40 due to Bouyukliev, Dzumalieva-Stoeva and Monev in [4]. 18 http://varepsilon.com/index.php/mse jankov_niki@yahoo.com https://orcid.org/0000-0003-3703-5867 kr.enev@shu.bg As length of the code gets bigger the number of codewords rises exponentially and one need efficient algorithms for computing the minimum distance of a linear code, and also efficient ways to check codes for equivalence when there are memory limitations. This paper is organized as follows: In Section 2 we outline an introduction to linear codes, self-dual codes, combinatorial designs and Gray codes. Next, in Section 3, we discuss how a reduction in computation time for minimum distance of linear code with constant-weight Gray code can be achieved. In Section 4 we explain a method for reducing the computation time for code equivalence by the use of combinatorial 2-designs. We conclude in Section 5 with a few final notes. 2 Definitions and preliminaries Let Fq be the finite field of q elements, for a prime power q. A linear [n,k]q code C is a k-dimensional subspace of Fnq . The elements of C are called codewords, and the (Hamming) weight of a codeword v ∈ C is the number of the non-zero coordinates of v. We use wt(v) to denote the weight of a codeword. The minimum weight d of C is the minimum nonzero weight of any codeword in C and the code is called an [n,k,d]q code. A matrix whose rows form a basis of C is called a generator matrix of this code. Let (u,v) ∈ Fq for u,v ∈ Fnq be an inner product in Fnq . The dual code of an [n,k]q code C is C⊥ = {u ∈ Fnq | (u,v) = 0 for all v ∈ C} and C⊥ is a linear [n,n − k]q code. In the binary case the inner product is the standard one, namely, (u,v) = ∑n i=1 uivi. If C ⊆ C⊥, C is termed self-orthogonal, and if C = C⊥, C is self-dual. We say that two binary linear codes C and C′ are equivalent if there is a permutation of coordinates which sends C to C′. In the above definition the code equivalence is an equivalence relation is a binary relation that is reflexive, symmetric and transitive. Denote by Eq(a,b) some function that checks for equivalence all pairs of elements in both sets of linear codes a and b. For more information on codes we encourage the reader to [10]. When working with linear codes it is often needed for certain algorithm to pass trough all (or part) of binary vectors of given length. One way to make the generation efficient is to ensure that successive elements are generated such that they differ in a small, pre-specified way. One of the earliest examples of such a process is the Gray code generation. Introduced in a pulse code communication system in 1953 [8], Gray codes now have applications in diverse areas: analogue-to-digital conversion, coding theory, switching networks, and more. For the past 70 years Gray codes have been extensively studied and currently there are many different types of Gray code. A binary Gray code of order n is a list of all 2n vectors of length n such that exactly one bit changes from one string to the next. A t-(v,k,λ) design D is a set X of v points together with a collection of k-subsets of X (named blocks) such that every t-subset of X is contained exactly in λ blocks. The block intersection numbers of D are the cardinalities of the intersections of any two distinct blocks. 19 3 Reducing computation time for minimum distance of linear code with constant-weight Gray code Assume we have a linear binary [n,k] code C and we need to find its minimum distance d. Denote by G the generator matrix of the code C with rows r1, . . . ,rk. The obvious and direct approach is to compute all codewords of C and find their weight. This means that all 2k linear combinations of t (1 ≤ t ≤ k) of the rows of G must be computed using Algorithm 1. Algorithm 1: The direct approach for (i1 = 1; i1 <= k-t+1; i1++) { for (i2 = i1+1; i2 <= k-t+2; i2++) { for (i3 = i2+1; i3 <= k-t+3; i3++) { ... for (it = itm1+1; it <= k; it++) {body}... }} Then for each of the ( k t ) combination we need to compute t cycles and essentially t operations. Furthermore, in the body of this algorithm we need to find the codeword c ∈C which is a linear combination of those rows of the generator matrix G that are chosen for the current combination, i.e. c = t∑ s=1 ris, which will be represented by t “exclusive or” (xor) operations c = ri1 ⊕ ri2 ⊕ . . .⊕ rit . Our approach is to use Gray code for generating combinations in such a way that each successive combination is generated by the previous one with only two xor operations. Two xor operations are the absolute minimum since, if we have to switch from one combination of t elements to another, one xor will add or remove a position making a t + 1 or a t − 1 combination. In [12] it was proved that the set of ( k t ) -vectors of weight t, when chained according to the ordering on the Gray code Gk, has a Hamming distance of exactly two between every pair of adjacent code vectors. Also in [12] an algorithm for generating the constant-weight code vectors on a Gray code was given. Later in [3] a more efficient recursive algorithm was introduced (Algorithm 2). What we want to do is to find in Gray code Gk those k-tuples that have the same weight t, for example when k = 4 for t = 1 we have: 0000→0001→0011→0010→0110→0111 →0101→0100→1100→1101→1111→1110→1010→1011→1001→1000 and similarly, for t = 2 we have: 0000→0001→0011→0010→0110→0111→0101→0100→1100→1101→ 1111→1110→1010 →1011→ 1001→1000. Note that Algorithm 2 starts with the word 1t0k−t and finishes with 0k−t1t. Example 1: If we need to find all triples in G6 we have a total of 20 triples. We start with 000111 and from Gray code we have the following sequence of positions to change [2, 4], [1, 2], [1, 3], [2, 5], [1, 2], [2, 3], [1, 4], [1, 2], [1, 3], [2, 6], [1, 2], [2, 3], [3, 4], [1, 5], [1, 2], [2, 3], [1, 4], [1, 2], [1, 3]. So the sequence of triples is as follows {1, 2, 3},{1, 3, 4},{2, 3, 4},{1, 2, 4},{1, 4, 5},{2, 4, 5},{3, 4, 5},{1, 3, 5},{2, 3, 5},{1, 2, 5}, {1, 5, 6},{2, 5, 6},{3, 5, 6},{4, 5, 6},{1, 4, 6},{2, 4, 6},{3, 4, 6},{1, 3, 6},{2, 3, 6},{1, 2, 6}. 20 Algorithm 2: Constant t-weight (0 < t ≤ k) Gray code Gt [3] for j = 1 to t do { gj = 1 τj = j + 1 for j = t + 1 to k + 1 do { gj = 0 τj = j + 1 s = k τ1 = k + 1 i = 0 while i