Optimal erasure codes Erasure code




1 optimal erasure codes

1.1 parity check
1.2 polynomial oversampling

1.2.1 example: err-mail (k = 2)
1.2.2 general case
1.2.3 real world implementation







optimal erasure codes

optimal erasure codes have property k out of n code word symbols sufficient recover original message (i.e., have optimal reception efficiency). optimal erasure codes maximum distance separable codes (mds codes).


optimal codes costly (in terms of memory usage, cpu time, or both) when n large. except simple schemes, practical solutions have quadratic encoding , decoding complexity. in 2014, lin et al. gave approach



o
(
n
log

n
)


{\displaystyle o(n\log n)}

operations.


parity check

parity check special case n = k + 1. set of k values



{

v

i



}

1

i

k




{\displaystyle \{v_{i}\}_{1\leq i\leq k}}

, checksum computed , appended k source values:








v

k
+
1


=




i
=
1


k



v

i


.


{\displaystyle v_{k+1}=-\sum _{i=1}^{k}v_{i}.}



the set of k + 1 values



{

v

i



}

1

i

k
+
1




{\displaystyle \{v_{i}\}_{1\leq i\leq k+1}}

consistent regard checksum. if 1 of these values,




v

e




{\displaystyle v_{e}}

, erased, can recovered summing remaining variables:








v

e


=




i
=
1
,
i

e


k
+
1



v

i


.


{\displaystyle v_{e}=-\sum _{i=1,i\neq e}^{k+1}v_{i}.}



polynomial oversampling
example: err-mail (k = 2)

in simple case k = 2, redundancy symbols may created sampling different points along line between 2 original symbols. pictured simple example, called err-mail:


alice wants send telephone number (555629) bob using err-mail. err-mail works e-mail, except



instead of asking bob acknowledge messages sends, alice devises following scheme.





bob knows form of f(k)



f
(
i
)
=
a
+
(
b

a
)
(
i

1
)


{\displaystyle f(i)=a+(b-a)(i-1)}

, , b 2 parts of telephone number. suppose bob receives d=777 , e=851 .



bob can reconstruct alice s phone number computing values of , b values (f(4) , f(5)) has received. bob can perform procedure using 2 err-mails, erasure code in example has rate of 40%.


note alice cannot encode telephone number in 1 err-mail, because contains 6 characters, , maximum length of 1 err-mail message 5 characters. if sent phone number in pieces, asking bob acknowledge receipt of each piece, @ least 4 messages have sent anyway (two alice, , 2 acknowledgments bob). erasure code in example, requires 5 messages, quite economical.


this example little bit contrived. generic erasure codes work on data set, need other f(i) given.


general case

the linear construction above can generalized polynomial interpolation. additionally, points computed on finite field.


first choose finite field f order of @ least n, power of 2. sender numbers data symbols 0 k − 1 , sends them. constructs (lagrange) polynomial p(x) of order k such p(i) equal data symbol i. sends p(k), ..., p(n − 1). receiver can use polynomial interpolation recover lost packets, provided receives k symbols successfully. if order of f less 2, b number of bits in symbol, multiple polynomials can used.


the sender can construct symbols k n − 1 on fly , i.e., distribute workload evenly between transmission of symbols. if receiver wants calculations on fly , can construct new polynomial q, such q(i) = p(i) if symbol < k received , q(i) = 0 when symbol < k not received. let r(i) = p(i) − q(i). firstly know r(i) = 0 if symbol < k has been received successfully. secondly, if symbol ≥k has been received successfully, r(i) = p(i) − q(i) can calculated. have enough data points construct r , evaluate find lost packets. both sender , receiver require o(n (n − k)) operations , o(n − k) space operating on fly .


real world implementation

this process implemented reed–solomon codes, code words constructed on finite field using vandermonde matrix.








Comments

Popular posts from this blog

Fuji List of motion picture film stocks

The Missionaries and the Congo Congo Free State propaganda war

Discography Tommy Denander