AP04_2web.vp


1 Introduction
In packet speech transmission over a network, part of the

information is lost. If we are to preserve an acceptable quality
of speech, the permissible percentage of losses is limited. The
authors of [1] and other papers admit 1–3 % percentage loss.
As a result, the average network load is limited. The authors
of [2], [3] propose the use of a variable speech packet-en-
coding rate to enable smoothing of the effect of network
overloads on the received speech quality. Paper [4] proposes
the classification of speech segments in accordance with their
structure. Packets belonging to different classes are assigned
different priorities of delivery. When network overloads ap-
pear, packets with lower priority are discarded first. At the
receiving end, regeneration of lost packets is performed.

The articles mentioned above deal mainly with in-
formation transfer based on the encoding of speech signal pa-
rameters. This paper deals with transfer based on waveform
encoding using pulse-code modulation.

Section 2 considers the main principle of loss recovery in
waveform encoding. Section 3 analyses first-order interpola-
tion methods. Section 4 analyses the possibilities of sec-
ond-order interpolation. Section 5 discusses some experi-
mental results on the recovery of phrases and suggest some
future investigations.

2 Basic interpolation principle
For channels with error bursts, the sequence is often sub-

jected to an alternation prior to transmission and is recovered
at the receiving end. In this case the errors are distributed
in a more uniform manner [5]. We will use this principle
for packet speech transmission. The source sequence of the
samples of a signal segment is memorized, and some permu-
tation of the samples is made, which is followed by separation
into packets, and then transfer. At the receiving end a reverse
procedure is performed. If a packet is lost during transmis-
sion, the lost samples are separated by one or several samples
of the source signal. Such a procedure enables the recovery of
losses as a result of correlation.

Interpolation of samples was applied in [6]. A signal is di-
vided into two groups of even and odd samples. Each group is
shaped into a packet, following reverse permutation, and in
the case of the loss of one packet, one source sample appears
between the missing samples. This allows the applicability

extrapolation and first-order interpolation. The “Odd-even”
alternation allows only the correlation of the neighbouring
samples to be used for the recovery. Using the information on
a greater number of samples, the value of the lost samples
can be recovered more accurately. To do this, the samples
of the source signal have to be interchanged on a segment
containing more than two packets. For example, using a block
encoder, the sequence of samples is written into an n × m
matrix column-wise and is read row-wise. Having defined the
length of row n as being equal to the packet length, after the
reading we will obtain m packets. If one of the m packets is
lost, then after the reverse permutation the lost samples will
be separated by m�l samples of the source signal. In this case
interpolation procedures ranging from the zero order and up
to the (m�l)-th order may be applied for the recovery.

3 First-order interpolation
We will assume that not more than one packet is randomly

lost from the sequence of signal samples consisting of packets.
The sequence Xi (i � 1, 2, ...) of the centered signal X(t) is
transmitted. The reception and the reverse permutation fol-
lows. In the case of the loss of one packet, the received signal
Xi differs from Xi only in the points ti � tj, where the samples
are missing. In this case the probability is P(ti � tj) � 1/m. The
transmission error is �i i iX X� � . The mean square error of
the transmission of a sequence consisting of m packets is

� � � �E X X P t t E X X mi i i i j
i

i i( ) ( )�
2 2 2� � � � ��

��
	

�� (1)

If the estimate Xi of the values of the missing samples is
equated to zero, then:

E E X m R mi XX( ) ( ) ( )�0
2 2 0� � , (2)

where RXX(0) is the value of the correlation function for a zero
shift.

In the recovery of the values of Xi on the basis of the
first-order prediction X Xi i�  �� 1. It is usually assumed that
coefficient � � 1. Then, in accordance with (1), the mean
square error will be:

� �� � � �E E X X m R R mi i XX XX( ) ( ) ( )�P12 1 2 2 0 1� � � �� , (3)
where RXX(1) is the value of the correlation function of signal
X(t) with the shift equal to � t.

©  Czech Technical University Publishing House http://ctn.cvut.cz/ap/ 59

Acta Polytechnica Vol. 44  No. 2/2004

Speech Signal Recovery in
Communication Networks
V. Zagursky, A. Riekstinsh

Interpolation approaches to the shape recovery of a speech signal in transmission over packet switched communications networks are
proposed. The samples of signal fragments are mixed and transmitted in correspondence with standard procedure for packet-switched
transmission. After reception a reverse permutation is made. In the case of packet losses missing samples are separated by several samples of
the source signal. Correlation properties of the signal are used for the recovery samples due to first- and second-order non-adaptive and
adaptive interpolation. For the loss of 25 % packets and second order adaptive interpolation a 2–4 % error distribution range has been
achieved.


For the first-order interpolation:
X a X a Xi i i�  � � � � �1 1 1 1 . (4)

For the first-order interpolation it is usually assumed that
a a a� �� �1 1 . For the most commonly used procedure a � 0.5
(linear interpolation). This procedure was referred to in [6]
as non-adaptive, as well as the prediction with � � 1. For
X X Xi i i� �� �0 5 1 1. ( ) the mean square error will be:

� �E R R R mi XX XX XX( ) . ( ) ( ) . ( )� 1
2 1 5 0 2 1 0 5 2� � � . (5)

Let us compare expressions (3) and (5):

� �E E R R mr i XX XX( ) ( ) ( ) ( )� �1
2

1
2 0 2� � � . (6)

Expression (6) yields that interpolation provides a smaller
recovery error, as compared with prediction, since for any
random signals RXX(2) < RXX(0).

The error can be minimized by applying the adaptive ap-
proach, as was demonstrated in [6], by calculating the error
for interpolation with the use of (4) and by determining the
minimal error depending on the coefficients a

�1 and a+1, we
will obtain:

� �a a a r rXX XXopt � � �  �� �
�

1 1
11 1 2( ) ( ) , (7)

where r n R n RXX XX XX( ) ( ) ( )� 0 is the correlation coefficient.
The figure 1 shows the charts of the errors of recovering a

signal which corresponds to the sounds a, d, c, sh. The vertical

axis are laid off, in percent, the value of the normalized mean
square errors � �� E RXX( ) ( )

2 0 � .
The horizontal axis displays the ordinal numbers of the

appropriate signal segment with length n � 128 samples. A
real speech signal was used in the experiments.

A situation was simulated in which every fourth packet is
lost (m � 4). When no recovery is applied, the error at the re-
ceiving end equals 25 %, since from (2) we have � � 1/m. It fol-
lows from the figure that for practically all signal segments
�1.2 < �1.1.

4 Second-order interpolation
The estimate of the values of the lost samples Xi using the

known samples with numbers i � 1, i � 2 will be made using
the expression:

X b X X b X Xi i i i i� � � �� � � �1 1 1 2 2 2( ) ( ) (8)

The normalized mean-square error of the second-order
interpolation will be determined in a way similar to (3).

��2 12 22 2 2
1
2

2
2

1 2 2 4 1 1

2 2 2

� � � � � �

�

( ) ( ) ( )

( ) ( )

b b b b r

b b r

XX

XX �� �4 3 2 41 2 22b b r b r mXX XX( ) ( ) .
(9)

60 ©  Czech Technical University Publishing House http://ctn.cvut.cz/ap/

Acta Polytechnica Vol. 44  No. 2/2004

Fig. 1: Errors of recovering a signal which correspods to the sounds a, d, c, sh
1 – linear interpolation (�1.1)
2 – 1-st order adaptive interpolation (�1.2)
3 – 2-nd order Chebyshev’s interpolation (�2.1)
4 – 2-nd order adaptive interpolation (�2.2)


Like for the first-order interpolation, the procedure
for determining, coefficients b1, b2 may be non-adaptive or
adaptive. For a non-adaptive procedure some known fam-
ily of polynomials can be used, for example, the family
of second-order Chebyshev polynomials. It can be shown that
b1 � 0.667 and b2 � � 0.167. Then (9) is converted to the
following form:

��2 1 1 94 3 11 1 1 56 2
0 44 3 0 056

. . . ( ) . ( )

. ( ) .

� � � �

�

r r

r r
XX XX

XX X �X m( ) .4
(10)

Let us compare (10) and (5) for the normalized error
� �11 1

2 0. ( ) ( )� E Ri XX :

� � � �� �� �11 2 1 1 0 44 2 0 44 3. . ( ) . ( ) . ( )� � � � �r r r mXX XX XX (11)

Analysing (11), it is easy to see that the efficiency of the in-
terpolation is determined by the type of correlation function
of the signal. Specially, �2.1 < �1.1 for the signals for which
the correlation between the adjacent samples is high, and
afterwards it rapidly decreases. This is true for many speech
sounds [7]. However, for hushing sounds and fricatives, the
value of rXX(1) may not be high. Then it is possible that
�1.2 < �1.1. In selecting coefficients in (8) the second-order
adaptive interpolation allows us to take account of the effect
of all values of the correlation function. The formulas for the
optimum values of coefficients b1 and b2 will be obtained by
equating the partial derivatives � � �2 1b and � � �2 2b to zero
in (9). This is illustrated by Fig. 1, which shows the experi-
mental results of signal recovery for 25 % losses for the first-
and second-order non-adaptive and adaptive interpolation.
For the sounds “a” and “d”, the second-order interpolation
provides better results than the first-order interpolation. For
the sounds “c” and “sh” this is true for adaptive interpolation,
while with the use of the second-order Chebyshev interpola-
tion the recovery error increases.

5 Experimental results and
conclusions
Experiments have been made on the recovery of losses in

fused speech. Signal samples are divided into packets, 128
readings each. The packets are combined into groups of
4 packets. Within one group permutations are made so as to
ensure that all 4 packets are interrelated. Each fourth packet is
discarded, following which a reverse permutation and recov-
ery are performed. Use was made of first- and second-order
interpolation – non-adaptive and adaptive. The table shows
the integral error estimates for all recovery procedures for dif-
ferent speech phrases up to 5 sec. in length.

Our investigations testify to the efficiency of approaches
that invo1ve waveform recovery. The first-order adaptive
interpolation yielded results, acceptable in terms of sound

quality, for the loss of 25 % packets. The second-order adap-
tive interpolation yields better results in terms of both sound
quality and root-mean-square error.

The adaptive procedure requires additional processing of
the signal at the receiving end in order to calculate the corre-
lation instants and coefficients, followed by transmission of
the calculated information in the packet. The authors intend
to investigate other approaches – determining the relation
between the interpolation coefficients and the sign character-
istics of the signal, which are easier to determine than the
correlation ones, as well as calculating the characteristics at
the receiving end directly from the signal with losses.

References
[1] Chlamtac I.: “An Ethernet sompatible protocol for real-

-time voice/data integration.” Computer Networks and
ISDN Syst., Vol. 10 (1985), No. 2, p.81–96.

[2] Bially T., Crold B., Seneff S.: “A Technique for Adaptive
Voice Flow Control in Integrated Packet Networks.”
IEEE Trans. on Communic., Vol. COM-28 (March
1980), No. 3, p. 325–333.

[3] Forat V. S., Friedman E. M., Minden G. I.: “Multi-
rate voice coding for load control on CSMA/CD local
computer networks.“ Computer Networks and ISDN Syst.,
Vol. 11 (1986), p. 99–110.

[4] Petr P. W., DaSilva I. A., Forat V. S.: “Priority Discarding
of Speech in Integrated Packet Networks.” IEEE Journal
on Selected Areas in Communic., Vol. 7 (June 1989),
No.5, p. 644–656.

[5] Clark G. C., Cain J. B.: “Error-Correction Coding for
Digital Communications.” New York: Plenum Press,
1982, p. 352.

[6] van den Bos A.: “Complex Electron Wave Reconstruc-
tion Using Parametr Estimation.” IEEE Transact. on
Instrum. and Measurement, Vol. 46, No. 4, p. 826–830.

[7] Rabiner L. R., Schafer R. W.: “Digital processing of
speech signals.” New Jersey 07632: Prentice-Hall, 1978,
p. 436.

V. Zagursky
phone: +371 755 8448
fax: +371 755 5337
e-mail: zagursky@edi.lv, aigars@egle.cs.rtu.lv

Institute of Electronics and Computer Science
of Latvian University

A. Riekstinsh

Institute of Automation and Computer Engineering
of Riga Technical University

14 Dzerbenes str., Riga, LV-1006, Latvia

©  Czech Technical University Publishing House http://ctn.cvut.cz/ap/ 61

Acta Polytechnica Vol. 44  No. 2/2004

Type of interpolation Error distribution range

Linear interpolation 5–12 %

l-st order adaptive interpolation 3–5 %

2-nd order Chebyshev interpolation 5–15 %

2-nd order adaptive interpolation 2–4 %

Table 1: Fused speech recovery error


	Table of Contents
	Biological Systems Thinking for Control Engineering Design 3
	D. J. Murray-Smith

	Computational Fluid Dynamic Simulation (CFD) and Experimental Study on Wing-external Store Aerodynamic Interference of a Subsonic Fighter Aircraft 9
	Tholudin Mat Lazim, Shabudin Mat, Huong Yu Saint

	Dynamics of Micro-Air-Vehicle with Flapping Wings 15
	K. Sibilski
	The Role of CAD in Enterprise Integration Process 22
	M. Ota, I. Jelínek


	Development of a Technique and Method of Testing Aircraft Models with Turboprop Engine Simulators in a Small-scale Wind Tunnel – Results of Tests 27
	A. V. Petrov, Y. G. Stepanov, M. V. Shmakov

	Developing a Conceptual Design Engineering Toolbox and its Tools 32
	R. W. Vroom, E. J. J. van Breemen, W. F. van der Vegte
	Knowledge Support of Simulation Model Reuse 39
	M. Valášek, P. Steinbauer, Z. Šika, Z. Zdráhal

	The Effect of Pedestrian Traffic on the Dynamic Behavior of Footbridges 47
	M. Studnièková


	Control of Systems of Reservoirs with the Use of Risk Analysis 52
	P. Fošumpaur, L. Satrapa

	A coding and On-Line Transmitting System 56
	V. Zagursky, I. Zarumba, A. Riekstinsh
	Speech Signal Recovery in Communication Networks 59
	V. Zagursky, A. Riekstinsh


	Simulation of Scoliosis Treatment Using a Brace 62
	J. Èulík

	Image Analysis of Eccentric Photorefraction 68
	J. Dušek, M. Dostálek

	A Novel Approach to Power Circuit Breaker Design for Replacement of SF6 72
	D. J. Telfer, J. W. Spencer, G. R. Jones, J. E. Humphries
	Numerical Analysis of the Temperature Field in Luminaires 77
	J. Murín, M. Kropáè, R. Fric

	Computer Aided Design of Transformer Station Grounding System Using CDEGS Software 83
	S. Nikolovski, T. Bariæ


	Recycling and Networking 90
	T. Bányai
	Response of a Light Aircraft Under Gust Loads 97
	P. Chudý

	Preliminary Determination of Propeller Aerodynamic Characteristics for Small Aeroplanes 103
	S. Slavík