When the impulse response from the sound source to the ear is
defined as h(t), and the part caused by echoes
(removed the effect of the direct sound from the the impulse response)
is he(t).
The echoes of a sound signal s(t)
is given as
se(t) = he(t) * s(t).
Although the impulse response here is unknow, we can give an generalized
approximation which reflects the delay and decay features in the impulse response
ge = k e -(t-t0)/tau.
Here, the delay time t0 and decay factor tau are
chosen to match the most general cases in an ordinary environment.
(It is done by the learning effect in the human auditory systems.)
Thus, the echoes of a subband signal a(f,t)
(envelop of narrow-subband signal or amplitude of short-term Fourier transform)
can be estimated as
ae(f,t) = Max{a(f,t-t') ge(t')}
for 0 < t' < inf.
By using the exponential decay feature of the generalized impulse response,
the echo estimation algorithm can be implemented by a feed-back manner,
where ts is the sampling time interval
and d = e -ts/tou.
The following figure shows an example of echo estimation.
Localization results in an anechoic chamber
Localization results in a normal room