Path: utzoo!attcan!uunet!ginosko!gem.mps.ohio-state.edu!apple!malcolm
From: malcolm@Apple.COM (Malcolm Slaney)
Newsgroups: comp.dsp
Subject: Re: Wavelet Transform for pitch-shifting, etc.
Keywords: wavelet transform dsp
Message-ID: <35148@apple.Apple.COM>
Date: 30 Sep 89 18:15:34 GMT
References: <6028@jpl-devvax.JPL.NASA.GOV> <89255.105143P85025@BARILVM.BITNET>  <7767@microsoft.UUCP> <89264.171306P85025@BARILVM.BITNET> <9520001@hpsad.HP.COM>  <7813@microsoft.UUCP> <22313@sequent
Organization: Apple Computer Inc, Cupertino, CA
Lines: 23

In article <22313@sequent.UUCP> rjk@sequent.UUCP (Robert Kelley) writes:
>I came across an article:
>Kronland-Martinet, Richard. 1988. "The Wavelet Transform for Analysis, 
> Synthesis, and Processing of Speech and Music Sounds" Computer Music 
> Journal 12(4):11-20.
>It describes a method of signal analysis and synthesis which seems ideally 
>suited to the pitch-shifting or CD speed-altering task mentioned here 
>recently.

I think the Wavelet Transform is a partially developed rehash of two ideas
that have been around for a long time.  If you are interested in recognition
and detection than you should look at the Scale Space theory first proposed
by Andy Whitkin somewhere around 1984 in AI literature.  The other idea
is often called the Wigner distribution and was proposed in the 50's.  The
latest reference to it I have seen is an MIT thesis by Riley called something
like "Time Frequency Representation of Speech."  The thesis was published as
a book last year.

Drop me a note if you need better references.

						Malcolm Slaney
						Speech and Hearing Project
						malcolm@apple.com