Path: utzoo!attcan!uunet!ginosko!gem.mps.ohio-state.edu!apple!malcolm From: malcolm@Apple.COM (Malcolm Slaney) Newsgroups: comp.dsp Subject: Re: Wavelet Transform for pitch-shifting, etc. Keywords: wavelet transform dsp Message-ID: <35148@apple.Apple.COM> Date: 30 Sep 89 18:15:34 GMT References: <6028@jpl-devvax.JPL.NASA.GOV> <89255.105143P85025@BARILVM.BITNET><7767@microsoft.UUCP> <89264.171306P85025@BARILVM.BITNET> <9520001@hpsad.HP.COM> <7813@microsoft.UUCP> <22313@sequent Organization: Apple Computer Inc, Cupertino, CA Lines: 23 In article <22313@sequent.UUCP> rjk@sequent.UUCP (Robert Kelley) writes: >I came across an article: >Kronland-Martinet, Richard. 1988. "The Wavelet Transform for Analysis, > Synthesis, and Processing of Speech and Music Sounds" Computer Music > Journal 12(4):11-20. >It describes a method of signal analysis and synthesis which seems ideally >suited to the pitch-shifting or CD speed-altering task mentioned here >recently. I think the Wavelet Transform is a partially developed rehash of two ideas that have been around for a long time. If you are interested in recognition and detection than you should look at the Scale Space theory first proposed by Andy Whitkin somewhere around 1984 in AI literature. The other idea is often called the Wigner distribution and was proposed in the 50's. The latest reference to it I have seen is an MIT thesis by Riley called something like "Time Frequency Representation of Speech." The thesis was published as a book last year. Drop me a note if you need better references. Malcolm Slaney Speech and Hearing Project malcolm@apple.com