Path: utzoo!attcan!utgpu!jarvis.csri.toronto.edu!mailrus!cs.utexas.edu!uunet!image.soe.clarkson.edu!dean
From: dean@image.soe.clarkson.edu (Dean Swan)
Newsgroups: comp.dsp
Subject: Re: Wavelet Transform for pitch-shifting, etc.
Message-ID: <1989Oct2.182202.22666@sun.soe.clarkson.edu>
Date: 2 Oct 89 18:22:02 GMT
References: <35148@apple.Apple.COM>
Sender: dean@sun.soe.clarkson.edu (Dean Swan)
Organization: Clarkson University, Potsdam, NY
Lines: 25

From article <35148@apple.Apple.COM>, by malcolm@Apple.COM (Malcolm Slaney):
> In article <22313@sequent.UUCP> rjk@sequent.UUCP (Robert Kelley) writes:
>>I came across an article:
>>Kronland-Martinet, Richard. 1988. "The Wavelet Transform for Analysis, 
>> Synthesis, and Processing of Speech and Music Sounds" Computer Music 
>> Journal 12(4):11-20.
>>It describes a method of signal analysis and synthesis which seems ideally 
>>suited to the pitch-shifting or CD speed-altering task mentioned here 
>>recently.
> 
> I think the Wavelet Transform is a partially developed rehash of two ideas
> that have been around for a long time.  If you are interested in recognition
> and detection than you should look at the Scale Space theory first proposed
> by Andy Whitkin somewhere around 1984 in AI literature.  The other idea
> is often called the Wigner distribution and was proposed in the 50's.  The
> latest reference to it I have seen is an MIT thesis by Riley called something
> like "Time Frequency Representation of Speech."  The thesis was published as
> a book last year.
> 
> Drop me a note if you need better references.
> 
> 						Malcolm Slaney
> 						Speech and Hearing Project
> 						malcolm@apple.com