Path: utzoo!attcan!utgpu!jarvis.csri.toronto.edu!mailrus!cs.utexas.edu!uunet!image.soe.clarkson.edu!dean From: dean@image.soe.clarkson.edu (Dean Swan) Newsgroups: comp.dsp Subject: Re: Wavelet Transform for pitch-shifting, etc. Message-ID: <1989Oct2.182202.22666@sun.soe.clarkson.edu> Date: 2 Oct 89 18:22:02 GMT References: <35148@apple.Apple.COM> Sender: dean@sun.soe.clarkson.edu (Dean Swan) Organization: Clarkson University, Potsdam, NY Lines: 25 From article <35148@apple.Apple.COM>, by malcolm@Apple.COM (Malcolm Slaney): > In article <22313@sequent.UUCP> rjk@sequent.UUCP (Robert Kelley) writes: >>I came across an article: >>Kronland-Martinet, Richard. 1988. "The Wavelet Transform for Analysis, >> Synthesis, and Processing of Speech and Music Sounds" Computer Music >> Journal 12(4):11-20. >>It describes a method of signal analysis and synthesis which seems ideally >>suited to the pitch-shifting or CD speed-altering task mentioned here >>recently. > > I think the Wavelet Transform is a partially developed rehash of two ideas > that have been around for a long time. If you are interested in recognition > and detection than you should look at the Scale Space theory first proposed > by Andy Whitkin somewhere around 1984 in AI literature. The other idea > is often called the Wigner distribution and was proposed in the 50's. The > latest reference to it I have seen is an MIT thesis by Riley called something > like "Time Frequency Representation of Speech." The thesis was published as > a book last year. > > Drop me a note if you need better references. > > Malcolm Slaney > Speech and Hearing Project > malcolm@apple.com