04-18-2007 01:41 PM
Still buried in regular work, no time to tinker. I just want to be sure I understand *exactly* what is meant by your phrase, "the sub-segments have to be centered on the pitch peaks."
Here's what I *think* it means.
1. A given chunk of 512 samples (perhaps with a little +/-) is characterized with a given dominant frequency. This would be considered THE pitch for that whole chunk.
2. That pitch / frequency will further determine the lengh of the sub-segments. The sub-segments will initially be defined such that neighboring sub-segments overlap one another by 50%. At this point, the sound data hasn't necessarily been changed. The idea of overlap is handled by defining the range of indices for each of the subsegments, and those ranges overlap.
3. Subsegments are either deleted or duplicated. Then those subsegments are placed such that they span the 512-sample range. The amount of "overlap" may be anything from 0 to 75%.
4. Now the sound data will be "faded" across the subsegments. A Hanning window or other appropriate shaping function will be applied to each subsegment.
5. A natural result of this Hanning window (or a Triangle window) would be that the sound is attenuated to 0 at the edges of the subsegment, but is passed through unchanged in the middle. Thus the amplitude peak can be expected to occur in middle of the subsegment.
So it seems that the requirement that "the sub-segments have to be centered on the pitch peaks" will happen kinda automatically during the fade / merge. Is this right?
-Kevin P.
04-18-2007 01:47 PM - edited 04-18-2007 01:47 PM
Message Edited by Milqman on 04-18-2007 01:50 PM
04-18-2007 01:51 PM
04-18-2007 02:15 PM
04-18-2007 04:49 PM