Skip to main content

Hey ya'll! Long time no see. For the past year or two I haven't been doing much recording or spending time online, but the past few weeks have brought major changes to my life, so I hope to be around more. I'm living in Canada again, and I'm hoping to stay here for a while.

Anyways...

I guess because I'm rusty I failed to make sure that all my files in this project were properly converted into the correct sample and bit rates when I first set out. After spending a dozen hours or so working on it I noticed that the notes didn't quite sound right when I pitch corrected them. That's when I realized that my sample rates were all messed up. Now I'm trying to sort this all out and learn from my mistakes.

So my question is this: converting from 44.1khz to 48khz and from 16 bit to 24 bit seems to have raised the pitch 1.3 tones, as best as my ears can tell. But is there a way I can calculate exactly how much the pitch was changed?

Comments

Boswell Fri, 02/12/2016 - 15:31

It's purely ratiometric, Clark.

The ratio 48/44.1 is 1.0884, or 8.84% higher. A semitone is an interval of about 6% (this is by how much the 12th root of 2 is greater than 1), so replaying a 44.1KHz file at 48KHz without any sample rate conversion will raise the pitch by a bit less than 1.5 semitones. The playing time will reduce by the same ratio.

The wordlength does not play a part in this.

ClarkJaman Fri, 02/12/2016 - 18:40

You are indeed a smart cookie Boswell. I understand everything except this:

Boswell, post: 436171, member: 29034 wrote: (this is by how much the 12th root of 2 is greater than 1)

How can I calculate exactly how many semitones I need to lower the pitch? Right now I have it lowered 1.3 semitones and it sounds close but not quite perfect.

Boswell Sat, 02/13/2016 - 04:28

ClarkJaman, post: 436175, member: 42802 wrote: How can I calculate exactly how many semitones I need to lower the pitch? Right now I have it lowered 1.3 semitones and it sounds close but not quite perfect.

Sorry, I thought I mentioned it in my previous post: 1.5 semitones (1.487 if you want it more exactly).

Calculation is really the only useful technique for this type of problem, as you won't get the necessary precision out of any of the standard spectrum analysers. A guitar tuner that reads out in cents would give you a better experimental result, but even then you would have to correct the reading once you had subtracted the whole semitone from it. The required reading is a semitone plus 27.3 cents of the next semitone, not 48.7 cents.

dvdhawk Sat, 02/13/2016 - 10:41

With all of that genius applied to give a definitive answer to your actual perfectly good question, may I ask a few dumb questions of my own as a follow up? ( hopefully you don't mind, because here they are )

What's the end goal of the project? (CD, video, mp3)
What program / process are you using to convert them?
What DAW are you using?

As much as I enjoyed learning the answer, your original post made me wonder if it was worth all the trouble - which depends on what you want to do with it. CD quality is 16-bit 44.1kHz. Often conversion(s) are only going to degrade it, even if it's up-converting to theoretically higher rates. It can't make data where there was none, so the algorithm has to calculate a best guess. There are worse things than working in 16-bit 44.1kHz (CD quality).

And I'm currently using StudioOne (and before that an LE version of ProTools), both could import standard CD files into a session with a higher bit-rate and sample-rate without affecting the pitch. It still has to do the up-conversion during the import process, but it also factors in the exact sample-accurate length of the file ( in terms of minutes:seconds:samples) so it doesn't alter the pitch. So your difficulties might be the difference between opening the files in a different format on the DAW vs. importing the files into a different format session.

Boswell has brilliantly covered the 'how', I'm left wondering about the 'why'.

ClarkJaman Sat, 02/13/2016 - 11:41

dvdhawk, post: 436206, member: 36047 wrote: With all of that genius applied to give a definitive answer to your actual perfectly good question, may I ask a few dumb questions of my own as a follow up? ( hopefully you don't mind, because here they are )

What's the end goal of the project? (CD, video, mp3)
What program / process are you using to convert them?
What DAW are you using?

As much as I enjoyed learning the answer, your original post made me wonder if it was worth all the trouble - which depends on what you want to do with it. CD quality is 16-bit 44.1kHz. Often conversion(s) are only going to degrade it, even if it's up-converting to theoretically higher rates. It can't make data where the was none, so the algorithm has to calculate a best guess. There are worse things than working in 16-bit 44.1kHz (CD quality).

And I'm currently using StudioOne (and before that an LE version of ProTools), both could import standard CD files into a session with a higher bit-rate and sample-rate without affecting the pitch. It still has to do the up-conversion during the import process, but it also factors in the exact sample-accurate length of the file ( in terms of minutes:seconds:samples) so it doesn't alter the pitch. So your difficulties might be the difference between opening the files in a different format on the DAW vs. importing the files into a different format session.

Boswell has brilliantly covered the 'how', I'm left wondering about the 'why'.

Yeah, I totally understand what you mean. I am not actually going to use the converted files in the final product. The final product is a youtube video, which I understand is best done in 24bit with 48khz (correct me if I'm wrong, or if this has changed? ) But the real answer to the question of WHY is that I was also experimenting with getting around the copyright detection system on youtube and just trying to learn by doing. :P

dvdhawk Sat, 02/13/2016 - 11:50

"trying to learn by doing." I can certainly appreciate that! (y)(y)

You're right 48kHz is the preferred sample-rate for video editing. So you're working in a video editor rather than a DAW? Again from first hand experience, Premiere, Final Cut, or even iMovie will both import CD files into a 48kHz project. I'm sure most, if not all, of the other NLE programs will too.

If your video is for educational purposes you should be OK under the fair-use provisions of the law.

Good luck!

ClarkJaman Sat, 02/13/2016 - 14:49

dvdhawk, post: 436213, member: 36047 wrote:
You're right 48kHz is the preferred sample-rate for video editing. So you're working in a video editor rather than a DAW? Again from first hand experience, Premiere, Final Cut, or even iMovie will both import CD files into a 48kHz project. I'm sure most, if not all, of the other NLE programs will too.

Ok I'm glad that hasn't changed. :P I'm using Cubase 8.5pro for the audio, and once the audio is finished I upload it to FCPX.

kmetal Sat, 02/13/2016 - 18:58

dvdhawk, post: 436206, member: 36047 wrote: With all of that genius applied to give a definitive answer to your actual perfectly good question, may I ask a few dumb questions of my own as a follow up? ( hopefully you don't mind, because here they are )

What's the end goal of the project? (CD, video, mp3)
What program / process are you using to convert them?
What DAW are you using?

As much as I enjoyed learning the answer, your original post made me wonder if it was worth all the trouble - which depends on what you want to do with it. CD quality is 16-bit 44.1kHz. Often conversion(s) are only going to degrade it, even if it's up-converting to theoretically higher rates. It can't make data where the was none, so the algorithm has to calculate a best guess. There are worse things than working in 16-bit 44.1kHz (CD quality).

And I'm currently using StudioOne (and before that an LE version of ProTools), both could import standard CD files into a session with a higher bit-rate and sample-rate without affecting the pitch. It still has to do the up-conversion during the import process, but it also factors in the exact sample-accurate length of the file ( in terms of minutes:seconds:samples) so it doesn't alter the pitch. So your difficulties might be the difference between opening the files in a different format on the DAW vs. importing the files into a different format session.

Boswell has brilliantly covered the 'how', I'm left wondering about the 'why'.

That was my first thought. Why not just work w what's there. Any processing, effects, conversion is always at a price analog or digital, it's always a matter of is it worth it, and diminishing returns.

ClarkJaman, post: 436211, member: 42802 wrote: Yeah, I totally understand what you mean. I am not actually going to use the converted files in the final product. The final product is a youtube video, which I understand is best done in 24bit with 48khz (correct me if I'm wrong, or if this has changed? ) But the real answer to the question of WHY is that I was also experimenting with getting around the copyright detection system on youtube and just trying to learn by doing. :p

Last time a checked sound quality didn't matter to YouTube viewers. ;)

John Santos Wed, 03/09/2016 - 05:18

Boswell, post: 436171, member: 29034 wrote: It's purely ratiometric, Clark.

The ratio 48/44.1 is 1.0884, or 8.84% higher. A semitone is an interval of about 6% (this is by how much the 12th root of 2 is greater than 1), so replaying a 44.1KHz file at 48KHz without any sample rate conversion will raise the pitch by a bit less than 1.5 semitones. The playing time will reduce by the same ratio.

The wordlength does not play a part in this.

Wow. That's an awesome engineer-y way to explain this.

A bit difficult to understand for me though. Haha :cry:

paulears Thu, 03/10/2016 - 15:33

Just a thought - you've changed the pitch AND the time, so running the thing through a pitch shifter will still sound a bit wrong because the speed is wrong too. Why don't you just re-sample them. Bring them in at their recorded sample rate, the output it at the new one. Pitches and tempo will then be fine!

paulears Thu, 05/19/2016 - 01:14

Just thinking about what you said about pitch and Youtube's copyright system. I doubt that will work. You tube is pretty good at detecting well known songs even in different keys and with different tempo settings. I have no idea how it does it, but it's flagged a couple of ours recently and while nice to think it's close enough to be detected, still a bit of a pain. It even spotted a few seconds of a popular song being played right in the corner of the frame on a screen in the room with playback through a tiny speaker. So loads of people talking in a busy room, and the TV in the background was playing a well known song, that you could just hear - and it found that! Not bad programming by somebody.

ClarkJaman Thu, 05/19/2016 - 10:53

paulears, post: 438555, member: 47782 wrote: Just thinking about what you said about pitch and Youtube's copyright system. I doubt that will work. You tube is pretty good at detecting well known songs even in different keys and with different tempo settings. I have no idea how it does it, but it's flagged a couple of ours recently and while nice to think it's close enough to be detected, still a bit of a pain. It even spotted a few seconds of a popular song being played right in the corner of the frame on a screen in the room with playback through a tiny speaker. So loads of people talking in a busy room, and the TV in the background was playing a well known song, that you could just hear - and it found that! Not bad programming by somebody.

Actually, it is quite easy to fool Youtube's copyright detection. Just shift the pitch and time a little bit. Those examples you gave are probably not the copyright detection software, but rather a human being who flagged the video as a violation of copyright. Did the video get flagged as soon as it uploaded, or was it later on, after it got some views?

ClarkJaman Thu, 05/19/2016 - 12:48

That is strange. Perhaps the programming is able to identify the copyrighted content that is underlayed behind other audio, but not content that has been shifted in terms of pitch or timing. Perhaps the 24 hour delay indicates that the computer program was unsure whether or not the audio was copyrighted and it had to be double checked by a human being.