damnthefilibuster, damnthefilibuster@lemmy.world

Instance: lemmy.world
Joined: 3 years ago
Posts: 2
Comments: 179

RSS feed

Posts and Comments by damnthefilibuster, damnthefilibuster@lemmy.world



Yes, but…

The reason why iOS asks you to read specific things out loud is because of a thing called alignment - basically, it needs to map exactly sounds to exact phonemes. So if it’s making you say “apple”, it’s recording how you say “ae”, “puh”, and “l”.

When you expect it to take any random file, it needs to know specific things like, what’s the text, what is the time frame, when did the speaker speak in a normal voice vs a higher pitch or speed voice. Taking all of this information and created a voice is called “forced alignment” and that’s a field that’s well studied but not implemented on mobile phones as yet.

An alternative for you, if you want, say, Snoop Dog to be your Siri voice, is to actually use an AI generated Snoop Dog voice to say all the things that the Personal Voice feature expects and just play them out loud from your computer when setting up Personal voice. This assumes you use one of the many paid Snoop Dog AI voice companies out there.

Another way to get the voice you want is to just take a recording of someone, write the text of what they said and then feed it to a tool like the Gentle Forced Aligner - https://github.com/strob/gentle

This will give you enough material to then take it to AWS or Microsoft’s AI voice creator tools to create an AI voice. Then you can use that AI voice to say the same things that iOS Personal Voice wants you to say and again, play the correct recording when activating Personal Voice.

All of this assumes that is your end goal - Snoop Dogg as your Siri. Anything else, please let me know and I’ll help you with what you’re trying to achieve.


Yes that’s the one! And the way it starts is just so funny!

Oh you run EuroGraphic Novels? I just found that today! Thank you for Tintin Tuesdays. Just such a star, that fella!




Who is the bitch in your commentary, OP?

 reply
14



Compounding fines please! And per instance sold. If the fine doesn’t touch 1 Billion CAD then it won’t change things.




Reminds me of that game Townscaper.



I was gonna say… OP much better rich to afford Photoshop!


YouTube costs, what, $25 per month to not show ads?

I reckon Facebook and instagram would be in the same range.




Be so funny if the next scene is all these baddies running away because it’s IP Man and Bruce Lee who were coming in.



RSS feed

Posts by damnthefilibuster, damnthefilibuster@lemmy.world

Comments by damnthefilibuster, damnthefilibuster@lemmy.world



Yes, but…

The reason why iOS asks you to read specific things out loud is because of a thing called alignment - basically, it needs to map exactly sounds to exact phonemes. So if it’s making you say “apple”, it’s recording how you say “ae”, “puh”, and “l”.

When you expect it to take any random file, it needs to know specific things like, what’s the text, what is the time frame, when did the speaker speak in a normal voice vs a higher pitch or speed voice. Taking all of this information and created a voice is called “forced alignment” and that’s a field that’s well studied but not implemented on mobile phones as yet.

An alternative for you, if you want, say, Snoop Dog to be your Siri voice, is to actually use an AI generated Snoop Dog voice to say all the things that the Personal Voice feature expects and just play them out loud from your computer when setting up Personal voice. This assumes you use one of the many paid Snoop Dog AI voice companies out there.

Another way to get the voice you want is to just take a recording of someone, write the text of what they said and then feed it to a tool like the Gentle Forced Aligner - https://github.com/strob/gentle

This will give you enough material to then take it to AWS or Microsoft’s AI voice creator tools to create an AI voice. Then you can use that AI voice to say the same things that iOS Personal Voice wants you to say and again, play the correct recording when activating Personal Voice.

All of this assumes that is your end goal - Snoop Dogg as your Siri. Anything else, please let me know and I’ll help you with what you’re trying to achieve.


Yes that’s the one! And the way it starts is just so funny!

Oh you run EuroGraphic Novels? I just found that today! Thank you for Tintin Tuesdays. Just such a star, that fella!




Who is the bitch in your commentary, OP?

 reply
14



Compounding fines please! And per instance sold. If the fine doesn’t touch 1 Billion CAD then it won’t change things.




Reminds me of that game Townscaper.



I was gonna say… OP much better rich to afford Photoshop!


YouTube costs, what, $25 per month to not show ads?

I reckon Facebook and instagram would be in the same range.




Be so funny if the next scene is all these baddies running away because it’s IP Man and Bruce Lee who were coming in.