<?xml version="1.0" encoding="UTF-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
> <channel><title>Comments on: Synthetic Speech in Flash</title> <atom:link href="http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/feed/" rel="self" type="application/rss+xml" /><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/</link> <description>Friend to analog and digital alike</description> <lastBuildDate>Mon, 17 Oct 2011 01:32:57 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.3.1</generator> <item><title>By: slegectecaula</title><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7797</link> <dc:creator>slegectecaula</dc:creator> <pubDate>Fri, 11 Dec 2009 21:05:49 +0000</pubDate> <guid
isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7797</guid> <description>Lots of of folks talk about this matter but you wrote down really true words!</description> <content:encoded><![CDATA[<p>Lots of of folks talk about this matter but you wrote down really true words!</p> ]]></content:encoded> </item> <item><title>By: Zach</title><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7297</link> <dc:creator>Zach</dc:creator> <pubDate>Fri, 09 Oct 2009 20:24:15 +0000</pubDate> <guid
isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7297</guid> <description>Thanks Dave! As I understand it, LPC was designed with speech compression in mind. The playback works like this: you generate either a repeating pulse (for pitched frames such as vowel sounds) or noise (for sibilance (&quot;S&quot;, &quot;Z&quot;, etc)), and you run this signal through a set of very short delays (echoes). The output is fed back to the input, so it self-oscillates a bit.
Different vowel sounds can be produced by changing the amplitudes of the individual delays. The real magic is obtaining those values -- my early experiments were plagued with howling feedback, and situations where the DC offset increased exponentially ;)
Ultimately, I looked at the source of an app called rt_lpc (http://soundlab.cs.princeton.edu/software/rt_lpc/) and this was very useful. I have a vague idea of how the encoding works now (their code could stand more comments IMHO ;), the decoding &amp; playback is pretty straight-forward.
Currently I&#039;m not using any blending or transitions between phonemes (tho I am synthesizing the sound at a 10khz sample rate and using anti-aliasing during playback, because I like how it sounds ;) When the feedback amplitudes change, the playback buffer retains its old state, and the result is a smooth-sounding transition between frames. Pretty neat.
Really, I&#039;m amazed that LPC works AT ALL. It&#039;s very clever.
I&#039;d really like to share the code (the dictionary and speech-to-phoneme aspects are interesting, too). Hopefully I&#039;ll get the go-ahead from the team soon... I&#039;ll write another post when that&#039;s ready.</description> <content:encoded><![CDATA[<p>Thanks Dave! As I understand it, LPC was designed with speech compression in mind. The playback works like this: you generate either a repeating pulse (for pitched frames such as vowel sounds) or noise (for sibilance (&#8220;S&#8221;, &#8220;Z&#8221;, etc)), and you run this signal through a set of very short delays (echoes). The output is fed back to the input, so it self-oscillates a bit.</p><p>Different vowel sounds can be produced by changing the amplitudes of the individual delays. The real magic is obtaining those values &#8212; my early experiments were plagued with howling feedback, and situations where the DC offset increased exponentially ;)</p><p>Ultimately, I looked at the source of an app called rt_lpc (<a
href="http://soundlab.cs.princeton.edu/software/rt_lpc/" rel="nofollow">http://soundlab.cs.princeton.edu/software/rt_lpc/</a>) and this was very useful. I have a vague idea of how the encoding works now (their code could stand more comments IMHO ;), the decoding &#038; playback is pretty straight-forward.</p><p>Currently I&#8217;m not using any blending or transitions between phonemes (tho I am synthesizing the sound at a 10khz sample rate and using anti-aliasing during playback, because I like how it sounds ;) When the feedback amplitudes change, the playback buffer retains its old state, and the result is a smooth-sounding transition between frames. Pretty neat.</p><p>Really, I&#8217;m amazed that LPC works AT ALL. It&#8217;s very clever.</p><p>I&#8217;d really like to share the code (the dictionary and speech-to-phoneme aspects are interesting, too). Hopefully I&#8217;ll get the go-ahead from the team soon&#8230; I&#8217;ll write another post when that&#8217;s ready.</p> ]]></content:encoded> </item> <item><title>By: Dave</title><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7296</link> <dc:creator>Dave</dc:creator> <pubDate>Fri, 09 Oct 2009 14:46:28 +0000</pubDate> <guid
isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7296</guid> <description>Great work :) and I think it&#039;s a 1st for AS3, so congratulations!
So, basically LPC is a way of compressing an audio sample, and a decompression (predictive?) algorithm for playback?
I&#039;m interested in this type of thing a lot for AS3...is any blending used while transitioning between phonemes?
If you could share any of the papers or knowledge used while making this i&#039;d really appreciate it.
Cheers!</description> <content:encoded><![CDATA[<p>Great work :) and I think it&#8217;s a 1st for AS3, so congratulations!<br
/> So, basically LPC is a way of compressing an audio sample, and a decompression (predictive?) algorithm for playback?<br
/> I&#8217;m interested in this type of thing a lot for AS3&#8230;is any blending used while transitioning between phonemes?<br
/> If you could share any of the papers or knowledge used while making this i&#8217;d really appreciate it.<br
/> Cheers!</p> ]]></content:encoded> </item> <item><title>By: Zach</title><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7189</link> <dc:creator>Zach</dc:creator> <pubDate>Sat, 26 Sep 2009 08:56:54 +0000</pubDate> <guid
isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7189</guid> <description>@cmoore: I&#039;d like to release the code! However, I have to clear this with the team... We plan on refining the speech a bit, and using it in a special project. Watch this space...</description> <content:encoded><![CDATA[<p>@cmoore: I&#8217;d like to release the code! However, I have to clear this with the team&#8230; We plan on refining the speech a bit, and using it in a special project. Watch this space&#8230;</p> ]]></content:encoded> </item> <item><title>By: cmoore</title><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7172</link> <dc:creator>cmoore</dc:creator> <pubDate>Fri, 25 Sep 2009 01:34:27 +0000</pubDate> <guid
isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7172</guid> <description>man this is cool .. any source avail? really awesome stuff!</description> <content:encoded><![CDATA[<p>man this is cool .. any source avail? really awesome stuff!</p> ]]></content:encoded> </item> <item><title>By: Zach</title><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-6789</link> <dc:creator>Zach</dc:creator> <pubDate>Thu, 27 Aug 2009 20:56:32 +0000</pubDate> <guid
isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-6789</guid> <description>Og2t: Thank you!! You jogged my memory -- I recall using &quot;Say It Sam&quot; on the Apple ][, which was another scratchy, low-fidelity voice. But it was impressive for the time. I have no idea how that software worked. I&#039;ll investigate and report back...</description> <content:encoded><![CDATA[<p>Og2t: Thank you!! You jogged my memory &#8212; I recall using &#8220;Say It Sam&#8221; on the Apple ][, which was another scratchy, low-fidelity voice. But it was impressive for the time. I have no idea how that software worked. I&#8217;ll investigate and report back&#8230;</p> ]]></content:encoded> </item> <item><title>By: Og2t</title><link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-6784</link> <dc:creator>Og2t</dc:creator> <pubDate>Thu, 27 Aug 2009 14:18:04 +0000</pubDate> <guid
isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-6784</guid> <description>This is incredible Zach! I remember speech synth done on C64 (caller SAM or something) but never seen one in Flash. I like the noise option, sounds like whispering.</description> <content:encoded><![CDATA[<p>This is incredible Zach! I remember speech synth done on C64 (caller SAM or something) but never seen one in Flash. I like the noise option, sounds like whispering.</p> ]]></content:encoded> </item> </channel> </rss>
