<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Synthetic Speech in Flash</title>
	<atom:link href="http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/</link>
	<description>This blog is GO!</description>
	<lastBuildDate>Wed, 07 Jul 2010 05:59:50 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
	<item>
		<title>By: slegectecaula</title>
		<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7797</link>
		<dc:creator>slegectecaula</dc:creator>
		<pubDate>Fri, 11 Dec 2009 21:05:49 +0000</pubDate>
		<guid isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7797</guid>
		<description>Lots of of folks talk about this matter but you wrote down really true words!</description>
		<content:encoded><![CDATA[<p>Lots of of folks talk about this matter but you wrote down really true words!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zach</title>
		<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7297</link>
		<dc:creator>Zach</dc:creator>
		<pubDate>Fri, 09 Oct 2009 20:24:15 +0000</pubDate>
		<guid isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7297</guid>
		<description>Thanks Dave! As I understand it, LPC was designed with speech compression in mind. The playback works like this: you generate either a repeating pulse (for pitched frames such as vowel sounds) or noise (for sibilance (&quot;S&quot;, &quot;Z&quot;, etc)), and you run this signal through a set of very short delays (echoes). The output is fed back to the input, so it self-oscillates a bit.

Different vowel sounds can be produced by changing the amplitudes of the individual delays. The real magic is obtaining those values -- my early experiments were plagued with howling feedback, and situations where the DC offset increased exponentially ;)

Ultimately, I looked at the source of an app called rt_lpc (http://soundlab.cs.princeton.edu/software/rt_lpc/) and this was very useful. I have a vague idea of how the encoding works now (their code could stand more comments IMHO ;), the decoding &amp; playback is pretty straight-forward.

Currently I&#039;m not using any blending or transitions between phonemes (tho I am synthesizing the sound at a 10khz sample rate and using anti-aliasing during playback, because I like how it sounds ;) When the feedback amplitudes change, the playback buffer retains its old state, and the result is a smooth-sounding transition between frames. Pretty neat.

Really, I&#039;m amazed that LPC works AT ALL. It&#039;s very clever.

I&#039;d really like to share the code (the dictionary and speech-to-phoneme aspects are interesting, too). Hopefully I&#039;ll get the go-ahead from the team soon... I&#039;ll write another post when that&#039;s ready.</description>
		<content:encoded><![CDATA[<p>Thanks Dave! As I understand it, LPC was designed with speech compression in mind. The playback works like this: you generate either a repeating pulse (for pitched frames such as vowel sounds) or noise (for sibilance (&#8220;S&#8221;, &#8220;Z&#8221;, etc)), and you run this signal through a set of very short delays (echoes). The output is fed back to the input, so it self-oscillates a bit.</p>
<p>Different vowel sounds can be produced by changing the amplitudes of the individual delays. The real magic is obtaining those values &#8212; my early experiments were plagued with howling feedback, and situations where the DC offset increased exponentially ;)</p>
<p>Ultimately, I looked at the source of an app called rt_lpc (<a href="http://soundlab.cs.princeton.edu/software/rt_lpc/" rel="nofollow">http://soundlab.cs.princeton.edu/software/rt_lpc/</a>) and this was very useful. I have a vague idea of how the encoding works now (their code could stand more comments IMHO ;), the decoding &#038; playback is pretty straight-forward.</p>
<p>Currently I&#8217;m not using any blending or transitions between phonemes (tho I am synthesizing the sound at a 10khz sample rate and using anti-aliasing during playback, because I like how it sounds ;) When the feedback amplitudes change, the playback buffer retains its old state, and the result is a smooth-sounding transition between frames. Pretty neat.</p>
<p>Really, I&#8217;m amazed that LPC works AT ALL. It&#8217;s very clever.</p>
<p>I&#8217;d really like to share the code (the dictionary and speech-to-phoneme aspects are interesting, too). Hopefully I&#8217;ll get the go-ahead from the team soon&#8230; I&#8217;ll write another post when that&#8217;s ready.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dave</title>
		<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7296</link>
		<dc:creator>Dave</dc:creator>
		<pubDate>Fri, 09 Oct 2009 14:46:28 +0000</pubDate>
		<guid isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7296</guid>
		<description>Great work :) and I think it&#039;s a 1st for AS3, so congratulations!
So, basically LPC is a way of compressing an audio sample, and a decompression (predictive?) algorithm for playback?
I&#039;m interested in this type of thing a lot for AS3...is any blending used while transitioning between phonemes?
If you could share any of the papers or knowledge used while making this i&#039;d really appreciate it.
Cheers!</description>
		<content:encoded><![CDATA[<p>Great work :) and I think it&#8217;s a 1st for AS3, so congratulations!<br />
So, basically LPC is a way of compressing an audio sample, and a decompression (predictive?) algorithm for playback?<br />
I&#8217;m interested in this type of thing a lot for AS3&#8230;is any blending used while transitioning between phonemes?<br />
If you could share any of the papers or knowledge used while making this i&#8217;d really appreciate it.<br />
Cheers!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zach</title>
		<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7189</link>
		<dc:creator>Zach</dc:creator>
		<pubDate>Sat, 26 Sep 2009 08:56:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7189</guid>
		<description>@cmoore: I&#039;d like to release the code! However, I have to clear this with the team... We plan on refining the speech a bit, and using it in a special project. Watch this space...</description>
		<content:encoded><![CDATA[<p>@cmoore: I&#8217;d like to release the code! However, I have to clear this with the team&#8230; We plan on refining the speech a bit, and using it in a special project. Watch this space&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: cmoore</title>
		<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-7172</link>
		<dc:creator>cmoore</dc:creator>
		<pubDate>Fri, 25 Sep 2009 01:34:27 +0000</pubDate>
		<guid isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-7172</guid>
		<description>man this is cool .. any source avail? really awesome stuff!</description>
		<content:encoded><![CDATA[<p>man this is cool .. any source avail? really awesome stuff!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Zach</title>
		<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-6789</link>
		<dc:creator>Zach</dc:creator>
		<pubDate>Thu, 27 Aug 2009 20:56:32 +0000</pubDate>
		<guid isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-6789</guid>
		<description>Og2t: Thank you!! You jogged my memory -- I recall using &quot;Say It Sam&quot; on the Apple ][, which was another scratchy, low-fidelity voice. But it was impressive for the time. I have no idea how that software worked. I&#039;ll investigate and report back...</description>
		<content:encoded><![CDATA[<p>Og2t: Thank you!! You jogged my memory &#8212; I recall using &#8220;Say It Sam&#8221; on the Apple ][, which was another scratchy, low-fidelity voice. But it was impressive for the time. I have no idea how that software worked. I&#8217;ll investigate and report back&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Og2t</title>
		<link>http://blog.zacharcher.com/2009/08/27/synthetic-speech-in-flash/comment-page-1/#comment-6784</link>
		<dc:creator>Og2t</dc:creator>
		<pubDate>Thu, 27 Aug 2009 14:18:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.zacharcher.com/?p=161#comment-6784</guid>
		<description>This is incredible Zach! I remember speech synth done on C64 (caller SAM or something) but never seen one in Flash. I like the noise option, sounds like whispering.</description>
		<content:encoded><![CDATA[<p>This is incredible Zach! I remember speech synth done on C64 (caller SAM or something) but never seen one in Flash. I like the noise option, sounds like whispering.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
