<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Naïve Bayes in Hadoop</title>
	<atom:link href="http://nickjenkin.com/blog/2009/04/naive-bayes-in-hadoop/feed/" rel="self" type="application/rss+xml" />
	<link>http://nickjenkin.com/blog/2009/04/naive-bayes-in-hadoop/</link>
	<description>Just another WordPress weblog</description>
	<pubDate>Thu, 29 Jul 2010 12:51:05 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Nick</title>
		<link>http://nickjenkin.com/blog/2009/04/naive-bayes-in-hadoop/comment-page-1/#comment-1481</link>
		<dc:creator>Nick</dc:creator>
		<pubDate>Tue, 17 Nov 2009 22:47:04 +0000</pubDate>
		<guid isPermaLink="false">http://nickjenkin.com/blog/?p=85#comment-1481</guid>
		<description>@Cliff
Thanks, however you may want to check out Knuths std dev function as it more accurate

http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance

On-line algorithm</description>
		<content:encoded><![CDATA[<p>@Cliff<br />
Thanks, however you may want to check out Knuths std dev function as it more accurate</p>
<p><a href="http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance" rel="nofollow">http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance</a></p>
<p>On-line algorithm</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick</title>
		<link>http://nickjenkin.com/blog/2009/04/naive-bayes-in-hadoop/comment-page-1/#comment-1480</link>
		<dc:creator>Nick</dc:creator>
		<pubDate>Tue, 17 Nov 2009 22:45:20 +0000</pubDate>
		<guid isPermaLink="false">http://nickjenkin.com/blog/?p=85#comment-1480</guid>
		<description>Also note, while this looks like python it is pseudo code.</description>
		<content:encoded><![CDATA[<p>Also note, while this looks like python it is pseudo code.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick</title>
		<link>http://nickjenkin.com/blog/2009/04/naive-bayes-in-hadoop/comment-page-1/#comment-1479</link>
		<dc:creator>Nick</dc:creator>
		<pubDate>Tue, 17 Nov 2009 22:44:46 +0000</pubDate>
		<guid isPermaLink="false">http://nickjenkin.com/blog/?p=85#comment-1479</guid>
		<description>Hi
The collect function is provided by Hadoop
-Nick</description>
		<content:encoded><![CDATA[<p>Hi<br />
The collect function is provided by Hadoop<br />
-Nick</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Petro</title>
		<link>http://nickjenkin.com/blog/2009/04/naive-bayes-in-hadoop/comment-page-1/#comment-1478</link>
		<dc:creator>Petro</dc:creator>
		<pubDate>Tue, 17 Nov 2009 18:52:26 +0000</pubDate>
		<guid isPermaLink="false">http://nickjenkin.com/blog/?p=85#comment-1478</guid>
		<description>Hi, 

where did the 'collect' function come from? 

Thanks,
Petro</description>
		<content:encoded><![CDATA[<p>Hi, </p>
<p>where did the &#8216;collect&#8217; function come from? </p>
<p>Thanks,<br />
Petro</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cliff Moon</title>
		<link>http://nickjenkin.com/blog/2009/04/naive-bayes-in-hadoop/comment-page-1/#comment-1476</link>
		<dc:creator>Cliff Moon</dc:creator>
		<pubDate>Sun, 15 Nov 2009 00:34:29 +0000</pubDate>
		<guid isPermaLink="false">http://nickjenkin.com/blog/?p=85#comment-1476</guid>
		<description>Just to let you know your standard deviation calculation is incorrect.  It should be:

  sqrt(abs(sumSq - mean * mean) / count)</description>
		<content:encoded><![CDATA[<p>Just to let you know your standard deviation calculation is incorrect.  It should be:</p>
<p>  sqrt(abs(sumSq - mean * mean) / count)</p>
]]></content:encoded>
	</item>
</channel>
</rss>
