<?xml version="1.0" encoding="UTF-8"?><!-- generator="wordpress/2.0.3" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/">
<channel>
	<title>Comments on: word frequency count</title>
	<link>http://geekgeekgeek.antithetical.org/2011/12/word-frequency-count/</link>
	<description>geeky stuff</description>
	<pubDate>Sat, 19 May 2012 19:55:42 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.3</generator>

	<item>
		<title>by: äxl</title>
		<link>http://geekgeekgeek.antithetical.org/2011/12/word-frequency-count/#comment-9927</link>
		<pubDate>Sat, 11 Feb 2012 16:47:49 +0000</pubDate>
		<guid>http://geekgeekgeek.antithetical.org/2011/12/word-frequency-count/#comment-9927</guid>
					<description>Here from pravin27:
http://www.unix.com/shell-programming-scripting/156334-word-frequency-sort-printfriendly.html</description>
		<content:encoded><![CDATA[<p>Here from pravin27:<br />
http://www.unix.com/shell-programming-scripting/156334-word-frequency-sort-printfriendly.html
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: äxl</title>
		<link>http://geekgeekgeek.antithetical.org/2011/12/word-frequency-count/#comment-9926</link>
		<pubDate>Sat, 11 Feb 2012 16:46:35 +0000</pubDate>
		<guid>http://geekgeekgeek.antithetical.org/2011/12/word-frequency-count/#comment-9926</guid>
					<description>This seems to work:

&lt;code&gt;awk '{gsub(/[^[:alnum:]_[:blank:]]/, &quot;&quot;, $0);for (i = 1; i </description>
		<content:encoded><![CDATA[<p>This seems to work:</p>
<p><code>awk '{gsub(/[^[:alnum:]_[:blank:]]/, "", $0);for (i = 1; i
</p>
]]></content:encoded>
				</item>
	<item>
		<title>by: äxl</title>
		<link>http://geekgeekgeek.antithetical.org/2011/12/word-frequency-count/#comment-9925</link>
		<pubDate>Sat, 11 Feb 2012 16:42:01 +0000</pubDate>
		<guid>http://geekgeekgeek.antithetical.org/2011/12/word-frequency-count/#comment-9925</guid>
					<description>&lt;code&gt;cat {file} &amp;#124; tr [:upper:] [:lower:]\ &amp;#124; sed s/[^[:alpha:]][^[:alpha:]]*/\ /g &amp;#124; sort &amp;#124; uniq -c &amp;#124; sort -r &amp;#124; more&lt;code&gt;

Gotten rid of the typographic apostrophes. Use typewriter apostrophes instead. The space before /g is important to separate words.
But still it doesn't work correctly since it only counts unique lines not words ...</description>
		<content:encoded><![CDATA[<p><code>cat {file} | tr [:upper:] [:lower:]\ | sed s/[^[:alpha:]][^[:alpha:]]*/\ /g | sort | uniq -c | sort -r | more<code></p>
<p>Gotten rid of the typographic apostrophes. Use typewriter apostrophes instead. The space before /g is important to separate words.<br />
But still it doesn't work correctly since it only counts unique lines not words ...
</p>
]]></content:encoded>
				</item>
</channel>
</rss>

