<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Ruby 1.9 made me remember how I hate the concept of encodings</title>
	<atom:link href="http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/</link>
	<description>Because programming should be fun</description>
	<lastBuildDate>Wed, 04 Jan 2012 05:17:14 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: mark</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-1452</link>
		<dc:creator>mark</dc:creator>
		<pubDate>Fri, 11 Nov 2011 20:23:10 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-1452</guid>
		<description>You are absolutely right Frank.

Ruby 1.9.x forces everyone to deal with it.

The Encoding shit is still the number one reason why I can&#039;t upgrade. I solved all the other problems with syntax change.

Ruby now forces me to deal with Encodings. I was able to avoid that.

Now I have to read up shit about Encoding and I KNOW THAT I WILL NEVER NEED THEM BECAUSE IN THE TEN YEARS SO FAR I NEVER HAD TO CATER TOWARDS ENCODINGS I DID NOT CARE ABOUT.

Now this choice has been removed from Ruby users.</description>
		<content:encoded><![CDATA[<p>You are absolutely right Frank.</p>
<p>Ruby 1.9.x forces everyone to deal with it.</p>
<p>The Encoding shit is still the number one reason why I can&#8217;t upgrade. I solved all the other problems with syntax change.</p>
<p>Ruby now forces me to deal with Encodings. I was able to avoid that.</p>
<p>Now I have to read up shit about Encoding and I KNOW THAT I WILL NEVER NEED THEM BECAUSE IN THE TEN YEARS SO FAR I NEVER HAD TO CATER TOWARDS ENCODINGS I DID NOT CARE ABOUT.</p>
<p>Now this choice has been removed from Ruby users.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frank</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-1015</link>
		<dc:creator>Frank</dc:creator>
		<pubDate>Wed, 30 Jun 2010 13:26:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-1015</guid>
		<description>Comments are now moderated. Please respect this public space and be nice in your comments if you want them to be published. Thank you!</description>
		<content:encoded><![CDATA[<p>Comments are now moderated. Please respect this public space and be nice in your comments if you want them to be published. Thank you!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: May 14: Moving To Beta 3 &#171; Rails Test Prescriptions Blog</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-987</link>
		<dc:creator>May 14: Moving To Beta 3 &#171; Rails Test Prescriptions Blog</dc:creator>
		<pubDate>Fri, 14 May 2010 15:09:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-987</guid>
		<description>[...] A brief rant on Ruby 1.9 and encodings. [...]</description>
		<content:encoded><![CDATA[<p>[...] A brief rant on Ruby 1.9 and encodings. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frank</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-986</link>
		<dc:creator>Frank</dc:creator>
		<pubDate>Fri, 14 May 2010 13:57:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-986</guid>
		<description>@Yehuda,

Thanks for the nice explanation. I&#039;m not throwing the blame at anyone in fact. I know Unicode is supposed to be the &quot;ultimate encoding table&quot; that I am talking about but it just isn&#039;t the case yet.

In a perfect world we would not have to worry about character encoding at all. Machines should do what has to be done behind the scene to make it all work without our intervention. I&#039;m sure one day we&#039;ll get there.</description>
		<content:encoded><![CDATA[<p>@Yehuda,</p>
<p>Thanks for the nice explanation. I&#8217;m not throwing the blame at anyone in fact. I know Unicode is supposed to be the &#8220;ultimate encoding table&#8221; that I am talking about but it just isn&#8217;t the case yet.</p>
<p>In a perfect world we would not have to worry about character encoding at all. Machines should do what has to be done behind the scene to make it all work without our intervention. I&#8217;m sure one day we&#8217;ll get there.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frank</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-985</link>
		<dc:creator>Frank</dc:creator>
		<pubDate>Fri, 14 May 2010 13:32:45 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-985</guid>
		<description>@Amadan, but what kind of disaster could happen if the website in question doesn&#039;t need internationalization? These &quot;local&quot; websites/applications are not waiting for a disaster to happen... they would have continue to work for eternity. Now they don&#039;t and I think it is stretching a bit too far to put all the blame on the developer.

Call me crazy but I prefer the silent &quot;failing&quot; of 1.8 over the unforgiving exception that is triggered with 1.9.1</description>
		<content:encoded><![CDATA[<p>@Amadan, but what kind of disaster could happen if the website in question doesn&#8217;t need internationalization? These &#8220;local&#8221; websites/applications are not waiting for a disaster to happen&#8230; they would have continue to work for eternity. Now they don&#8217;t and I think it is stretching a bit too far to put all the blame on the developer.</p>
<p>Call me crazy but I prefer the silent &#8220;failing&#8221; of 1.8 over the unforgiving exception that is triggered with 1.9.1</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Amadan</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-984</link>
		<dc:creator>Amadan</dc:creator>
		<pubDate>Fri, 14 May 2010 08:34:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-984</guid>
		<description>@Frank: Well, if the Rails app is presenting a UTF-8 web page and using a Latin-1 table for storage, it is a broken app - conceptually if not in practice. It&#039;s a disaster that was just waiting to happen. Well, it happened - not Ruby&#039;s fault people don&#039;t configure their DBs properly.</description>
		<content:encoded><![CDATA[<p>@Frank: Well, if the Rails app is presenting a UTF-8 web page and using a Latin-1 table for storage, it is a broken app &#8211; conceptually if not in practice. It&#8217;s a disaster that was just waiting to happen. Well, it happened &#8211; not Ruby&#8217;s fault people don&#8217;t configure their DBs properly.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Frank</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-983</link>
		<dc:creator>Frank</dc:creator>
		<pubDate>Fri, 14 May 2010 00:47:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-983</guid>
		<description>Jorg, that&#039;s the most exaggerated comment I saw so far. With your standards every rails applications backed by mysql databases encoded in Latin-1 (often the default encoding with mysql) are broken. That&#039;s a whole lot of broken apps!</description>
		<content:encoded><![CDATA[<p>Jorg, that&#8217;s the most exaggerated comment I saw so far. With your standards every rails applications backed by mysql databases encoded in Latin-1 (often the default encoding with mysql) are broken. That&#8217;s a whole lot of broken apps!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jörg W Mittag</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-982</link>
		<dc:creator>Jörg W Mittag</dc:creator>
		<pubDate>Fri, 14 May 2010 00:20:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-982</guid>
		<description>&gt; but it breaks working applications

Actually, it doesn&#039;t. It makes *already broken* applications raise an exception instead of silently corrupting data.</description>
		<content:encoded><![CDATA[<p>&gt; but it breaks working applications</p>
<p>Actually, it doesn&#8217;t. It makes *already broken* applications raise an exception instead of silently corrupting data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: nona</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-981</link>
		<dc:creator>nona</dc:creator>
		<pubDate>Thu, 13 May 2010 17:59:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-981</guid>
		<description>Every string has an implicit encoding, even if it&#039;s just ASCII (be happy it&#039;s not EBCDIC). Ruby 1.9 just makes this explicit, and allows you to convert strings to specific encodings, or not (remember, some encoding conversions are lossy).</description>
		<content:encoded><![CDATA[<p>Every string has an implicit encoding, even if it&#8217;s just ASCII (be happy it&#8217;s not EBCDIC). Ruby 1.9 just makes this explicit, and allows you to convert strings to specific encodings, or not (remember, some encoding conversions are lossy).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yehuda Katz</title>
		<link>http://www.rubyfleebie.com/ruby-1-9-made-me-remember-how-i-hate-the-concept-of-encodings/comment-page-1/#comment-980</link>
		<dc:creator>Yehuda Katz</dc:creator>
		<pubDate>Thu, 13 May 2010 16:04:38 +0000</pubDate>
		<guid isPermaLink="false">http://www.rubyfleebie.com/?p=137#comment-980</guid>
		<description>&gt; Why on earth do we have all those different encodings in 2010? 
&gt; Why not making a huge encoding table UTF-16384 containing 
&gt; every single character in the universe so we can forget about this 
&gt; crazy concept of different encodings and pretend that it never existed?

We do have that. It&#039;s called Unicode. Unicode has a list of every character the unicode committee has gotten to with a number. UTF-8, UTF-16, and UTF-32 are simply different ways to encode those numbers in a way that can be read.

Unfortunately, what you propose is an inherently human activity, and a hard one at that. They&#039;ve been at it for quite some time, and there are still encodings that cannot be losslessly converted to Unicode and back. This is the same for UTF-8, UTF-16, and UTF-32, because UTF-8 can already encode the entire Unicode space.

One issue is that in multiple encodings, there might be characters that represent the same unicode character. For instance, we don&#039;t need a separate &quot;A&quot; for the UTF-8 variant and the Latin-1 variant (Latin-1 is a common encoding that has the ASCII characters plus a bunch of stuff for latin countries). They attempted to do this with the myriad encodings in Japan, in a process called &quot;Han Unification&quot;. Unfortunately, the Japanese weren&#039;t always happy with the results.

This process could sometimes lead to it being impossible to represent your name in Unicode. Imagine if your name was José, and the Unicode committee decided that é and e were the same character, so a program which read in José spit back out Jose. You would not be happy with it.

In short, it&#039;s possible that this problem will eventually be solved, but it&#039;s complicated and messy, and probably has many years to go before it&#039;s completed.</description>
		<content:encoded><![CDATA[<p>&gt; Why on earth do we have all those different encodings in 2010?<br />
&gt; Why not making a huge encoding table UTF-16384 containing<br />
&gt; every single character in the universe so we can forget about this<br />
&gt; crazy concept of different encodings and pretend that it never existed?</p>
<p>We do have that. It&#8217;s called Unicode. Unicode has a list of every character the unicode committee has gotten to with a number. UTF-8, UTF-16, and UTF-32 are simply different ways to encode those numbers in a way that can be read.</p>
<p>Unfortunately, what you propose is an inherently human activity, and a hard one at that. They&#8217;ve been at it for quite some time, and there are still encodings that cannot be losslessly converted to Unicode and back. This is the same for UTF-8, UTF-16, and UTF-32, because UTF-8 can already encode the entire Unicode space.</p>
<p>One issue is that in multiple encodings, there might be characters that represent the same unicode character. For instance, we don&#8217;t need a separate &#8220;A&#8221; for the UTF-8 variant and the Latin-1 variant (Latin-1 is a common encoding that has the ASCII characters plus a bunch of stuff for latin countries). They attempted to do this with the myriad encodings in Japan, in a process called &#8220;Han Unification&#8221;. Unfortunately, the Japanese weren&#8217;t always happy with the results.</p>
<p>This process could sometimes lead to it being impossible to represent your name in Unicode. Imagine if your name was José, and the Unicode committee decided that é and e were the same character, so a program which read in José spit back out Jose. You would not be happy with it.</p>
<p>In short, it&#8217;s possible that this problem will eventually be solved, but it&#8217;s complicated and messy, and probably has many years to go before it&#8217;s completed.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

