<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Ye olde English is dead</title>
	<atom:link href="http://www.dotvoid.com/2009/03/ye-olde-english-is-dead/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dotvoid.com/2009/03/ye-olde-english-is-dead/</link>
	<description>Experiments and thoughts in PHP and javascript</description>
	<lastBuildDate>Mon, 30 Jan 2012 20:33:19 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Giorgio Sironi</title>
		<link>http://www.dotvoid.com/2009/03/ye-olde-english-is-dead/comment-page-1/#comment-447</link>
		<dc:creator>Giorgio Sironi</dc:creator>
		<pubDate>Tue, 17 Mar 2009 17:41:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.dotvoid.com/?p=361#comment-447</guid>
		<description>I only go as far as removing diacritics marks (&#039;è&#039; becomes &#039;e&#039;):
http://ossigeno.svn.sourceforge.net/viewvc/ossigeno/trunk/core/library/Otk/Filter/Diacritics.php?revision=520&amp;view=markup</description>
		<content:encoded><![CDATA[<p>I only go as far as removing diacritics marks (&#8216;è&#8217; becomes &#8216;e&#8217;):<br />
<a href="http://ossigeno.svn.sourceforge.net/viewvc/ossigeno/trunk/core/library/Otk/Filter/Diacritics.php?revision=520&amp;view=markup" rel="nofollow">http://ossigeno.svn.sourceforge.net/viewvc/ossigeno/trunk/core/library/Otk/Filter/Diacritics.php?revision=520&amp;view=markup</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Danne</title>
		<link>http://www.dotvoid.com/2009/03/ye-olde-english-is-dead/comment-page-1/#comment-444</link>
		<dc:creator>Danne</dc:creator>
		<pubDate>Tue, 17 Mar 2009 15:16:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.dotvoid.com/?p=361#comment-444</guid>
		<description>I have tested translit with with good results.
&lt;code&gt;echo transliterate(&#039;þorn&#039;, array(&#039;normalize_ligature&#039;), &#039;utf8&#039;, &#039;iso-8859-1&#039;);&lt;/code&gt;

The above code will output &lt;em&gt;thorn&lt;/em&gt; which is more correct than &lt;em&gt;Y&lt;em&gt;.</description>
		<content:encoded><![CDATA[<p>I have tested translit with with good results.<br />
<code>echo transliterate('þorn', array('normalize_ligature'), 'utf8', 'iso-8859-1');</code></p>
<p>The above code will output <em>thorn</em> which is more correct than <em>Y</em><em>.</em></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Philippe Gamache</title>
		<link>http://www.dotvoid.com/2009/03/ye-olde-english-is-dead/comment-page-1/#comment-443</link>
		<dc:creator>Philippe Gamache</dc:creator>
		<pubDate>Tue, 17 Mar 2009 14:52:44 +0000</pubDate>
		<guid isPermaLink="false">http://www.dotvoid.com/?p=361#comment-443</guid>
		<description>You can look at the Translit PECL pack :
http://pecl.php.net/package/translit

This extension allows you to transliterate text in non-latin characters (such as Chinese, Cyrillic, Greek etc) to latin characters. Besides the
transliteration the extension also contains filters to upper- and lowercase latin, cyrillic and greek, and perform special forms of transliteration such as converting ligatures such as the Norwegian &quot;Ã¦&quot; to &quot;ae&quot; and normalizing punctuation and spacing.</description>
		<content:encoded><![CDATA[<p>You can look at the Translit PECL pack :<br />
<a href="http://pecl.php.net/package/translit" rel="nofollow">http://pecl.php.net/package/translit</a></p>
<p>This extension allows you to transliterate text in non-latin characters (such as Chinese, Cyrillic, Greek etc) to latin characters. Besides the<br />
transliteration the extension also contains filters to upper- and lowercase latin, cyrillic and greek, and perform special forms of transliteration such as converting ligatures such as the Norwegian &#8220;Ã¦&#8221; to &#8220;ae&#8221; and normalizing punctuation and spacing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Danne</title>
		<link>http://www.dotvoid.com/2009/03/ye-olde-english-is-dead/comment-page-1/#comment-442</link>
		<dc:creator>Danne</dc:creator>
		<pubDate>Tue, 17 Mar 2009 14:39:25 +0000</pubDate>
		<guid isPermaLink="false">http://www.dotvoid.com/?p=361#comment-442</guid>
		<description>Well, Icelandic people probably use in daily life when writing Icelandic texts...

It&#039;s not about sanitizing, it&#039;s about creating search engine/user friendly ascii urls that are as close as possible to the original title but excludes national characters to be a little bit more international viable.

And to clarify, the above code example is shortened. For the sake of an example it is not necessary to include all western characters that needs to be supported.</description>
		<content:encoded><![CDATA[<p>Well, Icelandic people probably use in daily life when writing Icelandic texts&#8230;</p>
<p>It&#8217;s not about sanitizing, it&#8217;s about creating search engine/user friendly ascii urls that are as close as possible to the original title but excludes national characters to be a little bit more international viable.</p>
<p>And to clarify, the above code example is shortened. For the sake of an example it is not necessary to include all western characters that needs to be supported.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rian</title>
		<link>http://www.dotvoid.com/2009/03/ye-olde-english-is-dead/comment-page-1/#comment-441</link>
		<dc:creator>Rian</dc:creator>
		<pubDate>Tue, 17 Mar 2009 14:31:36 +0000</pubDate>
		<guid isPermaLink="false">http://www.dotvoid.com/?p=361#comment-441</guid>
		<description>but who actually still uses \xFE?

One could argue that it&#039;s not really worthwhile to write a sanitizer for non-alphabetic and numeric characters as no-one tends to use them in their titles anyhow.. If you do write it up, I imagine there&#039;s a few more characters that need replacing.</description>
		<content:encoded><![CDATA[<p>but who actually still uses \xFE?</p>
<p>One could argue that it&#8217;s not really worthwhile to write a sanitizer for non-alphabetic and numeric characters as no-one tends to use them in their titles anyhow.. If you do write it up, I imagine there&#8217;s a few more characters that need replacing.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

