<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>IAintaBlonde.com &#187; Linux/Unix/GNU</title>
	<atom:link href="http://www.iaintablonde.com/category/technology-internet/linuxunixgnu/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.iaintablonde.com</link>
	<description>Now that u know, lets get serious..............</description>
	<lastBuildDate>Thu, 04 Feb 2010 21:17:49 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>UTF-8 and Unicode Standards :: What is UTF-8?</title>
		<link>http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/</link>
		<comments>http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/#comments</comments>
		<pubDate>Sat, 03 Mar 2007 17:36:45 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		<br />
<b>Warning</b>:  Invalid argument supplied for foreach() in <b>/home/iaab123/public_html/wp-content/plugins/autometa/autometa.php</b> on line <b>301</b><br />
		<category><![CDATA[I Love Programming!]]></category>
		<category><![CDATA[I Love Tech]]></category>
		<category><![CDATA[Linux/Unix/GNU]]></category>

	<!-- AutoMeta Start -->
	<!-- AutoMeta End -->
	
		<guid isPermaLink="false">http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/</guid>
		<description><![CDATA[UTF-8 stands for Unicode
Transformation
Format-8.  It is an octet (8-bit)
lossless encoding of Unicode characters.
UTF-8 encodes each Unicode character as a variable number of 1 to 4
octets, where the number of octets depends on the integer value assigned
to the Unicode character.  It is an efficient encoding of Unicode
documents that use mostly US-ASCII characters because it [...]]]></description>
			<content:encoded><![CDATA[<p>UTF-8 stands for <strong>U</strong>nicode<br />
<strong>T</strong>ransformation<br />
<strong>F</strong>ormat-<strong>8</strong>.  It is an octet (8-bit)<br />
lossless encoding of Unicode characters.</p>
<p>UTF-8 encodes each Unicode character as a variable number of 1 to 4<br />
octets, where the number of octets depends on the integer value assigned<br />
to the Unicode character.  It is an efficient encoding of Unicode<br />
documents that use mostly US-ASCII characters because it represents each<br />
character in the range U+0000 through U+007F as a single octet.  UTF-8<br />
is the default encoding for <a href="http://www.w3.org/TR/REC-xml">XML</a>.</p>
<h2>Standards</h2>
<dl>
<dt><a href="http://www.ietf.org/rfc/rfc3629.txt">RFC<br />
    3629</a>: UTF-8, a transformation format of ISO 10646.  November 2003.</dt>
<dt><a href="http://www.unicode.org/standard/standard.html">The<br />
    Unicode Standard 5.0</a>, November 2006. [<a title="The Unicode Standard, Version 5.0" href="http://www.amazon.com/o/ASIN/0321480910/ref=nosim/hebcal-20">purchase<br />
from Amazon.com</a>]</dt>
<dd>In particular, see the <a title="2.5 Encoding Forms: UTF-8" href="http://www.unicode.org/versions/Unicode4.0.0/ch02.pdf#G11165">informal<br />
    description</a> of UTF-8 in sections 2.5 and 2.6, pages 30-32, and a<br />
    much more <a title="3.9 Unicode Encoding Forms: UTF-8" href="http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf#G31703">formal<br />
    definition</a> in sections 3.9 and 3.10, pages 77-81.</dd>
</dl>
<p><a title="Unicode Demystified: A Practical Programmer's Guide to the Encoding Standard" href="http://www.amazon.com/o/ASIN/0201700522/hebcal-20"><img src="http://www.utf-8.com/0201700522.01.MZZZZZZZ.jpg" alt="Unicode Demystified: A Practical Programmer's Guide to the Encoding Standard" align="right" border="0" height="160" hspace="4" vspace="4" width="128" /></a></p>
<h2>Articles and background reading</h2>
<dl>
<dt><a href="http://www.cl.cam.ac.uk/%7Emgk25/unicode.html">UTF-8 and<br />
    Unicode FAQ for Unix/Linux</a> by Markus Kuhn</dt>
<dt><a href="http://icu.sourceforge.net/docs/papers/forms_of_unicode/">Forms<br />
    of Unicode</a>, an excellent overview by Mark Davis</dt>
<dt><a href="http://en.wikipedia.org/wiki/UTF-8">Wikipedia UTF-8</a><br />
    contains a good discussion of why five- and six-octet sequences are<br />
    now illegal UTF-8</dt>
<dt><a href="http://www.czyborra.com/utf/">Unicode<br />
Transformation Formats</a> [czyborra.com]</dt>
<dt><a href="http://www.unicode.org/unicode/faq/utf_bom.html#UTF8">Unicode<br />
    UTF-8 FAQ</a></dt>
<dt><a href="http://www.unicode.org/unicode/reports/tr20/">Unicode in<br />
    XML and other Markup Languages</a>: Unicode Technical Report #20</dt>
<dt><a href="http://www.joelonsoftware.com/articles/Unicode.html">The<br />
    Absolute Minimum Every Software Developer Absolutely, Positively<br />
    Must Know About Unicode and Character Sets (No Excuses!)</a>, an<br />
    amusing and informative article by Joel Spolsky</dt>
</dl>
<h2>Character Sets</h2>
<p>The MIME character set attribute for UTF-8 is <code>UTF-8</code>.<br />
Character sets are case-insensitive, so <code>utf-8</code> is equally<br />
valid. [<a href="http://www.iana.org/assignments/character-sets">IANA Character<br />
Sets</a>].</p>
<p>In an HTML file, place this tag inside <code>&lt;head&gt;</code> &#8230;<br />
<code>&lt;/head&gt;</code>:</p>
<pre>&lt;meta http-equiv="Content-Type" content="text/html;charset=UTF-8"&gt;</pre>
<p>In an XML prolog, the encoding is typically specified as an<br />
attribute:</p>
<pre>&lt;?xml version="1.0" encoding="UTF-8" ?&gt;</pre>
<p>In Apache server config or .htaccess, this will cause the HTTP header<br />
to be generated for <code>text/html</code> and <code>text/plain</code><br />
content:</p>
<pre><a href="http://httpd.apache.org/docs/1.3/mod/core.html#adddefaultcharset">AddDefaultCharset</a> UTF-8</pre>
<p class="poweredbyperformancing">powered by <a href="http://performancing.com/firefox">performancing firefox</a></p>
<p class="facebook"><a href="http://www.facebook.com/share.php?u=http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/" target="_blank"><img src="http://www.iaintablonde.com/wp-content/plugins/add-to-facebook-plugin/facebook_share_icon.gif" alt="Share on Facebook" title="Share on Facebook" /></a><a href="http://www.facebook.com/share.php?u=http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/" target="_blank" title="Share on Facebook">Share on Facebook</a></p>]]></content:encoded>
			<wfw:commentRss>http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
