<?xml version="1.0" encoding="UTF-8"?>
<!-- generator="wordpress/2.0.4" -->
<rss version="2.0" 
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	>

<channel>
	<title>IAintaBlonde.com</title>
	<link>http://www.iaintablonde.com</link>
	<description>Now that u know, lets get serious..............</description>
	<pubDate>Tue, 07 Oct 2008 17:37:51 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.0.4</generator>
	<language>en</language>
			<item>
		<title>UTF-8 and Unicode Standards :: What is UTF-8?</title>
		<link>http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/</link>
		<comments>http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/#comments</comments>
		<pubDate>Sat, 03 Mar 2007 17:36:45 +0000</pubDate>
		<dc:creator>admin</dc:creator>
		
	<dc:subject>I Love Tech</dc:subject>
	<dc:subject>I Love Programming!</dc:subject>
	<dc:subject>Linux/Unix/GNU</dc:subject><br />
<b>Warning</b>:  Invalid argument supplied for foreach() in <b>/home/iaab123/public_html/wp-content/plugins/autometa/autometa.php</b> on line <b>301</b><br />

	<!-- AutoMeta Start -->
	<!-- AutoMeta End -->
	
		<guid isPermaLink="false">http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/</guid>
		<description><![CDATA[UTF-8 stands for Unicode
Transformation
Format-8.  It is an octet (8-bit)
lossless encoding of Unicode characters.
UTF-8 encodes each Unicode character as a variable number of 1 to 4
octets, where the number of octets depends on the integer value assigned
to the Unicode character.  It is an efficient encoding of Unicode
documents that use mostly US-ASCII characters because it [...]]]></description>
			<content:encoded><![CDATA[<p>UTF-8 stands for <strong>U</strong>nicode<br />
<strong>T</strong>ransformation<br />
<strong>F</strong>ormat-<strong>8</strong>.  It is an octet (8-bit)<br />
lossless encoding of Unicode characters.</p>
<p>UTF-8 encodes each Unicode character as a variable number of 1 to 4<br />
octets, where the number of octets depends on the integer value assigned<br />
to the Unicode character.  It is an efficient encoding of Unicode<br />
documents that use mostly US-ASCII characters because it represents each<br />
character in the range U+0000 through U+007F as a single octet.  UTF-8<br />
is the default encoding for <a href="http://www.w3.org/TR/REC-xml">XML</a>.</p>
<h2>Standards</h2>
<dl>
<dt><a href="http://www.ietf.org/rfc/rfc3629.txt">RFC<br />
    3629</a>: UTF-8, a transformation format of ISO 10646.  November 2003.</dt>
<dt><a href="http://www.unicode.org/standard/standard.html">The<br />
    Unicode Standard 5.0</a>, November 2006. [<a title="The Unicode Standard, Version 5.0" href="http://www.amazon.com/o/ASIN/0321480910/ref=nosim/hebcal-20">purchase<br />
from Amazon.com</a>]</dt>
<dd>In particular, see the <a title="2.5 Encoding Forms: UTF-8" href="http://www.unicode.org/versions/Unicode4.0.0/ch02.pdf#G11165">informal<br />
    description</a> of UTF-8 in sections 2.5 and 2.6, pages 30-32, and a<br />
    much more <a title="3.9 Unicode Encoding Forms: UTF-8" href="http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf#G31703">formal<br />
    definition</a> in sections 3.9 and 3.10, pages 77-81.</dd>
</dl>
<p><a title="Unicode Demystified: A Practical Programmer's Guide to the Encoding Standard" href="http://www.amazon.com/o/ASIN/0201700522/hebcal-20"><img src="http://www.utf-8.com/0201700522.01.MZZZZZZZ.jpg" alt="Unicode Demystified: A Practical Programmer's Guide to the Encoding Standard" align="right" border="0" height="160" hspace="4" vspace="4" width="128" /></a></p>
<h2>Articles and background reading</h2>
<dl>
<dt><a href="http://www.cl.cam.ac.uk/%7Emgk25/unicode.html">UTF-8 and<br />
    Unicode FAQ for Unix/Linux</a> by Markus Kuhn</dt>
<dt><a href="http://icu.sourceforge.net/docs/papers/forms_of_unicode/">Forms<br />
    of Unicode</a>, an excellent overview by Mark Davis</dt>
<dt><a href="http://en.wikipedia.org/wiki/UTF-8">Wikipedia UTF-8</a><br />
    contains a good discussion of why five- and six-octet sequences are<br />
    now illegal UTF-8</dt>
<dt><a href="http://www.czyborra.com/utf/">Unicode<br />
Transformation Formats</a> [czyborra.com]</dt>
<dt><a href="http://www.unicode.org/unicode/faq/utf_bom.html#UTF8">Unicode<br />
    UTF-8 FAQ</a></dt>
<dt><a href="http://www.unicode.org/unicode/reports/tr20/">Unicode in<br />
    XML and other Markup Languages</a>: Unicode Technical Report #20</dt>
<dt><a href="http://www.joelonsoftware.com/articles/Unicode.html">The<br />
    Absolute Minimum Every Software Developer Absolutely, Positively<br />
    Must Know About Unicode and Character Sets (No Excuses!)</a>, an<br />
    amusing and informative article by Joel Spolsky</dt>
</dl>
<h2>Character Sets</h2>
<p>The MIME character set attribute for UTF-8 is <code>UTF-8</code>.<br />
Character sets are case-insensitive, so <code>utf-8</code> is equally<br />
valid. [<a href="http://www.iana.org/assignments/character-sets">IANA Character<br />
Sets</a>].</p>
<p>In an HTML file, place this tag inside <code>&lt;head&gt;</code> &#8230;<br />
<code>&lt;/head&gt;</code>:</p>
<pre>&lt;meta http-equiv="Content-Type" content="text/html;charset=UTF-8"&gt;</pre>
<p>In an XML prolog, the encoding is typically specified as an<br />
attribute:</p>
<pre>&lt;?xml version="1.0" encoding="UTF-8" ?&gt;</pre>
<p>In Apache server config or .htaccess, this will cause the HTTP header<br />
to be generated for <code>text/html</code> and <code>text/plain</code><br />
content:</p>
<pre><a href="http://httpd.apache.org/docs/1.3/mod/core.html#adddefaultcharset">AddDefaultCharset</a> UTF-8</pre>
<p class="poweredbyperformancing">powered by <a href="http://performancing.com/firefox">performancing firefox</a></p>
:) IAintABlonde.com<p class="akst_link"><a href="http://www.iaintablonde.com/?p=78&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_78" class="akst_share_link" rel="nofollow">Share This</a>
</p><p>---<br />Related Articles at IAintaBlonde.com:<li>No Related Posts</li></p><br />]]></content:encoded>
			<wfw:commentRSS>http://www.iaintablonde.com/2007/03/03/utf-8-and-unicode-standards-what-is-utf-8/feed/</wfw:commentRSS>
		</item>
	</channel>
</rss>
