{"id":961,"date":"2010-04-23T11:31:37","date_gmt":"2010-04-23T10:31:37","guid":{"rendered":"http:\/\/blog.capdata.fr\/?p=961"},"modified":"2018-01-15T09:49:06","modified_gmt":"2018-01-15T08:49:06","slug":"jeux-de-caracteres-unicode-et-base-de-donnees","status":"publish","type":"post","link":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/","title":{"rendered":"Jeux de caract\u00e8res, Unicode  et Base de donn\u00e9es"},"content":{"rendered":"<a class=\"synved-social-button synved-social-button-share synved-social-size-24 synved-social-resolution-single synved-social-provider-twitter nolightbox\" data-provider=\"twitter\" target=\"_blank\" rel=\"nofollow\" title=\"Share on Twitter\" href=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Fblog.capdata.fr%2Findex.php%2Fwp-json%2Fwp%2Fv2%2Fposts%2F961&#038;text=Article%20sur%20le%20blog%20de%20la%20Capdata%20Tech%20Team%20%3A%20\" style=\"font-size: 0px;width:24px;height:24px;margin:0;margin-bottom:5px;margin-right:5px\"><img loading=\"lazy\" decoding=\"async\" alt=\"twitter\" title=\"Share on Twitter\" class=\"synved-share-image synved-social-image synved-social-image-share\" width=\"24\" height=\"24\" style=\"display: inline;width:24px;height:24px;margin: 0;padding: 0;border: none;box-shadow: none\" src=\"https:\/\/blog.capdata.fr\/wp-content\/plugins\/social-media-feather\/synved-social\/image\/social\/regular\/48x48\/twitter.png\" \/><\/a><a class=\"synved-social-button synved-social-button-share synved-social-size-24 synved-social-resolution-single synved-social-provider-linkedin nolightbox\" data-provider=\"linkedin\" target=\"_blank\" rel=\"nofollow\" title=\"Share on Linkedin\" href=\"https:\/\/www.linkedin.com\/shareArticle?mini=true&#038;url=https%3A%2F%2Fblog.capdata.fr%2Findex.php%2Fwp-json%2Fwp%2Fv2%2Fposts%2F961&#038;title=Jeux%20de%20caract%C3%A8res%2C%20Unicode%20%20et%20Base%20de%20donn%C3%A9es\" style=\"font-size: 0px;width:24px;height:24px;margin:0;margin-bottom:5px;margin-right:5px\"><img loading=\"lazy\" decoding=\"async\" alt=\"linkedin\" title=\"Share on Linkedin\" class=\"synved-share-image synved-social-image synved-social-image-share\" width=\"24\" height=\"24\" style=\"display: inline;width:24px;height:24px;margin: 0;padding: 0;border: none;box-shadow: none\" src=\"https:\/\/blog.capdata.fr\/wp-content\/plugins\/social-media-feather\/synved-social\/image\/social\/regular\/48x48\/linkedin.png\" \/><\/a><a class=\"synved-social-button synved-social-button-share synved-social-size-24 synved-social-resolution-single synved-social-provider-mail nolightbox\" data-provider=\"mail\" rel=\"nofollow\" title=\"Share by email\" href=\"mailto:?subject=Jeux%20de%20caract%C3%A8res%2C%20Unicode%20%20et%20Base%20de%20donn%C3%A9es&#038;body=Article%20sur%20le%20blog%20de%20la%20Capdata%20Tech%20Team%20%3A%20:%20https%3A%2F%2Fblog.capdata.fr%2Findex.php%2Fwp-json%2Fwp%2Fv2%2Fposts%2F961\" style=\"font-size: 0px;width:24px;height:24px;margin:0;margin-bottom:5px\"><img loading=\"lazy\" decoding=\"async\" alt=\"mail\" title=\"Share by email\" class=\"synved-share-image synved-social-image synved-social-image-share\" width=\"24\" height=\"24\" style=\"display: inline;width:24px;height:24px;margin: 0;padding: 0;border: none;box-shadow: none\" src=\"https:\/\/blog.capdata.fr\/wp-content\/plugins\/social-media-feather\/synved-social\/image\/social\/regular\/48x48\/mail.png\" \/><\/a><div><strong>Introduction<\/strong><\/div>\n<div>Un caract\u00e8re est un symbole ou un signe unique appartenant \u00e0 un alphabet.<\/div>\n<div>Le jeu de caract\u00e8res est un ensemble de caract\u00e8res \u00e0 repr\u00e9senter.<\/div>\n<div>Il existe plusieurs jeux de caract\u00e8res, permettant de repr\u00e9senter diff\u00e9rents alphabets du monde.<\/div>\n<p><strong>Le standard ASCII<\/strong><\/p>\n<p><span style=\"font-weight: normal;\">Afin de r\u00e9pertorier tous les caract\u00e8res, un standard a \u00e9t\u00e9 mis en place.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Le premier standard qui voit le jour est l\u2019ASCII. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Pour repr\u00e9senter des caract\u00e8res dans un fichier texte, on associe un nombre (code) \u00e0 une lettre, \u00e0 un chiffre ou \u00e0 un symbole.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">L\u2019encodage le plus connu de l\u2019ASCII est US-ASCII. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">A =&gt; 65 =&gt; 01000001<\/span><\/p>\n<p><span style=\"font-weight: normal;\">a =&gt; 97 =&gt; 01100001 <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Le codage est r\u00e9alis\u00e9 sur 7bits\/octet soit<br \/>\n<\/span><br \/>\n128 symboles possibles. Une trentaine d&#8217;entre eux sont des caract\u00e8res de commande, le reste est constitu\u00e9 de l&#8217;alphabet anglais standard, des chiffres et des ponctuations.<\/p>\n<p><span style=\"font-weight: normal;\">L\u2019extension la plus connue est la famille ISO-8859, permettant de repr\u00e9senter les caract\u00e8res accentu\u00e9s. Le codage est r\u00e9alis\u00e9 sur un octet complet, soit 256 caract\u00e8res possibles. Les codes de 160 \u00e0 255 sont affect\u00e9s au codage des caract\u00e8res latins accentu\u00e9s.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Standards ISO-8859-1 \u00e0 15<\/span><\/p>\n<p><span style=\"font-weight: normal;\"> ISO-8859-1 puis 15 (euro): Europe de l&#8217;Ouest<\/span><\/p>\n<p><span style=\"font-weight: normal;\"> ISO-8859-2: Europe Centrale ou de l&#8217;Est<\/span><\/p>\n<p><span style=\"font-weight: normal;\"> ISO-8859-5: Cyrillique<\/span><\/p>\n<p><span style=\"font-weight: normal;\"> ISO-8859-7: Grec<\/span><\/p>\n<p>&nbsp;<\/p>\n<p><strong> <span style=\"font-weight: normal;\">Les principaux probl\u00e8mes rencontr\u00e9s avec ce standard sont les suivants :<\/span><\/strong><\/p>\n<p>&#8211; On ne peut utiliser plusieurs langues \u00e0 l&#8217;int\u00e9rieur d&#8217;une m\u00eame instance.<\/p>\n<p><span style=\"font-weight: normal;\">&#8211; Deux grands absents de l&#8217;ISO-Latin-1 sont la ligature oe \u00ab \u0153 \u00bb, essentielle \u00e0 la typographie fran\u00e7aise correcte, et le symbole de l&#8217;euro \u00ab \u20ac \u00bb.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211; Chaque constructeur d&#8217;ordinateur a aussi d\u00e9velopp\u00e9 ses propres codes non conformes aux standards internationaux (IBM EBCDIC, Windows-1252,..) <\/span><\/p>\n<p><strong>L\u2019unicode<\/strong><\/p>\n<p><span style=\"font-weight: normal;\">La globalisation consiste \u00e0 int\u00e9grer dans une m\u00eame table tous les caract\u00e8res connus du monde. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Le standard Unicode est un m\u00e9canisme universel de codage de caract\u00e8res. Il d\u00e9finit une mani\u00e8re coh\u00e9rente de coder des textes multilingues et facilite l\u2019\u00e9change de donn\u00e9es textuelles. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Il est un passage obligatoire pour la plupart des nouveaux protocoles de l\u2019Internet, mis en \u0153uvre dans tous les syst\u00e8mes d\u2019exploitation et langages informatiques modernes.<\/span><\/p>\n<p><span style=\"font-weight: normal;\"><span style=\"text-decoration: underline;\">Formes de codage<\/span><\/span><\/p>\n<p><span style=\"font-weight: normal;\">Il existe plusieurs formes de codage de caract\u00e8res pour l&#8217;Universal Character Set:<\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211; UCS-2<\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211; UCS-4<\/span><\/p>\n<p>Cependant, les codages UCS-2 et UCS-4 sont difficiles \u00e0 utiliser dans de nombreuses applications et protocoles qui travaillent avec des caract\u00e8res cod\u00e9s sur 8 ou m\u00eame 7 bits. M\u00eame certains syst\u00e8mes plus r\u00e9cents capables de travailler avec des caract\u00e8res sur 16 bits ne peuvent pas traiter les donn\u00e9es cod\u00e9es en UCS-4. Cette situation a conduit au d\u00e9veloppement de formats de transformation UTF (UCS Transformation Formats). Chacun poss\u00e9dant ses propres caract\u00e9ristiques.<\/p>\n<p><span style=\"text-decoration: underline;\">Encodage UTF<\/span>8<\/p>\n<p><span style=\"font-weight: normal;\">UTF-8 a \u00e9t\u00e9 con\u00e7u par Kenneth Thompson en 1992 pour \u00eatre compatible avec certains logiciels originellement pr\u00e9vus pour traiter des caract\u00e8res d&#8217;un seul octet. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Les caract\u00e8res de num\u00e9ro 0 \u00e0 127 sont cod\u00e9s sur un octet dont le bit de poids fort est toujours nul.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Les caract\u00e8res de num\u00e9ro sup\u00e9rieur \u00e0 127 sont cod\u00e9s sur plusieurs octets.<\/span><\/p>\n<p><span style=\"text-decoration: underline;\">Encodage UTF 16<\/span><\/p>\n<p><span style=\"font-weight: normal;\">UTF-16 est une m\u00e9thode permettant de transformer un sous-ensemble de la table UCS-4 en une paire de valeurs UCS-2 d\u2019une plage r\u00e9serv\u00e9e. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">UTF-16 est d\u00e9compos\u00e9 en 2 types.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">UTF-16-BE (Big-endian)<\/span><\/p>\n<p><span style=\"font-weight: normal;\">UTF-16-LE (Little-endian)<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Le\u00a0<\/span>codage UTF-16-BE est l&#8217;un des codages pr\u00e9conis\u00e9s par la norme Unicode. Ce codage associe simplement un code de 16 bits (2 octets) \u00e0 chaque caract\u00e8re.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"text-decoration: underline;\">Encodage UTF 32<\/span><\/p>\n<p><span style=\"font-weight: normal;\">UTF-32 est un protocole pour encoder les caract\u00e8res sur 32 bits. Du fait que l\u2019UTF-32 utilise 4 bytes par caract\u00e8re. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Il est aussi d\u00e9compos\u00e9 en 2 types.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">UTF-32-BE (Big-endian)<\/span><\/p>\n<p><span style=\"font-weight: normal;\">UTF-32-LE (Little-endian)<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Il est relativement consommateur d\u2019espace.<\/span><\/p>\n<p><strong>Support SGBD<\/strong><\/p>\n<p style=\"text-align: center;\"><a href=\"http:\/\/blog.capdata.fr\/wp-content\/uploads\/2010\/04\/unicode_sgbd.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-medium wp-image-962 aligncenter\" src=\"http:\/\/blog.capdata.fr\/wp-content\/uploads\/2010\/04\/unicode_sgbd-300x161.png\" alt=\"\" width=\"500\" height=\"261\" \/><\/a><\/p>\n<p><strong>Oracle<\/strong><\/p>\n<p><span style=\"font-weight: normal;\">Deux param\u00e8tres permettent de prendre en charge l\u2019Unicode.<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: normal;\">Database Character Set<\/span><\/li>\n<li><span style=\"font-weight: normal;\">National Character Set<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: normal;\">Ces param\u00e8tres sont d\u00e9finis lors de la cr\u00e9ation de la base.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">L\u2019instruction CREATE DATABASE dispose de la clause character set et de la clause national character set qui permettent de d\u00e9clarer le jeu de caract\u00e8res qui servira de jeu de caract\u00e8res de la base et de jeu de caract\u00e8res nationaux. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Sans sp\u00e9cification de la clause national character set, le jeu AL16UTF16 sera utilis\u00e9 par d\u00e9faut.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Le param\u00e8tre Database Character Set ne peut pas \u00eatre configur\u00e9 en UTF-16.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Oracle g\u00e8re l&#8217;unicode au niveau des colonnes gr\u00e2ce <\/span>National Character Set et <span style=\"font-weight: normal;\">au types de donn\u00e9es suivantes :<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: normal;\">Nchar<\/span><\/li>\n<li><span style=\"font-weight: normal;\">Nvarchar2<\/span><\/li>\n<li><span style=\"font-weight: normal;\">nclob<\/span><\/li>\n<\/ul>\n<p><strong>Sybase<\/strong><\/p>\n<p><span style=\"font-weight: normal;\">Au moment de l\u2019installation, vous devez d\u00e9finir le param\u00e8tre default character set. Par d\u00e9faut il est d\u00e9termin\u00e9 par celui utilis\u00e9 sur votre server (Win = 1252, solaris =ISO 8859-1). Sybase recommande de d\u00e9finir UTF 8 pour ce param\u00e8tre si vous utiliser de l&#8217;unicode.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Sybase supporte la gestion de l\u2019unicode au niveau du type de colonne \u00e0 l\u2019aide des 3 types suivants : <\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211;<\/span><span style=\"font-weight: normal;\">Unichar<\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211;<\/span><span style=\"font-weight: normal;\">Univarchar<\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211;<\/span><span style=\"font-weight: normal;\">Unitext<\/span><\/p>\n<p><span style=\"font-weight: normal;\">Ces 3 types de donn\u00e9es sont encod\u00e9s en UTF 16.<\/span><\/p>\n<p><strong>Sql server<\/strong><\/p>\n<p><span style=\"font-weight: normal;\">Nous allons introduire la notion de collation.<\/span><\/p>\n<p><span style=\"font-weight: normal;\"> Une collation est une table de correspondance sp\u00e9cifique \u00e0 un jeu de caract\u00e8res. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Un jeu de caract\u00e8res peut avoir plusieurs collations, en g\u00e9n\u00e9ral une par langue (par exemple, latin1 fran\u00e7ais, latin1 su\u00e9dois). <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Ce param\u00e8tre se d\u00e9finit \u00e0 l&#8217;installation, sa valeur par d\u00e9faut est en fonction de la langue de l&#8217;OS sur lequel se fait l&#8217;installation (pour les collations de type windows) : <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Syst\u00e8me en fran\u00e7ais : French_CI_AS <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Syst\u00e8me en anglais : Latin1_General_CI_AS <\/span><\/p>\n<p><span style=\"font-weight: normal;\">Il sert \u00e0 d\u00e9terminer la page de code utilis\u00e9e dans les champs non unicode (varchar par exemple), l&#8217;ordre de tri des cha\u00eenes de caract\u00e8re et aussi la sensibilit\u00e9 aux accents, aux majuscules, etc.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">La collation param\u00e9tr\u00e9e s&#8217;applique aux bases de donn\u00e9es syst\u00e8mes (master, msdb, tempdb et model) et par d\u00e9faut aux nouvelles bases de donn\u00e9es cr\u00e9\u00e9es. Elle peut cependant \u00eatre modifi\u00e9e lors de la cr\u00e9ation d&#8217;une nouvelle base de donn\u00e9es ou lors de l&#8217;ajout d&#8217;un champ texte gr\u00e2ce au mot clef COLLATE. <\/span><\/p>\n<p>Sql server supporte la gestion de l\u2019unicode \u00e0 travers le type de colonne :<\/p>\n<p><span style=\"font-weight: normal;\">&#8211;<\/span><span style=\"font-weight: normal;\">Nchar<\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211;<\/span><span style=\"font-weight: normal;\">Nvarchar<\/span><\/p>\n<p><span style=\"font-weight: normal;\">&#8211;<\/span><span style=\"font-weight: normal;\">Ntext<\/span><\/p>\n<p><strong>My sql<\/strong><\/p>\n<p>MySQL d\u00e9termine le jeu de caract\u00e8res et la collation en fonction de l&#8217;option de configuration active quand le serveur d\u00e9marre ou en fonction des valeurs de configuration \u00e0 l&#8217;ex\u00e9cution.<\/p>\n<p><span style=\"font-weight: normal;\">Vous pouvez utiliser &#8211;default-character-set=character_set_name comme jeu de caract\u00e8res et vous pouvez en m\u00eame temps ajouter &#8211;default-collation=collation_name pour la collation. Par d\u00e9faut mysql utilise les options suivante &#8211;default-charset=latin1 &#8211;collation=latin1_swedish_ci car latin1_swedish_ci est la collation par d\u00e9faut de latin1.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">La collation peut aussi \u00eatre d\u00e9finie au niveau de la base ou de la colonne \u00e0 l\u2019aide de la clause collate.<\/span><\/p>\n<p><span style=\"font-weight: normal;\">En MySQL version 4.x et plus ancien, NCHAR et CHAR \u00e9taient synonymes. <\/span><\/p>\n<p><span style=\"font-weight: normal;\">MySQL utilise UTF8 comme jeu de caract\u00e8re pr\u00e9d\u00e9fini pour les champs de types Nchar et Nvarchar .<\/span><\/p>\n<a class=\"synved-social-button synved-social-button-share synved-social-size-24 synved-social-resolution-single synved-social-provider-twitter nolightbox\" data-provider=\"twitter\" target=\"_blank\" rel=\"nofollow\" title=\"Share on Twitter\" href=\"https:\/\/twitter.com\/intent\/tweet?url=https%3A%2F%2Fblog.capdata.fr%2Findex.php%2Fwp-json%2Fwp%2Fv2%2Fposts%2F961&#038;text=Article%20sur%20le%20blog%20de%20la%20Capdata%20Tech%20Team%20%3A%20\" style=\"font-size: 0px;width:24px;height:24px;margin:0;margin-bottom:5px;margin-right:5px\"><img loading=\"lazy\" decoding=\"async\" alt=\"twitter\" title=\"Share on Twitter\" class=\"synved-share-image synved-social-image synved-social-image-share\" width=\"24\" height=\"24\" style=\"display: inline;width:24px;height:24px;margin: 0;padding: 0;border: none;box-shadow: none\" src=\"https:\/\/blog.capdata.fr\/wp-content\/plugins\/social-media-feather\/synved-social\/image\/social\/regular\/48x48\/twitter.png\" \/><\/a><a class=\"synved-social-button synved-social-button-share synved-social-size-24 synved-social-resolution-single synved-social-provider-linkedin nolightbox\" data-provider=\"linkedin\" target=\"_blank\" rel=\"nofollow\" title=\"Share on Linkedin\" href=\"https:\/\/www.linkedin.com\/shareArticle?mini=true&#038;url=https%3A%2F%2Fblog.capdata.fr%2Findex.php%2Fwp-json%2Fwp%2Fv2%2Fposts%2F961&#038;title=Jeux%20de%20caract%C3%A8res%2C%20Unicode%20%20et%20Base%20de%20donn%C3%A9es\" style=\"font-size: 0px;width:24px;height:24px;margin:0;margin-bottom:5px;margin-right:5px\"><img loading=\"lazy\" decoding=\"async\" alt=\"linkedin\" title=\"Share on Linkedin\" class=\"synved-share-image synved-social-image synved-social-image-share\" width=\"24\" height=\"24\" style=\"display: inline;width:24px;height:24px;margin: 0;padding: 0;border: none;box-shadow: none\" src=\"https:\/\/blog.capdata.fr\/wp-content\/plugins\/social-media-feather\/synved-social\/image\/social\/regular\/48x48\/linkedin.png\" \/><\/a><a class=\"synved-social-button synved-social-button-share synved-social-size-24 synved-social-resolution-single synved-social-provider-mail nolightbox\" data-provider=\"mail\" rel=\"nofollow\" title=\"Share by email\" href=\"mailto:?subject=Jeux%20de%20caract%C3%A8res%2C%20Unicode%20%20et%20Base%20de%20donn%C3%A9es&#038;body=Article%20sur%20le%20blog%20de%20la%20Capdata%20Tech%20Team%20%3A%20:%20https%3A%2F%2Fblog.capdata.fr%2Findex.php%2Fwp-json%2Fwp%2Fv2%2Fposts%2F961\" style=\"font-size: 0px;width:24px;height:24px;margin:0;margin-bottom:5px\"><img loading=\"lazy\" decoding=\"async\" alt=\"mail\" title=\"Share by email\" class=\"synved-share-image synved-social-image synved-social-image-share\" width=\"24\" height=\"24\" style=\"display: inline;width:24px;height:24px;margin: 0;padding: 0;border: none;box-shadow: none\" src=\"https:\/\/blog.capdata.fr\/wp-content\/plugins\/social-media-feather\/synved-social\/image\/social\/regular\/48x48\/mail.png\" \/><\/a>","protected":false},"excerpt":{"rendered":"<p>Introduction Un caract\u00e8re est un symbole ou un signe unique appartenant \u00e0 un alphabet. Le jeu de caract\u00e8res est un ensemble de caract\u00e8res \u00e0 repr\u00e9senter. Il existe plusieurs jeux de caract\u00e8res, permettant de repr\u00e9senter diff\u00e9rents alphabets du monde. Le standard&hellip; <a href=\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\" class=\"more-link\">Continuer la lecture <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":14,"featured_media":963,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[143,4,3,5,6],"tags":[139,140,141,142,138],"class_list":["post-961","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-general","category-mysql","category-oracle","category-sqlserver","category-sybase","tag-characterset","tag-jeux-de-caracteres","tag-nchar","tag-nvarchar","tag-unicode"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v20.8 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es - Capdata TECH BLOG<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\" \/>\n<meta property=\"og:locale\" content=\"fr_FR\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es - Capdata TECH BLOG\" \/>\n<meta property=\"og:description\" content=\"Introduction Un caract\u00e8re est un symbole ou un signe unique appartenant \u00e0 un alphabet. Le jeu de caract\u00e8res est un ensemble de caract\u00e8res \u00e0 repr\u00e9senter. Il existe plusieurs jeux de caract\u00e8res, permettant de repr\u00e9senter diff\u00e9rents alphabets du monde. Le standard&hellip; Continuer la lecture &rarr;\" \/>\n<meta property=\"og:url\" content=\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\" \/>\n<meta property=\"og:site_name\" content=\"Capdata TECH BLOG\" \/>\n<meta property=\"article:published_time\" content=\"2010-04-23T10:31:37+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2018-01-15T08:49:06+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/blog.capdata.fr\/wp-content\/uploads\/2010\/04\/unicode.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"125\" \/>\n\t<meta property=\"og:image:height\" content=\"136\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Benjamin VESAN\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u00c9crit par\" \/>\n\t<meta name=\"twitter:data1\" content=\"Benjamin VESAN\" \/>\n\t<meta name=\"twitter:label2\" content=\"Dur\u00e9e de lecture estim\u00e9e\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\"},\"author\":{\"name\":\"Benjamin VESAN\",\"@id\":\"https:\/\/blog.capdata.fr\/#\/schema\/person\/ae56d1d3d5680d95ccc2c927e44bdc3e\"},\"headline\":\"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es\",\"datePublished\":\"2010-04-23T10:31:37+00:00\",\"dateModified\":\"2018-01-15T08:49:06+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\"},\"wordCount\":1200,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/blog.capdata.fr\/#organization\"},\"keywords\":[\"characterset\",\"jeux de caract\u00e8res\",\"nchar\",\"nvarchar\",\"unicode\"],\"articleSection\":[\"G\u00e9n\u00e9ral\",\"MySQL\",\"Oracle\",\"SQL Server\",\"Sybase\"],\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\",\"url\":\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\",\"name\":\"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es - Capdata TECH BLOG\",\"isPartOf\":{\"@id\":\"https:\/\/blog.capdata.fr\/#website\"},\"datePublished\":\"2010-04-23T10:31:37+00:00\",\"dateModified\":\"2018-01-15T08:49:06+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#breadcrumb\"},\"inLanguage\":\"fr-FR\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\/\/blog.capdata.fr\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/blog.capdata.fr\/#website\",\"url\":\"https:\/\/blog.capdata.fr\/\",\"name\":\"Capdata TECH BLOG\",\"description\":\"Le blog technique sur les bases de donn\u00e9es de CAP DATA Consulting\",\"publisher\":{\"@id\":\"https:\/\/blog.capdata.fr\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/blog.capdata.fr\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"fr-FR\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/blog.capdata.fr\/#organization\",\"name\":\"Capdata TECH BLOG\",\"url\":\"https:\/\/blog.capdata.fr\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"fr-FR\",\"@id\":\"https:\/\/blog.capdata.fr\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/blog.capdata.fr\/wp-content\/uploads\/2023\/01\/logo_capdata.webp\",\"contentUrl\":\"https:\/\/blog.capdata.fr\/wp-content\/uploads\/2023\/01\/logo_capdata.webp\",\"width\":800,\"height\":254,\"caption\":\"Capdata TECH BLOG\"},\"image\":{\"@id\":\"https:\/\/blog.capdata.fr\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.linkedin.com\/company\/cap-data-consulting\/mycompany\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/blog.capdata.fr\/#\/schema\/person\/ae56d1d3d5680d95ccc2c927e44bdc3e\",\"name\":\"Benjamin VESAN\",\"url\":\"https:\/\/blog.capdata.fr\/index.php\/author\/bvesan\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es - Capdata TECH BLOG","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/","og_locale":"fr_FR","og_type":"article","og_title":"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es - Capdata TECH BLOG","og_description":"Introduction Un caract\u00e8re est un symbole ou un signe unique appartenant \u00e0 un alphabet. Le jeu de caract\u00e8res est un ensemble de caract\u00e8res \u00e0 repr\u00e9senter. Il existe plusieurs jeux de caract\u00e8res, permettant de repr\u00e9senter diff\u00e9rents alphabets du monde. Le standard&hellip; Continuer la lecture &rarr;","og_url":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/","og_site_name":"Capdata TECH BLOG","article_published_time":"2010-04-23T10:31:37+00:00","article_modified_time":"2018-01-15T08:49:06+00:00","og_image":[{"width":125,"height":136,"url":"https:\/\/blog.capdata.fr\/wp-content\/uploads\/2010\/04\/unicode.jpg","type":"image\/jpeg"}],"author":"Benjamin VESAN","twitter_card":"summary_large_image","twitter_misc":{"\u00c9crit par":"Benjamin VESAN","Dur\u00e9e de lecture estim\u00e9e":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#article","isPartOf":{"@id":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/"},"author":{"name":"Benjamin VESAN","@id":"https:\/\/blog.capdata.fr\/#\/schema\/person\/ae56d1d3d5680d95ccc2c927e44bdc3e"},"headline":"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es","datePublished":"2010-04-23T10:31:37+00:00","dateModified":"2018-01-15T08:49:06+00:00","mainEntityOfPage":{"@id":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/"},"wordCount":1200,"commentCount":0,"publisher":{"@id":"https:\/\/blog.capdata.fr\/#organization"},"keywords":["characterset","jeux de caract\u00e8res","nchar","nvarchar","unicode"],"articleSection":["G\u00e9n\u00e9ral","MySQL","Oracle","SQL Server","Sybase"],"inLanguage":"fr-FR","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/","url":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/","name":"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es - Capdata TECH BLOG","isPartOf":{"@id":"https:\/\/blog.capdata.fr\/#website"},"datePublished":"2010-04-23T10:31:37+00:00","dateModified":"2018-01-15T08:49:06+00:00","breadcrumb":{"@id":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#breadcrumb"},"inLanguage":"fr-FR","potentialAction":[{"@type":"ReadAction","target":["https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/blog.capdata.fr\/index.php\/jeux-de-caracteres-unicode-et-base-de-donnees\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/blog.capdata.fr\/"},{"@type":"ListItem","position":2,"name":"Jeux de caract\u00e8res, Unicode et Base de donn\u00e9es"}]},{"@type":"WebSite","@id":"https:\/\/blog.capdata.fr\/#website","url":"https:\/\/blog.capdata.fr\/","name":"Capdata TECH BLOG","description":"Le blog technique sur les bases de donn\u00e9es de CAP DATA Consulting","publisher":{"@id":"https:\/\/blog.capdata.fr\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/blog.capdata.fr\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"fr-FR"},{"@type":"Organization","@id":"https:\/\/blog.capdata.fr\/#organization","name":"Capdata TECH BLOG","url":"https:\/\/blog.capdata.fr\/","logo":{"@type":"ImageObject","inLanguage":"fr-FR","@id":"https:\/\/blog.capdata.fr\/#\/schema\/logo\/image\/","url":"https:\/\/blog.capdata.fr\/wp-content\/uploads\/2023\/01\/logo_capdata.webp","contentUrl":"https:\/\/blog.capdata.fr\/wp-content\/uploads\/2023\/01\/logo_capdata.webp","width":800,"height":254,"caption":"Capdata TECH BLOG"},"image":{"@id":"https:\/\/blog.capdata.fr\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.linkedin.com\/company\/cap-data-consulting\/mycompany\/"]},{"@type":"Person","@id":"https:\/\/blog.capdata.fr\/#\/schema\/person\/ae56d1d3d5680d95ccc2c927e44bdc3e","name":"Benjamin VESAN","url":"https:\/\/blog.capdata.fr\/index.php\/author\/bvesan\/"}]}},"_links":{"self":[{"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/posts\/961","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/users\/14"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/comments?post=961"}],"version-history":[{"count":26,"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/posts\/961\/revisions"}],"predecessor-version":[{"id":6466,"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/posts\/961\/revisions\/6466"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/media\/963"}],"wp:attachment":[{"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/media?parent=961"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/categories?post=961"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.capdata.fr\/index.php\/wp-json\/wp\/v2\/tags?post=961"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}