Unicode is an industry standard whose goal is to provide the means by which text of all forms and languages can be encoded for use by computers through a single character set. |
Caighdeán tionscail is ea Unicode agus an sprioc atá aige ná bealach a sholáthar chun téacs den uile shórt agus i ngach uile theanga a ionchódú i bhfoireann carachtar amháin lena úsáid ar an ríomhaire. |
Originally, text-characters were represented in computers using byte-wide data: each printable character (and many non-printing, or "control" characters) were implemented using a single byte each, which allowed for 256 characters total. |
I dtús báire, léiríodh carachtair téacs ar an ríomhaire mar shonraí beart-leithid: is é sin le rá, léiríodh gach carachtar inphriontáilte (agus go leor carachtar neamh-inphriontáilte, nó "carachtair rialacháin") in aon bheart amháin, scéim a cheadaigh 256 carachtar san iomlán. |
However, globalization has created a need for computers to be able to accommodate many different alphabets (and other writing systems) from around the world in an interchangeable way. |
Ach, mar gheall ar an domhandú, ní mór do ríomhairí déileáil le go leor aibítrí éagsúla (agus córais scríofa eile) ar fud an domhain ar bhealach inmhalartaithe. |
The old encodings in use included ASCII or EBCDIC, but it was apparent that they were not capable of handling all the different characters and alphabets from around the world. |
I measc na sean-ionchóduithe a bhí in úsáid, bhí ASCII agus EBCDIC ann, ach ba ríléir nach raibh siad in ann na carachtair agus na haibítrí go léir ar fud an domhain a láimhseáil. |
The solution to this problem was to create a set of "wide" 16-bit characters that would theoretically be able to accommodate most international language characters. |
An réiteach a bhí ar an bhfadhb seo ná foireann nua de charachtair “leathana” 16-giotán a chruthú, foireann a bheadh in ann déileáil leis an gcuid is mó de na carachtair idirnáisiúnta, go teoiriciúil. |
This new charset was first known as the Universal Character Set (UCS), and later standardized as Unicode. |
Tugadh “Universal Character Set” (UCS) ar an bhfoireann carachtar seo ar dtús, rud a caighdeánaíodh mar Unicode níos déanaí. |
However, after the first versions of the Unicode standard it became clear that 65,535 (216) characters would still not be enough to represent every character from all scripts in existence, so the standard was amended to add sixteen supplementary planes of 65,536 characters each, thus bringing the total number of representable code points to 1,114,112. |
Ach, tar éis na chéad leaganacha den chaighdeán Unicode, ba léir nár leor 65,536 (2^16) carachtar chun gach carachtar i ngach script atá ann a léiriú. Dá bharr sin, leasaíodh an caighdeán chun sé phlána dhéag fhorlíontacha a bhfuil 65,536 carachtar i ngach ceann acu a chur leis. Mar thoradh air seo, tá 1,114,112 pointe cóid inléirithe ann anois. |
To this date, less than 10% of that space is in use. |
Sa lá atá inniu ann, tá níos lú ná 10% den spás sin in úsáid. |