fontU-download1_600.jpg (26111 bytes)
 
Gurmukhi via Unicode
By Amrinder Singh

Ever since the invention of the computers, English has been the predominant language. All software programs and Operating Systems such as Microsoft Windows, Mac OS, and Unix/Linux are in English. Punjabi writers could not take advantage of the Computer because of fonts. Since the 1990s, the boom in sales Personal Computers allowed the average person to start and learn how to use a computer. By that time, Punjabis had already begun to experiment with Gurmukhi font development. Gurmukhi fonts such Anandpursahib, Sukhmani, and fonts by Dr. Kulbir Singh Thind started to appear. All these fonts till now have been using the American Standard Code for Information Interchange (ASCII) which is a standard adopted by International Organization for Standards (ISO).

The standards at that time adopted encoding from various languages. But due to the limitations of computer memory and storage and 7 bit (later 8 bit) structure, support for many languages was not included. So fonts at that time had to be developed using the ASCII standard. Since there was no standard for Gurmukhi script, font developers created their fonts without any standard keyboard layout. Another problem with using fonts that are without a standard is that users needed to download and install the fonts. Font conversion was very time consuming when done by hand. To solve the limitations of 255 characters in ASCII and add support for all languages, Unicode was developed.

Unicode was developed to specify a code for every single character needed by every human language. Instead of 8 bit scheme which only allowed for (28 - 1) 255 characters, Unicode allows for 1114112 (216 + 220) code points. 65,535 of the code points are reserved for characters and 1,048,576 code points are reserved for Unicode Transformation Format (UTF). This allows for applications that are not designed for Unicode to allow users to use Unicode. Gurmukhi and English characters now have separate code, where as in ASCII, Gurmukhi and English characters shared the code points. Unicode allows for every single character to have its own unique code. This gives huge advantages to other languages and allows the users to use those languages just about every where they can use English. For this reasons Unicode has been becoming very popular in the recent years.

Unicode allows users to communicate in writing without having to worry about fonts just like in English. For example, when I was writing this article, I did not worry about choosing which font to write in. I knew that other users will be able to read this article without any problems. But if I was writing an article in Gurmukhi using ASCII fonts, I would have to decide which font the readers would have installed on their machines. But when using a Unicode Gurmukhi font, users would not have to decide which to font to use.

With these advantages, users can write e-mails directly in Gurmukhi. With ASCII, a user would have to type the message in a word processor such as Microsoft Word and then attach it to the e-mail message. But with Unicode, a user can type Gurmukhi in the e-mail message just like English with the push of two buttons on the keyboard. (Alt and Shift by default). There is no need for attachments now. Users can now even name files and folders on their computers in Gurmukhi and other languages. Also, scanning software used to convert printed documents into a word processor comes in hundreds of languages but it is very disappointing to know that it does not include Gurmukhi support because we did not have a standard for Gurmukhi font. But now there is hope that software companies that make Optical Character Recognition software will support Gurmukhi Unicode fonts are script based so they do not break up when used on a website like traditional Gurmukhi ASCII fonts. In Unicode, characters that take up very little space, are given a space holder (Note the dotted circle). Web developers now don't have to worry about the visitors to the website having fonts instilled. As being a web developer myself, I know this can be an issue even when using font embedding due to Internet browser compatibly.

If we look in the Fonts folder in our computers, we have hundreds of English fonts. No matter which English font a user chooses, they can type the letter "A" using the same keyboard key. Now with the help of Unicode, keyboard layouts for Gurmukhi are standardized by the Unicode Consortium. Font developers can now create virtually unlimited number of fonts without any keyboard layout issues as long as they comply the Unicode standard for Gurmukhi. Click on this link for the standard

http://www.unicode.org/charts/PDF/U0A00.pdf

Now with Unicode fonts, only the look of the characters in the font will change, not the keyboard layout.

Now let's discuss about the characters in the Unicode standard. I have been reading letters posted by viewers on this website. Some of the users have asked questions about the characters and the keyboard layout.

Unicode Consortium has given code points to Gurmukhi script. Some of the characters that are used to type Sri Guru Granth Sahib Ji are not assigned. We can request the Unicode Consortium to assign these characters and add those to the standard chart.

Another concern for most users is typing in a Unicode font. Since the characters use a new keyboard layout, it is difficult to type in Unicode. It is difficult to master that layout because the keys are everywhere. Not only do you have to look at the Shift state but also Control Alt state. But again this problem is very easy to solve. Microsoft allows a user to create their own keyboard layout without altering the actual code. It just acts as an interpreter between the actual code and the user defined code.


Font Test: New Punjabi Pages

mwnv cyqnwieiqhwsigAwn/ivigAwn,   Awm jwxkwrI, kivqw, khwxI


darya1.jpg (3265 bytes)Back to Home Page

(c)copyright 1999-2004 5abi Limited