In
multilingual computer systems, different logical characters may have identical or very similar appearances. For example,
Unicode character U+0430,
Cyrillic small letter a ("?"), can look identical to Unicode character U+0061,
Latin small letter a, ("a") which is the lowercase "a" used in English. Technically, characters that look alike in this way are known as
homographs (strictly,
homoglyphs).
Spoofing attacks based on these similarities are known as
homograph spoofing attacks.
The problem arises from the different treatment of the characters in the users mind and the computer's programming. From the viewpoint of the user, a Cyrillic "?" within a Latin string is a Latin "a"; there is literally no difference in the glyphs for these characters in most fonts. However, the computer treats them differently when processing the character string as an identifier. Thus, the user's assumption of a one-to-one correspondence between the visual appearance of a name, and the named entity, breaks down.
In a typical example of a hypothetical attack, someone could register a domain name that appears identical to an existing domain but goes somewhere else. For example, the spoofed domain "p?ypal.com" contains a Cyrillic a, not a Latin a. In many ways, this is not a new thing. For example, even staying within the old character set of A-Z, 0-9 and hyphen, G00GLE.COM looks much GOOGLE.COM in some fonts; or, using a mix of uppercase and lowercase characters, googIe.com (capital I, not small ell) looks much like google.com in some fonts. Or, displaying characters in lowercase alone, rnicrosoft.com (" RNICROSOFT.COM") looks very much like microsoft.com in many fonts. What is new was that the expansion by the internationalized domain name system of the character repertoire from a few dozen characters in a single alphabet to many thousands of characters in many scripts greatly increased the scope for homograph attacks.
more..--
h.o.s.a.m.r.e.d
Be OpeN Mind
--
http://hosamred.blogspot.com