Hello David, Were pairs like 1 and l , 0 and o in latin script analyzed? (it might give better perception of what it going on and if the confusion is limited to IDNs or is it a general issue) Sincerely Yours, Maxim Alzoba Special projects manager, International Relations Department, FAITID m. +7 916 6761580(+whatsapp) skype oldfrogger Current UTC offset: -5.00 (Panama)
On 27 Jun 2018, at 10:56, David Conrad <david.conrad@icann.org> wrote:
I suspect that might be triggered by:
https://info.farsightsecurity.com/farsight-idn-research-report?utm_source=ne... <https://info.farsightsecurity.com/farsight-idn-research-report?utm_source=ne...>
From the executive summary:
Among the key findings:
100M total IDN resolutions observed; 27M unique fully qualified domain names (FQDNs)
8,000 IDN homographs representing or containing a top global brand name
Unicode “confusables” make up a significant percentage of the characters found in IDNs; 91% of all characters observed in IDN homographs are considered “confusable” -- a “confusable” is a Unicode code point that is often easily confused with other characters, ligatures, and/or digraphs.
Brands in banking and other related sectors are frequently imitated using IDN homographs with ~750 unique FQDNs observed per month
91% of IDN homographs offered some sort of webpage
We found clear violations of the ICANN Guidelines for the Implementation of Internationalized Domain Names
66% of all IDN homograph IP addresses were found to be geolocated in the United States
93% of IDN homograph FQDNs had IPv4-based address records
Regards, -drc
On Jun 27, 2018, at 10:27 AM, Andrew Sullivan <ajs@anvilwalrusden.com <mailto:ajs@anvilwalrusden.com>> wrote:
I see, via Slashdot, that the BBC is once again promoting this problem:
https://it.slashdot.org/story/18/06/26/2031212/scammers-abuse-multilingual-d... <https://it.slashdot.org/story/18/06/26/2031212/scammers-abuse-multilingual-d...>
A -- Andrew Sullivan ajs@anvilwalrusden.com <mailto:ajs@anvilwalrusden.com>