Jiankang,
Thanks for your offer, that’d be extremely helpful. Let me know if we need to document the “bug” in a different way before submitting to Baidu. I’ll be happy to collaborate with you.
On a related note, I received second hand information on how Baidu crawls and indexes websites and how IDNs are treated different, which results in the behavior we just described below. Do you happen to know
any of Baidu’s web-crawling practices?
Thanks again,
Dennis
From: Jiankang Yao [mailto:yaojk@cnnic.cn]
Sent: Tuesday, November 24, 2015 12:41 AM
To: Tan Tanaka, Dennis; ua-discuss
Cc: ua-international@icann.org
Subject: Re: [UA-International] IDN-as-punycode-encoded-label in Baidu search engine results
I can help to talk to baidu and forward your message to them.
Jiankang Yao
From: Tan
Tanaka, Dennis
Date: 2015-11-24 05:45
Subject: [UA-International] IDN-as-punycode-encoded-label
in Baidu search engine results
Often times I hear that IDNs are not indexed by certain search engines. While I know this is not true, the example below doesn’t help my case either (at least not 100%). Here is an example where the IDN I’m looking
for is showing up in the first 5 search results on Baidu (see picture below). However, the string is displayed as the punycode-encoded label instead of the corresponding Chinese IDN (i.e. xn--ebr05n.com) .
Google and Yandex appear to work as expected. Bing didn’t display the domain name in the results (first two pages).
Is there someone interested (and with the language skills) in taking the action item to reach out to Baidu? This might be in the form of opening a bug ticket to explain the problem (IDN is displayed as punycode-encoded
label. Example: xn--ebr05.com) and what the expected result should have been (IDN displayed as Chinese domain nam. Example:
墨刀.com).

|
|
|
|
Dennis Tan
Naming Services
|
|