Thank you. That is just the sort of input needed for Computer Science/ICT/IT Curricula internationalization. The unfortunate reality is that currently the vast majority of students graduate knowing only ASCII text processing/storage/transformation/transmission because that is all they are taught. So, no chance of graduates understanding Unicode or normalization forms. André Schappo
On 29 Jun 2017, at 15:23, Andrew Sullivan <ajs@anvilwalrusden.com> wrote:
On Thu, Jun 29, 2017 at 02:06:31PM +0000, Andre Schappo wrote:
Which internationalization topics would you like covered in Computer Science/ICT/IT Curricula?
I think it would be really good if graduates had a clear idea of how Unicode worked, what the differences are between (e.g.) a character and a code point or sequence of them, what the properties are, how to access them, normalization, and so on. Even in this group we sometimes struggle because people forget that the Unicode properties are what determine a given code point, and have stumbled over normalization forms. It's amazing to me, for instance, that we have to keep telling people to normalize user-generated text input before storage. (A later-year student would get a failing grade if s/he didn't check input before blindly handing it to the database, and yet we don't have the same reaction when NF* isn't immediately used on the same input.)
Best regards,
A
-- Andrew Sullivan ajs@anvilwalrusden.com