Alois Treindl said:I have analyzed to what extent TZ database represents the time zone information correctly, for these data. I think the results are of some interest for the TZ community. The overall sums are: tz_count 24'674'767 data records, 100% tz_irange 23'675'636 in time range and region which TZ covers correctly, 96% tz_good 736'769 in time range and region where TZ is unreliable, but correct, 3% tz_bad 262'362 in time range and region where TZ database gives false result. 1%Can you please explain how you've done this analysis. In particular, how do you know that TZ is "unreliable, but correct" or gives a bad result?
As I said, we have other sources which we use besides TZ data.
These sources are used for the pre-1970 cases where we know that
TZ data is incomplete.
I give you an example:
This incorrectness of TZ database for Germany applies only to those few months on 1945. But we say 'West Germany before 1946, do not use TZ database'.
the results for Germany are:
GER 1202363 95.4 4.4 0.2 Germany
this means: we have 1'202'363 cases.
95.4% of these fall either after 1 Jan 1946, or are in area
described correctly by Europe/Berlin. For these we use TZ
database.
4.6% of cases do not fall into this category, and we do not use TZ
databases for them.
If we would use TZ database for these 4.6% cases, 4.4% would come
out correct anyway, but 0.2% would come out bad.
This are 0.2 % of those data records our users have entered into
our database, and used out 'automatic time zone' setting. They
could have used 'manual time zone', and I have ignored those for
the statistics.