The Good, The Bad and The Ugly

Statistics for an Excellent Dataset

Statistics for a Pretty Good Dataset

Statistics for Pretty Bad Dataset

Methods to Calculate Data Quality Statistics

R linear = SUM ( ABS(I - I(mean)) ) / SUM (I)

R square = SUM ( (I - I(mean)) ** 2 ) / SUM (I ** 2)

Chi**2 = SUM ( (I - I(mean)) ** 2 ) / (Error ** 2 * N / (N-1) ) )

R linear is the same as Rmerge when the data is reduced in P1.
Otherwise R linear is the same as Rsym.
For reflections with only a single measurement (N=1), these statistics are not computed.

According to the HKL2000 Manual: Chi**2 is the weighted ratio of the difference between the observed and average value of I and I(mean), squared, divided by the square of the error model times a factor correcting for the correlation between I and I(mean). It depends on an explicit declaration of the expected error in the measurement. So the user of the program is part of the Bayesian reasoning process behind the error estimation.

Loren's comment: But the equation given does not indicate that there is a weight. And introducing user bias into your error analysis is nothing to brag about.

Starting on page 89 of the HKL2000 is a discussion of these statistics.

HKL2000 Manual (pdf file)

After looking at the statistics you may be left wondering what "nonunf" means. Who knows? It is not mentioned in the HKL2000 Manual. If you do know what it means, please tell Loren.