Here are some papers that I read this week, in the CS and Stat category, plus random stuff that were mentioned on IRC or Hacker News.
I’ve used Bland-Altman plots a lot when I was working on diagnostic tests and psychometric measurement. Interestingly, few medics really understood how to interpret those kind of graphical displays, and how they are used in method comparison studies. I suspect they also dislike this approach since it does not provide formal tests of hypothesis. Here we go, with three tests for accuracy, precision and agreement.
Fond memories of Jan de Leeuw’s (and the GIFI nom de plume) own work.
A simple yet efficient graphical method to assess significant p-values computed using Benjamini-Hochberg approach.
The FDR is a property of a set of p-values, not of any individual value.
This article deals with Fréchet regression on network data, using the corresponding space of graph Laplacians (with no restriction on their rank). It has two applications (NY taxi trips after COVID-19 and dynamic networks in aging brains) which I found quite interesting. Such an approach might prove superior to classical data mining on spatio-temporal data structures since we can capture both the structure and weight information of a networks given relevant covariates, and assess the quality of adjustement for projected trends.
Not much of an interest for me except that it deals with Potassium reference level which is a subject I have been familiar with (for personal reason, unfortunately) for several years now. This remains, however, an interesting simulation study for mixed-effects models and big data aficionados.
This paper introduces a Stata command that allows to compare empirical CDFs. Contrary to the Kolmogorov-Smirnov approach, distcomp
allows to test the equality of the distribution functions point by point., using appropriate FWER control, and displays ranges of values in which the distributions’ difference is statistically significant.