After attending months of Twitter discussion about what could be the best software–R or Python–for data science several months ago, this is now the time of the R vs. Stata debate, here and there. Arguably, Stata is a paid software and does not offer the same scripting facilities than R for some tasks, mainly non-statistical tasks. However, what’s the point? Did anyone ever mentioned the fact that Stata has a GUI which completely mimics the command-line operations, so that people afraid of typing commands or just interested in running a logistic regression on a well-formed dataset can just do it in under a minute? It is slow with some estimators or optimization approaches (e.g., gglamm), and we had to wait a bit long to get full support for unicode and XLS, better graphical rendering, etc. But the versioning system allows to repoduce any result prior to the current version of Stata. And it does interact very well with Stan and R, too. The question is not which software is better, the real question is who’s the end user? #rstats #stata