Here is the latest bag of tweets^{*}, which covers **July 2014**.
(*) These are interesting news that I found on Twitter and that I archive periodically.
* stefanjudis: By just watching what @getify does in his repo `You-Dont-Know-JS` on Github you can learn a lot about #javascript. (27 Jul)
* catherinedevlin: My #ipython ddlgenerator lightning talk notebook: (26 Jul)
* stephensenn: Thanks to @learnfromerror "Responder Despondency: Myths of personalised medicine" Added bonus: zombie statistics! (26 Jul)
* moorejh: RT @sethdobson: Model Validation: Interpreting Residual Plots in R. #datascience #rstats (26 Jul)
* maxrzepka: Awesome visual intro to Clojure core.async by @ericnormand (26 Jul)
* albertocairo: Nice small multiples by @HuffPostGraphic's @jadiehm #infographics #dataviz (26 Jul)
* b__k: My paper, "A Useful Algebraic System of Statistical Models", is now an official research report from @uscensusbureau. (26 Jul)
* kwbroman: Interactive diagnostic chart for linkage between pairs of genetic markers #D3js #rstats (26 Jul)
* ian_soboroff: Awesome blog on hacking machine learning code at . (26 Jul)
* twiecki: Slides of my #ep14 talk on Probabilistic Programming in Python: (with link to video). @quantopian (25 Jul)
* newsycombinator: Mastering Emacs in one year (25 Jul)
* kdnuggets: MLlib: Apache Spark component for machine learning (25 Jul)
* alexhanna: .@randyzwitch at @BadHessian: ggplot v #rstats v plot.ly v seaborn v gadfly v matplotlib. The winner may surprise you (25 Jul)
* juliabloggers: Julia for Data Science #julialang (25 Jul)
* takluyver: @emble64 IPython does R now too ;-). A bit awkwardly at present, but it will be smoother in the next release. (24 Jul)
* heavysixer: New release of d4 (0.8.6), which adds mouse tracking to line series charts (24 Jul)
* YhatHQ: Quantitative Economics + Python (pdf) | (24 Jul)
* YhatHQ: New Yhat Blog Post! | Fuzzy Matching with Yhat | @theglamp (24 Jul)
* YhatHQ: Random forests | The Shape of Data | (24 Jul)
* albertocairo: Oh, I forgot the most successful post ever: 2012- Recommended resources for my infographics and visualization courses (24 Jul)
* newsycombinator: Why I'm Productive in Clojure (24 Jul)
* d3visualization: Forio Contour: Interactive charts lib made with #d3js (24 Jul)
* HNTweets: Hemingway App now available: Comments: (24 Jul)
* YhatHQ: Geospacial Data with Python - SciPy (videos) | (24 Jul)
* dzerbino: @pathogenomenick ? (24 Jul)
* wellcometrust: Discovering data - new models proposed for effective data sharing in epidemiology: #data (24 Jul)
* treycausey: Confidence Intervals for Random Forests: The Jackknife & the Infinitesimal Jackknife (Wager, Hastie, Efron 2014 PDF) (23 Jul)
* RevoJoe: #rstats A case study on meta-analysis, reproducible research and having the right tools for literate programming: (23 Jul)
* rstudio: Announcing Packrat v0.4 (now with RStudio IDE integration) #rstats (23 Jul)
* juliabloggers: Cobbling together parallel random number generation in Julia #julialang (23 Jul)
* mja: tidyr looks to be a more readable alternative to reshape and melt/cast in #rstats (23 Jul)
* RevoJoe: #rstats Looks like a good bit of R content at the upcoming ASA Biopharmaceutical FDA Industry Statistics workshop (22 Jul)
* rstudiotips: Type the same commands a lot? Press Ctrl + Up (OSX: Cmd + Up) at the console prompt to see and reuse recent commands. (22 Jul)
* johnmyleswhite: Here's a quick post in which I implore people to stop using arithmetic on data.frame objects in #rstats: (22 Jul)
* kwbroman: Blog: UseR 2014, days 3-4 (20 Jul)
* mattmight: Understand and implement laziness with examples in Scala, JavaScript, Swift and Racket (20 Jul)
* juliabloggers: Julia syntax features #julialang (20 Jul)
* nonrecursive: finally cleaned up my .emacs.d and put it online: . used most of @technomancy's - thanks! (20 Jul)
* mattmayo13: Fantastic data generator: (via @diou) #CS #DataScience #DataMining #Data #Databases #DB #GradSchoolSolutions #SWEng (20 Jul)
* gd047: Is difference in proportion appropriate measure to compare performance of a drug over another one? #rstats (20 Jul)
* aficionado: A Lisper's first impression of Julia (20 Jul)
* treycausey: When people stop being polite and start getting real (in GitHub issues). (20 Jul)
* inesgn: 'If I had one piece of advice (for beginner data scientists)' - take more matrix algebra.' Interview with @treycausey (20 Jul)
* mfenner: How Frequently Do the Results from Completed US Clinical Trials Enter the Public Domain? - A Statistical Analysis of… (20 Jul)
* debasishg: Very good roadmap by @bitemyapp if u want to learn Haskell. I would just add @sigfpe's blog in the advanced parts .. (20 Jul)
* AndrewMarritt: A very good intro to survival analysis: (19 Jul)
* YhatHQ: A Practical Introduction to Information Entropy by Kevin Davenport | (blog post) (IPynb) (19 Jul)
* zevross: For loops get such bad press in #rstats that I actually feel guilty when I use one. See for a good discussion. (19 Jul)
* target_learning: "Entering the Era of Data Science: Targeted Learning & the Integration of Statistics & Computational Data Analysis" (18 Jul)
* askdrstats: Full pdfs available on the CRAN mosaic package page #rstats distributed under a @creativecommons license @AmeliaMN (18 Jul)
* TeXtip: A little macro that defines \given to be the conditional probability symbol: \def\given{\,|\,} (18 Jul)
* YhatHQ: R Wikibooks - a very excellent R resource. Checkout the long list of topics in the right sidebar! | (17 Jul)
* datassist: More awesome truth and beauty #dataviz from Mortiz Stefaner in the Visual Complexity companion Book of Trees (17 Jul)
* ogrisel: OpenBLAS 0.2.1 is out! with many optimized kernel fixes: (17 Jul)
* kaz_yos: page 50: Standardization, IPW, stratification/restriction, matching are approaches to estimate average causal effects (17 Jul)
* twiecki: George -- a new Gaussian Process regression library in Python by @exoplaneteer (17 Jul)
* johnb30: If you’re doing text stuff in Python make sure to check out gensim (). Topic Modeling for Humans by @RadimRehurek. (16 Jul)
* Stata: The program for the 2014 UK #Stata Users Group meeting is now posted at . We hope to see you in London in September! (15 Jul)
* RDataMining: A nice introduction into Classification and regression trees (15 Jul)
* drewconway: What is deep learning, and why should you care? (15 Jul)
* ErgoEmacs: List of Emacs Starter Kits (14 Jul)
* __DataTau__: Feature Scaling and Normalization and the effect for machine learning algorithms: (14 Jul)
* __DataTau__: Using Python's sci-packages to prepare data for Machine Learning: (14 Jul)
* dloss: Lessons learned from 16yrs of teaching lab skills for scientific computing (Unix, version ctrl, progr, SQL) #mustsee (14 Jul)
* eagereyes: Perhaps now I’ll finally be able to figure this thing out. (14 Jul)
* patrickDurusau: RP Medical Vocabulary #vocabulary #medical #JohnDCook (14 Jul)
* deleeuw_jan: Flattering. But I'll take it. (14 Jul)
* jakevdp: MT @pkgw: Hot fire from the Arxiv: "It is Time to Stop Teaching Frequentism to Non-statisticians" (14 Jul)
* CompSciFact: Learn Lisp the Hard Way // Not by Zed Shaw but it uses his template. (14 Jul)
* DiegoKuonen: Statistical Scientist Meets Philosopher of #Science: David Cox <-> Deborah Mayo (@learnfromerror) PDF > #Statistics (14 Jul)
* rasbt: 8 great data blogs to follow (14 Jul)
* shiffman: Corrected link to follow progress and report issues for Learning Processing 2nd Edition. (14 Jul)
* johnmyleswhite: Need to try out @RickWicklin's trick of "log"-transforming potentially negative values: (14 Jul)
* zentree: @grserrano_ @inesgn On a similar vein "R as a second language" & "R is a language" (14 Jul)
* ptaoussanis: Grimoire, a new community-driven #Clojure docs/examples site: Seems cool! Rationale here: (14 Jul)
* ucsantabarbara: @JuliaLanguage, made by alum Stefan Karpinski & Viral Shah, gives a glimpse into potential programming of the future (14 Jul)
* victoriastodden: I've been collaboratively building a library of references for reproducibility in @zotero - please use and/or add! (13 Jul)
* ErgoEmacs: why you shouldn't use dotimes and cons. Guy Steele on Parallel Programing: Get rid of cons! #lisp (13 Jul)
* __DataTau__: Flambo - A Clojure DSL for Apache Spark: (13 Jul)
* zentree: "Sometimes I feel (some) need for speed [in #Rstats]" in Quantum Forest. New and improved with Rcpp goodness! (13 Jul)
* twiecki: Porting Kruschke's "Doing Bayesian Data Analysis" to PyMC3: (12 Jul)
* StephenPiment: Outstanding area for data science @SardelichM For those intetest in #ML and #cancer research, an excellent review: … (12 Jul)
* kjhealy: Cosma Shalizi conveys A Statement from the Editorial Board of the Journal of Evidence-Based Haruspicy. (12 Jul)
* chapmanb: Notes Bioinformatics Open Source Conference #BOSC2014 day 1 afternoon Visualization, project updates, lightning talks (12 Jul)
* octonion: Introduction to Julia - Part 1 - (11 Jul)
* UnixToolTip: You can drag a file and drop it on an Emacs dired window to copy the file into the dired directory. (11 Jul)
* zentree: "A random walk through Mathematica 10" by @walkingrandomly (11 Jul)
* mja: Mendelian randomization study shows that moderate drinking does **not** lower risk of coronary heart disease. (11 Jul)
* visualisingdata: Cross-posting from @Seeing_Data: 8 Articles Discussing Visual and Visualisation Literacy (11 Jul)
* analyticbridge: Item Response Theory and Item Information Exploration: (This article was first published on Econometrics ... (11 Jul)
* StatMLPapers: Learning Probabilistic Programs. (arXiv:1407.2646v1 [cs.AI]) (11 Jul)
* github: Introducing a simpler, faster GitHub for Mac (11 Jul)
* newsycombinator: Learn Lisp The Hard Way (11 Jul)
* lighttable: 0.6.7 is out today with some important bug fixes. Thanks to everyone who contributed! Read the release post here: (10 Jul)
* revodavid: .@revojoe reviews John Chambers’ keynote address on history of R at #user2014 #rstats (10 Jul)
* DrBunsen: Testing the M+ font today. Unsure it can usurp the venerable Letter Gothic, but I like it: (10 Jul)
* xieyihui: R 3.1.1 is released. Code name "Hello Radford Neal" #rstats (10 Jul)
* thosjleeper: @hadleywickham It appears your call has been answered. (10 Jul)
* arnicas: Great post. RT @evalu8r: Not happy I had to write this new blog post: An Incomplete List of Females in #dataviz : (10 Jul)
* benhamner: 10 R packages to help you win @kaggle competitions - nice presentation from Xavier Conort (10 Jul)
* moorejh: The looming challenges of genetic counseling #genomics #genetics (10 Jul)
* JohnDCook: Journals could have two sections: 1) Papers published so the authors can say they've published. 2) Papers people should read. (9 Jul)
* jonathanstray: The open-source document mining software my team has been working on now reads pretty much every file type. (9 Jul)
* StatFact: Collaborative Interactive ggplot2 and R plots (9 Jul)
* patrickDurusau: RP EuroClojure 2014 (notes) #Clojure #EuroClojure (9 Jul)
* onethingwell: Screenmailer - Record screen and voice. Share video via private link. (9 Jul)
* jmsidhu: Visualizing MBTA Data > Nice assembly of #dataviz elements, story and interactivity. FYI @MTLawrence @noahi (9 Jul)
* d_spiegel: wow, a report on the future of statistical science! (9 Jul)
* getpy: "Using Docker as a Python Development Environment": (9 Jul)
* IanDworkin: @IanDworkin link to the data and scripts on github for the paper: (9 Jul)
* floss4science: This Github repository contains lists of world countries in JSON, CSV and XML (9 Jul)
* dataJujitsu: Tool to download all meta-analysis data from the @CochraneLibrary of Systematic reviews (need institutional access) (9 Jul)
* stephensenn: @zentree @david_colquhoun Be careful! Here's why even Bayesians should not (naively) use Bayes theorem for screening (9 Jul)
* enjalot: i'm pretty excited about these interactive d3.geo slides for tomorrows #d3js @maptimeSF session (click the numbers!) (9 Jul)
* vsbuffalo: Forgot to announce: R chapter of my book is available on early release. I love R and said too much — tell me parts that are too in depth! (9 Jul)
* ethanwhite: Some of IPython to become @ProjectJupyter to reflect that it supports #rstats, @JuliaLanguage, and more! (9 Jul)
* TeXtip: “There is only one large computer program I have used in which there are to a decent approximation 0 bugs: Don Knuth's TeX.” -- Jaap Weel (9 Jul)
* YhatHQ: Dynamic Time-Series Modeling | post by Kevin Davenport | @KevinLDavenport (8 Jul)
* albertocairo: Reminder of recent tools: @plotlygraphs @datavisualinfo @blockspring (8 Jul)
* sethdobson: Nice slide presentation on Data Munging in R -> #datascience #rstats (8 Jul)
* kjhealy: Thing is, Harvard Psych has many excellent people who do strong in-lab pre-pub replications of their own work. (8 Jul)
* IanMulvany: some nice tutorials for using plot.ly (8 Jul)
* albertocairo: Folks, don't forget to add to your RSS f. Updated by @eagereyes #dataviz #infographics #visualization #dataJournalism (8 Jul)
* efazati: awesome-python: A curated list of awesome #Python frameworks, libraries and software (8 Jul)
* fonnesbeck: @Drgloo and then there's this: (8 Jul)
* johnmyleswhite: No comment necessary: (7 Jul)
* newscientist: We can learn from Paris to make cities more chilled as the climate warms. (7 Jul)
* dartdog: looks like an interesting Web site builder (runs on Google App-engine) (7 Jul)
* InterestingLit: Henry Miller's writing schedule: (7 Jul)
* sgsfak: @hadleywickham How about ? (7 Jul)
* stuartcantrill: Awesome referee comment of the day: "Please eliminate 'We believe'. This is science. Nobody cares what you believe." (7 Jul)
* inesgn: @jleek @bcaffo @rdpeng See what the layman in R can do (App) (slides)-> (7 Jul)
* gnuhealth: #GNUHealth 2.6.0 is out ! #Tryton (7 Jul)
* wjrl59: @paulblaser @chlalanne @moorejh Novel network vis, but still drawing nodes as points! How about lines? (6 Jul)
* rasbt: I am not great at #regex, so I started to collect them copy&paste-ready an IPython notebook: (6 Jul)
* paulblaser: RT @moorejh: Novel approaches to network #visualization: PDF #dataviz #bioinformatics #datascience... (6 Jul)
* ErgoEmacs: Intro to Mathematica Pattern Matching for Lisp Programers (5 Jul)
* RevolutionR: RT @kdnuggets: plyrmr, package for making R work seamlessly with the #Hadoop system #rstats (4 Jul)
* hmCuesta: Mining Big Data using Weka 3 (4 Jul)
* teddy777: The Quantification of Behavioral Analytics in Psychiatry (4 Jul)
* strataconf: #Strataconf + #HadoopWorld #DataScience track features @jeroenhjanssens @hadleywickham @fperez_org @hmason @cdubhland (4 Jul)
* jmsidhu: On the origins of the scatter plot (4 Jul)
* SteffLocke: If you're interested in #R and don't use data.table - check out this in-depth data.table presentation (4 Jul)
* DiegoKuonen: MT @msgbi: Becoming a Data Scientist : A RoadMap #DataScience #Statistics (4 Jul)
* burnsstat: simple random functions in #rstats to draw pictures (4 Jul)
* modernscientist: Friendly reminder to cite open source software in manuscripts. Here's the info for the Python scientific packages: GH Pages demo: (3 Jul)
* erlichya: .@IgorCarron Link to our genetic privacy paper: (3 Jul)
* DiegoKuonen: MT @kdnuggets: For advanced Data Scientists: Tutorial in Gradient boosting machines > (3 Jul)
* peterdfields: Have Procrustes rotations as in trying to get the PC1-PC2 coordinates onto the scale of the lat-long points #Rstats (3 Jul)
* sgrifter: Using random forests in #rstats?@JohnEhrlinger has a nice application of ggplot2 #useR2014 (3 Jul)
* tslumley: AmStat white paper on Stats and Computer Science and Data and stuff is out: (3 Jul)
* ledell: Presenting my new R package for ensemble learning, subsemble, at #user2014 in 1 hr. Now available on CRAN: #rstats (3 Jul)
* agramfort: Gabriel Peyre started to convert his great matlab numerical tours to #python Give him a hand and send him PRs (3 Jul)
* TrestleJeff: Couldn't make it to #user2014? Here's a preview of the slides from my rmarkdown talk tomorrow morning. #rstats (3 Jul)
* revodavid: Slides from my #user2014 talk, “R and reproducibility - a proposal” #rstats (3 Jul)
* MMaechler: slides of my talk at #useR2014... Yes, I should find a more modern publication venue! (3 Jul)
* _inundata: This is cool → statsTeachR open-access online repo of modular lesson plans for teaching statistics using R #useR2014 (3 Jul)
* ledell: Adaptive resampling to shortcut (tuning param) grid search, by Max Kuhn. Now part of caret R package. #user2014 (2 Jul)
* ledell: "The Airline dataset is the Iris dataset for big data." -- @RevoJoe 120M rows, available here: #user2014 (2 Jul)
* __DataTau__: Books for Machine Learning with R: (2 Jul)
* leonawicz: Collaborative coding for R (2 Jul)
* kwbroman: @_inundata Beamer slides that don’t suck: (2 Jul)
* RevolutionR: Review of Applied Predictive Modeling by Kuhn and Johnson (2 Jul)
* seandavis12: #diigo: useR 2014: Fostering the next generation of Open Science with R (2 Jul)
* thattommyhall: Been toying for ages with the idea of doing a Clojure 'SICP distilled' course in London, would anyone be interested? (2 Jul)
* DiegoKuonen: #Statistics & #Science - 'Future of Statistical Sciences' report by @ASTATWORLD at (2 Jul)
* yokofakun: pushed a standalone version of verticalize an everyday linux command to verticalize tab-delimited files. (2 Jul)
* benjaminlmoore: Circlize #rstats package: for some reason the pretty examples are buried in zipped suppl. (2 Jul)
* bgoncalves: Inferring the origin of an epidemic with a dynamic message-passing algorithm (2 Jul)
* walkingrandomly: A 'data engine' is powered by large unstructured data sources and converts them to actionable knowledge. (2 Jul)
* REAS: GNU Jokes: Hello World! @dcuartielles (2 Jul)
* Soc_Net_Intel: MT @JoeMarr4: Want to read big data files into R? Check this out: Handling big data in R #rstats (2 Jul)
* OpenCPU: @biotimylated sort of. The API is actually interoperable, not necessarily R. But current implementation is R. (1 Jul)
* rxin: Blog post explaining our (@databricks) position in Shark, Spark SQL, Hive on Spark, and the future of SQL on Spark (1 Jul)
* ogrisel: scikit-learn 0.15.0b2 is out on PyPI with wheels for OSX and Win32. Grab it with "pip install scikit-learn==0.15.0b2"! (1 Jul)
* ramnath_vaidya: Interactive slides from my Interactive Visualization presentation at #user2014. #rstats (1 Jul)
* YhatHQ: Principles of good data analysis by Greg Reda | @gjreda (1 Jul)
* revodavid: Getting a demo of example apps built in OpenCPU from @JeroenOoms . Try them here: #rstats #user2014 (1 Jul)
* hspter: RCloud looks awesome! #user2014 (1 Jul)
* DataJunkie: Some great examples of rcharts visualizations: #Rstats #useR2014 (1 Jul)
* winston_chang: Slides and code from my talk on #ggvis at #useR2014: #rstats (1 Jul)
* jedisct1: RT @YhatHQ: DAWG - Read-only dictionary-like objects in Python using 200x less memory than standard dicts | (1 Jul)
* lpfeed: JuliaCon Presentation Slides: The Julia language’s first conference has recently wound up and the PDFs of the... (1 Jul)
* ConcejeroPedro: R-igraph tutorial in R markdown (.Rmd), based on enron dataset (enron.RData) available here (1 Jul)
* moorejh: Is #ArtificialIntelligence the future of medicine? - JAMA #informatics #health #healthcare #machinelearning (1 Jul)
* DiegoKuonen: RT @freakonometrics: "De Statisticien à Data Scientist" ht @a_bh (1 Jul)
* DiegoKuonen: RT @freakonometrics: "Statistique etBig Data AnalyticsVolumétrie – L’Attaque desClones" ht @a_bh (1 Jul)
* smashingmag: A detailed introduction to Functional Reactive Programming (FRP). written by @andrestaltz (1 Jul)
* yokofakun: In case you missed it yesterday night: I pushed a C version of my tool visualizing the dependencies in a Makefile (1 Jul)