  • Calculating similarity of two users on Twitter http://bit.ly/dY6nai (h/t metaoptimizeqa, 27 Mar)
  • 6 Free E-Books on Learning to Program with Python http://j.mp/g6GiQE #python (h/t tdhopper, 27 Mar)
  • Network medicine: a network-based approach to human disease. http://ff.im/zMFDD (h/t kshameer, 27 Mar)
  • Great article about graph processing concepts and Google Pregel: http://horicky.blogspot.com/2010/07/google-pregel-graph-processing.html (h/t sbtourist, 27 Mar)
  • New post: Easily embedding R inside a Qt application with a full example of the 'density slider'. http://goo.gl/zWjeR (h/t eddelbuettel, 25 Mar)
  • #Biostar codebase on github: https://github.com/ialbert/biostar-central #python (h/t yokofakun, 25 Mar)
  • #stata 's #ice add-on is really powerful: Multiple imputation using chained equations: Issues & guidance for practice DOI: 10.1002/sim.4067 (h/t berndweiss, 24 Mar)
  • The Many Uses of Q-Q Plots: My last four posts have dealt with boxplots and some… http://goo.gl/fb/wn7UE #rstats (h/t Rbloggers, 24 Mar)
  • Free E-Book on "Text Algorithms" (by M. Crochemore/W. Rytter. OUP, 1994) http://bit.ly/gDcHJ1 (h/t mxlearn, 24 Mar)
  • Creating my perfect citation system using LaTeX http://bit.ly/giHkuw #greader (h/t neilfws, 24 Mar)
  • The data science tool kit: OCR, geocoding, text processing, etc. all on a open source VM! http://www.datasciencetoolkit.org/ #rstats #hadoop (h/t cmastication, 23 Mar)
  • blogged about Gene–Environment Interactions in Human Disease http://is.gd/RkLrne #GXE (h/t moorejh, 23 Mar)
  • Applied #rstats for the quantitative social scientist [PDF] http://bit.ly/hNUlWw 'h/t drewconway, 23 Mar)
  • impressed by #Choosel, an #opensource framework and tool for #dataviz by @lgrammel http://j.mp/bvhcHb (h/t JanWillemTulp, 23 Mar)
  • There is also a good discussion about where to start with category theory (also rec. CWM and CTCS) at MathOverflow: http://bit.ly/8jxtow (h/t mdreid, 23 Mar)
  • Using genome-wide pathway analysis to unravel the etiology of complex diseases http://ff.im/zxAyk (h/t kshameer, 22 Mar)
  • what happens if u look all wikipedia articles with an historic reference and visualize by year on the world map? this: http://bit.ly/fLxAgP (h/t al3xandr3, 22 Mar)
  • LLM3D: a log-linear modeling-based method to predict functional gene regulatory interactions from gen... http://bit.ly/fhK9Sp #citeulike (h/t neilfws, 22 Mar)
  • There's still time: Read and comment on the revised Standards for Educ and Psych Testing http://teststandards.org/index.htm @psychometrics (h/t psychometrix, 21 Mar)
  • waffles: machine learning command line tools: http://waffles.sourceforge.net/ (h/t mikedewar, 21 Mar)
  • some interesting #stackoverflow tag clouds if you are into that sort of thing http://goo.gl/e6ZkV (h/t codinghorror, 21 Mar)
  • Great Machine Learning exercises with R. http://al3xandr3.github.com/ #rstats (h/t i_314, 21 Mar)
  • An introduction to probability and statistics using Python http://ow.ly/4igsW via @boris_gorelik (h/t SciPyTip, 20 Mar)
  • The Joy of #Clojure MEAP is now complete and ready for download: http://t.co/XDGGOF (h/t liebke, 19 Mar)
  • P. Donneley: Quantifying the Underestimation of Relative Risks from Genome-Wide Association Studies http://goo.gl/fNPKD #GWAS (h/t genetics_blog, 18 Mar)
  • New GenABEL Website, and more *ABEL software #rstats #GWAS http://goo.gl/fb/7LUK3 (h/t genetics_blog, 18 Mar)
  • Interesting idea, and good start: basic ggplot2 network graphs http://bit.ly/flrYDh #rstats #sna (h/t drewconway, 18 Mar)
  • Data Analysis and Manifold Learning. Course notes. Recommended http://bit.ly/e7c8s2 (h/t gappy3000, 18 Mar)
  • Interactome Networks and Human Disease http://ff.im/zub1u (h/t kshameer, 18 Mar)
  • RT @visualisingdata New on visualisingdata.com | Part 1: The essential collection of visualisation resources http://bit.ly/fgkey5 (h/t Biff_Bruise, 17 Mar)
  • New blog post with revised statistical analyses of #canabalt scores using #rstats and #jags http://bit.ly/eR5uhb (h/t johnmyleswhite, 17 Mar)
  • Great read for anyone interested in Foundations of Statistics: http://arxiv.org/abs/1006.3868 Profs. Gelman & Shalizi do a great job. #stats (h/t suncoolsu, 17 Mar)
  • Good stuff for Data Mining and Cancer. http://1.usa.gov/hfwtxh https://www.oncomine.org/ #datamining #cancer (h/t i_314, 17 Mar)
  • @vsbuffalo Here's one: http://odin.mdacc.tmc.edu/~kdo/geneclust/ (h/t JohnDCook, 17 Mar)
  • Milk: (Yet Another) Machine Learning Toolkit for Python http://bit.ly/fmL4KP (h/t mxlearn, 16 Mar)
  • 2 more #Protovis tutorials by @jcukier More fun with arrays: http://j.mp/gECJ8R, analysis of the Map projections example: http://j.mp/hRQQiY (h/t JanWillemTulp, 16 Mar)
  • RT @SignMagazine Making information beautiful and clear - a Significance toolkit on data visualisation - read for free http://ow.ly/4fwPA (h/t Biff_Bruise, 16 Mar)
  • Variation across the allele frequency spectrum http://ff.im/zsx49 (h/t kshameer, 16 Mar)
  • Infectious diseases not immune to genome-wide association http://ff.im/zsx48 (h/t kshameer, 16 Mar)
  • Hints of hidden heritability in GWAS http://ff.im/zsx46 (h/t kshameer, 16 Mar)
  • Think you can't create web apps in #rstats? @jeffreyhorner replicated Google ngrams using Rack and ggplot2 (h/t nyhackr, 16 Mar)
  • two extra pieces of my #protovis tutorial on data http://bit.ly/fk7RR1 http://bit.ly/h8l9jd (h/t jcukier, 15 Mar)
  • RT @metacode: Network Analysis Basics (and applications to online networks) http://ur1.ca/3ixi5 27 slides (@RessiveNetworks) #SNA (h/t RessiveNetworks, 15 Mar)
  • Where the UNIX philosophy breaks down http://bit.ly/92p5Zv (h/t CompSciFact , 15 Mar)
  • Ooh, a comp bio for beginners tutorial - ADMIXTURE, R, PLINK - by @razibkhan. (Featuring... New Kids on the Block!): http://bit.ly/g6wZ8p (h/t mary_carmichael, 15 Mar)
  • GHC 7.0 status update http://post.ly/1kccq (h/t irr, 15 Mar)
  • This is useful (if you make maps of the world): A gallery of map projections: http://spatial.ly/fB8tS9 (h/t spatialanalysis, 15 Mar)
  • Unifying Gene Expression Measures from Multiple Platforms Using Factor Analysis http://bit.ly/hAEZ7Y #citeulike (h/t neilfws, 15 Mar)
  • French project #Datalift aims to develop a platform for publishing & interlinking heterogeneous data on the Web of Data: http://datalift.org (h/t nicolastorzec, 15 Mar)
  • Beyond clinical phenotype: The biologic integratome http://1.usa.gov/hurOiL #readcast (h/t kshameer, 14 Mar)
  • I just entered my @visualizingorg #dataviz submission http://j.mp/fmLNu3 http://yfrog.com/h2qpep It's built in #D3 http://j.mp/fB0uJ6 (h/t JanWillemTulp, 14 Mar)
  • A Review of Phase 2-3 Clinical Trial Designs http://bit.ly/hvDZeA (h/t StatFact, 14 Mar)
  • The Data Structures of Python http://bit.ly/gBuIRo "We read Knuth so you don't have to." (h/t vsbuffalo, 14 Mar)
  • Critical Assessment of Massive Data Analysis (CAMDA) http://www.camda.info/ #bioinformatics #bigdata #datamining #genomics (h/t kshameer, 14 Mar)
  • Bioscala https://github.com/bioscala/bioscala/ #scala #bioinformatics (h/t yokofakun, 13 Mar)
  • PyCon 2011: Introduction to Parallel Computing on an NVIDIA GPU using PyCUDA http://goo.gl/fb/cYU9R (h/t ThePSF, 13 Mar)
  • People ask why I like Common Lisp. I think this short page does a very good explaining why: http://bit.ly/h1JhMZ (h/t vsbuffalo, 13 Mar)
  • Motion charts in R. http://code.google.com/p/google-motion-charts-with-r/ (h/t inverseofverse, 12 Mar)
  • Ensemble Learning for Variable selection; an easy read too! http://bit.ly/i31OF1 (h/t mxlearn, 12 Mar)
  • Bolt Online Learning Toolbox in Python - Very cool. http://is.gd/dClHGz (h/t ChrisDiehl, 12 Mar)
  • Features of Common Lisp http://post.ly/1jf5g (h/t irr, 12 Mar)
  • What to demand from a scientific computing language http://ow.ly/4cDfn Presentation by Peter Norvig (h/t SciPyTip, 11 Mar)
  • Here are the slides of my #pycon talk on statistical machine learning for text classification with @scikit_learn http://slidesha.re/i9dIZz (h/t ogrisel, 12 Mar)
  • RT @kbradnam: A few years ago I gave a talk on 'Trust and mistrust in bioinfo'. I think it holds up very well today: http://t.co/skQDVYr (h/t vsbuffalo, 12 Mar)
  • agamemnon 0.1.1: A graph database built on top of cassandra http://bit.ly/h4rhbK (h/t pipy, 10 Mar)
  • 10 papers every programmer should read (at least twice) http://bit.ly/jOjxv (h/t CompSciFact, 10 Mar)
  • Science magazine special on data (requires free registration): http://bit.ly/ea2jMy (h/t algoriffic, 10 Mar)
  • Comparison of Collaborative Filtering Algorithms by Cacheda et al. ACM TWEB. Vol.5(1) http://bit.ly/ffGMjb (PDF) HT @lemire (h/t chengweiwei, 10 Mar)
  • #ESS complement #rautoyas, provides automatically created yasnippets for #Rstats functions. http://bit.ly/hiaACf, http://bit.ly/govY3J (h/t suncoolsu, 10 Mar)
  • "Feature Selection Using Principal Feature Analysis" http://bit.ly/dH26Hk (h/t vsbuffalo, 10 Mar)
  • Great Heatmaps from Microarray Data with Python and R. Tutorial and Code. http://bit.ly/fylawX #rstats #rpy #bioconductor (h/t i_314, 10 Mar)
  • Dalliance: interactive genome viewing on the web http://ff.im/zo1on (h/t kshameer, 10 Mar)
  • Clojure or Scala for bioinformatics/biostatistics/medical research http://goo.gl/fb/SvHPt #clojure #SO (h/t planetclojure, 9 Mar)
  • Stata Blog: Understanding matrices intuitively, part 2, eigenvalues and eigenvectors http://bit.ly/gEvknE (h/t Stata, 9 Mar)
  • Release candidate of LyX 2 published http://ow.ly/4arKB (h/t TeXtip, 9 Mar)
  • http://jdk7.java.net/preview/ Java™ Platform, Standard Edition 7 Developer Preview Release (h/t yokofakun, 8 Mar)
  • RT @TopologyFact: What are good books for computational geometry? http://bit.ly/hfBnGC (h/t CompSciFact, 8 Mar)
  • my notes on #ngs variant-calling, realignment methods here: http://github.com/brentp/bio-playground/tree/master/ngs-notes #bioinformatics (h/t brent_p, 8 Mar)
  • This is a great intuitive example to explain how a factor model works http://bit.ly/geOyrP (h/t gappy3000, 8 Mar)
  • A brief introduction to "apply" in R: http://t.co/Zi (h/t RforBusiness, 8 Mar)
  • If you are into #SemanticWeb and #Logic, have a look at "Semantic Web Technologies: From Theory To Practice" by @AxelPolleres: bit.ly/h0tGV5 (h/t nicolastorzec, 8 Mar)
  • Computing on the Language in R http://t.co/D0bUFzK (h/t gappy3000, 7 Mar)
  • UniSNP: uniquely mapped SNPs from dbSNP (build 129) and HapMap (release 27) http://1.usa.gov/gE3Ou0 #genomics #bioinformatics (h/t kshameer, 7 Mar)
  • What are the 'hot topics' in MachineLearning right now? http://bit.ly/gRWxbz (h/t mxlearn, 7 Mar)
  • Great list of machine learning tutorials http://bit.ly/fNnl3Q (h/t drewconway, 7 Mar)
  • A philosophy of clean data, by @hadleywickham http://bit.ly/hnnqIO (h/t drewconway, 7 Mar)
  • Programmatic representation of #INCEPTION : tribute to Nolan in "C" Language and a bit of assembly (x86): http://bit.ly/fszU8U #geek (h/t onertipaday, 7 Mar)
  • Best use of #Protovis I've ever seen - http://bit.ly/iaZPMa - can anyone point to anything better? #visualization (h/t MetaThis, 7 Mar)* Ruby, Python, and Science http://bit.ly/g5QdKM (h/t SciPyTip , 6 Mar)
  • J programming language source is now available under GPL version 3 http://www.jsoftware.com/source.htm (h/t hakankj, 6 Mar)
  • Incredible! reMap - a visual semantic browser of the Visual Complexity database - http://bit.ly/eHhOwv (h/t MetaThis, 6 Mar)
  • New blog post Creating a pdf of your favorite tweets with Apache FOP http://goo.gl/JOfzc (h/t yokofakun, 6 Mar)
  • Mapping the nation's well-being: results from Gallup's quality of life survey http://t.co/ys (h/t nytgraphics, 5 Mar)
  • @janwillemtulp it's in my openprocessing sketches http://bit.ly/hPt1QI it's suite simple (h/t jcukier, 5 Mar)
  • #scala + #processing = va.lent.in/blog/2011/03/0… How to use the power of Scala in visual art. (h/t valyard, 4 Mar)
  • "Genetic Facebook: how genes influence social networks" http://bit.ly/gDfxGk So visible behavioral phenotypes cause social groups. Who knew? (h/t vsbuffalo, 3 Mar)
  • #GeoSPARQL, a proposed spatial extension to #SPARQL for dealing with geographic information: http://bit.ly/gEhrqt. #OGC #SemTech2011 (h/t nicolastorzec, 3 Mar)
  • #git add some color to git diff, etc., with git config --global color.ui auto http://bit.ly/iaSTpf (h/t onelinetips, 3 Mar)
  • Benchmark for several Python machine learning packages: http://fseoane.net/ml-benchmarks/ (h/t fpedregosa, 3 Mar)
  • Natural Language Processing (almost) from Scratch. (arXiv:1103.0398v1 [cs.LG]) http://bit.ly/h17Puw part of the exciting Bottou conjecture (h/t sclopit, 3 Mar)
  • RT @ahier: A vision for a patient-centered health information system (pdf) http://bit.ly/eg5dNO (h/t kshameer, 3 Mar)
  • Brendan W. McAdams walks us through MapReduce with MongoDB 1.8 and Java. Visit http://bit.ly/ex4L45 for more info. #nosql #mongodb (h/t nosqldatabases, 2 Mar)
  • If you don't already know about #Python and #Unicode, then have a look at this nice summary by @kumar303: http://bit.ly/eF4sqD. Via @radar (h/t nicolastorzec, 2 Mar)
  • Tip of the Week: DAnCER for disease-annotated epigenetics data http://bit.ly/hXkGdS #bioinformatics #genomics (h/t OpenHelix, 2 Mar)
  • Check out #Eurographics Digital Library for many inspirational research papers on #infovis and related subjects: http://j.mp/fNHRqt (h/t JanWillemTulp, 2 Mar)
  • Register for EuroVis 2011 in Bergen, Norway! http://tinyurl.com/4vmu3ce Important: secure your accommodation in Bergen soon! See you! :-) (h/t HelwigHauser, 2 Mar)
  • ABCL Dev: Bootstrapping Quicklisp on ABCL http://tinyurl.com/4fjme37 (h/t planet_lisp, 2 Mar)
  • First Steps at Building a Classifier with Mahout http://bit.ly/gRskcs (h/t mxlearn, 1 Mar)
  • RT @chengweiwei @mathupdate Solving #Sudoku: A pencil-and-paper algorithm by J. Crook http://sns.ly/RJcey2 (h/t moorejh, 1 Mar)
  • Lots of health data released via Health Indicators Warehouse http://datafl.ws/18h (h/t flowingdata, 1 Mar)

