Posts
Test Python Syntax Hightlight
intro to Julia's Debugger
ideas for software engineering lessons
string formatting with files in bash
differences between subset assignment in R
heuristics for task scheduling algorithms
understanding R S4
customizing and extending R code
polyglot pipelines
automated query optimization in R
representing semantics for data analysis code
selecting rows using different R packages
using global variables with multicore
lazy joins
lessons on making R code fast
names and performance in R
converting cov2cor from R to julia
parallel overview
count columns in a big text file
where to place commas
value of a data scientist
are apply functions faster than for loops?
3 billion rows with R
hive udaf with R
missing arguments in parsed R expressions
column selection at read
search and replace in a directory
comparing parquet with txt.gz
a lesson in R profiling
pushing R into swap space
vectors to matrices and back
an experiment with Matloff's software alchemy
variable scoping in R
variable assignments
example statistical computation on GPU
first impressions of julia
gaussian process
Changing Factor Contrasts In R
base memory usage of data analysis processes
becoming a better problem solver
data as random variables
scipy.stats for the win
a treasure of Tukey
why phd
random boolean in numpy
what computer skills do you need
advanced python
python mock objects
so many blog platforms
Data Analysis Techniques
Linux Utilities
data virtualization
May the Source be with you
Ipython Notebook for Productivity
Convenience versus Generality
The Best Thing on The Internet
Blog Strategy
subscribe via RSS