## Posts

### representing semantics for data analysis code

### selecting rows using different R packages

### using global variables with multicore

### lazy joins

### lessons on making R code fast

### names and performance in R

### converting cov2cor from R to julia

### parallel overview

### count columns in a big text file

### where to place commas

### value of a data scientist

### are apply functions faster than for loops?

### 3 billion rows with R

### hive udaf with R

### missing arguments in parsed R expressions

### column selection at read

### search and replace in a directory

### comparing parquet with txt.gz

### a lesson in R profiling

### pushing R into swap space

### vectors to matrices and back

### an experiment with Matloff's software alchemy

### variable scoping in R

### variable assignments

### example statistical computation on GPU

### first impressions of julia

### gaussian process

### Changing Factor Contrasts In R

### base memory usage of data analysis processes

### becoming a better problem solver

### data as random variables

### scipy.stats for the win

### a treasure of Tukey

### why phd

### random boolean in numpy

### what computer skills do you need

### advanced python

### python mock objects

### so many blog platforms

### Data Analysis Techniques

### Linux Utilities

### data virtualization

### May the Source be with you

### Ipython Notebook for Productivity

### Convenience versus Generality

### The Best Thing on The Internet

### Blog Strategy

subscribe via RSS