Posts

value of a data scientist

are apply functions faster than for loops?

3 billion rows with R

hive udaf with R

missing arguments in parsed R expressions

column selection at read

search and replace in a directory

comparing parquet with txt.gz

a lesson in R profiling

pushing R into swap space

vectors to matrices and back

an experiment with Matloff's software alchemy

variable scoping in R

variable assignments

example statistical computation on GPU

first impressions of julia

gaussian process

Changing Factor Contrasts In R

base memory usage of data analysis processes

becoming a better problem solver

data as random variables

scipy.stats for the win

a treasure of Tukey

why phd

random boolean in numpy

what computer skills do you need

advanced python

python mock objects

so many blog platforms

Data Analysis Techniques

Linux Utilities

data virtualization

May the Source be with you

Ipython Notebook for Productivity

Convenience versus Generality

The Best Thing on The Internet

Blog Strategy
subscribe via RSS