data science big data data engineering machine learning hadoop recommendaton engine r cloudera data pipelines spark scaling apache hadoop data visualization mongodb software development data analytics hiring new york times measuring data data mining devops tumblr go linkedin apache analytics sensor data wearable devices message queue culture software engineering logging data structures entity resolution diversity interviewing spark sql spotfy financial modeling blackrock ned chris wiggins google feature extraction microsoft datadog satori platform engineering rocana kafka apache kafka genomics ibis annoy mechanical turk computer programming apache flume logstash rsyslog elasticsearch sematext solr python nix command line yplan command line hbase pinterest nosql search square nyc test driven development spotify virtualenv deployment tools debian cloud computing joyent mantra thoughtworks code clojure algorithms sensu paperless post one class classification outlier selection stochastic outlier selection node.js database influxdb time series agency agent specialk protunity embedded dsl pi-calculus a scala library kvdb apachezookeeper renttherunway camille fournier avery rosen wee data lisp lambda calculus calculus digital signal processing code school better programmer ember.js web application matrix factorization sgd stochastic gradient descent data analysis open source jquery javascript libraries r-bloggers dendextend r-bloggers dendextend r-bloggers dendextend date web app grid system bootstrap building systems erlang hortonworks mapreduce yarn lil-brother shutterstock ntf rickshaw google analytics aditya mukerjee 10gen crowdsourcing hack data statistical inference idd etl morphlines apache pig netflix
See more