About Objective: Engineer systems & algorithms to help users get to the content they need.
Hands-on experience with distributed systems for both online and offline data processing.
Designed and implemented low-latency high-throughput online retrieval systems from scratch, doing micro and millisecond latencies for few hundred QPS per node (without caching).
Designed and implemented simple & extensible data-infrastructure for offline data processing pipelines on hadoop. These range from simple search-index building pipelines, to non-trivial pipelines to do machine learning algorithms. Using tools like plain java map/reduce, pig, hive, spark, scalding and so forth (ordered by familiari...