Personal Information
Organization / Workplace
Bengaluru Area, India India
Occupation
Staff Engineer ( Global Data platforms ) @WalmartLabs India
Industry
Technology / Software / Internet
Website
http://verisigninc.com/
About
Experienced BigData Solution Architect, Developer and Apache committer.
Proficient at Big Data Technologies and Solution architecture for large scale data processing.
Vast experience in research and development of products leveraging distributed computing platforms.
Successfully designed cloud and on premise data architecture for PetaByte scale volume.
Experienced in tuning and managing petabyte scale big data processing ecosystem involving open source technologies such as Hadoop, Yarn, Spark, Kafka, Spark Streaming, Flink, HBase, Geode, Flume & Apex.
Successfully setup Lambada architecture pipeline for large scale AdTech data processing for reporting, analytics & machine learning....
Tags
big data
analytics
hadoop
prestosql
presto
streaming
apacheapex
apex
alluxio
sql
nosql
gcp
data architecture
streaminganalytics
bigdata
geode
bigdata hadoop streaming distributed computing
See more
Presentations
(6)Likes
(24)Distributed Systems: scalability and high availability
Renato Lucindo
•
13 years ago
Scalability, Availability & Stability Patterns
Jonas Bonér
•
13 years ago
A Beginners Guide to noSQL
Mike Crabb
•
8 years ago
Agility Requires Safety
Yevgeniy Brikman
•
8 years ago
Hadoop 3.0 - Revolution or evolution?
Uwe Printz
•
6 years ago
Drizzle—Low Latency Execution for Apache Spark: Spark Summit East talk by Shivaram Venkataraman
Spark Summit
•
7 years ago
#GeodeSummit - Apex & Geode: In-memory streaming, storage & analytics
PivotalOpenSourceHub
•
8 years ago
Apache Phoenix and Apache HBase: An Enterprise Grade Data Warehouse
Josh Elser
•
7 years ago
Real Time Analytics: Algorithms and Systems
Arun Kejariwal
•
8 years ago
Introduction to Apache Apex
Chinmay Kolhatkar
•
8 years ago
Apache Apex & Apace Geode In-Memory Computation, Storage & Analysis
Apache Apex
•
8 years ago
Startups are Hard. Like, Really Hard. @luketucker
Empowered Presentations
•
8 years ago
From Mainframe to Microservice: An Introduction to Distributed Systems
Tyler Treat
•
9 years ago
Comparison of MPP Data Warehouse Platforms
David Portnoy
•
11 years ago
Being Ready for Apache Kafka - Apache: Big Data Europe 2015
Michael Noll
•
8 years ago
Analysing data analytics use cases to understand big data platform
dataeaze systems
•
8 years ago
9 Ways to Be More Productive - Backed by Science
D B
•
8 years ago
November 2014 HUG: Lessons from Hadoop 2+Java8 migration at LinkedIn
Yahoo Developer Network
•
9 years ago
Data science and_analytics_for_ordinary_people_ebook
Jeffrey Strickland, Ph.D., CMSP
•
8 years ago
Three Ways Benchmarking Data Can Save the Day for Publishers (Infographic)
PubMatic
•
8 years ago
Mapreduce Algorithms
Amund Tveit
•
11 years ago
Introduction to YARN and MapReduce 2
Cloudera, Inc.
•
10 years ago
Large scale ETL with Hadoop
OReillyStrata
•
11 years ago
Apache Kafka 0.8 basic training - Verisign
Michael Noll
•
9 years ago
Personal Information
Organization / Workplace
Bengaluru Area, India India
Occupation
Staff Engineer ( Global Data platforms ) @WalmartLabs India
Industry
Technology / Software / Internet
Website
http://verisigninc.com/
About
Experienced BigData Solution Architect, Developer and Apache committer.
Proficient at Big Data Technologies and Solution architecture for large scale data processing.
Vast experience in research and development of products leveraging distributed computing platforms.
Successfully designed cloud and on premise data architecture for PetaByte scale volume.
Experienced in tuning and managing petabyte scale big data processing ecosystem involving open source technologies such as Hadoop, Yarn, Spark, Kafka, Spark Streaming, Flink, HBase, Geode, Flume & Apex.
Successfully setup Lambada architecture pipeline for large scale AdTech data processing for reporting, analytics & machine learning....
Tags
big data
analytics
hadoop
prestosql
presto
streaming
apacheapex
apex
alluxio
sql
nosql
gcp
data architecture
streaminganalytics
bigdata
geode
bigdata hadoop streaming distributed computing
See more