Personal Information
Organization / Workplace
San Francisco, CA United States
Occupation
Senior Data Engineer at Workday
Industry
Technology / Software / Internet
Website
github.com/erenavsarogullari
About
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Tags
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
See more
Presentations
(6)Personal Information
Organization / Workplace
San Francisco, CA United States
Occupation
Senior Data Engineer at Workday
Industry
Technology / Software / Internet
Website
github.com/erenavsarogullari
About
Eren is highly motivated senior software developer and enthusiast on JVM based technologies.
His areas of interest are Scala, Akka, Apache Spark, Apache Hadoop, Big Data, Distributed & Parallel Computing, High Availability & Scalability.
He hold a B.Sc. degree in Electrical & Electronics Engineering and a M.Sc. degree in Control & Automation Engineering.
Technical Articles : https://dzone.com/users/938353/eren_avsarogullari.html
Github : https://github.com/erenavsarogullari
Tags
apache spark
batch processing
spark on yarn
springone
spring
spring integration
multi tenancy
spark sql metrics
apache spark upgrade
etl
sql on hadoop
distributed computing engine
sql
distributed sql engine
gc policy
storage level
job scheduling
data locality
data skew
serialization
checkpointing
event sourcing
partitioning
persistency
data structures
best practices
apache pulsar
stream processing
streaming
data processing patterns
data pipelines
rdd persistency
catalyst optimizer
tungsten
spark job lifecycle
spark ecosystem
spark internals
dataset
dataframe
rdd
hazelcast
See more