Performing analysis and prediction in big scale (aka Big Data) is very popular in IT world now. But how to deal with this amount of data in real time? How to get answers on questions asked by business runners in seconds rather than in hours? How to forget about MapReduce jobs and still deal with BigData?
In this talk I will try to answer these questions and introduce you to real time "recipies" and algoritms. Talk covers also basics of Apache Storm Project which is my preferred tool for doing such type of analysis. Presentation also contains facts and lessons learned from designing and developing real time topologies at INTERIA.PL.
Marcin Stanislawski - Marcin Stanislawski works as software architect/engineer at INTERIA.PL, 4th biggest portal of the Polish internet. Currently he is working on a system that analyzes and predicts user's behaviour on the portal websites. Moreover he is a huge fan of Open Source, Continous Delivery, TDD, BDD and functional programming languages like Scala. On a daily basis husband, father and urbanomics enthusiast.
Clipping is a handy way to collect important slides you want to go back to later.