Introduction...to our short course on NoSQL in theCloud...
Housekeeping● Emergency Exits● Toilets● Lunch/coffee ○ Some NovaUCD folks may be present
Who is attending thiscourse?● Good response to call ○ Did not do much publicization - linkedin, twitter● Objective was to determine interest in this topic ○ Evil agenda is to run longer, more hands-on courses ○ Hence very modest price point● Attendees from big companies ○ AOL, Symantec, HP, D&B● ...and small companies ○ Boxever, eSpatial, Ezora● Capped attendance at 50 poeple
Communications● Hashtag ○ #nosqlucd● Wifi ○ Available - see instructions provided● Twitter ○ @seanrmurphy ○ @amine_aouad ○ @NovaUCD
Course objective● To have an overview of the NoSQL landscape with particular emphasis on analytics● Conceptual understanding of specifics of Cassandra/Hadoop/Pig● Understanding of some key tools used to work with Cassandra/Hadoop/Pig● Understanding of example use case based on Gowalla data
Schedule (revised!)8.45 - Registration/coffee9.30 - Intro - course overview9.35 - Overview of NoSQL landscape10.15 - Overview to Cassandra11.00 - Break11.15 - Introduction to Hadoop and Hadoop ecosystem12.00 - Introduction to Pig12.25 - Integration of Cassandra/Hadoop/Pig12.45 - Lunch13.30 - Description of example problem, design of data models13.45 - Design of cluster for this simple scenario - essential parameters14.00 - Cassandra walkthrough15.00 - Break15.30 - Hadoop Walkthrough16.00 - Pig walkthrough16.45 - Discussion/Q&A17.00 - Close
Further points● A lot of content to go through ○ In some cases dont have time/space to go into great detail● Chose specific technologies which are most widely used in this space ○ although there is a large set of technologies to choose from● Documentation on this topic is very distributed● For Cassandra/Hadoop, datastax is a leading company - chose not to use their