Houston Hadoop Meetup Series - Hybrid Big Data Solutions
Upcoming SlideShare
Loading in...5
×
 

Houston Hadoop Meetup Series - Hybrid Big Data Solutions

on

  • 685 views

This is the slide deck presented by Victor Cintron of Wisemen Consulting at a Houston Hadoop Meetup on September 11, http://www.meetup.com/Houston-Hadoop-Meetup-Group/events/133682522/

This is the slide deck presented by Victor Cintron of Wisemen Consulting at a Houston Hadoop Meetup on September 11, http://www.meetup.com/Houston-Hadoop-Meetup-Group/events/133682522/

Statistics

Views

Total Views
685
Views on SlideShare
685
Embed Views
0

Actions

Likes
1
Downloads
29
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Houston Hadoop Meetup Series - Hybrid Big Data Solutions Houston Hadoop Meetup Series - Hybrid Big Data Solutions Presentation Transcript

    • HYBRID BIG DATA SOLUTIONS Houston Hadoop Meetup September 11, 2013
    • Agenda • Introduction • InfoSphere BigInsights • Hadoop • InfoSphere Streams • real-time analytics • Parse.com • mobility • Perl • integration and automation
    • Introductions •Victor Cintron • Analytics Practice Manager for Wise Men Consultants •Daniel Lopez • Software Engineer at Wise Men Consultants
    • InfoSphere BigInsights •Management console •BigSheets •Accelerators (machine, social, telco) •Jaql (like a blend of Pig and Hive) •Add your own Hadoop flavor •Basic edition
    • InfoSphere BigInsights
    • InfoSphere Streams •Gov (NSA) + IBM as System S •Real-time analytics •Telecoms, Gov and others •Visual development environment •Program with SPL and Java
    • InfoSphere Streams •Thousands of real-time sources •Millions of events per second •Microsecond latency •Can use R, SPSS and BigInsights/Hadoop •Structured and unstructured data •RHEL or CentOS
    • Parse.com •Mobile PaaS •MongoDB •Dashboard •Cloud Code •Geo Queries •Multiple Platform API’s
    • Parse.com http://aws.amazon.com/solutions/case-studies/parse/
    • Perl •Available for almost every OS •17K+ Modules on CPAN •MediaWiki::DumpFile •Geo::Parse::OSM •Streaming MapReduce •PDL
    • Perl MapReduce •Map • While(<>){ • Process each line • Print out for reduce • } •Reduce • While(<>){ • Aggregate lines • Print output • }
    • Perl MapReduce $HADOOP_HOME/bin/hadoop jar $HADOOP_HOME/hadoop-streaming.jar -input myInputDirs -output myOutputDir -mapper /usr/’perl map.pl’ -reducer /usr/’perl reduce.pl’
    • Thank You •Questions?