• Email
  • Like
  • Save
  • Private Content
  • Embed
 

Big Data: tools and techniques for working with large data sets

by

  • 3,934 views

Working with thousands, millions, or billions of data records in high dimensions is increasingly becoming the reality for scientific research. What are some techniques to make this kind of data ...

Working with thousands, millions, or billions of data records in high dimensions is increasingly becoming the reality for scientific research. What are some techniques to make this kind of data volume tractable? How can parallel computing help? In this talk I'll review data management tools and infrastructures, languages, and paradigms that help in this regard. In particular, I'll discuss Hadoop, MapReduce, Python, NumPy, and Globus Online to provide a survey of ways in which researchers can manage their data and process it in parallel.

Accessibility

Categories

Upload Details

Uploaded via SlideShare as Apple Keynote

Usage Rights

© All Rights Reserved

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

Cancel

2 Embeds 9

http://www.linkedin.com 8
https://bb9dev.newcastle.edu.au 1

Statistics

Likes
8
Downloads
471
Comments
0
Embed Views
9
Views on SlideShare
3,925
Total Views
3,934
Post Comment
Edit your comment

Big Data: tools and techniques for working with large data sets Big Data: tools and techniques for working with large data sets Presentation Transcript