Connecting and Exploiting Big Data

886 views

Published on

Whilst big data may represent a step forward in business intelligence and analytics, we see added value in linking and utilizing big data for business benefit. Once we bring together numerous data sources to provide a single reference point can we start to derive new value. Until then, we only risk creating new data silos.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
886
On SlideShare
0
From Embeds
0
Number of Embeds
103
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Connecting and Exploiting Big Data

  1. 1. www.vestforsk.noConnecting and Exploiting Big DataRajendra Akerkarrak@vestforsk.no
  2. 2. www.vestforsk.no Whilst big data may represent a step forwardin business intelligence and analytics, we seeadded value in linking and utilizing big data forbusiness benefit. Once we bring together numerous datasources to provide a single reference point canwe start to derive new value. Until then, weonly risk creating new data silos.
  3. 3. www.vestforsk.noSource: Bloor Group
  4. 4. www.vestforsk.noHype around Big Data Today, the difference between success and failure is the ability to monetize anew class of data. It’s ironic that, despite billions of dollars spent on businessintelligence systems, we are still data‐bankrupt.– Roman Stanek, Founder and CEO of Good Data
  5. 5. www.vestforsk.noWhat is Big Data?Too big, moves too fast, or doesn’t fit the structures of your database architecture3 Dimensions: Variety, Velocity, and VolumeBigDatasocialvideos & photosmobile GPSemail
  6. 6. www.vestforsk.noThe rise and rise of Big Data
  7. 7. www.vestforsk.noShare of the digital universe by India and China
  8. 8. www.vestforsk.noDefinition(s) of “big data”Big Data is a term encompassing the use oftechniques to capture, process, analyse andvisualize potentially large datasets in areasonable timeframe not accessible tostandard IT technologies.By extension, the platform, tools andsoftware used for this purpose are collectivelycalled ‘Big Data technologies’(Networked European Software and Service Initiative, 2012). 
  9. 9. www.vestforsk.noAttributes VenueVeracityVocabularyWhat happens if the raw data you are injecting into your system is incomplete or formatted incorrectly from the get‐go?
  10. 10. www.vestforsk.noThe potential of Big Data Data contains information of enormous business value Extract those insights and make far better decisions ...but is data indeed that valuable?
  11. 11. www.vestforsk.noCorrelation versus Causation versus “What’s great for the job”Oncologists might benefit from seeing the similarities among cells in a biopsy, but targeting certain markers doesn’t guarantee you can cure someone’s cancer.Source: Columbia University
  12. 12. www.vestforsk.no
  13. 13. www.vestforsk.noAdvanced (intelligent) data analyticsIn-DatabaseanalyticsConventionalAdvancedanalyticsHadoopEvolution• Needs human intervention• Latency, compression and speed• Coverage is vital rather than thoroughness• Data can be Tbytes  Pbytes• Enhances the system performance by scale-out• Statistical data and data miningConventional• Fully automated thoroughnessis required• Restricted on kinds of data• Transaction management• Volumes of dataBig Data Future• New insight of multi-structured data• Real-time big data analytics• Process information in-memory, In-time, in-place• Enhanced speed with low latency• Semantic technologiesConventional Advanced(intelligent)Analytics – NLPand semantictechnologiesUnstructured databatch processing -HadoopIn-DatabaseanalyticsInformationApplicationsInfrastructureCohesiveInfrastructure
  14. 14. www.vestforsk.noSemantics of big data shifting from “data of action” to “data of intention.” The future of big data will be to use it as a tool to discover new segments & audiences, and invent new products.
  15. 15. www.vestforsk.noWhy Linked Data? Big data tends to be unstructured and metadata becomes important  for example, location data can help to make some sense of the data in that it provides particular structure.  Here, linked data provides some significant advantages in knotting together different records to provide a view of the bigger picture. linked data sees the Web as a giant database that can be mined to link to data, rather than document‐based resources.
  16. 16. www.vestforsk.noLinked Data ‐ Paradigm Use URIs as names for things Use HTTP URIs so that people can look up those names. When someone looks up a URI, provide useful information. Include links to other URIs. so that they can discover more things.
  17. 17. www.vestforsk.noLinked data  ̶ the next evolutionary stage for the database? Through linked data, data in unstructured databases can be linked to data in traditional data stores without changing existing schemas Using linked data provides a highly scalable solution, based on the same principles as unstructured data but by structuring the linked data around triples An agent! mapping and interconnecting, indexing and feeding real‐time information from a variety of sources
  18. 18. www.vestforsk.noImplementing big data and linked data • Data integrity• Understanding of the data• Integration• Data quality• Data storage and replication• Data migration• Data security
  19. 19. www.vestforsk.noBut, linked data is no remedy...if rigorous controls are not applied to the metamodel then it becomes yet another unstructured data source, making the problem worse, rather than better!
  20. 20. www.vestforsk.noPublisher: Taylor & Francis Group/CRC Presshttp://www.taylorandfrancis.com/books/details/9781466578371/ Read this book!

×