Big Data & Text Mining

Marketing Director at Michel Bruley
Jan. 16, 2014
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
Big Data & Text Mining
1 of 15

More Related Content

What's hot

Information retrieval 10 tf idf and bag of wordsInformation retrieval 10 tf idf and bag of words
Information retrieval 10 tf idf and bag of wordsVaibhav Khanna
Exploratory data analysisExploratory data analysis
Exploratory data analysisVishwas N
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...
Big Data vs Data Science vs Data Analytics | Demystifying The Difference | Ed...Edureka!
Text ClassificationText Classification
Text ClassificationRAX Automation Suite
Data Mining: Text and web miningData Mining: Text and web mining
Data Mining: Text and web miningDataminingTools Inc
Text mining presentation in Data mining AreaText mining presentation in Data mining Area
Text mining presentation in Data mining AreaMahamudHasanCSE

Similar to Big Data & Text Mining

1  _text_mining_v0a1  _text_mining_v0a
1 _text_mining_v0asaira gilani
Text mining and analytics   v6 - p1Text mining and analytics   v6 - p1
Text mining and analytics v6 - p1Dave King
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data miningDevakumar Jain
Lect 1 introductionLect 1 introduction
Lect 1 introductionhktripathy
CognitiveComputing_CirrusShakeri_finalCognitiveComputing_CirrusShakeri_final
CognitiveComputing_CirrusShakeri_finalCirrus Shakeri
Web_Mining_Overview_Nfaoui_El_HabibWeb_Mining_Overview_Nfaoui_El_Habib
Web_Mining_Overview_Nfaoui_El_HabibEl Habib NFAOUI

More from Michel Bruley

La chute de l'Empire romain comme modèle.pdfLa chute de l'Empire romain comme modèle.pdf
La chute de l'Empire romain comme modèle.pdfMichel Bruley
Synthèse sur Neuville.pdfSynthèse sur Neuville.pdf
Synthèse sur Neuville.pdfMichel Bruley
Propos sur des sujets qui m'ont titillé.pdfPropos sur des sujets qui m'ont titillé.pdf
Propos sur des sujets qui m'ont titillé.pdfMichel Bruley
Propos sur les Big Data.pdfPropos sur les Big Data.pdf
Propos sur les Big Data.pdfMichel Bruley
Sun tzuSun tzu
Sun tzuMichel Bruley
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientGeorges Anselmi - 1914 - 1918 Campagnes de France et d'Orient
Georges Anselmi - 1914 - 1918 Campagnes de France et d'OrientMichel Bruley

Recently uploaded

Grand Challenges & Platform EcosystemsGrand Challenges & Platform Ecosystems
Grand Challenges & Platform EcosystemsPaavo Ritala
Steve Cunningham - AI Innovation Summit.pdfSteve Cunningham - AI Innovation Summit.pdf
Steve Cunningham - AI Innovation Summit.pdfSOLTUIONSpeople, THINKubators, THINKathons
How to properly use SEO, branding and brand protection to lower your CPA (4).pdfHow to properly use SEO, branding and brand protection to lower your CPA (4).pdf
How to properly use SEO, branding and brand protection to lower your CPA (4).pdfIvanaFlynn1
VC AI Deep Dive: Embracing the Potential, Addressing the challenges, and Pinp...VC AI Deep Dive: Embracing the Potential, Addressing the challenges, and Pinp...
VC AI Deep Dive: Embracing the Potential, Addressing the challenges, and Pinp...saastr
Commercial Growth Strategies for Startups (Sep 20th, 2023) by RevXCommercial Growth Strategies for Startups (Sep 20th, 2023) by RevX
Commercial Growth Strategies for Startups (Sep 20th, 2023) by RevXDino Jugo
BlueSnap Overview DeckBlueSnap Overview Deck
BlueSnap Overview DeckNorma Mushkat Gaffin

Recently uploaded(20)

Big Data & Text Mining

Editor's Notes

  1. Input Data System: This part of the system is related to the collection of the data. -Getting data from the internet with a crawler -Getting data from Online vendors -Getting data from the internal data banks Regarding the input format (physical and logical), data are physicaly reformated into html format and then it's loaded into an XML format
  2. Feature extraction tools It recognizes significant vocabulary items in documents, and measures their importance to the document content. 2. Clustering tools Clustering is used to segment a document collection into subsets, called clusters. 3. Summarization tool Summarization is the process of condensing a source text into a shorter version preserving its information content. 4. Categorization tool Categorization is used to assign objects to predefined categories, or classes from a taxonomy.
  3. http://services.alphaworks.ibm.com/manyeyes/view/SWhH8QsOtha6qL3F~y5HQ2~