SlideShare a Scribd company logo
1 of 32
The information 
SUPERNOVA 
THE THREAT AND THE CURE 
INTRODUCTION TO INFO ARAB ‘S 
TECHNOLOGIES 
BY; ALAA S. AL-AGAMAWI. 
SEPTEMBER 2014
Technologies impact on data sizes… 
sizes in algorithmic 
scale 
Invention of Paper 
{China 105AD} 
Invention of writing 
{IRAQ 5000BC} 
Invention of Printing 
{Europe 1400AD} 
Invention of ICT 
{2000AD} 
5000BC 4000BC 3000BC 2000BC 1000BC 0 1000AD 2000AD 
 Technology has always played the most major role in the growth of human knowledge. 
 With the ICT user generated phenomena, rate of growth is extraordinary.
Before the 
SUPERNOVA 
explodes !!! WHEN A STAR GOES OUT OF CONTROL IT 
EXPLODES AND EATS EVERYTHING AROUND! 
IMAGINE … 
THE INFORMATION SUPERNOVA. 
WHEN DATA SIZE IS OUT OF CONTROL. 
TO CONTROL IT … WE NEED TO UNDERSTAND IT.
The ICT era…
The social networks… 
What happens in one minute in the social networks 
requires over… 
1500 person days to read.
Strongest FRIEND and Toughest ENEMY!!! 
Whether you are a movie star, a politician, a governmental agency, a 
broker or a brand owner, you should give adequate attention to what 
people are saying about you. In most cases, traditional monitoring is 
not enough and requires tens and hundreds of person days. And still 
cannot cover everything thoroughly.
Your ONLY 
choice… 
2 1 
3 
Go 
Is to utilize ICT 
to know what 
the people are 
saying about 
you and ACT 
accordingly. 
Collect 
Organize 
Act 
Anticipate Analyze
The Winning TEAM… 
Collect Organize Analyze Anticipate Act 
BI 
Implementation 
Partner 
THIS 
You should 
do yourself
A typical solution architecture… 
Data Sources 
Data 
Connectors 
Enterprise Servers world wide web 
Business 
Presentation 
Layer 
Data Gathering 
Layer 
Results 
Layer 
Significs©
Proudly introducing 
Fetch Technologies©
Historical 
background… 
 Fetch PC© 2006. 
 Fetch enterprise© 2009. 
 Fetch cloud© 2012.
The dilemma… 
Relevant Area 
Irrelevant Area Irrelevant Area 
Typical crawling will bring many irrelevant data. That is… 
 Irrelevant information. 
 And waste of computer resources.
Traditional crawling techniques… 
v 
In many cases search results goes to an irrelevant area.
Structured & unstructured data…
Dealing with all types of Sources… 
Having an API… Does not have an API…
Only relevant data in structured format 
Title 
Brief 
Media 
Article 
body 
Publisher Author Date
Classifies with Social media interactions…
Never miss what you are looking for… 
You will never miss any mention that you are looking for with our semantic based 
search engines. Our engines are taking care of the following... 
 Basic search objects in a hierarchal format (e.g. a company, its senior 
managerial staff, its main brands, sub brands ...etc). 
 Totally morphed (i.e. in their singular, plural, masculine, feminine, adjectives 
...etc.). 
 With all their Arabic/English synonyms and antonyms. 
 Taking care of spelling mistakes and grammatical errors. 
 Taking care of object types and their components.
And everything is easily available… 
RSS 
XML 
JSON 
SaaS 
Timely 
In-site
As fast as you may dream of… 
The above figure (from Microsoft© Azure © control panel) represents a typical day of usage of Fetch News© semantic crawling 
processing. 
 As shown in the figure… 
 Exactly 129.64 Mega Bytes of sources’ contents were captured, analyzed, and converted to a structured text of 41.02 Mega 
Bytes in 281.96 seconds –i.e. about 500 Kilo Byte per second including connectivity time.. 
 Maximum memory usage was 129.75 Mega Bytes fulfilling the needs of 64 parallel threads running on a single core 
processor.
Proudly introducing 
Querying the text streams
Significs© Main focus… 
CONVERTING 
UNSTRUCTURED DATA 
TO STRUCTURED DATA 
AND 
EXTRACTING MEANINGFUL INDICATORS 
FOR THEIR CONTENTS WITH 
DISTINCT PERFORMANCE 
w.r.t various areas of significance.
Analysis Dimensions… 
Descript 
or 
Modifi 
er 
Indecenci 
es 
Prover 
bs 
Danger 
Safety 
Liking 
Disliking
Main Dimensions Foundations… 
Liking 
Disliking 
Safety 
Danger 
Basic 
Dimensions 
Time 
Place 
Subject 
Object 
Type 
Component 
Feature 
Descriptors 
Modifiers 
Indecencies 
Proverbs 
Sentiment 
Dimensions 
Needs 
Feelings 
Actions 
Attitude 
Object 
Dimensions 
Descriptor 
Dimensions
Deep Analysis vs. shallow… 
 Shallow Analysis does not depend on the specific 
linguistic rules of the target language; thus can’t 
guarantee the precise understanding of the textual 
contents. 
 Deep Analysis uses specific linguistic rules of the target 
language; accordingly maximizes the accuracy of the 
semantic analysis results. 
Info Arab® utilizes its wealth of linguistic technologies (e.g. 
speller, morphing, grammar, thesaurus …etc.) in order to 
avail a robust deep analysis engine.
3 Levels of Taxonomies… 
Generic 
Taxonomy 
Basic Objects’ types 
Basic Features 
Basic Components 
Basic Measures 
Basic Actions 
Basic Feeling 
Basic Needs 
Basic attitudes 
…etc. 
Domain 
Specific 
Domain 
specific 
• Object 
• Features 
• Measures 
• Actions 
• Feelings 
• Attitudes 
D• o…meatcin Specific… 
•Subjects 
•Issues 
•…etc 
User 
Specific 
Taxonomy 
User specific 
• Object 
• Features 
• Measures 
• Actions 
• Feelings 
• Attitudes 
• …etc 
User Specific… 
•Subjects 
•Issues 
•..etc
Supported languages… 
Significs© is currently able to analyze the following textual language… 
• Standard formal Arabic. 
• Egyptian common slang. 
• Transliterated Arabic texts (written in Latin characters). 
• Standard formal English. 
• GCC slangs. 
Significs© implements the Deep analysis approach to handle languages marked in 
BLUE while Shallow Analysis to handle languages marked in GREEN. Future 
versions of Significs© are planned to extend the deep analysis algorithms to the other 
languages marked in GREEN.
up to 20x1012 Areas of Significance 
Areas of significance represent the exact requirements of a certain user/customer. They 
could be represented in one or more (Correlated/Intersected) dimension. 
For example… 
 A celebrity may be interested to have a look on any mention about his/her objects 
(Persons, Places …etc.). 
 A company may want to automatically route all incoming mail that Needs Support of a 
certain Product to one of its affiliate companies and measure the customers Satisfaction. 
 A marketing agency may be interested to check the Feelings and Attitudes about a certain 
Component or Characteristic of one or all of its customers’ products. 
 A security agency may want to extract one or all Actions within a certain Subject that 
exceeds a certain threshold.
Available on almost all platforms… 
Significs© is available on the following formats… 
• Microsoft© Windows© .DLL. 
• MAC OSX, UNIX, LUNIX .LIB & .SO. 
• Web service. 
• WCF web service. 
• http request (with some limitations regarding input text size).
Unprecedented Performance… 
• 500’000 to 4’000’000 word per second depending 
on the analysis depth. Speed 
• Total data dictionaries sizes ranges from 20-30 
Mega Bytes. Storage 
• Core memory is about 80 Mega Bytes. 
• Typical memory per thread is about 2 Mega Bytes. Memory
Economic feasibility at a glance… 
$0.35 
$0.30 
$0.25 
$0.20 
$0.15 
$0.10 
$0.05 
$- 
For both Fetch-Cloud© and Significs© site licensing 
COST PER FEED ANALYSIS 
Manual cost Standard Professional Ultimate Open 
Cost per feed 
Number of feeds 
$0.001 / feed @ 200 
Million feeds. 
Edition 
Breakeven 
Number of 
feeds 
Standard 150K 
Professional 300K 
Ultimate 700K 
Open license 3.5M
Thank you 
Please feel free to contact us for 
any enquiry. 
We shall be delighted to arrange 
for a life demo at any time of 
your choice. 
Catering for the Arabic language needs in the Digital 
era 
Info Arab®, Arabic Information Systems, Alaa S. Al-Agamawi & Co. 
7 Mosadek Street, Dokki,, Cairo 12312, Egypt. 
Phone: +2 02 37 61 38 33 - Fax: +2 02 37 61 38 39 x115 
email: info@info-arab.com 
www.info-arab.com www.info-arab-technologies.com

More Related Content

Viewers also liked

Destacados febrero2016
Destacados febrero2016Destacados febrero2016
Destacados febrero2016Ramon Pineda
 
Endocrine dx co existing anesthesiology
Endocrine dx co existing anesthesiologyEndocrine dx co existing anesthesiology
Endocrine dx co existing anesthesiologyJingili Jingili
 
Destacados julio2016
Destacados julio2016Destacados julio2016
Destacados julio2016Ramon Pineda
 
My diary third term seminar
My diary third term seminarMy diary third term seminar
My diary third term seminarGerman Ramirez
 
Destacados enero2016
Destacados enero2016Destacados enero2016
Destacados enero2016Ramon Pineda
 
Large tolerance quality display unit for Mitutoyo Digimatic Gauges
Large tolerance quality display unit for Mitutoyo Digimatic GaugesLarge tolerance quality display unit for Mitutoyo Digimatic Gauges
Large tolerance quality display unit for Mitutoyo Digimatic GaugesKapayo New Delhi
 
Pel. 12 perkahwinan dalam islam
Pel. 12 perkahwinan dalam islamPel. 12 perkahwinan dalam islam
Pel. 12 perkahwinan dalam islamwanafidah
 
Fixture del Torneo de Primera División 2016/2017
Fixture del Torneo de Primera División 2016/2017Fixture del Torneo de Primera División 2016/2017
Fixture del Torneo de Primera División 2016/2017Gonzalo Reyes
 
Inotrope use in icu patient fink
Inotrope use in icu patient finkInotrope use in icu patient fink
Inotrope use in icu patient finkJingili Jingili
 

Viewers also liked (11)

Destacados febrero2016
Destacados febrero2016Destacados febrero2016
Destacados febrero2016
 
Hoja254
Hoja254Hoja254
Hoja254
 
Endocrine dx co existing anesthesiology
Endocrine dx co existing anesthesiologyEndocrine dx co existing anesthesiology
Endocrine dx co existing anesthesiology
 
Hoja253
Hoja253Hoja253
Hoja253
 
Destacados julio2016
Destacados julio2016Destacados julio2016
Destacados julio2016
 
My diary third term seminar
My diary third term seminarMy diary third term seminar
My diary third term seminar
 
Destacados enero2016
Destacados enero2016Destacados enero2016
Destacados enero2016
 
Large tolerance quality display unit for Mitutoyo Digimatic Gauges
Large tolerance quality display unit for Mitutoyo Digimatic GaugesLarge tolerance quality display unit for Mitutoyo Digimatic Gauges
Large tolerance quality display unit for Mitutoyo Digimatic Gauges
 
Pel. 12 perkahwinan dalam islam
Pel. 12 perkahwinan dalam islamPel. 12 perkahwinan dalam islam
Pel. 12 perkahwinan dalam islam
 
Fixture del Torneo de Primera División 2016/2017
Fixture del Torneo de Primera División 2016/2017Fixture del Torneo de Primera División 2016/2017
Fixture del Torneo de Primera División 2016/2017
 
Inotrope use in icu patient fink
Inotrope use in icu patient finkInotrope use in icu patient fink
Inotrope use in icu patient fink
 

Similar to The information supernova

ASAS 2014 - Simon Brown
ASAS 2014 - Simon BrownASAS 2014 - Simon Brown
ASAS 2014 - Simon BrownAvisi B.V.
 
Brownfield Domain Driven Design
Brownfield Domain Driven DesignBrownfield Domain Driven Design
Brownfield Domain Driven DesignNicolò Pignatelli
 
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found..."Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...Dataconomy Media
 
Introduction to ePublishing
Introduction to ePublishingIntroduction to ePublishing
Introduction to ePublishingApex CoVantage
 
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...Karen Thompson
 
Domain Driven Design in the Browser - Cameron Edwards
Domain Driven Design in the Browser - Cameron EdwardsDomain Driven Design in the Browser - Cameron Edwards
Domain Driven Design in the Browser - Cameron EdwardsHakka Labs
 
Big Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-AriBig Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-AriDemi Ben-Ari
 
Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012Marko Grobelnik
 
.NET Architecture for Enterprises
.NET Architecture for Enterprises.NET Architecture for Enterprises
.NET Architecture for EnterprisesWade Wegner
 
Software Analytics: Data Analytics for Software Engineering and Security
Software Analytics: Data Analytics for Software Engineering and SecuritySoftware Analytics: Data Analytics for Software Engineering and Security
Software Analytics: Data Analytics for Software Engineering and SecurityTao Xie
 
EDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko GrobelnikEDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko GrobelnikEuropean Data Forum
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015Bipin Singh
 
Gala Webminar September 2013
Gala Webminar September 2013Gala Webminar September 2013
Gala Webminar September 2013pangeanic
 
Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.Gladson DSouza
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big DataFrank Kienle
 
Domain Driven Design Big Picture Strategic Patterns
Domain Driven Design Big Picture Strategic PatternsDomain Driven Design Big Picture Strategic Patterns
Domain Driven Design Big Picture Strategic PatternsMark Windholtz
 
The Yin and Yang of Software
The Yin and Yang of SoftwareThe Yin and Yang of Software
The Yin and Yang of Softwareelliando dias
 
Android application development
Android application developmentAndroid application development
Android application developmentLinh Vi Tường
 

Similar to The information supernova (20)

ASAS 2014 - Simon Brown
ASAS 2014 - Simon BrownASAS 2014 - Simon Brown
ASAS 2014 - Simon Brown
 
Microservices Architecture
Microservices ArchitectureMicroservices Architecture
Microservices Architecture
 
Brownfield Domain Driven Design
Brownfield Domain Driven DesignBrownfield Domain Driven Design
Brownfield Domain Driven Design
 
Let's talk about... Microservices
Let's talk about... MicroservicesLet's talk about... Microservices
Let's talk about... Microservices
 
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found..."Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
"Updates on Semantic Fingerprinting", Francisco Webber, Inventor and Co-Found...
 
Introduction to ePublishing
Introduction to ePublishingIntroduction to ePublishing
Introduction to ePublishing
 
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
Cis 555 Week 4 Assignment 2 Automated Teller Machine (Atm)...
 
Domain Driven Design in the Browser - Cameron Edwards
Domain Driven Design in the Browser - Cameron EdwardsDomain Driven Design in the Browser - Cameron Edwards
Domain Driven Design in the Browser - Cameron Edwards
 
Big Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-AriBig Data made easy in the era of the Cloud - Demi Ben-Ari
Big Data made easy in the era of the Cloud - Demi Ben-Ari
 
Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012Big Data Tutorial - Marko Grobelnik - 25 May 2012
Big Data Tutorial - Marko Grobelnik - 25 May 2012
 
.NET Architecture for Enterprises
.NET Architecture for Enterprises.NET Architecture for Enterprises
.NET Architecture for Enterprises
 
Software Analytics: Data Analytics for Software Engineering and Security
Software Analytics: Data Analytics for Software Engineering and SecuritySoftware Analytics: Data Analytics for Software Engineering and Security
Software Analytics: Data Analytics for Software Engineering and Security
 
EDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko GrobelnikEDF2013: Big Data Tutorial: Marko Grobelnik
EDF2013: Big Data Tutorial: Marko Grobelnik
 
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015TIBCO Advanced Analytics Meetup (TAAM) - June 2015
TIBCO Advanced Analytics Meetup (TAAM) - June 2015
 
Gala Webminar September 2013
Gala Webminar September 2013Gala Webminar September 2013
Gala Webminar September 2013
 
Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.Choosing the right Technologies for your next unicorn.
Choosing the right Technologies for your next unicorn.
 
Introduction Big Data
Introduction Big DataIntroduction Big Data
Introduction Big Data
 
Domain Driven Design Big Picture Strategic Patterns
Domain Driven Design Big Picture Strategic PatternsDomain Driven Design Big Picture Strategic Patterns
Domain Driven Design Big Picture Strategic Patterns
 
The Yin and Yang of Software
The Yin and Yang of SoftwareThe Yin and Yang of Software
The Yin and Yang of Software
 
Android application development
Android application developmentAndroid application development
Android application development
 

Recently uploaded

Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 

Recently uploaded (20)

Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 

The information supernova

  • 1. The information SUPERNOVA THE THREAT AND THE CURE INTRODUCTION TO INFO ARAB ‘S TECHNOLOGIES BY; ALAA S. AL-AGAMAWI. SEPTEMBER 2014
  • 2. Technologies impact on data sizes… sizes in algorithmic scale Invention of Paper {China 105AD} Invention of writing {IRAQ 5000BC} Invention of Printing {Europe 1400AD} Invention of ICT {2000AD} 5000BC 4000BC 3000BC 2000BC 1000BC 0 1000AD 2000AD  Technology has always played the most major role in the growth of human knowledge.  With the ICT user generated phenomena, rate of growth is extraordinary.
  • 3. Before the SUPERNOVA explodes !!! WHEN A STAR GOES OUT OF CONTROL IT EXPLODES AND EATS EVERYTHING AROUND! IMAGINE … THE INFORMATION SUPERNOVA. WHEN DATA SIZE IS OUT OF CONTROL. TO CONTROL IT … WE NEED TO UNDERSTAND IT.
  • 5. The social networks… What happens in one minute in the social networks requires over… 1500 person days to read.
  • 6. Strongest FRIEND and Toughest ENEMY!!! Whether you are a movie star, a politician, a governmental agency, a broker or a brand owner, you should give adequate attention to what people are saying about you. In most cases, traditional monitoring is not enough and requires tens and hundreds of person days. And still cannot cover everything thoroughly.
  • 7. Your ONLY choice… 2 1 3 Go Is to utilize ICT to know what the people are saying about you and ACT accordingly. Collect Organize Act Anticipate Analyze
  • 8. The Winning TEAM… Collect Organize Analyze Anticipate Act BI Implementation Partner THIS You should do yourself
  • 9. A typical solution architecture… Data Sources Data Connectors Enterprise Servers world wide web Business Presentation Layer Data Gathering Layer Results Layer Significs©
  • 10. Proudly introducing Fetch Technologies©
  • 11. Historical background…  Fetch PC© 2006.  Fetch enterprise© 2009.  Fetch cloud© 2012.
  • 12. The dilemma… Relevant Area Irrelevant Area Irrelevant Area Typical crawling will bring many irrelevant data. That is…  Irrelevant information.  And waste of computer resources.
  • 13. Traditional crawling techniques… v In many cases search results goes to an irrelevant area.
  • 15. Dealing with all types of Sources… Having an API… Does not have an API…
  • 16. Only relevant data in structured format Title Brief Media Article body Publisher Author Date
  • 17. Classifies with Social media interactions…
  • 18. Never miss what you are looking for… You will never miss any mention that you are looking for with our semantic based search engines. Our engines are taking care of the following...  Basic search objects in a hierarchal format (e.g. a company, its senior managerial staff, its main brands, sub brands ...etc).  Totally morphed (i.e. in their singular, plural, masculine, feminine, adjectives ...etc.).  With all their Arabic/English synonyms and antonyms.  Taking care of spelling mistakes and grammatical errors.  Taking care of object types and their components.
  • 19. And everything is easily available… RSS XML JSON SaaS Timely In-site
  • 20. As fast as you may dream of… The above figure (from Microsoft© Azure © control panel) represents a typical day of usage of Fetch News© semantic crawling processing.  As shown in the figure…  Exactly 129.64 Mega Bytes of sources’ contents were captured, analyzed, and converted to a structured text of 41.02 Mega Bytes in 281.96 seconds –i.e. about 500 Kilo Byte per second including connectivity time..  Maximum memory usage was 129.75 Mega Bytes fulfilling the needs of 64 parallel threads running on a single core processor.
  • 21. Proudly introducing Querying the text streams
  • 22. Significs© Main focus… CONVERTING UNSTRUCTURED DATA TO STRUCTURED DATA AND EXTRACTING MEANINGFUL INDICATORS FOR THEIR CONTENTS WITH DISTINCT PERFORMANCE w.r.t various areas of significance.
  • 23. Analysis Dimensions… Descript or Modifi er Indecenci es Prover bs Danger Safety Liking Disliking
  • 24. Main Dimensions Foundations… Liking Disliking Safety Danger Basic Dimensions Time Place Subject Object Type Component Feature Descriptors Modifiers Indecencies Proverbs Sentiment Dimensions Needs Feelings Actions Attitude Object Dimensions Descriptor Dimensions
  • 25. Deep Analysis vs. shallow…  Shallow Analysis does not depend on the specific linguistic rules of the target language; thus can’t guarantee the precise understanding of the textual contents.  Deep Analysis uses specific linguistic rules of the target language; accordingly maximizes the accuracy of the semantic analysis results. Info Arab® utilizes its wealth of linguistic technologies (e.g. speller, morphing, grammar, thesaurus …etc.) in order to avail a robust deep analysis engine.
  • 26. 3 Levels of Taxonomies… Generic Taxonomy Basic Objects’ types Basic Features Basic Components Basic Measures Basic Actions Basic Feeling Basic Needs Basic attitudes …etc. Domain Specific Domain specific • Object • Features • Measures • Actions • Feelings • Attitudes D• o…meatcin Specific… •Subjects •Issues •…etc User Specific Taxonomy User specific • Object • Features • Measures • Actions • Feelings • Attitudes • …etc User Specific… •Subjects •Issues •..etc
  • 27. Supported languages… Significs© is currently able to analyze the following textual language… • Standard formal Arabic. • Egyptian common slang. • Transliterated Arabic texts (written in Latin characters). • Standard formal English. • GCC slangs. Significs© implements the Deep analysis approach to handle languages marked in BLUE while Shallow Analysis to handle languages marked in GREEN. Future versions of Significs© are planned to extend the deep analysis algorithms to the other languages marked in GREEN.
  • 28. up to 20x1012 Areas of Significance Areas of significance represent the exact requirements of a certain user/customer. They could be represented in one or more (Correlated/Intersected) dimension. For example…  A celebrity may be interested to have a look on any mention about his/her objects (Persons, Places …etc.).  A company may want to automatically route all incoming mail that Needs Support of a certain Product to one of its affiliate companies and measure the customers Satisfaction.  A marketing agency may be interested to check the Feelings and Attitudes about a certain Component or Characteristic of one or all of its customers’ products.  A security agency may want to extract one or all Actions within a certain Subject that exceeds a certain threshold.
  • 29. Available on almost all platforms… Significs© is available on the following formats… • Microsoft© Windows© .DLL. • MAC OSX, UNIX, LUNIX .LIB & .SO. • Web service. • WCF web service. • http request (with some limitations regarding input text size).
  • 30. Unprecedented Performance… • 500’000 to 4’000’000 word per second depending on the analysis depth. Speed • Total data dictionaries sizes ranges from 20-30 Mega Bytes. Storage • Core memory is about 80 Mega Bytes. • Typical memory per thread is about 2 Mega Bytes. Memory
  • 31. Economic feasibility at a glance… $0.35 $0.30 $0.25 $0.20 $0.15 $0.10 $0.05 $- For both Fetch-Cloud© and Significs© site licensing COST PER FEED ANALYSIS Manual cost Standard Professional Ultimate Open Cost per feed Number of feeds $0.001 / feed @ 200 Million feeds. Edition Breakeven Number of feeds Standard 150K Professional 300K Ultimate 700K Open license 3.5M
  • 32. Thank you Please feel free to contact us for any enquiry. We shall be delighted to arrange for a life demo at any time of your choice. Catering for the Arabic language needs in the Digital era Info Arab®, Arabic Information Systems, Alaa S. Al-Agamawi & Co. 7 Mosadek Street, Dokki,, Cairo 12312, Egypt. Phone: +2 02 37 61 38 33 - Fax: +2 02 37 61 38 39 x115 email: info@info-arab.com www.info-arab.com www.info-arab-technologies.com