“Making Mole-hills out of Mountains”
30-70% of Big Data is Unstructured
• Difficult to mine and analyze
• Ergo, Largely ignored
• Represents a potential gold
• NEED:: a seamless, structured
representation of unstructured
• Software and transformational processes that
uncovers business value in unstructured text
• Uses statistical, linguistic, machine learning, data
analysis and visualization techniques
• $2Bn market expected to grow @ 25% CAGR
Data Information Knowledge
WitnessTree: Text Analytics
Boost search accuracy
Analyze relevant data
Identify & Define themes
Content + contextual similarity
Dynamic categories, Named-Entity (people,
places, brands, dates), Facets (metadata –
real and derived)
WT Semantic Analysis Machine (SAM)
Thread Analyzer Topic Explorer Search & Facet
API/web service API/web service API/web service API/web service
Semantic Analysis Machine
with no prior
knowledge of docs
Reduce redundant docs by 40% to 60%
“on the fly”
Found 10,000 docs
WitnessTree hosted solution for legal eDiscovery
How to e-discover 10,000 from 1M?
“Find the Relevant. With intuitive ease."
boolean, proximity ,
Id’s Missing &
Application Platforms / Development Tools
WitnessTree Technology Stack
• Discover concepts.
• Cross-reference ideas.
• Connect the dots.
• Build relevant queries.
• Get results.
(Un)supervised Doc Clustering
• Clusters related documents
• Labels each cluster
• Detects recurring themes
• Filters based on relevancy
• Search Wide, Dig Deep
Named Entity Recognition
Crew members on the ISS will open the hatch
Monday and unload 2,780 pounds of supplies
and experiments, the news release said.
"From the men and women involved in the
design, integration and test, to those who
launched the Antares (rocket) and operated the
Cygnus, our whole team”, said David W.
Thompson, president and chief executive
officer of Orbital, in a written statement from
It will burn up during re-entry over the Pacific
Ocean, officials said.
Orbital has a $1.9 billion contract with NASA to
make eight flights to the space station under
the space agency's commercial supply
• Structured and unstructured (text) data
• API or web application
• Minimal training required.
• Web browser + internet connection
Easy to Use
• Hosted model, SaaS, Licensed in-houseFlexibility
• Document classification, visualization, categorization,
• State-of-the-art feature set, in placeRich Feature-set
• OEM, white-label, resellerPartnership Models