SlideShare a Scribd company logo
1 of 28
Spinque
A Story of Adaptation
Arjen P. de Vries
@spinque
ECIR Industry Day, 2015
background
Information Retrieval and DB integration
Cornacchia et al. Flexible and efficient IR using Array Databases. VLDB‘08 Journal
Mühleisen et al. Column Stores as an IR Prototyping Tool. ECIR’14 & SIGIR’14
Best student paper, ECIR 2007
background
concept
Information Retrieval and DB integration
Cornacchia et al. Flexible and efficient IR using Array Databases. VLDB‘08 Journal
Mühleisen et al. Column Stores as an IR Prototyping Tool. ECIR’14 & SIGIR’14
Search by Strategy
Alink et al. Searching CLEF-IP by strategy. CLEF’09
PatOlympics, 2010 and 2011
Norbert Fuhr, Thomas Rölleke. A Probabilistic Relational Algebra for the Integration of Information Retrieval and Database Systems (1994)
Search by Strategy
http://www.comsode.eu/index.php/2014/11/building-an-application-on-open-data-with-spinque/
Data silos
SQL
CSV
XML
HTML
OAI
JSON
Linked DataComplex Tasks
Generate links Linked DataSearch Strategies
Custom search engines
API
STRATEGY EDITOR COMPILERINDEXING PIPELINE
SQL
CSV
HTML
OAI
XML
APPLICATIONS
background
concept
product
Information Retrieval and DB integration
Cornacchia et al. Flexible and efficient IR using Array Databases. VLDB‘08 Journal
Mühleisen et al. Column Stores as an IR Prototyping Tool. ECIR’14 & SIGIR’14
Search by Strategy
Alink et al. Searching CLEF-IP by strategy. CLEF’09
PatOlympics, 2010 and 2011
Tailored access to connected datasets
February 2013
“What happens if an
entrepreneur approaches
an angel investor, a VC
company, a bank, or a
prospective buy-out
partner? What sort of
numbers are typical in the
life-cycle of a company?
What's this "Series A"
people go on about?”
Technology mindedness
Original internal codename: 5F (Flexibility, eFFiciency, and eFFectiveness)
Work on projects because we think they are cool (instead of work for a client with a business need)
Technology mindedness
Original internal codename: 5F (Flexibility, eFFiciency, and eFFectiveness)
Work on projects because we think they are cool (instead of work for a client with a business need)
Work on projects because we think they are cool (instead of work for a paying customer with a business need)
Target: partner with other, less technology minded companies
Look & feel
Effect not to be underestimated...
Method: partner with other, UI/UX minded companies
Community Platform
1. Combine recency and a theme’s terms
2. Include URLs
mentioned in
related Tweets
3. Include 300+ RSS
feeds related to a
theme
4. Emphasize the
tweets originating
from the
community
Bottlenecks…
Open source “competitors” are not zero cost
(they are zero cost entry!)
HTTP://XKCD.COM/292/
Bottlenecks…
Open source “competitors” are not zero cost
(they are zero cost entry)
Spinque creates search engines, but who
creates the index?!
Spinque LINK
National “alignment service” for the cultural
heritage sector in The Netherlands
www.beeldengeluid.nl www.comsode.eu www.spinque.com
Spinque Link
screencast

More Related Content

More from Arjen de Vries

Beverwedstrijd Big Data (klas 3/4/5/6)
Beverwedstrijd Big Data (klas 3/4/5/6) Beverwedstrijd Big Data (klas 3/4/5/6)
Beverwedstrijd Big Data (klas 3/4/5/6) Arjen de Vries
 
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)Arjen de Vries
 
Web Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineWeb Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineArjen de Vries
 
Information Retrieval and Social Media
Information Retrieval and Social MediaInformation Retrieval and Social Media
Information Retrieval and Social MediaArjen de Vries
 
Information Retrieval intro TMM
Information Retrieval intro TMMInformation Retrieval intro TMM
Information Retrieval intro TMMArjen de Vries
 
ACM SIGIR 2017 - Opening - PC Chairs
ACM SIGIR 2017 - Opening - PC ChairsACM SIGIR 2017 - Opening - PC Chairs
ACM SIGIR 2017 - Opening - PC ChairsArjen de Vries
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master SpecialisationArjen de Vries
 
PUC Masterclass Big Data
PUC Masterclass Big DataPUC Masterclass Big Data
PUC Masterclass Big DataArjen de Vries
 
Bigdata processing with Spark - part II
Bigdata processing with Spark - part IIBigdata processing with Spark - part II
Bigdata processing with Spark - part IIArjen de Vries
 
Bigdata processing with Spark
Bigdata processing with SparkBigdata processing with Spark
Bigdata processing with SparkArjen de Vries
 
TREC 2016: Looking Forward Panel
TREC 2016: Looking Forward PanelTREC 2016: Looking Forward Panel
TREC 2016: Looking Forward PanelArjen de Vries
 
The personal search engine
The personal search engineThe personal search engine
The personal search engineArjen de Vries
 
Models for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationModels for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationArjen de Vries
 
Better Contextual Suggestions by Applying Domain Knowledge
Better Contextual Suggestions by Applying Domain KnowledgeBetter Contextual Suggestions by Applying Domain Knowledge
Better Contextual Suggestions by Applying Domain KnowledgeArjen de Vries
 
ESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social MediaESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social MediaArjen de Vries
 
Looking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterpriseLooking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterpriseArjen de Vries
 
Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Arjen de Vries
 
Searching Political Data by Strategy
Searching Political Data by StrategySearching Political Data by Strategy
Searching Political Data by StrategyArjen de Vries
 
How to Search Annotated Text by Strategy?
How to Search Annotated Text by Strategy?How to Search Annotated Text by Strategy?
How to Search Annotated Text by Strategy?Arjen de Vries
 
How to build the next 1000 search engines?!
How to build the next 1000 search engines?! How to build the next 1000 search engines?!
How to build the next 1000 search engines?! Arjen de Vries
 

More from Arjen de Vries (20)

Beverwedstrijd Big Data (klas 3/4/5/6)
Beverwedstrijd Big Data (klas 3/4/5/6) Beverwedstrijd Big Data (klas 3/4/5/6)
Beverwedstrijd Big Data (klas 3/4/5/6)
 
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
Beverwedstrijd Big Data (groep 5/6 en klas 1/2)
 
Web Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search EngineWeb Archives and the dream of the Personal Search Engine
Web Archives and the dream of the Personal Search Engine
 
Information Retrieval and Social Media
Information Retrieval and Social MediaInformation Retrieval and Social Media
Information Retrieval and Social Media
 
Information Retrieval intro TMM
Information Retrieval intro TMMInformation Retrieval intro TMM
Information Retrieval intro TMM
 
ACM SIGIR 2017 - Opening - PC Chairs
ACM SIGIR 2017 - Opening - PC ChairsACM SIGIR 2017 - Opening - PC Chairs
ACM SIGIR 2017 - Opening - PC Chairs
 
Data Science Master Specialisation
Data Science Master SpecialisationData Science Master Specialisation
Data Science Master Specialisation
 
PUC Masterclass Big Data
PUC Masterclass Big DataPUC Masterclass Big Data
PUC Masterclass Big Data
 
Bigdata processing with Spark - part II
Bigdata processing with Spark - part IIBigdata processing with Spark - part II
Bigdata processing with Spark - part II
 
Bigdata processing with Spark
Bigdata processing with SparkBigdata processing with Spark
Bigdata processing with Spark
 
TREC 2016: Looking Forward Panel
TREC 2016: Looking Forward PanelTREC 2016: Looking Forward Panel
TREC 2016: Looking Forward Panel
 
The personal search engine
The personal search engineThe personal search engine
The personal search engine
 
Models for Information Retrieval and Recommendation
Models for Information Retrieval and RecommendationModels for Information Retrieval and Recommendation
Models for Information Retrieval and Recommendation
 
Better Contextual Suggestions by Applying Domain Knowledge
Better Contextual Suggestions by Applying Domain KnowledgeBetter Contextual Suggestions by Applying Domain Knowledge
Better Contextual Suggestions by Applying Domain Knowledge
 
ESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social MediaESSIR 2013 - IR and Social Media
ESSIR 2013 - IR and Social Media
 
Looking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterpriseLooking beyond plain text for document representation in the enterprise
Looking beyond plain text for document representation in the enterprise
 
Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?Recommendation and Information Retrieval: Two Sides of the Same Coin?
Recommendation and Information Retrieval: Two Sides of the Same Coin?
 
Searching Political Data by Strategy
Searching Political Data by StrategySearching Political Data by Strategy
Searching Political Data by Strategy
 
How to Search Annotated Text by Strategy?
How to Search Annotated Text by Strategy?How to Search Annotated Text by Strategy?
How to Search Annotated Text by Strategy?
 
How to build the next 1000 search engines?!
How to build the next 1000 search engines?! How to build the next 1000 search engines?!
How to build the next 1000 search engines?!
 

Spinque at the ECIR 2015 Industry Day

Editor's Notes

  1. Background: Spinque started as a spinoff company from CWI. Research on Information Retrieval on top of Databases. People not familiar with this work, one of the interesting advantages as that it creates a flexible environment where you can intuitively combine keyword search with structured queries. Implement XML search, Graph search, Feature-based search??? This flexibility is one of the key features from Spinque. Concept: Spinque started in the context of a project to search patents. Finding patents is a complex task performed by domain specialists. Patents are structured documents and linked to references and external documents. Specialists know how to search in which parts. Spinque provided the environment to express complex search “strategy”. Not as a query, but as the algorithm. Spinque won the PatOlympics twice, 2010, 2011. Evolved into a technology and toolset to make connected data accessible. Commercial and research projects to build search engines for ...