SlideShare a Scribd company logo
1 of 34
MarkLogic Overview Ron Avnur, Vice President of Engineering Version 1.3
About MarkLogic 200+ customers Government, Media, Financial Services 200+ employees Silicon Valley, Austin, Boston, Frankfurt, London, New York, Washington DC Flagship product is purpose-built dbms for unstructured information Patented, award-winning technology MarkLogic is revolutionizing the way organizations leverage information
Leverage Valuable Information More Effectively  Physicians better diagnose their patients Government agencies preserve and archive national treasures Traditional media companies move to new media Wall Street analyze complex derivatives positions Soldiers share information to perform their missions more effectively MarkLogic helps:
Agenda Selected Customer Stories Core Technology New Features
Repurpose Content Over 25 reference texts Differential diagnosis tool Designed by pathologists for pathologists ,[object Object]
Over 25 reference texts
Differential diagnosis tool,[object Object]
Metadata Catalog
Information Sharing
Custom Publishing
Information Intelligence For any set of articles: - Find the most highly cited authors in a field  - Find and track hot topics in specific subject areas  - Check the most recent citation data on authors and articles  - Track and evaluate research trends.
Social Applications
Mobile Applications Zinio Full featured online magazines New England Journal of Medicine Subscription-based application American Institute of Physics Mobile access to journals
Agile Content Platform
Application Builder
Agenda Selected Customer Stories Core Technology New Features
The Industry’s Leading DBMS for Information Applications MarkLogic enables agility, speed and scale MarkLogic Server Application Services Office Tool Kits SharePoint Connector Public and Private Cloud Deployment Options
Data Model A database for semi-structured information  XML Data Model fpML Document Trade Product Title Author Metadata Trade Cashflow Section ID ID Last TradeLeg First TradeLeg Amount TradeLeg Event Event Event Event Section Section Section Section
MarkLogic Server in 8 Adjectives XML-centric Transactional Search-centric Structure-aware Schema-free XQuery-driven Extremely fast Clustered Database Server
The Universal Index Range Indexes UNIVERSAL INDEX Term Term List “which” 123, 127, 129, 152, 344, 791 . . .  “uniquely” 122, 125, 126, 129, 130, 167 . . . “identify” 123, 126, 130, 142, 143, 167 . . . “each” 123, 130, 131, 135, 162, 177 . . .  Document References “data base” 126, 130, 167, 212, 219, 377 . . . <article> . . .  <article>/<abstract> . . .  126, 130, 167, … <product>IMS</product> Directory(“/articles”) Collection(Red) Role:Editor + Action:Read
Shared-Nothing Architecture Host 1 Host 3 Host 2 Host 4 Host 5 Host 6 Host k partition1 partition2 partition3 partitionm partition4
Agenda Selected Customer Stories Core Technology New Features
4.2 Highlights XSLT support Flexible replication Shared-nothing failover Collector framework Filesystem collector Proximity boost Welcome page Compartment security Date/time parsing Pluggable password policy Client dictionaries for  	stemming Server fields AWS wrappers File upload collector  New search API constraints Easier to extend App Builder Database rollback   XSLT rendering in App Builder Date canonicalization pipeline Renaming pipeline Wildcard search optimizations Distinctive terms Result clustering Min / max occurs query Fast calculation of aggregates XDQP metering & Merge priority Search plan output Near-query optimizations TF normalization controls Extended query cancelling Range query caching Fast sampling Information Studio XSLT and XQuery cross-import cts:document-fragment-query Standalone properties Language detection Encoding detection Bulk load Pipeline monitoring
Application Builder Extensions
New Tool: Information Studio Easier to load content Build applications faster Features Collect Transform Configure  Load Monitor
Information Studio API Manage flows Monitor progress Collector framework Plug-in development Custom transforms
XSLT
4.2 Highlights: Search XSLT support Flexible replication Shared-nothing failover Collector framework Filesystem collector Proximity boost Welcome page Compartment security Result clustering Pluggable password policy Client dictionaries for  	stemming TF normalization controls AWS wrappers File upload collector  New search API constraints Easier to extend App Builder Database rollback Standalone properties XSLT rendering in App Builder Date canonicalization pipeline Renaming pipeline Distinctive terms Min / max occurs query Date/time parsing  Encoding detection XDQP metering & Merge priority Search plan output Near-query optimizations Extended query cancelling Range query caching Server fields Fast sampling Information Studio XSLT and XQuery cross-import cts:document-fragment-query Wildcard search optimizations Language detection Fast calculation of aggregates Bulk load Pipeline monitoring
Replication
Replication
Failover Host 1 Host 3 Host 2 Host 4 Host 5 Host 6 Host k CFS & SAN partition1 partition2 partition3 partition4
Failover: A New Option Host 1 Host 3 Host 2 Host 4 Host 5 Host 6 Host k P1 P1 P2 P2 P3 P3 P4 P4 Local Storage
Point-In-Time Recovery Have Now Bad Transaction Last Backup Want Just Before Bad Transaction Last Backup
Instantaneous Point-In-Time Recovery Replay  journals Just Before Bad Transaction Last Backup Recover to  timestamp Just Before Bad Transaction Last Backup

More Related Content

What's hot

Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Connected Data World
 
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
DataWorks Summit/Hadoop Summit
 
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
Connected Data World
 

What's hot (20)

Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
Solutions Linux 2013: SpagoBI and Talend jointly support Big Data scenarios
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
Supporting GDPR Compliance through effectively governing Data Lineage and Dat...
 
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
II-SDV 2017: Approaches of Web Information Analysis in a Day to Day Work Envi...
 
Vital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent AppsVital.AI Creating Intelligent Apps
Vital.AI Creating Intelligent Apps
 
Going Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph AnalyticsGoing Beyond Rows and Columns with Graph Analytics
Going Beyond Rows and Columns with Graph Analytics
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine Learning
 
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
Near Real-time Outlier Detection and Interpretation - Part 1 by Robert Thorma...
 
Smarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing PlatformSmarter content with a Dynamic Semantic Publishing Platform
Smarter content with a Dynamic Semantic Publishing Platform
 
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
ING's Customer-Centric Data Journey from Community Idea to Private Cloud Depl...
 
Vital AI: Big Data Modeling
Vital AI: Big Data ModelingVital AI: Big Data Modeling
Vital AI: Big Data Modeling
 
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
II-SDV 2017: Custom Open Source Search Engine with Drupal 8 and Solr at Frenc...
 
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, TrifactaData Wrangling on Hadoop - Olivier De Garrigues, Trifacta
Data Wrangling on Hadoop - Olivier De Garrigues, Trifacta
 
The Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringThe Fast Track to Knowledge Engineering
The Fast Track to Knowledge Engineering
 
Paris Spark Meetup - Trifacta - 03_04_2017
Paris Spark Meetup - Trifacta - 03_04_2017Paris Spark Meetup - Trifacta - 03_04_2017
Paris Spark Meetup - Trifacta - 03_04_2017
 
Introduction to Deep Learning and AI at Scale for Managers
Introduction to Deep Learning and AI at Scale for ManagersIntroduction to Deep Learning and AI at Scale for Managers
Introduction to Deep Learning and AI at Scale for Managers
 
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
How Graphs Continue to Revolutionize The Prevention of Financial Crime & Frau...
 
Taxonomy-Driven UX
Taxonomy-Driven UXTaxonomy-Driven UX
Taxonomy-Driven UX
 
Koshy june27 140pm_room210_c_v4
Koshy june27 140pm_room210_c_v4Koshy june27 140pm_room210_c_v4
Koshy june27 140pm_room210_c_v4
 

Similar to MarkLogic Overview, Ron Avnur, MarkLogic

SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
Splunk
 
Splunk Discovery Day Düsseldorf 2016 - Splunk für IT Operations
Splunk Discovery Day Düsseldorf 2016 - Splunk für IT OperationsSplunk Discovery Day Düsseldorf 2016 - Splunk für IT Operations
Splunk Discovery Day Düsseldorf 2016 - Splunk für IT Operations
Splunk
 
Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007
Neil Matatall
 

Similar to MarkLogic Overview, Ron Avnur, MarkLogic (20)

SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
SplunkLive! Washington DC May 2013 - Splunk Enterprise 5
 
Getting Started with Splunk Enterprise
Getting Started with Splunk EnterpriseGetting Started with Splunk Enterprise
Getting Started with Splunk Enterprise
 
GlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening KeynoteGlobusWorld 2019 Opening Keynote
GlobusWorld 2019 Opening Keynote
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020
 
As You Seek – How Search Enables Big Data Analytics
As You Seek – How Search Enables Big Data AnalyticsAs You Seek – How Search Enables Big Data Analytics
As You Seek – How Search Enables Big Data Analytics
 
Splunk Discovery Day Düsseldorf 2016 - Splunk für IT Operations
Splunk Discovery Day Düsseldorf 2016 - Splunk für IT OperationsSplunk Discovery Day Düsseldorf 2016 - Splunk für IT Operations
Splunk Discovery Day Düsseldorf 2016 - Splunk für IT Operations
 
What's New in 6.3 + Data On-Boarding
What's New in 6.3 + Data On-BoardingWhat's New in 6.3 + Data On-Boarding
What's New in 6.3 + Data On-Boarding
 
Open Cybersecurity Alliance Briefing at RSAC 2020
Open Cybersecurity Alliance Briefing at RSAC 2020Open Cybersecurity Alliance Briefing at RSAC 2020
Open Cybersecurity Alliance Briefing at RSAC 2020
 
Splunk for IT Operations
Splunk for IT OperationsSplunk for IT Operations
Splunk for IT Operations
 
Universal Search for Legal Enterprises
Universal Search for Legal EnterprisesUniversal Search for Legal Enterprises
Universal Search for Legal Enterprises
 
AI Class Topic 4: Text Analytics, Sentiment Analysis and Apache Spark
AI Class Topic 4: Text Analytics, Sentiment Analysis and Apache SparkAI Class Topic 4: Text Analytics, Sentiment Analysis and Apache Spark
AI Class Topic 4: Text Analytics, Sentiment Analysis and Apache Spark
 
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment PerformanceWebinar: Lucidworks + Thomson Reuters for Improved Investment Performance
Webinar: Lucidworks + Thomson Reuters for Improved Investment Performance
 
Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021 Tag.bio aws public jun 08 2021
Tag.bio aws public jun 08 2021
 
High-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutionsHigh-performance database technology for rock-solid IoT solutions
High-performance database technology for rock-solid IoT solutions
 
SplunkLive! München 2016 - Splunk Enterprise 6.3 - Data Onboarding
SplunkLive! München 2016 - Splunk Enterprise 6.3 - Data OnboardingSplunkLive! München 2016 - Splunk Enterprise 6.3 - Data Onboarding
SplunkLive! München 2016 - Splunk Enterprise 6.3 - Data Onboarding
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
AWS Summit Auckland - Sponsor Presentation - Splunk
AWS Summit Auckland - Sponsor Presentation - SplunkAWS Summit Auckland - Sponsor Presentation - Splunk
AWS Summit Auckland - Sponsor Presentation - Splunk
 
How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?How to govern and secure a Data Mesh?
How to govern and secure a Data Mesh?
 
Educause Annual 2007
Educause Annual 2007Educause Annual 2007
Educause Annual 2007
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 

MarkLogic Overview, Ron Avnur, MarkLogic

  • 1. MarkLogic Overview Ron Avnur, Vice President of Engineering Version 1.3
  • 2. About MarkLogic 200+ customers Government, Media, Financial Services 200+ employees Silicon Valley, Austin, Boston, Frankfurt, London, New York, Washington DC Flagship product is purpose-built dbms for unstructured information Patented, award-winning technology MarkLogic is revolutionizing the way organizations leverage information
  • 3. Leverage Valuable Information More Effectively Physicians better diagnose their patients Government agencies preserve and archive national treasures Traditional media companies move to new media Wall Street analyze complex derivatives positions Soldiers share information to perform their missions more effectively MarkLogic helps:
  • 4. Agenda Selected Customer Stories Core Technology New Features
  • 5.
  • 7.
  • 11. Information Intelligence For any set of articles: - Find the most highly cited authors in a field - Find and track hot topics in specific subject areas - Check the most recent citation data on authors and articles - Track and evaluate research trends.
  • 13. Mobile Applications Zinio Full featured online magazines New England Journal of Medicine Subscription-based application American Institute of Physics Mobile access to journals
  • 16. Agenda Selected Customer Stories Core Technology New Features
  • 17. The Industry’s Leading DBMS for Information Applications MarkLogic enables agility, speed and scale MarkLogic Server Application Services Office Tool Kits SharePoint Connector Public and Private Cloud Deployment Options
  • 18. Data Model A database for semi-structured information XML Data Model fpML Document Trade Product Title Author Metadata Trade Cashflow Section ID ID Last TradeLeg First TradeLeg Amount TradeLeg Event Event Event Event Section Section Section Section
  • 19. MarkLogic Server in 8 Adjectives XML-centric Transactional Search-centric Structure-aware Schema-free XQuery-driven Extremely fast Clustered Database Server
  • 20. The Universal Index Range Indexes UNIVERSAL INDEX Term Term List “which” 123, 127, 129, 152, 344, 791 . . . “uniquely” 122, 125, 126, 129, 130, 167 . . . “identify” 123, 126, 130, 142, 143, 167 . . . “each” 123, 130, 131, 135, 162, 177 . . . Document References “data base” 126, 130, 167, 212, 219, 377 . . . <article> . . . <article>/<abstract> . . . 126, 130, 167, … <product>IMS</product> Directory(“/articles”) Collection(Red) Role:Editor + Action:Read
  • 21. Shared-Nothing Architecture Host 1 Host 3 Host 2 Host 4 Host 5 Host 6 Host k partition1 partition2 partition3 partitionm partition4
  • 22. Agenda Selected Customer Stories Core Technology New Features
  • 23. 4.2 Highlights XSLT support Flexible replication Shared-nothing failover Collector framework Filesystem collector Proximity boost Welcome page Compartment security Date/time parsing Pluggable password policy Client dictionaries for stemming Server fields AWS wrappers File upload collector New search API constraints Easier to extend App Builder Database rollback XSLT rendering in App Builder Date canonicalization pipeline Renaming pipeline Wildcard search optimizations Distinctive terms Result clustering Min / max occurs query Fast calculation of aggregates XDQP metering & Merge priority Search plan output Near-query optimizations TF normalization controls Extended query cancelling Range query caching Fast sampling Information Studio XSLT and XQuery cross-import cts:document-fragment-query Standalone properties Language detection Encoding detection Bulk load Pipeline monitoring
  • 25. New Tool: Information Studio Easier to load content Build applications faster Features Collect Transform Configure Load Monitor
  • 26. Information Studio API Manage flows Monitor progress Collector framework Plug-in development Custom transforms
  • 27. XSLT
  • 28. 4.2 Highlights: Search XSLT support Flexible replication Shared-nothing failover Collector framework Filesystem collector Proximity boost Welcome page Compartment security Result clustering Pluggable password policy Client dictionaries for stemming TF normalization controls AWS wrappers File upload collector New search API constraints Easier to extend App Builder Database rollback Standalone properties XSLT rendering in App Builder Date canonicalization pipeline Renaming pipeline Distinctive terms Min / max occurs query Date/time parsing Encoding detection XDQP metering & Merge priority Search plan output Near-query optimizations Extended query cancelling Range query caching Server fields Fast sampling Information Studio XSLT and XQuery cross-import cts:document-fragment-query Wildcard search optimizations Language detection Fast calculation of aggregates Bulk load Pipeline monitoring
  • 31. Failover Host 1 Host 3 Host 2 Host 4 Host 5 Host 6 Host k CFS & SAN partition1 partition2 partition3 partition4
  • 32. Failover: A New Option Host 1 Host 3 Host 2 Host 4 Host 5 Host 6 Host k P1 P1 P2 P2 P3 P3 P4 P4 Local Storage
  • 33. Point-In-Time Recovery Have Now Bad Transaction Last Backup Want Just Before Bad Transaction Last Backup
  • 34. Instantaneous Point-In-Time Recovery Replay journals Just Before Bad Transaction Last Backup Recover to timestamp Just Before Bad Transaction Last Backup
  • 35. Thank You Ron Avnur ron.avnur@marklogic.com

Editor's Notes

  1. Show e-nodes and d-nodes, then click and say that they’re the same binary
  2. Show e-nodes and d-nodes, then click and say that they’re the same binary