SlideShare a Scribd company logo
Leveraging SQL Server
     to Improve Vector
 Display through Point
             Clustering
The Problem…
 Due to Java Script limitation,
 Large Numbers of Vectors don’t
 draw in Web-Mapping
 Environments.
 Cutoff is ~ 300-500 Features
Current ways to solve…
1. Use Silverlight or Flash
   Increases capacity roughly an order of magnitude, but similar
    limitation exists.

2. Show Pictures, Query Features
   Has the capability to show and query ALL features for a
    pleasant user experience, but takes time to build tile cash (the
    pretty pictures).
   However, Tile Caches are Large and are, basically, redundant
    storage.

3. Use a WMS to feed ‘pictures’ dynamically and then use
   a separate, spatial query for feature attributes.
   Extra ‘moving’ parts in the solution.
   Possibly extra cost in licenses.
Why Invent a ‘Better’
     Solution???
 Original Design…
   Texas Hydrologic Information System (TexasHIS)
   www.waterdatafortexas.org
   Client desires immediate access to vector data
    and attributes

 Data Characteristics
   Lots of tightly packed ‘Sites’ ~ 300K+
   data with gobs of related attribute data ~
    5,000,000+ related records!
Factors Affecting a Good ‘Clustering’
 The Data
               Solution…
   Density
   Distribution

 The Scale
   Large – Basically Draw everything
   ‘Medium’ – Draw some, Cluster some – Lots of Code and
    Logic!
   Small – Basically Cluster everything
 Logic
   Thresholds/Limits of when to Cluster vs. When to Draw
   How to Carve the display to deal with Data Distribution
   Complex Algorithms to determine the optimal number of
    features to draw.
 Create a ‘smart’ solution that…
  Isn’t affected by the java script limit
  Only queries the database one time
   (other solutions can query the
   database for image creation and
   attribute retrieval.
  Is still capable of retrieving vector
   attributes for single features
Solution Basics…
1. Drawing Thresholds
    When to Draw everything
    When to Cluster
    When to take no action
2. Carving The Display
    At first glance, a 16 x 16 ‘Grid’ would be
     best. It would yield 256 Cluster ‘Features’
3. Clustering Logic
    Draw the Singles
    Cluster Everything Else
Solution Basics: Thresholds…
1. Drawing Thresholds
  Draw Everything – if the total number
   of Points to ‘cluster’ is below 500
  Cluster – If there are 500 (Lower Limit)
   to 5000 (Upper Limit)
  Above the Upper Limit – Take no
   action. The Clustering Query becomes
   too expensive. So much so that user
   experience is drastically diminished.
Solution Basics: Carving up the Display
     and Applying the Logic Rules…

 Spatial Indexes are Similar in concept
 16 x 16 creates 256 Cells – Near the upper limit of
  our display threshold
 Polygons with more than 1 Site are ‘Cluster
  Candidates’. The Centroid of the polygon will
  represent the cluster spatially.
 Threshold values are variables so that they are
  easily ‘tuned’ in a series of runs.
Solution Basics: How it works…
  The Envelope of the Display is passed as a parameter
  First Function, a Scalar-Valued Function, determines
   how ‘big’ the cells will be in the X and Y direction –
   Returns a comma separated number pair –
   “0.3456778,0.3456777”
  Second Function, a Table-Valued Function, returns a
   table of polygon cells that are ‘built’ from the output
   of the SVF and the starting point of the Display
   Envelope. A ‘cutter’ variable is used to build-out
   polygons with @var_Y rows and @var_X columns.
  The results of these functions are then used to
   0perate on the ‘Sites’ point dataset.
X
Y
At Long Last…




 DEMO!!!
Further Improvements…
 Add logic to always draw ‘up-to’
 the lower limit.
 Possibly add some ‘weight’ to the
 clustered point so that it doesn’t
 show in such a ‘regular’ way.
 Others…
TNRIS Information Services…

   Richard Wade – Team Lead
   Ryan Mitchell – Web Systems and Data Czar
   Yvette Giraud – Web Development
   Ragunath Jayabalakrishnan – Developer (Contract)
   Chris Williams – Database Administrator
www.foursquare.com

More Related Content

Similar to Leveraging sql server to improve vector display through point clustering

Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentation
andyman3000
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
Sanghamitra Deb
 
Clustrix Database Percona Ruby on Rails benchmark
Clustrix Database Percona Ruby on Rails benchmarkClustrix Database Percona Ruby on Rails benchmark
Clustrix Database Percona Ruby on Rails benchmark
Clustrix
 
How we evolved data pipeline at Celtra and what we learned along the way
How we evolved data pipeline at Celtra and what we learned along the wayHow we evolved data pipeline at Celtra and what we learned along the way
How we evolved data pipeline at Celtra and what we learned along the way
Grega Kespret
 
http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151
xlight
 
Spark ml streaming
Spark ml streamingSpark ml streaming
Spark ml streaming
Adam Doyle
 
Large Scale Machine Learning with Apache Spark
Large Scale Machine Learning with Apache SparkLarge Scale Machine Learning with Apache Spark
Large Scale Machine Learning with Apache Spark
Cloudera, Inc.
 
High Throughput Analytics with Cassandra & Azure
High Throughput Analytics with Cassandra & AzureHigh Throughput Analytics with Cassandra & Azure
High Throughput Analytics with Cassandra & Azure
DataStax Academy
 
AWS Summit 2013 | India - Big Data Analytics, Abhishek Sinha
AWS Summit 2013 | India - Big Data Analytics, Abhishek SinhaAWS Summit 2013 | India - Big Data Analytics, Abhishek Sinha
AWS Summit 2013 | India - Big Data Analytics, Abhishek Sinha
Amazon Web Services
 
Machine Learning with ML.NET and Azure - Andy Cross
Machine Learning with ML.NET and Azure - Andy CrossMachine Learning with ML.NET and Azure - Andy Cross
Machine Learning with ML.NET and Azure - Andy Cross
Andrew Flatters
 
Avoiding big data antipatterns
Avoiding big data antipatternsAvoiding big data antipatterns
Avoiding big data antipatterns
grepalex
 
Real Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with SparkReal Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with Spark
DataWorks Summit/Hadoop Summit
 
Real time machine learning visualization with spark -- Hadoop Summit 2016
Real time machine learning visualization with spark -- Hadoop Summit 2016Real time machine learning visualization with spark -- Hadoop Summit 2016
Real time machine learning visualization with spark -- Hadoop Summit 2016
Chester Chen
 
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Spark Summit
 
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Databricks
 
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Karthik Murugesan
 
SPL_ALL_EN.pptx
SPL_ALL_EN.pptxSPL_ALL_EN.pptx
SPL_ALL_EN.pptx
政宏 张
 
Data Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup GroupData Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup Group
Doug Needham
 
Cloudera Data Science Challenge
Cloudera Data Science ChallengeCloudera Data Science Challenge
Cloudera Data Science Challenge
Mark Nichols, P.E.
 
Web based interactive big data visualization
Web based interactive big data visualizationWeb based interactive big data visualization
Web based interactive big data visualization
Wenli Zhang
 

Similar to Leveraging sql server to improve vector display through point clustering (20)

Storage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems PresentationStorage Systems for High Scalable Systems Presentation
Storage Systems for High Scalable Systems Presentation
 
Computer Vision for Beginners
Computer Vision for BeginnersComputer Vision for Beginners
Computer Vision for Beginners
 
Clustrix Database Percona Ruby on Rails benchmark
Clustrix Database Percona Ruby on Rails benchmarkClustrix Database Percona Ruby on Rails benchmark
Clustrix Database Percona Ruby on Rails benchmark
 
How we evolved data pipeline at Celtra and what we learned along the way
How we evolved data pipeline at Celtra and what we learned along the wayHow we evolved data pipeline at Celtra and what we learned along the way
How we evolved data pipeline at Celtra and what we learned along the way
 
http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151http://www.hfadeel.com/Blog/?p=151
http://www.hfadeel.com/Blog/?p=151
 
Spark ml streaming
Spark ml streamingSpark ml streaming
Spark ml streaming
 
Large Scale Machine Learning with Apache Spark
Large Scale Machine Learning with Apache SparkLarge Scale Machine Learning with Apache Spark
Large Scale Machine Learning with Apache Spark
 
High Throughput Analytics with Cassandra & Azure
High Throughput Analytics with Cassandra & AzureHigh Throughput Analytics with Cassandra & Azure
High Throughput Analytics with Cassandra & Azure
 
AWS Summit 2013 | India - Big Data Analytics, Abhishek Sinha
AWS Summit 2013 | India - Big Data Analytics, Abhishek SinhaAWS Summit 2013 | India - Big Data Analytics, Abhishek Sinha
AWS Summit 2013 | India - Big Data Analytics, Abhishek Sinha
 
Machine Learning with ML.NET and Azure - Andy Cross
Machine Learning with ML.NET and Azure - Andy CrossMachine Learning with ML.NET and Azure - Andy Cross
Machine Learning with ML.NET and Azure - Andy Cross
 
Avoiding big data antipatterns
Avoiding big data antipatternsAvoiding big data antipatterns
Avoiding big data antipatterns
 
Real Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with SparkReal Time Machine Learning Visualization with Spark
Real Time Machine Learning Visualization with Spark
 
Real time machine learning visualization with spark -- Hadoop Summit 2016
Real time machine learning visualization with spark -- Hadoop Summit 2016Real time machine learning visualization with spark -- Hadoop Summit 2016
Real time machine learning visualization with spark -- Hadoop Summit 2016
 
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)Advanced Data Science on Spark-(Reza Zadeh, Stanford)
Advanced Data Science on Spark-(Reza Zadeh, Stanford)
 
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
 
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
Apache Spark-Based Stratification Library for Machine Learning Use Cases at N...
 
SPL_ALL_EN.pptx
SPL_ALL_EN.pptxSPL_ALL_EN.pptx
SPL_ALL_EN.pptx
 
Data Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup GroupData Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup Group
 
Cloudera Data Science Challenge
Cloudera Data Science ChallengeCloudera Data Science Challenge
Cloudera Data Science Challenge
 
Web based interactive big data visualization
Web based interactive big data visualizationWeb based interactive big data visualization
Web based interactive big data visualization
 

More from Texas Natural Resources Information System

Txgio presentation rgsm_gps_pearson_062012
Txgio presentation rgsm_gps_pearson_062012Txgio presentation rgsm_gps_pearson_062012
Txgio presentation rgsm_gps_pearson_062012
Texas Natural Resources Information System
 
Usslsc cgsic regional austin 061312
Usslsc cgsic regional austin 061312Usslsc cgsic regional austin 061312
Usslsc cgsic regional austin 061312
Texas Natural Resources Information System
 
Using gps technology at the texas general land
Using gps technology at the texas general landUsing gps technology at the texas general land
Using gps technology at the texas general land
Texas Natural Resources Information System
 
Ussls austin civil utility - klein - 061312 - final
Ussls austin   civil utility - klein - 061312 - finalUssls austin   civil utility - klein - 061312 - final
Ussls austin civil utility - klein - 061312 - final
Texas Natural Resources Information System
 
Tx dot gps applications fuegner
Tx dot gps applications   fuegnerTx dot gps applications   fuegner
Tx dot gps applications fuegner
Texas Natural Resources Information System
 
Tnris 2012
Tnris 2012Tnris 2012
Nmc ussls charter 2012
Nmc ussls charter 2012Nmc ussls charter 2012
Nationwide dgps (ndgps) lt mendoza
Nationwide dgps (ndgps)   lt mendozaNationwide dgps (ndgps)   lt mendoza
Nationwide dgps (ndgps) lt mendoza
Texas Natural Resources Information System
 
Lyle tamucc 2
Lyle tamucc 2Lyle tamucc 2
Dirks cgsic brief 2012
Dirks cgsic brief 2012Dirks cgsic brief 2012
Connected vehicle highway network applications
Connected vehicle highway network applicationsConnected vehicle highway network applications
Connected vehicle highway network applications
Texas Natural Resources Information System
 
Cgsic presentation humphreys
Cgsic presentation humphreysCgsic presentation humphreys
Cgsic presentation humphreys
Texas Natural Resources Information System
 
Cgsic navcen
Cgsic navcenCgsic navcen
Gnss international policy regional cgsic (austin - jun2012)
Gnss international policy   regional cgsic (austin - jun2012)Gnss international policy   regional cgsic (austin - jun2012)
Gnss international policy regional cgsic (austin - jun2012)
Texas Natural Resources Information System
 
From creekology to rocket science the evolution of remote sensing gis in oilg...
From creekology to rocket science the evolution of remote sensing gis in oilg...From creekology to rocket science the evolution of remote sensing gis in oilg...
From creekology to rocket science the evolution of remote sensing gis in oilg...
Texas Natural Resources Information System
 
Early warning forecast of an oil spill bp deepwater horizon in the gulf
Early warning forecast of an oil spill bp deepwater horizon in the gulfEarly warning forecast of an oil spill bp deepwater horizon in the gulf
Early warning forecast of an oil spill bp deepwater horizon in the gulf
Texas Natural Resources Information System
 
We are the music makers and we are the dreamers of dreams
We are the music makers and we are the dreamers of dreamsWe are the music makers and we are the dreamers of dreams
We are the music makers and we are the dreamers of dreams
Texas Natural Resources Information System
 
Volunteered geographic information (vgi) for the national map
Volunteered geographic information (vgi) for the national mapVolunteered geographic information (vgi) for the national map
Volunteered geographic information (vgi) for the national map
Texas Natural Resources Information System
 
Uav image recognition technology and applications
Uav image recognition technology and applicationsUav image recognition technology and applications
Uav image recognition technology and applications
Texas Natural Resources Information System
 
Texas high water marks crowd sourcing history, culture, and geography
Texas high water marks   crowd sourcing history, culture, and geographyTexas high water marks   crowd sourcing history, culture, and geography
Texas high water marks crowd sourcing history, culture, and geography
Texas Natural Resources Information System
 

More from Texas Natural Resources Information System (20)

Txgio presentation rgsm_gps_pearson_062012
Txgio presentation rgsm_gps_pearson_062012Txgio presentation rgsm_gps_pearson_062012
Txgio presentation rgsm_gps_pearson_062012
 
Usslsc cgsic regional austin 061312
Usslsc cgsic regional austin 061312Usslsc cgsic regional austin 061312
Usslsc cgsic regional austin 061312
 
Using gps technology at the texas general land
Using gps technology at the texas general landUsing gps technology at the texas general land
Using gps technology at the texas general land
 
Ussls austin civil utility - klein - 061312 - final
Ussls austin   civil utility - klein - 061312 - finalUssls austin   civil utility - klein - 061312 - final
Ussls austin civil utility - klein - 061312 - final
 
Tx dot gps applications fuegner
Tx dot gps applications   fuegnerTx dot gps applications   fuegner
Tx dot gps applications fuegner
 
Tnris 2012
Tnris 2012Tnris 2012
Tnris 2012
 
Nmc ussls charter 2012
Nmc ussls charter 2012Nmc ussls charter 2012
Nmc ussls charter 2012
 
Nationwide dgps (ndgps) lt mendoza
Nationwide dgps (ndgps)   lt mendozaNationwide dgps (ndgps)   lt mendoza
Nationwide dgps (ndgps) lt mendoza
 
Lyle tamucc 2
Lyle tamucc 2Lyle tamucc 2
Lyle tamucc 2
 
Dirks cgsic brief 2012
Dirks cgsic brief 2012Dirks cgsic brief 2012
Dirks cgsic brief 2012
 
Connected vehicle highway network applications
Connected vehicle highway network applicationsConnected vehicle highway network applications
Connected vehicle highway network applications
 
Cgsic presentation humphreys
Cgsic presentation humphreysCgsic presentation humphreys
Cgsic presentation humphreys
 
Cgsic navcen
Cgsic navcenCgsic navcen
Cgsic navcen
 
Gnss international policy regional cgsic (austin - jun2012)
Gnss international policy   regional cgsic (austin - jun2012)Gnss international policy   regional cgsic (austin - jun2012)
Gnss international policy regional cgsic (austin - jun2012)
 
From creekology to rocket science the evolution of remote sensing gis in oilg...
From creekology to rocket science the evolution of remote sensing gis in oilg...From creekology to rocket science the evolution of remote sensing gis in oilg...
From creekology to rocket science the evolution of remote sensing gis in oilg...
 
Early warning forecast of an oil spill bp deepwater horizon in the gulf
Early warning forecast of an oil spill bp deepwater horizon in the gulfEarly warning forecast of an oil spill bp deepwater horizon in the gulf
Early warning forecast of an oil spill bp deepwater horizon in the gulf
 
We are the music makers and we are the dreamers of dreams
We are the music makers and we are the dreamers of dreamsWe are the music makers and we are the dreamers of dreams
We are the music makers and we are the dreamers of dreams
 
Volunteered geographic information (vgi) for the national map
Volunteered geographic information (vgi) for the national mapVolunteered geographic information (vgi) for the national map
Volunteered geographic information (vgi) for the national map
 
Uav image recognition technology and applications
Uav image recognition technology and applicationsUav image recognition technology and applications
Uav image recognition technology and applications
 
Texas high water marks crowd sourcing history, culture, and geography
Texas high water marks   crowd sourcing history, culture, and geographyTexas high water marks   crowd sourcing history, culture, and geography
Texas high water marks crowd sourcing history, culture, and geography
 

Recently uploaded

Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
Pixlogix Infotech
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
名前 です男
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Zilliz
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Vladimir Iglovikov, Ph.D.
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
Octavian Nadolu
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 

Recently uploaded (20)

Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website20 Comprehensive Checklist of Designing and Developing a Website
20 Comprehensive Checklist of Designing and Developing a Website
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
みなさんこんにちはこれ何文字まで入るの?40文字以下不可とか本当に意味わからないけどこれ限界文字数書いてないからマジでやばい文字数いけるんじゃないの?えこ...
 
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
Introducing Milvus Lite: Easy-to-Install, Easy-to-Use vector database for you...
 
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIEnchancing adoption of Open Source Libraries. A case study on Albumentations.AI
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AI
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Artificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopmentArtificial Intelligence for XMLDevelopment
Artificial Intelligence for XMLDevelopment
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 

Leveraging sql server to improve vector display through point clustering

  • 1. Leveraging SQL Server to Improve Vector Display through Point Clustering
  • 2. The Problem…  Due to Java Script limitation, Large Numbers of Vectors don’t draw in Web-Mapping Environments.  Cutoff is ~ 300-500 Features
  • 3. Current ways to solve… 1. Use Silverlight or Flash  Increases capacity roughly an order of magnitude, but similar limitation exists. 2. Show Pictures, Query Features  Has the capability to show and query ALL features for a pleasant user experience, but takes time to build tile cash (the pretty pictures).  However, Tile Caches are Large and are, basically, redundant storage. 3. Use a WMS to feed ‘pictures’ dynamically and then use a separate, spatial query for feature attributes.  Extra ‘moving’ parts in the solution.  Possibly extra cost in licenses.
  • 4. Why Invent a ‘Better’ Solution???  Original Design…  Texas Hydrologic Information System (TexasHIS)  www.waterdatafortexas.org  Client desires immediate access to vector data and attributes  Data Characteristics  Lots of tightly packed ‘Sites’ ~ 300K+  data with gobs of related attribute data ~ 5,000,000+ related records!
  • 5. Factors Affecting a Good ‘Clustering’  The Data Solution…  Density  Distribution  The Scale  Large – Basically Draw everything  ‘Medium’ – Draw some, Cluster some – Lots of Code and Logic!  Small – Basically Cluster everything  Logic  Thresholds/Limits of when to Cluster vs. When to Draw  How to Carve the display to deal with Data Distribution  Complex Algorithms to determine the optimal number of features to draw.
  • 6.  Create a ‘smart’ solution that…  Isn’t affected by the java script limit  Only queries the database one time (other solutions can query the database for image creation and attribute retrieval.  Is still capable of retrieving vector attributes for single features
  • 7. Solution Basics… 1. Drawing Thresholds  When to Draw everything  When to Cluster  When to take no action 2. Carving The Display  At first glance, a 16 x 16 ‘Grid’ would be best. It would yield 256 Cluster ‘Features’ 3. Clustering Logic  Draw the Singles  Cluster Everything Else
  • 8. Solution Basics: Thresholds… 1. Drawing Thresholds  Draw Everything – if the total number of Points to ‘cluster’ is below 500  Cluster – If there are 500 (Lower Limit) to 5000 (Upper Limit)  Above the Upper Limit – Take no action. The Clustering Query becomes too expensive. So much so that user experience is drastically diminished.
  • 9. Solution Basics: Carving up the Display and Applying the Logic Rules…  Spatial Indexes are Similar in concept  16 x 16 creates 256 Cells – Near the upper limit of our display threshold  Polygons with more than 1 Site are ‘Cluster Candidates’. The Centroid of the polygon will represent the cluster spatially.  Threshold values are variables so that they are easily ‘tuned’ in a series of runs.
  • 10. Solution Basics: How it works…  The Envelope of the Display is passed as a parameter  First Function, a Scalar-Valued Function, determines how ‘big’ the cells will be in the X and Y direction – Returns a comma separated number pair – “0.3456778,0.3456777”  Second Function, a Table-Valued Function, returns a table of polygon cells that are ‘built’ from the output of the SVF and the starting point of the Display Envelope. A ‘cutter’ variable is used to build-out polygons with @var_Y rows and @var_X columns.  The results of these functions are then used to 0perate on the ‘Sites’ point dataset.
  • 11. X Y
  • 12.
  • 13. At Long Last… DEMO!!!
  • 14. Further Improvements…  Add logic to always draw ‘up-to’ the lower limit.  Possibly add some ‘weight’ to the clustered point so that it doesn’t show in such a ‘regular’ way.  Others…
  • 15. TNRIS Information Services…  Richard Wade – Team Lead  Ryan Mitchell – Web Systems and Data Czar  Yvette Giraud – Web Development  Ragunath Jayabalakrishnan – Developer (Contract)  Chris Williams – Database Administrator