SlideShare a Scribd company logo
Analyzing LiDAR & SAR data
with Capella Space and TileDB
TileDB webinars - April 12, 2022
Founder & CEO of TileDB, Inc.
Stavros Papadopoulos
Deep roots at the intersection of HPC, databases and data science
Traction with telecoms, pharmas, hospitals and other scientific organizations
45+ members with expertise across all applications and domains
Who we are
TileDB was spun out from MIT and Intel Labs in 2017
WHERE IT ALL STARTED
Raised over $20M from world-class investors
INVESTORS
The Problem
Low productivity for data analysts and scientists
Huge TCO for organizations
Organizations are drowning in a data infrastructure mess
Too many domain-specific file formats
Difficult to handle data beyond tables and SQL
Overly complex metadata handling and data sharing
Numerous vendors and in-house solutions
Difficult to govern all data holistically
The Solution | Universal Database
All Data. Faster. Cheaper.
Securely manage all your data assets and
supercharge your analytics, data science and
machine learning with a universal database
All data in one place
Superior performance, at a lower cost
Analytics, data science and ML
Holistic governance and collaboration
The Universal Database Pillars
All data in one place
Manage any type of data – tables, files,
images, video, genomics, ML features,
metadata, even flat files and folders – in a
single powerful database.
Superior performance,
at a lower cost
Structure all your data in a canonical,
multi-dimensional array format, which adapts
to any data shape and workload for
maximum performance and minimum cost.
Analytics, data science
and ML
Run data science and machine learning
workloads in a single platform that unifies
data management with analytics and
scientific workloads.
Holistic governance and
collaboration
Securely control the access over all your
data assets, and enable collaboration and
reproducibility, while monitoring all activity
in a centralized way.
The Secret Sauce | The Data Model
Dense array
Store everything as dense or sparse multi-dimensional arrays
Sparse array
Arrays Subsume Dataframes
Applications
What can be modeled as an array
LiDAR (3D sparse)
SAR (2D or 3D dense)
Population genomics (3D sparse)
Single-cell genomics (2D dense or sparse)
Biomedical imaging (2D or 3D dense) Even flat files!!! (1D dense)
Time series (ND dense or sparse)
Weather (2D or 3D dense)
Graphs (2D sparse)
Video (3D dense)
Key-values (1D or ND sparse)
Tables (1D dense or ND sparse)
How we built a Universal Database
SQL ML & Data Science
Distributed Computing
Applications
APIs
Access control and logging
Serverless SQL, UDFs, task graphs
Jupyter notebooks and dashboards
C L O U D
Parallel IO, rapid reads and writes
Columnar, cloud-optimized
Data versioning and time traveling
E M B E D D E D
Open-source interoperable
storage with a universal
open-spec array format
Unified data management
and easy serverless
compute at global scale
Efficient APIs and tool Integrations with zero-copy techniques
Superior
performance
Built in C++
Fully-parallelized
Columnar format
Multiple compressors
R-trees for sparse arrays
TileDB Embedded
https://github.com/TileDB-Inc/TileDB
Open source:
Rapid updates
& data versioning
Immutable writes
Lock-free
Parallel reader / writer model
Time traveling
TileDB Embedded
https://github.com/TileDB-Inc/TileDB
Open source:
Extreme
interoperability
Numerous APIs
Numerous integrations
All backends
Optimized
for the cloud
Immutable writes
Parallel IO
Minimization of requests
TileDB Cloud
Universal storage Universal tooling
Universal data
.las .cog .vcf .csv
Universal scale
Management. Collaboration. Scalability
TileDB Cloud
Works as SaaS: https://cloud.tiledb.com
Works on premises
Currently on AWS, soon on any cloud
Built to work anywhere
Slicing, SQL, UDFs, task graphs
It is completely serverless
On-demand JupyterHub instances
Can launch Jupyter notebooks
Compute sent to the data
It is geo-aware
Authentication, compliance, etc.
It is secure
TileDB Cloud
Full marketplace (via Stripe)
Everything is monetizable
Access control inside and outside your
organization
Make any data and code public
Discover any public data and code
(central catalog)
Everything is shareable at global scale
Jupyter notebooks
UDFs and task graphs
ML models
Everything is an array!
Dashboards (e.g., R shiny apps)
All types of data (even flat files)
Full auditability (data, code, any action)
Everything is logged
SAR in TileDB
SAR data is stored in TileDB as 3D dense arrays
Rapid dense array slicing via implicit indexing on dimensions
Width, height, time are the dimensions
Cloud-native (rapid writes and reads)
Versioning and time traveling
Integration with GDAL
Visualization on TileDB Cloud
LiDAR in TileDB
LiDAR data is stored in TileDB as 3D sparse arrays
Efficient indexing with R-trees and Hilbert curves
Native float indexing - e.g, A[123.34:124.22, 30.23:31.00, :]
Cloud-native (rapid writes and reads)
Versioning and time traveling
Schema evolution
Integration with PDAL and PCL
Visualization on TileDB Cloud
A slicing query would just traverse the tree
top-down, visiting only nodes/MBRs that
intersect the slice
Indexing
Given the non-empty domain, the space tile extents and the
tile order, we can find easily that this slice overlaps the
second and fourth tile
1 2 3 4
5 6 7 8
9 10 11 12
13 14 15 16
row-major tile order
2x2
space
tiles
MBR1
MBR2
MBR3
MBR4
col-major tile order
row-major cell order
2x2
space
tiles
capacity
2
R-tree
(stored in fragment metadata)
MBR1 MBR2 MBR3 MBR4
Machine Learning in TileDB
Fusion of SAR with LiDAR data in a single platform
Integration with TensorFlow, PyTorch and more
Storage of ML models on TileDB Cloud
A full-fledged platform for exploration, analytics and ML
The Universal Database
Thank you
Capella Space
Jason Brown
SAR: A Window to See What Others Can't
Optical With SAR
Only observable
25% of the time
Observable
100% of the time
High Revisit
Low Latency
Cloud & Smoke
piercing visibility
Night Vision for
the planet’s activity
Capella Space is Changing Access to Earth Information
3
Any time,
Any Weather
Frequent Revisit Very High-
Resolution Imaging
Fastest From
Order to Delivery
High-cadencerevisit with
multiple imaging
opportunities per day at
various times of
day/night
Fully automated tasking
& data processing with
fastestorder-to-delivery
times available in market
Very High Resolution
(VHR) and
radiometrically enhanced
multi-looked imagery
with low noise
High-cadencerevisit with
multiple imaging
opportunities per day at
various times of
day/night
4
Capella SAR Imaging
Central Frequency X-Band
Polarization Single-Pol HH or VV
Imaging Bandwidth Up to 500 MHz
Acquisition Direction
Ascending+Descending Orbit Direction
Left+Right Look Direction
Imaging Modes
Spotlight
Sliding Spotlight
Stripmap
SAR Imagery Products
Spot (spotlight imaging mode)
Site (sliding spotlight imaging mode)
Strip (stripmap imaging mode)
SAR Imagery Product Scenes
5
VERY HIGH RESOLUTION
Spot| 5 km x 5 km | 0.5 m
VERY HIGH RESOLUTION
Site | 5 km x 10 km | 1.0 m
HIGH RESOLUTION
Strip| 5 km x 20 km| 1.2 m
SAR Imagery Product Modes
6
7
Capella Console
Simple-to-Use GUI
Task or purchase archived imagery via coordinates, AOI creation tool or
shapefile upload.
Fully automated and secure operations: Satellite ops, SAR processing
and data storage are cloud based, fully confidential.
Real-Time Status Updates
New tasking scheduling in ≤ 15 minutes and users are
provided real-time status updates.
Predicted time of collection displayed to enable
timely post-imaging operations.
Capella API Integration
● Tip-and-cue scenario for immediate responsiveness via API integration. Existing systemalerts can push task requests.
● React to emergencies in real-time. Deliver data to teams on the ground hours after image capture.
8
Task the Capella Constellation
Queue from Existing Systems Pull Scenes & Metadata

More Related Content

Similar to Analyzing LiDAR and SAR data with Capella Space and TileDB (TileDB webinars, 04-12-22).pdf

big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
VIKAS KATARE
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
Flavio Vit
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiative
Mansi Mehra
 
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
datastack
 
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Andrey Vykhodtsev
 
Wasp2 - IoT and Streaming Platform
Wasp2 - IoT and Streaming PlatformWasp2 - IoT and Streaming Platform
Wasp2 - IoT and Streaming Platform
Paolo Platter
 
IMGS Geospatial User Group 2014 - Big data management with Apollo
IMGS Geospatial User Group 2014 - Big data management with ApolloIMGS Geospatial User Group 2014 - Big data management with Apollo
IMGS Geospatial User Group 2014 - Big data management with Apollo
IMGS
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiative
Mansi Mehra
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
eRic Choo
 
Hadoop_arunam_ppt
Hadoop_arunam_pptHadoop_arunam_ppt
Hadoop_arunam_ppt
jerrin joseph
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
Rohit Dubey
 
Big Data and High Performance Computing
Big Data and High Performance ComputingBig Data and High Performance Computing
Big Data and High Performance Computing
Abzetdin Adamov
 
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Low-Latency Analytics with NoSQL – Introduction to Storm and CassandraLow-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Caserta
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Safe Software
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry Perspective
Cloudera, Inc.
 
Processing Drone data @Scale
Processing Drone data @ScaleProcessing Drone data @Scale
Processing Drone data @Scale
Dr Hajji Hicham
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
James Serra
 
1 mapreduce-fest
1 mapreduce-fest1 mapreduce-fest
1 mapreduce-fest
Genoveva Vargas-Solar
 
Real time analytics
Real time analyticsReal time analytics
Real time analytics
Leandro Totino Pereira
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
Mr. Ankit
 

Similar to Analyzing LiDAR and SAR data with Capella Space and TileDB (TileDB webinars, 04-12-22).pdf (20)

big data overview ppt
big data overview pptbig data overview ppt
big data overview ppt
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiative
 
عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟عصر کلان داده، چرا و چگونه؟
عصر کلان داده، چرا و چگونه؟
 
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
Big Data Essentials meetup @ IBM Ljubljana 23.06.2015
 
Wasp2 - IoT and Streaming Platform
Wasp2 - IoT and Streaming PlatformWasp2 - IoT and Streaming Platform
Wasp2 - IoT and Streaming Platform
 
IMGS Geospatial User Group 2014 - Big data management with Apollo
IMGS Geospatial User Group 2014 - Big data management with ApolloIMGS Geospatial User Group 2014 - Big data management with Apollo
IMGS Geospatial User Group 2014 - Big data management with Apollo
 
Hadoop - A big data initiative
Hadoop - A big data initiativeHadoop - A big data initiative
Hadoop - A big data initiative
 
Scaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data ScienceScaling up with Cisco Big Data: Data + Science = Data Science
Scaling up with Cisco Big Data: Data + Science = Data Science
 
Hadoop_arunam_ppt
Hadoop_arunam_pptHadoop_arunam_ppt
Hadoop_arunam_ppt
 
Big Data PPT by Rohit Dubey
Big Data PPT by Rohit DubeyBig Data PPT by Rohit Dubey
Big Data PPT by Rohit Dubey
 
Big Data and High Performance Computing
Big Data and High Performance ComputingBig Data and High Performance Computing
Big Data and High Performance Computing
 
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Low-Latency Analytics with NoSQL – Introduction to Storm and CassandraLow-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
 
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial DataCloud Revolution: Exploring the New Wave of Serverless Spatial Data
Cloud Revolution: Exploring the New Wave of Serverless Spatial Data
 
Hadoop: An Industry Perspective
Hadoop: An Industry PerspectiveHadoop: An Industry Perspective
Hadoop: An Industry Perspective
 
Processing Drone data @Scale
Processing Drone data @ScaleProcessing Drone data @Scale
Processing Drone data @Scale
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
1 mapreduce-fest
1 mapreduce-fest1 mapreduce-fest
1 mapreduce-fest
 
Real time analytics
Real time analyticsReal time analytics
Real time analytics
 
Big Data and Hadoop
Big Data and HadoopBig Data and Hadoop
Big Data and Hadoop
 

Recently uploaded

Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
ElizabethGarrettChri
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
Sachin Paul
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
Timothy Spann
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
taqyea
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
hyfjgavov
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
y3i0qsdzb
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
Social Samosa
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
xclpvhuk
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
AlessioFois2
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
Lars Albertsson
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
wyddcwye1
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
Bill641377
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
jitskeb
 

Recently uploaded (20)

Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024Open Source Contributions to Postgres: The Basics POSETTE 2024
Open Source Contributions to Postgres: The Basics POSETTE 2024
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......Palo Alto Cortex XDR presentation .......
Palo Alto Cortex XDR presentation .......
 
DSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelinesDSSML24_tspann_CodelessGenerativeAIPipelines
DSSML24_tspann_CodelessGenerativeAIPipelines
 
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(harvard毕业证书)哈佛大学毕业证如何办理
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
一比一原版兰加拉学院毕业证(Langara毕业证书)学历如何办理
 
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
一比一原版巴斯大学毕业证(Bath毕业证书)学历如何办理
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
The Ipsos - AI - Monitor 2024 Report.pdf
The  Ipsos - AI - Monitor 2024 Report.pdfThe  Ipsos - AI - Monitor 2024 Report.pdf
The Ipsos - AI - Monitor 2024 Report.pdf
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
一比一原版(Unimelb毕业证书)墨尔本大学毕业证如何办理
 
A presentation that explain the Power BI Licensing
A presentation that explain the Power BI LicensingA presentation that explain the Power BI Licensing
A presentation that explain the Power BI Licensing
 
End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024End-to-end pipeline agility - Berlin Buzzwords 2024
End-to-end pipeline agility - Berlin Buzzwords 2024
 
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
原版一比一利兹贝克特大学毕业证(LeedsBeckett毕业证书)如何办理
 
Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...Population Growth in Bataan: The effects of population growth around rural pl...
Population Growth in Bataan: The effects of population growth around rural pl...
 
Experts live - Improving user adoption with AI
Experts live - Improving user adoption with AIExperts live - Improving user adoption with AI
Experts live - Improving user adoption with AI
 

Analyzing LiDAR and SAR data with Capella Space and TileDB (TileDB webinars, 04-12-22).pdf

  • 1. Analyzing LiDAR & SAR data with Capella Space and TileDB TileDB webinars - April 12, 2022 Founder & CEO of TileDB, Inc. Stavros Papadopoulos
  • 2. Deep roots at the intersection of HPC, databases and data science Traction with telecoms, pharmas, hospitals and other scientific organizations 45+ members with expertise across all applications and domains Who we are TileDB was spun out from MIT and Intel Labs in 2017 WHERE IT ALL STARTED Raised over $20M from world-class investors INVESTORS
  • 3. The Problem Low productivity for data analysts and scientists Huge TCO for organizations Organizations are drowning in a data infrastructure mess Too many domain-specific file formats Difficult to handle data beyond tables and SQL Overly complex metadata handling and data sharing Numerous vendors and in-house solutions Difficult to govern all data holistically
  • 4. The Solution | Universal Database All Data. Faster. Cheaper. Securely manage all your data assets and supercharge your analytics, data science and machine learning with a universal database All data in one place Superior performance, at a lower cost Analytics, data science and ML Holistic governance and collaboration
  • 5. The Universal Database Pillars All data in one place Manage any type of data – tables, files, images, video, genomics, ML features, metadata, even flat files and folders – in a single powerful database. Superior performance, at a lower cost Structure all your data in a canonical, multi-dimensional array format, which adapts to any data shape and workload for maximum performance and minimum cost. Analytics, data science and ML Run data science and machine learning workloads in a single platform that unifies data management with analytics and scientific workloads. Holistic governance and collaboration Securely control the access over all your data assets, and enable collaboration and reproducibility, while monitoring all activity in a centralized way.
  • 6. The Secret Sauce | The Data Model Dense array Store everything as dense or sparse multi-dimensional arrays Sparse array
  • 8. Applications What can be modeled as an array LiDAR (3D sparse) SAR (2D or 3D dense) Population genomics (3D sparse) Single-cell genomics (2D dense or sparse) Biomedical imaging (2D or 3D dense) Even flat files!!! (1D dense) Time series (ND dense or sparse) Weather (2D or 3D dense) Graphs (2D sparse) Video (3D dense) Key-values (1D or ND sparse) Tables (1D dense or ND sparse)
  • 9. How we built a Universal Database SQL ML & Data Science Distributed Computing Applications APIs Access control and logging Serverless SQL, UDFs, task graphs Jupyter notebooks and dashboards C L O U D Parallel IO, rapid reads and writes Columnar, cloud-optimized Data versioning and time traveling E M B E D D E D Open-source interoperable storage with a universal open-spec array format Unified data management and easy serverless compute at global scale Efficient APIs and tool Integrations with zero-copy techniques
  • 10. Superior performance Built in C++ Fully-parallelized Columnar format Multiple compressors R-trees for sparse arrays TileDB Embedded https://github.com/TileDB-Inc/TileDB Open source: Rapid updates & data versioning Immutable writes Lock-free Parallel reader / writer model Time traveling
  • 11. TileDB Embedded https://github.com/TileDB-Inc/TileDB Open source: Extreme interoperability Numerous APIs Numerous integrations All backends Optimized for the cloud Immutable writes Parallel IO Minimization of requests
  • 12. TileDB Cloud Universal storage Universal tooling Universal data .las .cog .vcf .csv Universal scale Management. Collaboration. Scalability
  • 13. TileDB Cloud Works as SaaS: https://cloud.tiledb.com Works on premises Currently on AWS, soon on any cloud Built to work anywhere Slicing, SQL, UDFs, task graphs It is completely serverless On-demand JupyterHub instances Can launch Jupyter notebooks Compute sent to the data It is geo-aware Authentication, compliance, etc. It is secure
  • 14. TileDB Cloud Full marketplace (via Stripe) Everything is monetizable Access control inside and outside your organization Make any data and code public Discover any public data and code (central catalog) Everything is shareable at global scale Jupyter notebooks UDFs and task graphs ML models Everything is an array! Dashboards (e.g., R shiny apps) All types of data (even flat files) Full auditability (data, code, any action) Everything is logged
  • 15. SAR in TileDB SAR data is stored in TileDB as 3D dense arrays Rapid dense array slicing via implicit indexing on dimensions Width, height, time are the dimensions Cloud-native (rapid writes and reads) Versioning and time traveling Integration with GDAL Visualization on TileDB Cloud
  • 16. LiDAR in TileDB LiDAR data is stored in TileDB as 3D sparse arrays Efficient indexing with R-trees and Hilbert curves Native float indexing - e.g, A[123.34:124.22, 30.23:31.00, :] Cloud-native (rapid writes and reads) Versioning and time traveling Schema evolution Integration with PDAL and PCL Visualization on TileDB Cloud
  • 17. A slicing query would just traverse the tree top-down, visiting only nodes/MBRs that intersect the slice Indexing Given the non-empty domain, the space tile extents and the tile order, we can find easily that this slice overlaps the second and fourth tile 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 row-major tile order 2x2 space tiles MBR1 MBR2 MBR3 MBR4 col-major tile order row-major cell order 2x2 space tiles capacity 2 R-tree (stored in fragment metadata) MBR1 MBR2 MBR3 MBR4
  • 18. Machine Learning in TileDB Fusion of SAR with LiDAR data in a single platform Integration with TensorFlow, PyTorch and more Storage of ML models on TileDB Cloud A full-fledged platform for exploration, analytics and ML
  • 21. SAR: A Window to See What Others Can't Optical With SAR Only observable 25% of the time Observable 100% of the time High Revisit Low Latency Cloud & Smoke piercing visibility Night Vision for the planet’s activity
  • 22. Capella Space is Changing Access to Earth Information 3 Any time, Any Weather Frequent Revisit Very High- Resolution Imaging Fastest From Order to Delivery High-cadencerevisit with multiple imaging opportunities per day at various times of day/night Fully automated tasking & data processing with fastestorder-to-delivery times available in market Very High Resolution (VHR) and radiometrically enhanced multi-looked imagery with low noise High-cadencerevisit with multiple imaging opportunities per day at various times of day/night
  • 23. 4 Capella SAR Imaging Central Frequency X-Band Polarization Single-Pol HH or VV Imaging Bandwidth Up to 500 MHz Acquisition Direction Ascending+Descending Orbit Direction Left+Right Look Direction Imaging Modes Spotlight Sliding Spotlight Stripmap SAR Imagery Products Spot (spotlight imaging mode) Site (sliding spotlight imaging mode) Strip (stripmap imaging mode)
  • 24. SAR Imagery Product Scenes 5 VERY HIGH RESOLUTION Spot| 5 km x 5 km | 0.5 m VERY HIGH RESOLUTION Site | 5 km x 10 km | 1.0 m HIGH RESOLUTION Strip| 5 km x 20 km| 1.2 m
  • 26. 7 Capella Console Simple-to-Use GUI Task or purchase archived imagery via coordinates, AOI creation tool or shapefile upload. Fully automated and secure operations: Satellite ops, SAR processing and data storage are cloud based, fully confidential. Real-Time Status Updates New tasking scheduling in ≤ 15 minutes and users are provided real-time status updates. Predicted time of collection displayed to enable timely post-imaging operations.
  • 27. Capella API Integration ● Tip-and-cue scenario for immediate responsiveness via API integration. Existing systemalerts can push task requests. ● React to emergencies in real-time. Deliver data to teams on the ground hours after image capture. 8 Task the Capella Constellation Queue from Existing Systems Pull Scenes & Metadata