SlideShare a Scribd company logo
1 of 18
Presented by:
Elephants, ibises and a more Pythonic
way to work with databases
May 12, 2023
Marlene Mhangami
2
● Harare, Zimbabwe
● Developer Advocate Voltron
Data
● Vice Chair ACM practitioner
board
● Previous Vice Chair and
Director for the Python
Software Foundation
@marlene_zw
marlene.bsky.social
marlenemhangami.com
Marlene Mhangami
3
Art Around
Malaga
post conference exploration
image of Malaga from spain.info
European fine arts, decorative arts and architecture from the 3rd to 19th centuries.
https://www.wga.hu/index_database.html
Let’s look at some code
and art!
6
Art Around New
York
post conference exploration
image of Time Square from frommers.com
7
The MET
post conference exploration
image of ASAP Rocky and Rihanna from the
New York Times 2023
The Metropolitan Museum of Art Collection (over 450000 works of art)
https://www.metmuseum.org/art/the-collection
9
Which tool should we use to
work with the MET dataset?
▶ Dataset on Kaggle is 36GB
▶ A typical laptop has 8GB of RAM
▶ 5-10X as much RAM as the dataset size
Add item
or phrase
we ❤️
pandas,
but what
do we do
when it
breaks?
10
Which tool should we use to
work with the MET dataset?
▶ Database engines are built to handle
data of any size
▶ Many data scientists use Python
▶ Ibis generates SQL and the database
engine (backend) does the heavy lifting
Add item
or phrase
generate
sql and let
the engine
do the
lifting🦾
11
- pandas-like
- compiles to sql
(executed efficiently)
* anything you can
write with a SELECT
statement you can
write in ibis
12
The MET on Google BigQuery
▶ 200,000 art pieces
▶ Data hosted in the cloud publicly
▶ Data is consistently updated
▶ Uses SQL to query
Let’s look at some code
and art!
14
With The Right Tools Big
Data Should Feel Small
15
Tools For Big Data in Python
▶ Ibis
▶ Polars
▶ Apache Arrow
▶ DuckDB
16
Try Ibis Out!
marlene@voltrondata.com
@marlene_zw
Contact
TRY THE WORKSHOP!
Twitter
SOCIAL MEDIA
GENERAL CONTACT
https://voltrondata-labs.github.io/2023-jonthebeach-ibis/
Thanks

More Related Content

Similar to Elephants J on the beach.pptx

Innovation, data and social responsibility
Innovation, data and social responsibilityInnovation, data and social responsibility
Innovation, data and social responsibilityMuseumInnovation
 
杭州讲座 石田英敬
杭州讲座 石田英敬杭州讲座 石田英敬
杭州讲座 石田英敬luruiyang
 
Final project posters for lis 653 spring 2014
Final project posters for lis 653 spring 2014Final project posters for lis 653 spring 2014
Final project posters for lis 653 spring 2014PrattSILS
 
SCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual worldSCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual worldBritish Science Association
 
2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VU
2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VU2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VU
2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VUNetwerkDigitaalErfgoed
 
British Library Labs - Overview Talk 2017
British Library Labs - Overview Talk 2017British Library Labs - Overview Talk 2017
British Library Labs - Overview Talk 2017benosteen
 
Aallbibframe em-20130714
Aallbibframe em-20130714Aallbibframe em-20130714
Aallbibframe em-20130714zepheiraorg
 
Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...
Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...
Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...www.patkane.global
 
MementoMap Framework for Flexible and Adaptive Web Archive Profiling
MementoMap Framework for Flexible and Adaptive Web Archive ProfilingMementoMap Framework for Flexible and Adaptive Web Archive Profiling
MementoMap Framework for Flexible and Adaptive Web Archive ProfilingSawood Alam
 
Mid-Tennessee Region Future Technology Presentation
Mid-Tennessee Region Future Technology PresentationMid-Tennessee Region Future Technology Presentation
Mid-Tennessee Region Future Technology PresentationJason Griffey
 
Digital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the fieldDigital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the fieldaelang
 
Introduction to the International Image Interoperability Framework
Introduction to the International Image Interoperability FrameworkIntroduction to the International Image Interoperability Framework
Introduction to the International Image Interoperability FrameworkIIIF_io
 
museum-digital: making participation in europeana possible
museum-digital: making participation in europeana possiblemuseum-digital: making participation in europeana possible
museum-digital: making participation in europeana possiblemuseum-digital
 
Future of AI Smart Networks
Future of AI Smart NetworksFuture of AI Smart Networks
Future of AI Smart NetworksMelanie Swan
 
DH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task forceDH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task forcedhlab
 
Europeana Bcn Hackathon_2011
Europeana Bcn Hackathon_2011Europeana Bcn Hackathon_2011
Europeana Bcn Hackathon_2011Conxa Rodà
 
Hack4Europe! Barcelona 2011
Hack4Europe! Barcelona 2011Hack4Europe! Barcelona 2011
Hack4Europe! Barcelona 2011Museu Picasso
 

Similar to Elephants J on the beach.pptx (20)

Innovation, data and social responsibility
Innovation, data and social responsibilityInnovation, data and social responsibility
Innovation, data and social responsibility
 
杭州讲座 石田英敬
杭州讲座 石田英敬杭州讲座 石田英敬
杭州讲座 石田英敬
 
Final project posters for lis 653 spring 2014
Final project posters for lis 653 spring 2014Final project posters for lis 653 spring 2014
Final project posters for lis 653 spring 2014
 
Digital Scholarship at the British Library
Digital Scholarship at the British LibraryDigital Scholarship at the British Library
Digital Scholarship at the British Library
 
SCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual worldSCC2011 - Talking about e-Science in a virtual world
SCC2011 - Talking about e-Science in a virtual world
 
2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VU
2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VU2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VU
2D3 Parellelsessie Week Digitaal Erfgoed: Lora Aroyo / VU
 
British Library Labs - Overview Talk 2017
British Library Labs - Overview Talk 2017British Library Labs - Overview Talk 2017
British Library Labs - Overview Talk 2017
 
Semantic Web for Life Sciences: vision, aims, tools, platforms
 Semantic Web for Life Sciences: vision, aims, tools, platforms  Semantic Web for Life Sciences: vision, aims, tools, platforms
Semantic Web for Life Sciences: vision, aims, tools, platforms
 
Music in the Archives
Music in the ArchivesMusic in the Archives
Music in the Archives
 
Aallbibframe em-20130714
Aallbibframe em-20130714Aallbibframe em-20130714
Aallbibframe em-20130714
 
Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...
Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...
Soulitarian City: Looking for the Hacker Ethic in Glasgow by Pat Kane, The Pl...
 
MementoMap Framework for Flexible and Adaptive Web Archive Profiling
MementoMap Framework for Flexible and Adaptive Web Archive ProfilingMementoMap Framework for Flexible and Adaptive Web Archive Profiling
MementoMap Framework for Flexible and Adaptive Web Archive Profiling
 
Mid-Tennessee Region Future Technology Presentation
Mid-Tennessee Region Future Technology PresentationMid-Tennessee Region Future Technology Presentation
Mid-Tennessee Region Future Technology Presentation
 
Digital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the fieldDigital Humanities: A brief introduction to the field
Digital Humanities: A brief introduction to the field
 
Introduction to the International Image Interoperability Framework
Introduction to the International Image Interoperability FrameworkIntroduction to the International Image Interoperability Framework
Introduction to the International Image Interoperability Framework
 
museum-digital: making participation in europeana possible
museum-digital: making participation in europeana possiblemuseum-digital: making participation in europeana possible
museum-digital: making participation in europeana possible
 
Future of AI Smart Networks
Future of AI Smart NetworksFuture of AI Smart Networks
Future of AI Smart Networks
 
DH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task forceDH2013: Christine Sauter – Results of the task force
DH2013: Christine Sauter – Results of the task force
 
Europeana Bcn Hackathon_2011
Europeana Bcn Hackathon_2011Europeana Bcn Hackathon_2011
Europeana Bcn Hackathon_2011
 
Hack4Europe! Barcelona 2011
Hack4Europe! Barcelona 2011Hack4Europe! Barcelona 2011
Hack4Europe! Barcelona 2011
 

More from ssuser59b75e

PyCon Italia 2023.pptx
PyCon Italia 2023.pptxPyCon Italia 2023.pptx
PyCon Italia 2023.pptxssuser59b75e
 
PyCon MEA Arrow Slides.pptx
PyCon MEA Arrow Slides.pptxPyCon MEA Arrow Slides.pptx
PyCon MEA Arrow Slides.pptxssuser59b75e
 
Augmented Reality.pptx
Augmented Reality.pptxAugmented Reality.pptx
Augmented Reality.pptxssuser59b75e
 
Elephants Ibises and a more Pythonic way to work with databases
Elephants Ibises and a more Pythonic way to work with databases Elephants Ibises and a more Pythonic way to work with databases
Elephants Ibises and a more Pythonic way to work with databases ssuser59b75e
 
PyCon Colombia.pptx
PyCon Colombia.pptxPyCon Colombia.pptx
PyCon Colombia.pptxssuser59b75e
 
An Introduction to Apache Arrow for Python Programmers.pptx
An Introduction to Apache Arrow for Python Programmers.pptxAn Introduction to Apache Arrow for Python Programmers.pptx
An Introduction to Apache Arrow for Python Programmers.pptxssuser59b75e
 

More from ssuser59b75e (8)

PyCon LT .pptx
PyCon LT .pptxPyCon LT .pptx
PyCon LT .pptx
 
PyCon Italia 2023.pptx
PyCon Italia 2023.pptxPyCon Italia 2023.pptx
PyCon Italia 2023.pptx
 
PyCon MEA Arrow Slides.pptx
PyCon MEA Arrow Slides.pptxPyCon MEA Arrow Slides.pptx
PyCon MEA Arrow Slides.pptx
 
Augmented Reality.pptx
Augmented Reality.pptxAugmented Reality.pptx
Augmented Reality.pptx
 
Transcendance
Transcendance Transcendance
Transcendance
 
Elephants Ibises and a more Pythonic way to work with databases
Elephants Ibises and a more Pythonic way to work with databases Elephants Ibises and a more Pythonic way to work with databases
Elephants Ibises and a more Pythonic way to work with databases
 
PyCon Colombia.pptx
PyCon Colombia.pptxPyCon Colombia.pptx
PyCon Colombia.pptx
 
An Introduction to Apache Arrow for Python Programmers.pptx
An Introduction to Apache Arrow for Python Programmers.pptxAn Introduction to Apache Arrow for Python Programmers.pptx
An Introduction to Apache Arrow for Python Programmers.pptx
 

Recently uploaded

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 

Recently uploaded (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 

Elephants J on the beach.pptx