SlideShare a Scribd company logo
#WiDSAUB2023
#WiDSAUB2023
Transforming the Public Sector
with Tech & AI
Carole Alsharabati
Professor, Université Saint Joseph
Research Director, Siren Associates
OUTLINE
1. The Big Picture
2. Improving safety with hotspot analysis
3. Strengthening civic culture with anomaly detection
4. Ensuring inclusive aid distribution throught PMT
5. Constructive journalism with AraBERT and ChatGPT
6. Concluding Remarks
What was done
Where we were
CURRENT STATE CHALLENGES SOLUTIONS IMPACT
Weak institutions Inefficiency
ILP
Hotspot analysis
Reduced crimes
Rampant
clientelism
Favoritism
PMT
Multivariate regression
Fair aid distribution
Weak
civic culture
Fraud
Anomaly detection
Isolation forest
Blocked system
abusers
Media
capture
Disinformation
Deceit detection
AraBERT NLP
Better informed
citizen
LEBANON’S AI EXPERIENCES
Improving safety
with ILP
Hotspot Analysis
01
ILP to prevent crimes
Headquarters
Command and
Control Centers
Vehicle
and foot
patrols
Operation rooms &
police stations
Ops Room
activities
Incident and crime
reporting
Operational
implementation
Strategic and tactical
decisions
Data Analysis
Streamlined analytical products for decision-making
Strategic
Assessment
Tactical
Assessment
Problem Profile
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Total Crimes 2017 2018
1. Space-time clustering
2. Probability of occurrence based on past incidence
3. Identification of crime prone zones
4. Deploy patrols as preventive measures
Strengthening civic culture
with Machine Learning
Isolation Forest
02
Detecting and blocking exploits
• Government mandated lockdowns in Lebanon, with permits
processed by IMPACT
• 1M mobility requests per day with spikes of 1K requests per
second
• Excessive use and exploits of excuses that gave 4-5 hours of
allowed mobility
– i.e. hospitals, pharmacy, doctors…
• Need for a validation system
• Objective: Block requests from users that attempt to exploit the
platform
Machine Learning & Anomaly Detection
• Automatically flag anomalous behavior based on the
user’s request history (14 days).
• Assumptions:
– Anomalous behavior consist of a small percentage
of the total requests traffic
– Anomalous behavior are more unique and diverse
than normal user behavior.
• 4 key behavioral patterns were identified
– Unique destination count
– Requests frequency
– Diversity of mobility types (car, bus, by foot…)
– Unique Kadaa count
• Isolation forest: an unsupervised Tree-based Algorithm
Ensuring inclusive aid
distribution with PROXY
MEANS TEST
Multivariate Regression
03
ESSN to the Rescue
Since 2019, Lebanon has endured a severe and prolonged economic and financial crisis. The Emergency Social Safety Network
(ESSN), under MoSA and the PCM, was created as an add-on to the National Poverty Targeting Program to further extend social
assistance programming.
Value of the Lebanese Lira
PROXY MEANS TEST
Based on a statistical analysis of the population, different weights are assigned to the variables depending on their influence on
household consumption.​ The model includes more than 40 variables including income, assets, housing quality, occupation, and
demographic characteristics correlated with poverty.
Registration
Verification
Eligibility
through PMT
Payment
Did we target the extreme poor?
550,000 households registered, 250,000 visited, and 80,000 enrolled through PMT.
Survey was done on a sample of 1600 randomly picked respondent to validate the PMT results (Y4G summer of 2022).
ESSN
Paid
Registered
Not
Registered
At least one
member with
disability
24.6% 16.2% 12.4%
ESSN Paid Registered
Not
Registered
Average
dependency
ratio
3.91 3.16 2.88
The World Bank has defined extreme poverty as people
living on less than $2.15 a day, measured using the
international poverty line.
Average Monthly
Revenue per
capita of HH
receiving aid
29$
65%
Of those who
received aid said
that their living
conditions
improved
Constructive journalism with
NLP
When ChatGPT met AraBERT
04
Towards a constructive role of media: Detecting deceit
Media in Lebanon are serving politics.
The objective is to provide tools for journalists and citizens to detect disinformation.
Chat GPT and AraBERT Duo
Reliance on AraBERT (Antoun, Wissam, et al.)
More particularly AraBERTv02 large model, which contains 371 million parameters, equivalent to 1.38 gigabytes of data, and was
pre-trained on 77 gigabytes of Arabic corpus.
• Trained AraBERT to recognize Arabic language
propaganda and Subjectivity indicators
• Applied to local media
• Daily KPIs
• Labeling through open source, human, ChatGPT
• 7200 articles from various topics in the MENA
Labeled
20%
Bias
33%
Bias
Newspaper x
Newspaper y
11%
Propaganda
17%
Propaganda
The way forward: DALIL journalism
Going forward, the platform will be opened up to media with a dedicated space for journalists where they will be able, among
other things, to run their content through bias, propaganda and consensus detectors in order to check it before publication; think
Turnitin meets fact-checking.​
• Data-driven decision making processes
have huge potential in low resource
contexts
• There are champions of change in the
public sector who will quickly embrace
data-driven decision processes
• Once integrated into the processes, AI
can help gain efficiency, bring fairness,
control fraud, preempt deceit, and much
more.
SOLUTIONS IMPACT
ILP
Hotspot analysis
Reduced crimes
PMT
Multivariate regression
Fair aid distribution
Anomaly detection
Isolation forest
Blocked system
abusers
Deceit detection
AraBERT NLP
Better informed citizen
Concluding Remarks
Thank you!
Hotspot with Getis-Ord Gi*
1-Spatial weights matrix
for relationship between
data points
2-Incorporate temporal
component in weights.
3-Getis-Ord Gi* statistic
degree of surrounding
by other points with
high or low values.
3-Calculate z-score
Isolation Forest
1. Select a random subset of data
2. Select a random feature
3. Choose a random split point
4. Partition the data
5. Repeat the process recursively for each
subset until all the data points are isolated
6. Calculate anomaly score based on the
number of partitions required to isolate it
7. Anomalies are identified as data points with
lower anomaly scores.
8. Set threshold and classify anomalies.
PMT
Use data with income/expenditure + proxy variables
Estimate a model to predict household income/expenditure
based on the selected proxy variables.
Validate the model on separate sample of households.
Apply to new HH by collecting proxy variables and estimating
income/expenditure.
BERT and AraBERT (Bidirectional Encoder Representations from
Transformers)
• Bi-directional encoding (vectors) to
process text in both directions & capture
the context and meaning based on the
entire text.
• Self-attention mechanism to weigh the
importance of different words in a
sentence based on the context of the
sentence.
• Transformer architecture, a neural network
to capture long-range dependencies and
relationships between words and
sentences.

More Related Content

Similar to Carole Alsharabati Slides WIDS 2023

Applications of Artificial Intelligence for government
Applications of Artificial Intelligence for government Applications of Artificial Intelligence for government
Applications of Artificial Intelligence for government
HarshMishraMUSIC
 
Callcredit's Fraud Summit 2016 - Plenary session
Callcredit's Fraud Summit 2016 - Plenary sessionCallcredit's Fraud Summit 2016 - Plenary session
Callcredit's Fraud Summit 2016 - Plenary session
Callcredit123
 
Global Pulse Rk Activate Summit
Global Pulse Rk Activate SummitGlobal Pulse Rk Activate Summit
Global Pulse Rk Activate Summit
Robert Kirkpatrick
 
Intelligence Led Policing for Police Decision Makers
Intelligence Led Policing for Police Decision MakersIntelligence Led Policing for Police Decision Makers
Intelligence Led Policing for Police Decision Makers
Deborah Osborne
 
ICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open Innovation
ICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open InnovationICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open Innovation
ICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open Innovation
msissine
 
Adjusting Your Security Controls: It’s the New Normal
Adjusting Your Security Controls: It’s the New NormalAdjusting Your Security Controls: It’s the New Normal
Adjusting Your Security Controls: It’s the New Normal
Priyanka Aash
 
SafetyScore
SafetyScoreSafetyScore
SafetyScore
Tav .
 
COMMON GOOD DIGITAL FRAMEWORK
COMMON GOOD DIGITAL FRAMEWORKCOMMON GOOD DIGITAL FRAMEWORK
COMMON GOOD DIGITAL FRAMEWORK
Boston Global Forum
 
Scot Secure 2019 Edinburgh (Day 1)
Scot Secure 2019 Edinburgh (Day 1)Scot Secure 2019 Edinburgh (Day 1)
Scot Secure 2019 Edinburgh (Day 1)
Ray Bugg
 
Tech for Good: Using Map-Based Apps to Connect Us During a Pandemic
Tech for Good: Using Map-Based Apps to Connect Us During a PandemicTech for Good: Using Map-Based Apps to Connect Us During a Pandemic
Tech for Good: Using Map-Based Apps to Connect Us During a Pandemic
TechSoup
 
Global pulse technology
Global pulse technologyGlobal pulse technology
Global pulse technology
Sara-Jayne Terp
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
dawnrk
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
dawnrk
 
FakeNewsDetector.pptx
FakeNewsDetector.pptxFakeNewsDetector.pptx
FakeNewsDetector.pptx
SANDEEPMISHRA607554
 
State of Internet 2015
State of Internet 2015State of Internet 2015
State of Internet 2015
Tuan Anh Nguyen
 
Who´s connected Who´s not - worldwide in 2016
Who´s connected Who´s not - worldwide in 2016Who´s connected Who´s not - worldwide in 2016
Who´s connected Who´s not - worldwide in 2016
Amalist Client Services
 
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary SlidedeckBig Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
UN Global Pulse
 
Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...
Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...
Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...
AIRCC Publishing Corporation
 
Encrypting User Data in Local Government 2016
Encrypting User Data in Local Government 2016Encrypting User Data in Local Government 2016
Encrypting User Data in Local Government 2016
Ben B
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency Management
BYTE Project
 

Similar to Carole Alsharabati Slides WIDS 2023 (20)

Applications of Artificial Intelligence for government
Applications of Artificial Intelligence for government Applications of Artificial Intelligence for government
Applications of Artificial Intelligence for government
 
Callcredit's Fraud Summit 2016 - Plenary session
Callcredit's Fraud Summit 2016 - Plenary sessionCallcredit's Fraud Summit 2016 - Plenary session
Callcredit's Fraud Summit 2016 - Plenary session
 
Global Pulse Rk Activate Summit
Global Pulse Rk Activate SummitGlobal Pulse Rk Activate Summit
Global Pulse Rk Activate Summit
 
Intelligence Led Policing for Police Decision Makers
Intelligence Led Policing for Police Decision MakersIntelligence Led Policing for Police Decision Makers
Intelligence Led Policing for Police Decision Makers
 
ICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open Innovation
ICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open InnovationICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open Innovation
ICT4D Principle 6 - Open Standards, Open Data, Open Source, & Open Innovation
 
Adjusting Your Security Controls: It’s the New Normal
Adjusting Your Security Controls: It’s the New NormalAdjusting Your Security Controls: It’s the New Normal
Adjusting Your Security Controls: It’s the New Normal
 
SafetyScore
SafetyScoreSafetyScore
SafetyScore
 
COMMON GOOD DIGITAL FRAMEWORK
COMMON GOOD DIGITAL FRAMEWORKCOMMON GOOD DIGITAL FRAMEWORK
COMMON GOOD DIGITAL FRAMEWORK
 
Scot Secure 2019 Edinburgh (Day 1)
Scot Secure 2019 Edinburgh (Day 1)Scot Secure 2019 Edinburgh (Day 1)
Scot Secure 2019 Edinburgh (Day 1)
 
Tech for Good: Using Map-Based Apps to Connect Us During a Pandemic
Tech for Good: Using Map-Based Apps to Connect Us During a PandemicTech for Good: Using Map-Based Apps to Connect Us During a Pandemic
Tech for Good: Using Map-Based Apps to Connect Us During a Pandemic
 
Global pulse technology
Global pulse technologyGlobal pulse technology
Global pulse technology
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
 
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdfIbm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
 
FakeNewsDetector.pptx
FakeNewsDetector.pptxFakeNewsDetector.pptx
FakeNewsDetector.pptx
 
State of Internet 2015
State of Internet 2015State of Internet 2015
State of Internet 2015
 
Who´s connected Who´s not - worldwide in 2016
Who´s connected Who´s not - worldwide in 2016Who´s connected Who´s not - worldwide in 2016
Who´s connected Who´s not - worldwide in 2016
 
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary SlidedeckBig Data for Development: Opportunities and Challenges, Summary Slidedeck
Big Data for Development: Opportunities and Challenges, Summary Slidedeck
 
Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...
Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...
Inflation-Crime Nexus: A Predictive Analysis of Crime Rate Using Inflationary...
 
Encrypting User Data in Local Government 2016
Encrypting User Data in Local Government 2016Encrypting User Data in Local Government 2016
Encrypting User Data in Local Government 2016
 
Big Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency ManagementBig Data and Social Media Mining in Crisis and Emergency Management
Big Data and Social Media Mining in Crisis and Emergency Management
 

Recently uploaded

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
panagenda
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Speck&Tech
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
SOFTTECHHUB
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
Quotidiano Piemontese
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
Alpen-Adria-Universität
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 

Recently uploaded (20)

GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
HCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAUHCL Notes and Domino License Cost Reduction in the World of DLAU
HCL Notes and Domino License Cost Reduction in the World of DLAU
 
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!
 
National Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practicesNational Security Agency - NSA mobile device best practices
National Security Agency - NSA mobile device best practices
 
TrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy SurveyTrustArc Webinar - 2024 Global Privacy Survey
TrustArc Webinar - 2024 Global Privacy Survey
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Video Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the FutureVideo Streaming: Then, Now, and in the Future
Video Streaming: Then, Now, and in the Future
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 

Carole Alsharabati Slides WIDS 2023

  • 3. Transforming the Public Sector with Tech & AI Carole Alsharabati Professor, Université Saint Joseph Research Director, Siren Associates
  • 4. OUTLINE 1. The Big Picture 2. Improving safety with hotspot analysis 3. Strengthening civic culture with anomaly detection 4. Ensuring inclusive aid distribution throught PMT 5. Constructive journalism with AraBERT and ChatGPT 6. Concluding Remarks
  • 5. What was done Where we were CURRENT STATE CHALLENGES SOLUTIONS IMPACT Weak institutions Inefficiency ILP Hotspot analysis Reduced crimes Rampant clientelism Favoritism PMT Multivariate regression Fair aid distribution Weak civic culture Fraud Anomaly detection Isolation forest Blocked system abusers Media capture Disinformation Deceit detection AraBERT NLP Better informed citizen LEBANON’S AI EXPERIENCES
  • 7. ILP to prevent crimes Headquarters Command and Control Centers Vehicle and foot patrols Operation rooms & police stations Ops Room activities Incident and crime reporting Operational implementation Strategic and tactical decisions Data Analysis
  • 8. Streamlined analytical products for decision-making Strategic Assessment Tactical Assessment Problem Profile Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Total Crimes 2017 2018 1. Space-time clustering 2. Probability of occurrence based on past incidence 3. Identification of crime prone zones 4. Deploy patrols as preventive measures
  • 9. Strengthening civic culture with Machine Learning Isolation Forest 02
  • 10. Detecting and blocking exploits • Government mandated lockdowns in Lebanon, with permits processed by IMPACT • 1M mobility requests per day with spikes of 1K requests per second • Excessive use and exploits of excuses that gave 4-5 hours of allowed mobility – i.e. hospitals, pharmacy, doctors… • Need for a validation system • Objective: Block requests from users that attempt to exploit the platform
  • 11. Machine Learning & Anomaly Detection • Automatically flag anomalous behavior based on the user’s request history (14 days). • Assumptions: – Anomalous behavior consist of a small percentage of the total requests traffic – Anomalous behavior are more unique and diverse than normal user behavior. • 4 key behavioral patterns were identified – Unique destination count – Requests frequency – Diversity of mobility types (car, bus, by foot…) – Unique Kadaa count • Isolation forest: an unsupervised Tree-based Algorithm
  • 12. Ensuring inclusive aid distribution with PROXY MEANS TEST Multivariate Regression 03
  • 13. ESSN to the Rescue Since 2019, Lebanon has endured a severe and prolonged economic and financial crisis. The Emergency Social Safety Network (ESSN), under MoSA and the PCM, was created as an add-on to the National Poverty Targeting Program to further extend social assistance programming. Value of the Lebanese Lira
  • 14. PROXY MEANS TEST Based on a statistical analysis of the population, different weights are assigned to the variables depending on their influence on household consumption.​ The model includes more than 40 variables including income, assets, housing quality, occupation, and demographic characteristics correlated with poverty. Registration Verification Eligibility through PMT Payment
  • 15. Did we target the extreme poor? 550,000 households registered, 250,000 visited, and 80,000 enrolled through PMT. Survey was done on a sample of 1600 randomly picked respondent to validate the PMT results (Y4G summer of 2022). ESSN Paid Registered Not Registered At least one member with disability 24.6% 16.2% 12.4% ESSN Paid Registered Not Registered Average dependency ratio 3.91 3.16 2.88 The World Bank has defined extreme poverty as people living on less than $2.15 a day, measured using the international poverty line. Average Monthly Revenue per capita of HH receiving aid 29$ 65% Of those who received aid said that their living conditions improved
  • 16. Constructive journalism with NLP When ChatGPT met AraBERT 04
  • 17. Towards a constructive role of media: Detecting deceit Media in Lebanon are serving politics. The objective is to provide tools for journalists and citizens to detect disinformation.
  • 18. Chat GPT and AraBERT Duo Reliance on AraBERT (Antoun, Wissam, et al.) More particularly AraBERTv02 large model, which contains 371 million parameters, equivalent to 1.38 gigabytes of data, and was pre-trained on 77 gigabytes of Arabic corpus. • Trained AraBERT to recognize Arabic language propaganda and Subjectivity indicators • Applied to local media • Daily KPIs • Labeling through open source, human, ChatGPT • 7200 articles from various topics in the MENA Labeled 20% Bias 33% Bias Newspaper x Newspaper y 11% Propaganda 17% Propaganda
  • 19. The way forward: DALIL journalism Going forward, the platform will be opened up to media with a dedicated space for journalists where they will be able, among other things, to run their content through bias, propaganda and consensus detectors in order to check it before publication; think Turnitin meets fact-checking.​
  • 20. • Data-driven decision making processes have huge potential in low resource contexts • There are champions of change in the public sector who will quickly embrace data-driven decision processes • Once integrated into the processes, AI can help gain efficiency, bring fairness, control fraud, preempt deceit, and much more. SOLUTIONS IMPACT ILP Hotspot analysis Reduced crimes PMT Multivariate regression Fair aid distribution Anomaly detection Isolation forest Blocked system abusers Deceit detection AraBERT NLP Better informed citizen Concluding Remarks
  • 22. Hotspot with Getis-Ord Gi* 1-Spatial weights matrix for relationship between data points 2-Incorporate temporal component in weights. 3-Getis-Ord Gi* statistic degree of surrounding by other points with high or low values. 3-Calculate z-score
  • 23. Isolation Forest 1. Select a random subset of data 2. Select a random feature 3. Choose a random split point 4. Partition the data 5. Repeat the process recursively for each subset until all the data points are isolated 6. Calculate anomaly score based on the number of partitions required to isolate it 7. Anomalies are identified as data points with lower anomaly scores. 8. Set threshold and classify anomalies.
  • 24. PMT Use data with income/expenditure + proxy variables Estimate a model to predict household income/expenditure based on the selected proxy variables. Validate the model on separate sample of households. Apply to new HH by collecting proxy variables and estimating income/expenditure.
  • 25. BERT and AraBERT (Bidirectional Encoder Representations from Transformers) • Bi-directional encoding (vectors) to process text in both directions & capture the context and meaning based on the entire text. • Self-attention mechanism to weigh the importance of different words in a sentence based on the context of the sentence. • Transformer architecture, a neural network to capture long-range dependencies and relationships between words and sentences.