Mark Quinsland, Sr. Field Engineer at Neo4j
Luxury yachts, football teams, and mansions are no longer safe havens for the illicit profits of Russian Oligarchs with ties to Putin. Assets are being identified and seized with benefits flowing to causes in Ukraine. This presentation covers:
- How are friends and relatives of Putin sheltering immense profits
- Graphs and other tools being used to identify sources & destinations of illicit wealth
- Latest asset seizures
- New regulations to expose hidden investors
Government GraphSummit: Keynote - Graphs in GovernmentNeo4j
Jim Webber Ph.D., Chief Scientist, Neo4j
Learn about the importance of graph technology, its evolution over the last few years and the impact it has had on the database and data analytics industry. This session will provide an overview of graph technology and talk about the past, present, and future of graphs and data management. Multiple use cases and customer examples will be covered, including examples of where graph databases and graph data science can assist and accelerate machine learning and artificial intelligence projects.
Neo4j Keynote: The Art of the Possible with Graph TechnologyNeo4j
Hear how companies are using graph technology to leverage the relationships in their connected data to reveal new ways of solving their most pressing business problems and creating new business value for their enterprises. You’ll see real-world use cases that include fraud detection, AI/ML, supply chain management, real-time recommendations, Customer 360, network, and IT operations, and more.
With the world’s supply chain system in crisis, it’s clear that better solutions are needed. Digital twins built on knowledge graph technology allow you to achieve an end-to-end view of the process, supporting real-time monitoring of critical assets.
Government GraphSummit: Keynote - Graphs in GovernmentNeo4j
Jim Webber Ph.D., Chief Scientist, Neo4j
Learn about the importance of graph technology, its evolution over the last few years and the impact it has had on the database and data analytics industry. This session will provide an overview of graph technology and talk about the past, present, and future of graphs and data management. Multiple use cases and customer examples will be covered, including examples of where graph databases and graph data science can assist and accelerate machine learning and artificial intelligence projects.
Neo4j Keynote: The Art of the Possible with Graph TechnologyNeo4j
Hear how companies are using graph technology to leverage the relationships in their connected data to reveal new ways of solving their most pressing business problems and creating new business value for their enterprises. You’ll see real-world use cases that include fraud detection, AI/ML, supply chain management, real-time recommendations, Customer 360, network, and IT operations, and more.
With the world’s supply chain system in crisis, it’s clear that better solutions are needed. Digital twins built on knowledge graph technology allow you to achieve an end-to-end view of the process, supporting real-time monitoring of critical assets.
Are You Underestimating the Value Within Your Data? A conversation about grap...Neo4j
Are You Underestimating the Value Within Your Data?
A conversation about graph technology
Dr Jesús Barrasa
Head of Field Engineering, Neo4j
Dr Jim Webber
Chief Scientist, Neo4j
Gartner IT Symposium Xpo Barcelona 2022 - Neo4j
Maturity frameworks have varying levels of information management maturity. Each level corresponds to not only increased data maturity, but also increased organizational maturity and bottom-line ROI. There are recommended targets to achieve an effective information management program. The speaker’s maturity framework sequences the information management activities for your consideration. It is based on real client roadmaps. This webinar promises to offer a wealth of ideas for key quick wins to benefit the organization’s information management program.
Attendees can self-assess their current information management capabilities as we go through Data Strategy, organization, architecture, and technology, yielding an overall view of the current level of information management maturity.
This webinar provides a foundation for enhancing current data and analytic capabilities and updating the strategy and plans for achievement of improved information management maturity, aligned with major initiatives.
This is always a hot topic when the speaker gives it, so be sure to come plot your shop on the curve.
Workshop 1. Architecting Innovative Graph Applications
Join this hands-on workshop for beginners led by Neo4j experts guiding you to systematically uncover contextual intelligence. Using a real-life dataset we will build step-by-step a graph solution; from building the graph data model to running queries and data visualization. The approach will be applicable across multiple use cases and industries.
Using Graph to Enable Digital Transformation Neo4j
Phil Meredith, CEO and Founder of Process Tempo, will discuss how graph technology can be applied to help create agile decision-making capabilities. Phil will be joined by Michael Richard, Chief Product Officer of Realogic Solutions and formerly of Ford Motor Corporation, who will share his experiences of how graph technology can help organizations evolve and digitally transform with greater speed and success.
The Data Platform for Today's Intelligent Applications.pdfNeo4j
Do you know how graph technology is used in today’s data-driven applications? We’ll get you up to speed and introduce you to the Neo4j product portfolio.
GraphSummit Toronto: Keynote - Innovating with Graphs Neo4j
Jim Webber Ph.D., Chief Scientist, Neo4j
Learn about the importance of graph technology, its evolution over the last few years and the impact it has had on the database and data analytics industry. This session will provide an overview of graph technology and talk about the past, present, and future of graphs and data management. Multiple use cases and customer examples will be covered, including examples of where graph databases and graph data science can assist and accelerate machine learning and artificial intelligence projects.
Analytics plays a critical role in supporting strategic business initiatives. Despite the apparent value of providing the data infrastructure for these initiatives, many executives question the economic feasibility of business intelligence and analytics. This requires information professionals to calculate and present the business value in terms business executives can understand.
Unfortunately, most IT professionals lack the knowledge required to develop comprehensive cost-benefit analyses and return on investment (ROI) measurements.
This session provides a framework to help IT professionals research, measure, and present the economic value of a proposed or existing analytics initiative. The session will provide practical advice about how to calculate ROI, the formulas in use, and how to collect necessary information.
Government GraphSummit: Optimizing the Supply ChainNeo4j
Michael Moore Ph.D., Principal, Partner Solutions and Neo4j Technology, Neo4j
With the world’s supply chain system in crisis, it’s clear that better solutions are needed. Digital twins built on knowledge graph technology allow you to achieve an end-to-end view of the process, supporting real-time monitoring of critical assets.
In this deck from the Open Compute Summit, Walter Hinton, Senior Global Director of Enterprise & Client Compute Solutions Marketing at Western Digital presents: The State of the Solid State Drive SSD.
"In 2013, Western Digital acquired flash storage hardware and software supplier, Virident, for $685 million in cash. They followed that up in May 2016, with the acquisition of SanDisk Corporation. The addition of SanDisk makes Western Digital Corporation a comprehensive storage solutions provider with global reach, and an extensive product and technology platform that includes deep expertise in both rotating magnetic storage and non-volatile memory (NVM)."
Watch the video presentation: http://wp.me/p3RLHQ-guI
Learn more: https://www.wdc.com/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
Are You Underestimating the Value Within Your Data? A conversation about grap...Neo4j
Are You Underestimating the Value Within Your Data?
A conversation about graph technology
Dr Jesús Barrasa
Head of Field Engineering, Neo4j
Dr Jim Webber
Chief Scientist, Neo4j
Gartner IT Symposium Xpo Barcelona 2022 - Neo4j
Maturity frameworks have varying levels of information management maturity. Each level corresponds to not only increased data maturity, but also increased organizational maturity and bottom-line ROI. There are recommended targets to achieve an effective information management program. The speaker’s maturity framework sequences the information management activities for your consideration. It is based on real client roadmaps. This webinar promises to offer a wealth of ideas for key quick wins to benefit the organization’s information management program.
Attendees can self-assess their current information management capabilities as we go through Data Strategy, organization, architecture, and technology, yielding an overall view of the current level of information management maturity.
This webinar provides a foundation for enhancing current data and analytic capabilities and updating the strategy and plans for achievement of improved information management maturity, aligned with major initiatives.
This is always a hot topic when the speaker gives it, so be sure to come plot your shop on the curve.
Workshop 1. Architecting Innovative Graph Applications
Join this hands-on workshop for beginners led by Neo4j experts guiding you to systematically uncover contextual intelligence. Using a real-life dataset we will build step-by-step a graph solution; from building the graph data model to running queries and data visualization. The approach will be applicable across multiple use cases and industries.
Using Graph to Enable Digital Transformation Neo4j
Phil Meredith, CEO and Founder of Process Tempo, will discuss how graph technology can be applied to help create agile decision-making capabilities. Phil will be joined by Michael Richard, Chief Product Officer of Realogic Solutions and formerly of Ford Motor Corporation, who will share his experiences of how graph technology can help organizations evolve and digitally transform with greater speed and success.
The Data Platform for Today's Intelligent Applications.pdfNeo4j
Do you know how graph technology is used in today’s data-driven applications? We’ll get you up to speed and introduce you to the Neo4j product portfolio.
GraphSummit Toronto: Keynote - Innovating with Graphs Neo4j
Jim Webber Ph.D., Chief Scientist, Neo4j
Learn about the importance of graph technology, its evolution over the last few years and the impact it has had on the database and data analytics industry. This session will provide an overview of graph technology and talk about the past, present, and future of graphs and data management. Multiple use cases and customer examples will be covered, including examples of where graph databases and graph data science can assist and accelerate machine learning and artificial intelligence projects.
Analytics plays a critical role in supporting strategic business initiatives. Despite the apparent value of providing the data infrastructure for these initiatives, many executives question the economic feasibility of business intelligence and analytics. This requires information professionals to calculate and present the business value in terms business executives can understand.
Unfortunately, most IT professionals lack the knowledge required to develop comprehensive cost-benefit analyses and return on investment (ROI) measurements.
This session provides a framework to help IT professionals research, measure, and present the economic value of a proposed or existing analytics initiative. The session will provide practical advice about how to calculate ROI, the formulas in use, and how to collect necessary information.
Government GraphSummit: Optimizing the Supply ChainNeo4j
Michael Moore Ph.D., Principal, Partner Solutions and Neo4j Technology, Neo4j
With the world’s supply chain system in crisis, it’s clear that better solutions are needed. Digital twins built on knowledge graph technology allow you to achieve an end-to-end view of the process, supporting real-time monitoring of critical assets.
In this deck from the Open Compute Summit, Walter Hinton, Senior Global Director of Enterprise & Client Compute Solutions Marketing at Western Digital presents: The State of the Solid State Drive SSD.
"In 2013, Western Digital acquired flash storage hardware and software supplier, Virident, for $685 million in cash. They followed that up in May 2016, with the acquisition of SanDisk Corporation. The addition of SanDisk makes Western Digital Corporation a comprehensive storage solutions provider with global reach, and an extensive product and technology platform that includes deep expertise in both rotating magnetic storage and non-volatile memory (NVM)."
Watch the video presentation: http://wp.me/p3RLHQ-guI
Learn more: https://www.wdc.com/
Sign up for our insideHPC Newsletter: http://insidehpc.com/newsletter
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA
Mike Limcaco, Analytics Specialist / Customer Engineer at Google
Measure trends in a particular topic or search term across Google Search across the US down to the city-level. Integrate these data signals into analytic pipelines to drive product, retail, media (video, audio, digital content) recommendations tailored to your audience segment. We'll discuss how Google unique datasets can be used with Google Cloud smart analytic services to process, enrich and surface the most relevant product or content that matches the ever-changing interests of your local customer segment.
Melinda Thielbar, Data Science Practice Lead and Director of Data Science at Fidelity Investments
From corporations to governments to private individuals, most of the AI community has recognized the growing need to incorporate ethics into the development and maintenance of AI models. Much of the current discussion, though, is meant for leaders and managers. This talk is directed to data scientists, data engineers, ML Ops specialists, and anyone else who is responsible for the hands-on, day-to-day of work building, productionalizing, and maintaining AI models. We'll give a short overview of the business case for why technical AI expertise is critical to developing an AI Ethics strategy. Then we'll discuss the technical problems that cause AI models to behave unethically, how to detect problems at all phases of model development, and the tools and techniques that are available to support technical teams in Ethical AI development.
Data Con LA 2022 - Improving disaster response with machine learningData Con LA
Antje Barth, Principal Developer Advocate, AI/ML at AWS & Chris Fregly, Principal Engineer, AI & ML at AWS
The frequency and severity of natural disasters are increasing. In response, governments, businesses, nonprofits, and international organizations are placing more emphasis on disaster preparedness and response. Many organizations are accelerating their efforts to make their data publicly available for others to use. Repositories such as the Registry of Open Data on AWS and Humanitarian Data Exchange contain troves of data available for use by developers, data scientists, and machine learning practitioners. In this session, see how a community of developers came together though the AWS Disaster Response hackathon to build models to support natural disaster preparedness and response.
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA
Sig Narvaez, Executive Solution Architect at MongoDB
MongoDB is now a Developer Data Platform. Come learn what�s new in the 6.0 release and Atlas following all the recent announcements made at MongoDB World 2022. Topics will include
- Atlas Search which combines 3 systems into one (database, search engine, and sync mechanisms) letting you focus on your product's differentiation.
- Atlas Data Federation to seamlessly query, transform, and aggregate data from one or more MongoDB Atlas databases, Atlas Data Lake and AWS S3 buckets
- Queryable Encryption lets you run expressive queries on fully randomized encrypted data to meet the most stringent security requirements
- Relational Migrator which analyzes your existing relational schemas and helps you design a new MongoDB schema.
- And more!
Data Con LA 2022 - Real world consumer segmentationData Con LA
Jaysen Gillespie, Head of Analytics and Data Science at RTB House
1. Shopkick has over 30M downloads, but the userbase is very heterogeneous. Anecdotal evidence indicated a wide variety of users for whom the app holds long-term appeal.
2. Marketing and other teams challenged Analytics to get beyond basic summary statistics and develop a holistic segmentation of the userbase.
3. Shopkick's data science team used SQL and python to gather data, clean data, and then perform a data-driven segmentation using a k-means algorithm.
4. Interpreting the results is more work -- and more fun -- than running the algo itself. We'll discuss how we transform from ""segment 1"", ""segment 2"", etc. to something that non-analytics users (Marketing, Operations, etc.) could actually benefit from.
5. So what? How did team across Shopkick change their approach given what Analytics had discovered.
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA
Ravi Pillala, Chief Data Architect & Distinguished Engineer at Intuit
TurboTax is one of the well known consumer software brand which at its peak serves 385K+ concurrent users. In this session, We start with looking at how user behavioral data & tax domain events are captured in real time using the event bus and analyzed to drive real time personalization with various TurboTax data pipelines. We will also look at solutions performing analytics which make use of these events, with the help of Kafka, Apache Flink, Apache Beam, Spark, Amazon S3, Amazon EMR, Redshift, Athena and Amazon lambda functions. Finally, we look at how SageMaker is used to create the TurboTax model to predict if a customer is at risk or needs help.
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA
George Mansoor, Chief Information Systems Officer at California State University
Overview of the CSU Data Architecture on moving on-prem ERP data to the AWS Cloud at scale using Delphix for Data Replication/Virtualization and AWS Data Migration Service (DMS) for data extracts
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA
Anand Ranganathan, Chief AI Officer at Unscrambl
Conversational AI is getting more and more widely used for customer support and employee support use-cases. In this session, I'm going to talk about how it can be extended for data analysis and data science use-cases ... i.e., how users can interact with a bot to ask analytical questions on data in relational databases.
This allows users to explore complex datasets using a combination of text and voice questions, in natural language, and then get back results in a combination of natural language and visualizations. Furthermore, it allows collaborative exploration of data by a group of users in a channel in platforms like Microsoft Teams, Slack or Google Chat.
For example, a group of users in a channel can ask questions to a bot in plain English like ""How many cases of Covid were there in the last 2 months by state and gender"" or ""Why did the number of deaths from Covid increase in May 2022"", and jointly look at the results that come back. This facilitates data awareness, data-driven collaboration and joint decision making among teams in enterprises and outside.
In this talk, I'll describe how we can bring together various features including natural-language understanding, NL-to-SQL translation, dialog management, data story-telling, semantic modeling of data and augmented analytics to facilitate collaborate exploration of data using conversational AI.
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA
Anil Inamdar, VP & Head of Data Solutions at Instaclustr
The most modernized enterprises utilize polyglot architecture, applying the best-suited database technologies to each of their organization's particular use cases. To successfully implement such an architecture, though, you need a thorough knowledge of the expansive NoSQL data technologies now available.
Attendees of this Data Con LA presentation will come away with:
-- A solid understanding of the decision-making process that should go into vetting NoSQL technologies and how to plan out their data modernization initiatives and migrations.
-- They will learn the types of functionality that best match the strengths of NoSQL key-value stores, graph databases, columnar databases, document-type databases, time-series databases, and more.
-- Attendees will also understand how to navigate database technology licensing concerns, and to recognize the types of vendors they'll encounter across the NoSQL ecosystem. This includes sniffing out open-core vendors that may advertise as “open source,"" but are driven by a business model that hinges on achieving proprietary lock-in.
-- Attendees will also learn to determine if vendors offer open-code solutions that apply restrictive licensing, or if they support true open source technologies like Hadoop, Cassandra, Kafka, OpenSearch, Redis, Spark, and many more that offer total portability and true freedom of use.
Data Con LA 2022 - Intro to Data ScienceData Con LA
Zia Khan, Computer Systems Analyst and Data Scientist at LearningFuze
Data Science tutorial is designed for people who are new to Data Science. This is a beginner level session so no prior coding or technical knowledge is required. Just bring your laptop with WiFi capability. The session starts with a review of what is data science, the amount of data we generate and how companies are using that data to get insight. We will pick a business use case, define the data science process, followed by hands-on lab using python and Jupyter notebook. During the hands-on portion we will work with pandas, numpy, matplotlib and sklearn modules and use a machine learning algorithm to approach the business use case.
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA
Mariana Danilovic, Managing Director at Infiom, LLC
We will address:
(1) Community creation and engagement using tokens and NFTs
(2) Organization of DAO structures and ways to incentivize Web3 communities
(3) DeFi business models applied to Web3 ventures
(4) Why Metaverse matters for new entertainment and community engagement models.
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA
Curtis ODell, Global Director Data Integrity at Tricentis
Join me to learn about a new end-to-end data testing approach designed for modern data pipelines that fills dangerous gaps left by traditional data management tools—one designed to handle structured and unstructured data from any source. You'll hear how you can use unique automation technology to reach up to 90 percent test coverage rates and deliver trustworthy analytical and operational data at scale. Several real world use cases from major banks/finance, insurance, health analytics, and Snowflake examples will be presented.
Key Learning Objective
1. Data journeys are complex and you have to ensure integrity of the data end to end across this journey from source to end reporting for compliance
2. Data Management tools do not test data, they profile and monitor at best, and leave serious gaps in your data testing coverage
3. Automation with integration to DevOps and DataOps' CI/CD processes are key to solving this.
4. How this approach has impact in your vertical
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA
Arif Ansari, Professor at University of Southern California
Super Bowl Ad cost $7 million and each year a few Super Bowl ads go viral. The traditional A/B testing does not predict virality. Some highly shared ones reach over 60 million organic views, which can be more valuable than views on TV. Not only are these voluntary, but they are typically without distraction, and win viewer engagement in the form of likes, comments, or shares. A Super Bowl ad that wins 69 million views on YouTube (e.g., Alexa Mind Reader) costs less than 10 cents per quality view! However, the challenge is triggering virality. We developed a method to predict virality and engineer virality into Ads.
1. Prof. Gerard J. Tellis and co-authors recommended that advertisers use YouTube to tease, test, and tweak (TTT) their ads to maximize sharing and viewing. 2022 saw that maxim put into practice.
2. We developed viral Ads prediction using two scientific models:
a. Prof. Gerard Tellis et al.'s model for viral prediction
b. Deep Learning viral prediction using social media effect
3. The model was able to identify all the top 15 Viral Ads it performed better than the traditional agencies.
4. New proposed method is Tease, Test, Tweak, Target and Spots Ad.
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA
Jai Bansal, Senior Manager, Data Science at Aetna
This talk describes an internal data product called Member Embeddings that facilitates modeling of member medical journeys with machine learning.
Medical claims are the key data source we use to understand health journeys at Aetna. Claims are the data artifacts that result from our members' interactions with the healthcare system. Claims contain data like the amount the provider billed, the place of service, and provider specialty. The primary medical information in a claim is represented in codes that indicate the diagnoses, procedures, or drugs for which a member was billed. These codes give us a semi-structured view into the medical reason for each claim and so contain rich information about members' health journeys. However, since the codes themselves are categorical and high-dimensional (10K cardinality), it's challenging to extract insight or predictive power directly from the raw codes on a claim.
To transform claim codes into a more useful format for machine learning, we turned to the concept of embeddings. Word embeddings are widely used in natural language processing to provide numeric vector representations of individual words.
We use a similar approach with our claims data. We treat each claim code as a word or token and use embedding algorithms to learn lower-dimensional vector representations that preserve the original high-dimensional semantic meaning.
This process converts the categorical features into dense numeric representations. In our case, we use sequences of anonymized member claim diagnosis, procedure, and drug codes as training data. We tested a variety of algorithms to learn embeddings for each type of claim code.
We found that the trained embeddings showed relationships between codes that were reasonable from the point of view of subject matter experts. In addition, using the embeddings to predict future healthcare-related events outperformed other basic features, making this tool an easy way to improve predictive model performance and save data scientist time.
Data Con LA 2022 - Data Streaming with KafkaData Con LA
Jie Chen, Manager Advisory, KPMG
Data is the new oil. However, many organizations have fragmented data in siloed line of businesses. In this topic, we will focus on identifying the legacy patterns and their limitations and introducing the new patterns packed by Kafka's core design ideas. The goal is to tirelessly pursue better solutions for organizations to overcome the bottleneck in data pipelines and modernize the digital assets for ready to scale their businesses. In summary, we will walk through three uses cases, recommend Dos and Donts, Take aways for Data Engineers, Data Scientist, Data architect in developing forefront data oriented skills.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Adjusting primitives for graph : SHORT REPORT / NOTESSubhajit Sahu
Graph algorithms, like PageRank Compressed Sparse Row (CSR) is an adjacency-list based graph representation that is
Multiply with different modes (map)
1. Performance of sequential execution based vs OpenMP based vector multiply.
2. Comparing various launch configs for CUDA based vector multiply.
Sum with different storage types (reduce)
1. Performance of vector element sum using float vs bfloat16 as the storage type.
Sum with different modes (reduce)
1. Performance of sequential execution based vs OpenMP based vector element sum.
2. Performance of memcpy vs in-place based CUDA based vector element sum.
3. Comparing various launch configs for CUDA based vector element sum (memcpy).
4. Comparing various launch configs for CUDA based vector element sum (in-place).
Sum with in-place strategies of CUDA mode (reduce)
1. Comparing various launch configs for CUDA based vector element sum (in-place).
Data Con LA 2022 - Who Owns That Yacht? How Graphs Are Used to Identify Assets for Sanctions
1. Neo4j, Inc. All rights reserved 2022
1
Who Owns That Yacht?
How Graphs Are Used to Identify Assets for Sanctions and Seizures
DataCon Los Angeles 2022
Mark Quinsland
Senior Field Engineer - Neo4j
mark.quinsland@neo4j.com
2. Neo4j, Inc. All rights reserved 2022
2
Welcome!
Mark Quinsland
Senior Field Engineer - Americas
mark.quinsland@neo4j.com
3. Neo4j, Inc. All rights reserved 2022
Agenda
● How the Ultra-wealthy exploit loopholes to shield assets
● Ultimate Beneficial Ownership Tracking - promise vs reality
● ICIJ
● How are Graph Databases being used
● Graph Data Science tools
● Questions
These slides (and additional bonus material) will be available for download
6. Neo4j, Inc. All rights reserved 2022
6
This is the story of an Egg
https://robbreport.com/lifestyle/news/faberge-egg-russian-superyacht-1234729438/
7. Neo4j, Inc. All rights reserved 2022
7
That was found on this yacht
8. Neo4j, Inc. All rights reserved 2022
8
That was owned by:
Suleyman Kerimov Eduard Khudainatov
OR
9. Neo4j, Inc. All rights reserved 2022
9
Explore Visually
TODO: use Bloom to create
A subgraph of this - andbuild it out
10. Neo4j, Inc. All rights reserved 2022
10
USA #2 Tax Haven
• Registering LLC or
trust is simple
• South Dakota and
Delaware lead the US
https://www.visualcapitalist.com/worlds-biggest-private-
tax-havens/
11. Neo4j, Inc. All rights reserved 2022
11
(almost) Every Country is a Tax Haven
12. Neo4j, Inc. All rights reserved 2022
12
IOR (Vatican Bank)
• Founded 1942 by Leo XII to transfer funds for
church members and official purposes
• Vatican City-State status provided secrecy
shield
• Exploited to move money & gold for Nazis,
Axis companies, war criminals, church
officials, and other corrupt persons
• On-going process to provide transparency
and follow international standards.
https://www.simonandschuster.com/books/Gods-Bankers/Gerald-Posner/9781416576594
15. Neo4j, Inc. All rights reserved 2022
15
OpenScreening - Search & Explore
https://resources.linkurious.com/openscreening
16. Neo4j, Inc. All rights reserved 2022
16
OpenScreening - quick demo
https://resources.linkurious.com/openscreening
17. Neo4j, Inc. All rights reserved 2022
17
Denmark Business Authority
• Fraudsters were hiding their
ownership in order to avoid
taxes.
• Goal was to Reduce the
yearly VAT deficit by 1 billion
DKK (136M USD)
https://www.slideshare.net/neo4j/graphtalk-copenhagen-fraud-detection-with-graphs
18. Neo4j, Inc. All rights reserved 2022
18
Reduce VAT Tax Avoidance
• DBA combined insights from
graph algorithms to improve
the metadata about
customers.
• Improved metadata was
used as input to ML
processes.
• VAT fraud was reduced
‘considerably’
https://www.slideshare.net/neo4j/graphtalk-copenhagen-fraud-detection-with-graphs
19. Neo4j, Inc. All rights reserved 2022
19
Money Laundering 101
20. Neo4j, Inc. All rights reserved 2022
20
The ‘Patriot’ Act Created a Massive Loophole
• Real Estate transactions
were exempt from
reporting requirements
• Multi-million $$ CASH
transactions became the
preferred way
• Sales of luxury condos to
shell companies and
foreign buyers
skyrocketed
21. Neo4j, Inc. All rights reserved 2022
21
Out with the Old In with the New!
23. Neo4j, Inc. All rights reserved 2022
23
Trump + Condos + Russians + Shell Companies = $$$
24. Neo4j, Inc. All rights reserved 2022
24
Trump + Condos + Russians + Shell Companies
• 1/3 of units in Trump
Tower sold to
Russians or Russian
shell companies
• One of 2 NYC
buildings selling to
anonymous buyers.
https://newrepublic.com/article/143586/trumps-russian-laundromat-trump-
tower-luxury-high-rises-dirty-money-international-crime-syndicate
25. Neo4j, Inc. All rights reserved 2022
25
Financial Crimes Enforcement Network (FinCEN)
• Corporate Transparency Act
passed in 2020
• Anyone forming a shell company
in US must list owner’s name &
TIN
• Banks must file Suspicious
Activity Reports to report
potentially illegal $$ movement
• Goal was to cut down on
largeCash payments for real
estate
https://www.bbc.com/news/uk-54226107
26. Neo4j, Inc. All rights reserved 2022
26
FinCEN Files Leak
https://www.bbc.com/news/uk-54226107
• 2600 leaked documents including
2100 Suspicious Activity Reports
(SAR)
• Most SARs resulted in no punitive
measures.
• HSBC, JP Morgan, Barclays,
Deutsche Bank all made the SAR
‘Greatest Hits’ offender list.
27. Neo4j, Inc. All rights reserved 2022
28
Beneficial Ownership - D&B
• Describe new regulation
https://www.facebook.com/watch/?v=281716713045688
https://neo4j.com/case-studies/dun-bradstreet/
28. Neo4j, Inc. All rights reserved 2022
31
Graph Example
• Use the yacht guys to show their relationships
• This will need some researching
• Visualize in Bloom, but use workbench first to show the concept
29. Neo4j, Inc. All rights reserved 2022
32
Morning: GDS Intro and GDS Internals
Graph Data Science
Graph Catalog
Graph Algorithms
Graph projection Graph machine learning
30. Neo4j, Inc. All rights reserved 2022
33
Afternoon: Applying Graph Data Science
Graph Embeddings Graph visualization Best Practices
A
Fraud Detection
31. Neo4j, Inc. All rights reserved 2022
Neo4j Browser
• Data Scientist and Developer direct access to Neo4j
Graph Databases
• CRUD operations using Cypher query language
• Execute Neo4j GDS Algorithms
Neo4j GDS - End User Experience
• Point-and-Click Graph Visualization for Data Scientists,
Analysts
• Search, Visualize, Explore, and Discover
• Share and Collaborate with Business Stakeholders
Neo4j Bloom
32. Neo4j, Inc. All rights reserved 2022
35
Neo4j GDS - End User Experience
33. Neo4j, Inc. All rights reserved 2021
Neo4j, Inc. All rights reserved 2022
36
Why Are Graph Databases ideal
for this problem!
46. Neo4j, Inc. All rights reserved 2022
Summary: Key Points
• Complex Digital Twin data are naturally and easily modeled as a graph.
• Digital Twin graphs can become very large, with potentially millions of connected data elements
that require frequent near-real time updates.
• Neo4j’s in-memory graph database provides the flexibility, performance and analytical
capabilities needed to build, manage and query Digital Twins on enterprise scale.
• Enriching the Digital Twin graph with data unified across multiple sources opens up many
related use cases and provides the most business value.
Update this
47. Neo4j, Inc. All rights reserved 2022
Director’s Cut / Bonus Material
Additional Slides skipped due to time restrictions
50. Neo4j, Inc. All rights reserved 2022
ICIJ used Neo4j to uncover the world’s largest
journalistic leak to date, The Panama Papers,
exposing criminals, corruption and extensive tax
evasion.
The US space agency uses Neo4j for their
“Lessons Learned” database to connect
information to improve search ability
effectiveness in space mission.
German Centre for Diabetes Research (DZD)
uses Neo4j to make research more
accessible with multiple perspectives as
well as for machine learning
Knowledge Graph for humans & ML
Fraud Detection Knowledge Graph for humans
55
Neo4j: Making Sense of Highly-Connected Data - For Good!
51. Neo4j, Inc. All rights reserved 2022
The Labeled Property Graph Model
56
Nodes
• Describe the persons, things, etc
Relationships
• Describe how the Nodes are related
• Can have names and properties
• Often more important than the nodes
Replace graphic with Yacht
... "we've seen SUCH a breadth of use cases emerging spanning nearly every industry."
And as the leader in graph technology, we've been fortunate to be able to engage so many of you in helping to solve some very complex data challenges.
So much so that we've been able to apply our graph expertise to help XXX
7 of top 10 Retailers
3 of top 5 Aircraft manufacturers
8 of the top 10 Insurance companies
... and ALL of America's top 20 banks to achieve their goals in leveraging graph technology.
Were fortunate to say that over 75% of Fortune 100 uses Neo4j.
... "we've seen SUCH a breadth of use cases emerging spanning nearly every industry."
And as the leader in graph technology, we've been fortunate to be able to engage so many of you in helping to solve some very complex data challenges.
So much so that we've been able to apply our graph expertise to help XXX
7 of top 10 Retailers
3 of top 5 Aircraft manufacturers
8 of the top 10 Insurance companies
... and ALL of America's top 20 banks to achieve their goals in leveraging graph technology.
Were fortunate to say that over 75% of Fortune 100 uses Neo4j.
To answer that question I introduce "the labeled property graph model". It's got NODES, relationships and properties on each that can be added via code and data science Algorithms
If this were a traditional relational database system you would have three data points;
you have the car, the person and the other person. In graph you have the ability to add labels AND properties to nodes which essentially doubles triples and quadruples your ability to do analytics and data science and get more of a story from your data
Graphs are the natural way to reveal, manage, use and even discover those relationships. Graphs are designed to mimic the data and the patterns naturally. <<< NEXT >>