Data Visualization Kick Off #1 - Nov 3 2020 - Data for Good SaskatchewanData For Good Regina
This document announces a visualization challenge hosted by Data for Good to encourage the use of data skills to tell stories with data visualizations. Participants can choose any public dataset to visualize and must submit a single-page PDF by a deadline to be judged on understandability, how well it tells a story, and visual appeal. Bonus points will be given to visualizations that use local Saskatchewan data. Top submissions will be featured on the Data for Good social media and presented at an upcoming meetup. Contact information and examples of visualization tools and public datasets are provided.
Data for Good Regina talks about how it has used data to help organizations understand their data better so that they can further their mission. They talk about the United Way Summer Success Program and the datathon with the Distress Centre in Calgary.
Jason Barnard — Structuring, Visualizing and Evaluating Your E-A-TSemrush
These slides were presented at the SEMrush webinar "4 Hours of E-A-T | Structuring, Visualizing and Evaluating Your E-A-T". Video replay and transcript are available at https://www.semrush.com/webinars/4-hours-of-e-a-t-or-structuring-visualizing-and-evaluating-your-e-a-t/
The document discusses Regina's first data for good meetup that was hosted by ISM Canada. It provides information on how to get involved with data for good initiatives in Regina through roles like data ambassadors or participating in datathons and meetups. The goal is to use data science skills to contribute to social good and help non-profits through ongoing or short-term projects.
Google Analytics: Campaign Tracking for Smart PeopleLars von Sneidern
One of the core capabilities of Google Analytics is tracking what paid or earned campaign a visitor is referred by, and how their visit relates to goals established for your site.
Wait, you didn't know it could do that?! Guess what, it can...and it's a very cool thing. AND it's even easier now than ever with an instance that is implemented using Google Tag Manager.
This deck was presented at the Portland Google Analytics User Group on 11/9/16.
Reverse Engineering Google's Local Search AlgorithmDFWSEM
The document discusses local SEO ranking factors based on a study conducted by Dan Leibson. It identifies several key on-page and off-page ranking factors for local search results, including citations, links, websites, location, and Google My Business profiles. The presentation provides recommendations for prioritizing local SEO efforts on the most important ranking factors and links to additional resources for further information.
The document discusses 4 main strategies for building high quality, relevant backlinks for websites:
1. Researching a competitor's link profile to understand what strategies work best.
2. Releasing thought leadership content like studies and reports to attract links.
3. Establishing relationships with media sources to gain coverage.
4. Creating a wiki to track and organize ongoing link building strategies.
The bare naked truth about Joomla!'s data Jessica Dunbar
Are you interpreting Joomla's data? If you are, do you really understand what you're telling your clients? In this session, Jessica Dunbar will talk you through the ACTUAL data on how Joomla! is performing - the bare naked truths of what's really happening.
Data Visualization Kick Off #1 - Nov 3 2020 - Data for Good SaskatchewanData For Good Regina
This document announces a visualization challenge hosted by Data for Good to encourage the use of data skills to tell stories with data visualizations. Participants can choose any public dataset to visualize and must submit a single-page PDF by a deadline to be judged on understandability, how well it tells a story, and visual appeal. Bonus points will be given to visualizations that use local Saskatchewan data. Top submissions will be featured on the Data for Good social media and presented at an upcoming meetup. Contact information and examples of visualization tools and public datasets are provided.
Data for Good Regina talks about how it has used data to help organizations understand their data better so that they can further their mission. They talk about the United Way Summer Success Program and the datathon with the Distress Centre in Calgary.
Jason Barnard — Structuring, Visualizing and Evaluating Your E-A-TSemrush
These slides were presented at the SEMrush webinar "4 Hours of E-A-T | Structuring, Visualizing and Evaluating Your E-A-T". Video replay and transcript are available at https://www.semrush.com/webinars/4-hours-of-e-a-t-or-structuring-visualizing-and-evaluating-your-e-a-t/
The document discusses Regina's first data for good meetup that was hosted by ISM Canada. It provides information on how to get involved with data for good initiatives in Regina through roles like data ambassadors or participating in datathons and meetups. The goal is to use data science skills to contribute to social good and help non-profits through ongoing or short-term projects.
Google Analytics: Campaign Tracking for Smart PeopleLars von Sneidern
One of the core capabilities of Google Analytics is tracking what paid or earned campaign a visitor is referred by, and how their visit relates to goals established for your site.
Wait, you didn't know it could do that?! Guess what, it can...and it's a very cool thing. AND it's even easier now than ever with an instance that is implemented using Google Tag Manager.
This deck was presented at the Portland Google Analytics User Group on 11/9/16.
Reverse Engineering Google's Local Search AlgorithmDFWSEM
The document discusses local SEO ranking factors based on a study conducted by Dan Leibson. It identifies several key on-page and off-page ranking factors for local search results, including citations, links, websites, location, and Google My Business profiles. The presentation provides recommendations for prioritizing local SEO efforts on the most important ranking factors and links to additional resources for further information.
The document discusses 4 main strategies for building high quality, relevant backlinks for websites:
1. Researching a competitor's link profile to understand what strategies work best.
2. Releasing thought leadership content like studies and reports to attract links.
3. Establishing relationships with media sources to gain coverage.
4. Creating a wiki to track and organize ongoing link building strategies.
The bare naked truth about Joomla!'s data Jessica Dunbar
Are you interpreting Joomla's data? If you are, do you really understand what you're telling your clients? In this session, Jessica Dunbar will talk you through the ACTUAL data on how Joomla! is performing - the bare naked truths of what's really happening.
This document provides an overview of Statistics Canada data resources that can be used to understand communities, including a municipal data portal, proximity measures data viewer, and 2021 Census of Population. It summarizes census geography levels and tools, and provides examples of population counts and age distribution data for areas in and around Regina, Saskatchewan from the 2021 and previous censuses.
The talk discusses and demonstrate techniques for analyzing survey data. Survey data is useful data source to answer a wide range of questions, however, it often requires special analytical techniques to interpret. We'll discuss how to weight data to match known population parameters (such as StatsCan census data) using post-stratification and using the MICE algorithm to deal with missing data. These techniques are commonly used in political polling and social science research. I'll provide example code in R and explain all the steps using data from a survey of Canadians' values.
All companies want to use machine learning, but face many roadblocks to getting there. It can be hard for an organization to get the skills, technology and computing power necessary to build a working machine learning model, and deploy it as a pipeline. Modern Cloud providers have a host of tools to make machine learning easier than ever before and they have available computing power to back it up. In this learning focused session, Ryan will introduce you to some basics of data for machine learning and show how cloud services like Microsoft Azure Machine Learning have made building scalable and accurate Machine Learning pipelines as easy as pivoting a table in excel.
This is a presentation and workshop that Data for Good delivered during the Regina Food Summit put on by the City of Regina and the Regina Foodbank, on December 10, 2021.
Naiomi Borger, Director of Information Systems at Precision AI tells us all about her company's AI and drone technology and how that tech will impact the ag sector in the future.
Telecommunication networks are evolving through technologies like 5G, SDN, and NFV that will change how data analytics are performed. 5G networks in particular will provide higher speeds, lower latency and greater capacity that will support new applications in areas like smart cities, autonomous vehicles and industrial IoT. These network advances will decentralize storage and computing and better support technologies like AI, blockchain and edge/fog computing for data analytics. Challenges around data security, privacy and effective utilization will also need to be addressed.
Lance Dudar and Wendy Stone talk about TRiP is and how they provide young people and families access to resources in Regina by focusing on coordinated service support, reduction of barriers to pro-social activities, and school engagement
In this presentation, Economic Development Regina and Tourism Saskatchewan team up to showcase how they use data to target visitors inside and outside Regina.
This document is a presentation on carbon pricing by Brett Dolter, an assistant professor of economics. It includes 22 slides covering topics such as rising carbon dioxide concentrations, climate change impacts, greenhouse gas emissions sources, Canadian emissions trends, policy tools to reduce emissions, how carbon pricing works, evidence that carbon pricing reduces emissions, critiques of carbon pricing, and options for returning carbon pricing revenues. The document provides an overview of the issues surrounding carbon pricing and climate change policy.
This document provides information about the Regina Early Learning Centre, including its goals, programs, and attendance data. It operates three locations (Sacred Heart, Dr. Hanna, St. Matthew School) that provide early learning programs for infants through preschoolers, with a focus on education, parent support, health, and community connections. Attendance statistics from April to June 2019 show over 3,300 total visits across the locations. The centre collects family data to track attendance patterns and ensure services reflect the diversity of Regina.
ISM Environment Insights w/ Advanced Analytics - Data For GoodData For Good Regina
Manitoba Forage and Grassland Association
The project proponent Manitoba Forage and Grassland Association (MFGA) sought (1) to quantify the hydrologic effect of natural forage land use within the Assiniboine River Basin, and (2) to recommend land and water management practices that address various hydrologic issues present in the basin. Through ISM’s web-based delivery platform, highly technical hydrologic simulation results (provided by project partner Aquanty) are presented in a summarized and consumable format, intended for use by high level decision makers.
South Nation Conservation
South Nation Conservation (SNC) is a conservation authority responsible for watershed management outside of Ottawa, ON. In addition to having a need for a hydrologic understanding of their geography, SNC had a need for a full hydrologic forecasting platform to drive their business decisions. Daily ingestion of weather forecasts formed the foundational piece of this platform, giving SNC a continually updated prediction of potential hydrologic issues. ISM, Aquanty, and IBM’s The Weather Company partnered in this pioneering solution.
California Utility Company
ISM and IBM’s The Weather Company partnered to provide a predictive asset maintenance platform for a southern California energy utility. The client required real-time weather forecast models to be ingested, and fuel the prediction of “fire weather”, or places where wild fires are likely to occur. This allowed the client to identify which of their assets (power lines, sub stations, etc) may be at risk, and enables them to take proactive and preventive.
Robyn Edwards-Bentz walks through the way that the United Way in Regina helps young people keep up their literacy skills in their younger years to combat future educational issues.
ISM speaks about data maturity, the work they have done in Saskatchewan around analyzing the data behind human services, and how some of the biggest tech companies traffic data, not goods and services.
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
This document provides an overview of Statistics Canada data resources that can be used to understand communities, including a municipal data portal, proximity measures data viewer, and 2021 Census of Population. It summarizes census geography levels and tools, and provides examples of population counts and age distribution data for areas in and around Regina, Saskatchewan from the 2021 and previous censuses.
The talk discusses and demonstrate techniques for analyzing survey data. Survey data is useful data source to answer a wide range of questions, however, it often requires special analytical techniques to interpret. We'll discuss how to weight data to match known population parameters (such as StatsCan census data) using post-stratification and using the MICE algorithm to deal with missing data. These techniques are commonly used in political polling and social science research. I'll provide example code in R and explain all the steps using data from a survey of Canadians' values.
All companies want to use machine learning, but face many roadblocks to getting there. It can be hard for an organization to get the skills, technology and computing power necessary to build a working machine learning model, and deploy it as a pipeline. Modern Cloud providers have a host of tools to make machine learning easier than ever before and they have available computing power to back it up. In this learning focused session, Ryan will introduce you to some basics of data for machine learning and show how cloud services like Microsoft Azure Machine Learning have made building scalable and accurate Machine Learning pipelines as easy as pivoting a table in excel.
This is a presentation and workshop that Data for Good delivered during the Regina Food Summit put on by the City of Regina and the Regina Foodbank, on December 10, 2021.
Naiomi Borger, Director of Information Systems at Precision AI tells us all about her company's AI and drone technology and how that tech will impact the ag sector in the future.
Telecommunication networks are evolving through technologies like 5G, SDN, and NFV that will change how data analytics are performed. 5G networks in particular will provide higher speeds, lower latency and greater capacity that will support new applications in areas like smart cities, autonomous vehicles and industrial IoT. These network advances will decentralize storage and computing and better support technologies like AI, blockchain and edge/fog computing for data analytics. Challenges around data security, privacy and effective utilization will also need to be addressed.
Lance Dudar and Wendy Stone talk about TRiP is and how they provide young people and families access to resources in Regina by focusing on coordinated service support, reduction of barriers to pro-social activities, and school engagement
In this presentation, Economic Development Regina and Tourism Saskatchewan team up to showcase how they use data to target visitors inside and outside Regina.
This document is a presentation on carbon pricing by Brett Dolter, an assistant professor of economics. It includes 22 slides covering topics such as rising carbon dioxide concentrations, climate change impacts, greenhouse gas emissions sources, Canadian emissions trends, policy tools to reduce emissions, how carbon pricing works, evidence that carbon pricing reduces emissions, critiques of carbon pricing, and options for returning carbon pricing revenues. The document provides an overview of the issues surrounding carbon pricing and climate change policy.
This document provides information about the Regina Early Learning Centre, including its goals, programs, and attendance data. It operates three locations (Sacred Heart, Dr. Hanna, St. Matthew School) that provide early learning programs for infants through preschoolers, with a focus on education, parent support, health, and community connections. Attendance statistics from April to June 2019 show over 3,300 total visits across the locations. The centre collects family data to track attendance patterns and ensure services reflect the diversity of Regina.
ISM Environment Insights w/ Advanced Analytics - Data For GoodData For Good Regina
Manitoba Forage and Grassland Association
The project proponent Manitoba Forage and Grassland Association (MFGA) sought (1) to quantify the hydrologic effect of natural forage land use within the Assiniboine River Basin, and (2) to recommend land and water management practices that address various hydrologic issues present in the basin. Through ISM’s web-based delivery platform, highly technical hydrologic simulation results (provided by project partner Aquanty) are presented in a summarized and consumable format, intended for use by high level decision makers.
South Nation Conservation
South Nation Conservation (SNC) is a conservation authority responsible for watershed management outside of Ottawa, ON. In addition to having a need for a hydrologic understanding of their geography, SNC had a need for a full hydrologic forecasting platform to drive their business decisions. Daily ingestion of weather forecasts formed the foundational piece of this platform, giving SNC a continually updated prediction of potential hydrologic issues. ISM, Aquanty, and IBM’s The Weather Company partnered in this pioneering solution.
California Utility Company
ISM and IBM’s The Weather Company partnered to provide a predictive asset maintenance platform for a southern California energy utility. The client required real-time weather forecast models to be ingested, and fuel the prediction of “fire weather”, or places where wild fires are likely to occur. This allowed the client to identify which of their assets (power lines, sub stations, etc) may be at risk, and enables them to take proactive and preventive.
Robyn Edwards-Bentz walks through the way that the United Way in Regina helps young people keep up their literacy skills in their younger years to combat future educational issues.
ISM speaks about data maturity, the work they have done in Saskatchewan around analyzing the data behind human services, and how some of the biggest tech companies traffic data, not goods and services.
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
The Building Blocks of QuestDB, a Time Series Databasejavier ramirez
Talk Delivered at Valencia Codes Meetup 2024-06.
Traditionally, databases have treated timestamps just as another data type. However, when performing real-time analytics, timestamps should be first class citizens and we need rich time semantics to get the most out of our data. We also need to deal with ever growing datasets while keeping performant, which is as fun as it sounds.
It is no wonder time-series databases are now more popular than ever before. Join me in this session to learn about the internal architecture and building blocks of QuestDB, an open source time-series database designed for speed. We will also review a history of some of the changes we have gone over the past two years to deal with late and unordered data, non-blocking writes, read-replicas, or faster batch ingestion.
Natural Language Processing (NLP), RAG and its applications .pptxfkyes25
1. In the realm of Natural Language Processing (NLP), knowledge-intensive tasks such as question answering, fact verification, and open-domain dialogue generation require the integration of vast and up-to-date information. Traditional neural models, though powerful, struggle with encoding all necessary knowledge within their parameters, leading to limitations in generalization and scalability. The paper "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks" introduces RAG (Retrieval-Augmented Generation), a novel framework that synergizes retrieval mechanisms with generative models, enhancing performance by dynamically incorporating external knowledge during inference.
End-to-end pipeline agility - Berlin Buzzwords 2024Lars Albertsson
We describe how we achieve high change agility in data engineering by eliminating the fear of breaking downstream data pipelines through end-to-end pipeline testing, and by using schema metaprogramming to safely eliminate boilerplate involved in changes that affect whole pipelines.
A quick poll on agility in changing pipelines from end to end indicated a huge span in capabilities. For the question "How long time does it take for all downstream pipelines to be adapted to an upstream change," the median response was 6 months, but some respondents could do it in less than a day. When quantitative data engineering differences between the best and worst are measured, the span is often 100x-1000x, sometimes even more.
A long time ago, we suffered at Spotify from fear of changing pipelines due to not knowing what the impact might be downstream. We made plans for a technical solution to test pipelines end-to-end to mitigate that fear, but the effort failed for cultural reasons. We eventually solved this challenge, but in a different context. In this presentation we will describe how we test full pipelines effectively by manipulating workflow orchestration, which enables us to make changes in pipelines without fear of breaking downstream.
Making schema changes that affect many jobs also involves a lot of toil and boilerplate. Using schema-on-read mitigates some of it, but has drawbacks since it makes it more difficult to detect errors early. We will describe how we have rejected this tradeoff by applying schema metaprogramming, eliminating boilerplate but keeping the protection of static typing, thereby further improving agility to quickly modify data pipelines without fear.
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Aggregage
This webinar will explore cutting-edge, less familiar but powerful experimentation methodologies which address well-known limitations of standard A/B Testing. Designed for data and product leaders, this session aims to inspire the embrace of innovative approaches and provide insights into the frontiers of experimentation!
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataKiwi Creative
Harness the power of AI-backed reports, benchmarking and data analysis to predict trends and detect anomalies in your marketing efforts.
Peter Caputa, CEO at Databox, reveals how you can discover the strategies and tools to increase your growth rate (and margins!).
From metrics to track to data habits to pick up, enhance your reporting for powerful insights to improve your B2B tech company's marketing.
- - -
This is the webinar recording from the June 2024 HubSpot User Group (HUG) for B2B Technology USA.
Watch the video recording at https://youtu.be/5vjwGfPN9lw
Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main
2. #dataforgood
What is a Visualization Challenge?
- Take a dataset and visualize it, tell a story
- Show off your data skills
- Build a portfolio
- Work with others
6. #dataforgood
- We will be using the dataset of bike accidents in Regina from 2010 -
2019
- You are allowed to use any other additional datasets you choose to
include (not necessary though)
- You are allowed to use any BI or visualization tool
- Your submission will be a single page in PDF format
- You can work individually or as part of a team
Rules
7. #dataforgood
- Send an email to regina@dataforgood.ca to enter the contest
- In your email, explain if you will be working solo, or want to be
placed on a team
- If you would like to be placed on a team, please explain your skills
and skill level so we can match you appropriately.
- Email your PDF file by midnight June 22nd, 2021 to be judged.
- If your visual has been selected as one of the top entries, you will
have the opportunity to present at the June 2021 Data For Good
Meetup on June 29th, 2021.
More Details
10. #dataforgood
Register to be in DVC
#2 by April 19th
DP-100 Azure Data Scientist
DP-300 Administering Relational Databases on Microsoft Azure
DA-100 PowerBI Azure Data Analyst
DA-100 PowerBI Azure Data Analyst
Be entered to win on Course Enrollment to
1 of the following courses: