#1 DataBeersBCN - Oscar Marin from Outliers.CollectiveDataBeersBCN
This document contains links to various projects and analyses done by Óscar Marín Miró including mood analysis of text using lexicons and corpora, geohashing to analyze events and locations on social media, viralization analysis, and profile mining of political affiliations on Twitter through lexicon and corpus analysis. Many of the links provide more information on analyzing emotions in text and geolocating social media posts.
The document discusses the income of average US teachers compared to Bill Gates. It notes that the average yearly income for a US teacher is $40,000, while Bill Gates' net worth was listed as $4 billion in Forbes in 2015. Charts on the pages compare the "Resistance to Pay" or willingness to spend money for common expenses like rice or a car between a teacher making $40,000 a year and Bill Gates with a net worth of $4 billion. The document also provides brief descriptions of the scientific method process.
This document discusses designing cities with consideration for urban smells. It mentions conducting "smell walks" in several cities to collect words associated with urban smells. A wheel was created to categorize both positive and negative urban smells into groups like nature, animals, and emissions. Data from London and Barcelona showed correlations between reported smells and air pollution levels. The document advocates that city planning should account for different urban smells in addition to other senses like sight and sound.
#6 DataBeersBCN -"The (Big) Data behind the brain"DataBeersBCN
This document discusses different methods for measuring and mapping the brain at micro, meso, and macro scales. It outlines technologies like electron microscopy, axonal tracing, Clarity brainbow techniques, MRI, CT, PET, and SPECT imaging. Important historical developments are noted, such as the invention of X-ray, CT, PET and MRI imaging technologies. Challenges and opportunities in open data initiatives and the emerging field of connectomics are also mentioned.
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"DataBeersBCN
1. The document discusses the history and evolution of data journalism, from early examples in the 1800s to modern practices using new digital tools.
2. It outlines key aspects of modern data journalism, such as multidisciplinary teams and making sources and methods transparent.
3. The author argues that data journalism is increasingly important for accountability by enabling investigative reporting using transparency laws and open data.
#5 DataBeersBCN -"Dos and Don'ts of Data Viz"DataBeersBCN
This document provides dos and don'ts for data visualization. It discusses how to properly scale and represent proportions in charts. Common misleading techniques are shown such as rescaling axes, omitting the y-axis origin, using different scales for the same axis, and including meaningless or invented data. The document advocates showing only relevant information without crowding plots. It also notes that the visualization should fit the intended audience and goal. While rules can be broken, the overall message is that visualizations should accurately and honestly portray the data.
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"DataBeersBCN
This document discusses the potential of sociothermodynamics to model human decision making and behavior using concepts from quantum mechanics. It provides several examples showing how human decisions can exhibit quantum-like effects, such as order effects and superposition of states. The document suggests this approach could help explain viral spreading of messages, electoral outcomes, and designing sustainable transportation networks by better understanding how individuals in a society interact similarly to quantum particles.
#1 DataBeersBCN - Oscar Marin from Outliers.CollectiveDataBeersBCN
This document contains links to various projects and analyses done by Óscar Marín Miró including mood analysis of text using lexicons and corpora, geohashing to analyze events and locations on social media, viralization analysis, and profile mining of political affiliations on Twitter through lexicon and corpus analysis. Many of the links provide more information on analyzing emotions in text and geolocating social media posts.
The document discusses the income of average US teachers compared to Bill Gates. It notes that the average yearly income for a US teacher is $40,000, while Bill Gates' net worth was listed as $4 billion in Forbes in 2015. Charts on the pages compare the "Resistance to Pay" or willingness to spend money for common expenses like rice or a car between a teacher making $40,000 a year and Bill Gates with a net worth of $4 billion. The document also provides brief descriptions of the scientific method process.
This document discusses designing cities with consideration for urban smells. It mentions conducting "smell walks" in several cities to collect words associated with urban smells. A wheel was created to categorize both positive and negative urban smells into groups like nature, animals, and emissions. Data from London and Barcelona showed correlations between reported smells and air pollution levels. The document advocates that city planning should account for different urban smells in addition to other senses like sight and sound.
#6 DataBeersBCN -"The (Big) Data behind the brain"DataBeersBCN
This document discusses different methods for measuring and mapping the brain at micro, meso, and macro scales. It outlines technologies like electron microscopy, axonal tracing, Clarity brainbow techniques, MRI, CT, PET, and SPECT imaging. Important historical developments are noted, such as the invention of X-ray, CT, PET and MRI imaging technologies. Challenges and opportunities in open data initiatives and the emerging field of connectomics are also mentioned.
#5 DataBeersBCN -"How to do Data Journalism… and not die trying"DataBeersBCN
1. The document discusses the history and evolution of data journalism, from early examples in the 1800s to modern practices using new digital tools.
2. It outlines key aspects of modern data journalism, such as multidisciplinary teams and making sources and methods transparent.
3. The author argues that data journalism is increasingly important for accountability by enabling investigative reporting using transparency laws and open data.
#5 DataBeersBCN -"Dos and Don'ts of Data Viz"DataBeersBCN
This document provides dos and don'ts for data visualization. It discusses how to properly scale and represent proportions in charts. Common misleading techniques are shown such as rescaling axes, omitting the y-axis origin, using different scales for the same axis, and including meaningless or invented data. The document advocates showing only relevant information without crowding plots. It also notes that the visualization should fit the intended audience and goal. While rules can be broken, the overall message is that visualizations should accurately and honestly portray the data.
#5 DataBeersBCN -"The gripping potentials of Sociothermodynamics"DataBeersBCN
This document discusses the potential of sociothermodynamics to model human decision making and behavior using concepts from quantum mechanics. It provides several examples showing how human decisions can exhibit quantum-like effects, such as order effects and superposition of states. The document suggests this approach could help explain viral spreading of messages, electoral outcomes, and designing sustainable transportation networks by better understanding how individuals in a society interact similarly to quantum particles.
#5 DataBeersBCN -"Location Based Business Oportunity Detector"DataBeersBCN
The document discusses a potential location-based business opportunity detector service called an LBBOD. The LBBOD would gather sensor, business, and fieldwork data around a user to detect opportunities near them and save them time spent searching. It would provide information on competitors and collaborators, the type of business environment, and the suitability of different locations for businesses. The LBBOD aims to answer questions about opportunities, risks, commercial activity in an area, and whether specific locations would work for a user's business.
#4 DataBeersBCN - "We know what you did last sonar" by Fernando CucchiettiDataBeersBCN
This document discusses using Cassandra DB for data storage, analytics, real-time processing, recommendations, aggregated statistics tracking, sensors, and visualization for business intelligence and t-shirts. It also mentions Sonar+D PlantaComplex Village, Sonardome Hall, and the number of iOS devices compared to people and addresses, with signals lasting less than 20 minutes. A schedule is listed for various days of the week.
#3 DataBeersBCN - "The impact of data in reality" by Karina GibertDataBeersBCN
The document discusses the impact of data on decision making in complex real-world domains. It notes that while data availability is growing exponentially, less than 1% of data is currently analyzed, leading to suboptimal decisions. Two main research fields—data mining and intelligent decision support systems—aim to extract better knowledge from data and support decision makers, but there remains a gap between these fields. The document advocates for an integrative, multidisciplinary approach combining data mining, modeling, and decision theory to develop intelligent decision support systems that can help decision makers understand data and make more informed choices.
#3 DataBeersBCN - "Big Fun Data" by Xavier GuardiolaDataBeersBCN
The document discusses how King.com uses big data and the scientific method to optimize user engagement and retention in their mobile and web games. It notes that King.com has 185 games with 364 million monthly unique players generating 14 billion rows of data per day. King focuses on key metrics like retention, engagement, monetization, conversion, and virality. The presentation explains how King designs experiments by making small changes to levels and uses data science to analyze results and improve the user experience.
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo AragonDataBeersBCN
The document analyzes how the political movement 15M transformed into a political party called Barcelona en Comú for the 2015 Barcelona City Council election. Through analysis of over 500k tweets, it finds that Barcelona en Comú exhibits both centralized and decentralized structures on Twitter. Specifically, it has a centralized and less resilient party cluster, as well as a decentralized and more resilient movement cluster, suggesting the party acts as an interface between minor political parties and 15M activists.
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...DataBeersBCN
This document discusses using data to create successful mobile games. It highlights the importance of analyzing funnels to track the percentage of users completing tutorials and engagement events. Natural language processing is also mentioned as a way to understand what players say about games to help guide improvements. The overall message is that collecting and leveraging data on player behavior and feedback is key to developing great mobile games.
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha CatalanDataBeersBCN
The document discusses a seismometer, an API that tracks daily changes, and tweets from Govern and others about opening data and requests for information. Visualizar15 thanked Concha Catalan, MVTango, LasaRux, Albert Carles, and others for a new design and open government in Catalonia.
#2 DataBeersBCN - "Why counting people at public transport" by Caterina FontDataBeersBCN
Counting passengers at public transportation helps transportation agencies understand capacity levels over time, passenger travel patterns including popular routes and times of trips, and passenger demographics. This data allows agencies to appropriately size vehicles, adjust prices, improve scheduling flexibility and route planning to better match capacity with demand. Mobility surveys using people counters provide passenger numbers, origin-destination data and other insights over periods of weeks to help transportation planning.
#2 DataBeersBCN - "Analog data visualitzation in the digital age" by Alberto ...DataBeersBCN
This document provides a summary of 6 examples of analog data visualization projects. It lists the names and website URLs of projects like Data Cuisine, Decoding Dom Perignon, Dear Data, and Handmade Dataviz Kit, which create non-digital visualizations of data through means like postcards, paper crafts and physical installations. The document serves to highlight creative approaches to representing data and insights in tangible, non-screen based forms.
Alea iacta est! Understanding historical dynamics using Monte Carlo simulations” wtih Xavier-Rubio Campillo, researcher at Barcelona Supercomputing Center - @xrubiocampillo
“Pear Campaigns -Publicity Effective AwaReness Campaigns. 1st prize BBVA Innova Challenge Mx” with David Solans from Centre Innovació i Tecnoglogia, UPC
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICSDataBeersBCN
This document provides information about Databeers, which are events that combine presentations on data-related topics with socializing over beers. The events follow a standard format of talks no longer than 6 minutes on data relationships, followed by more socializing. Anyone can give a talk, as long as it avoids code, formulas, and sells the presenter's data story. The document promotes following the Databeers social media accounts and attending future related events that combine data, innovation challenges, food, and drinks.
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
End-to-end pipeline agility - Berlin Buzzwords 2024Lars Albertsson
We describe how we achieve high change agility in data engineering by eliminating the fear of breaking downstream data pipelines through end-to-end pipeline testing, and by using schema metaprogramming to safely eliminate boilerplate involved in changes that affect whole pipelines.
A quick poll on agility in changing pipelines from end to end indicated a huge span in capabilities. For the question "How long time does it take for all downstream pipelines to be adapted to an upstream change," the median response was 6 months, but some respondents could do it in less than a day. When quantitative data engineering differences between the best and worst are measured, the span is often 100x-1000x, sometimes even more.
A long time ago, we suffered at Spotify from fear of changing pipelines due to not knowing what the impact might be downstream. We made plans for a technical solution to test pipelines end-to-end to mitigate that fear, but the effort failed for cultural reasons. We eventually solved this challenge, but in a different context. In this presentation we will describe how we test full pipelines effectively by manipulating workflow orchestration, which enables us to make changes in pipelines without fear of breaking downstream.
Making schema changes that affect many jobs also involves a lot of toil and boilerplate. Using schema-on-read mitigates some of it, but has drawbacks since it makes it more difficult to detect errors early. We will describe how we have rejected this tradeoff by applying schema metaprogramming, eliminating boilerplate but keeping the protection of static typing, thereby further improving agility to quickly modify data pipelines without fear.
#5 DataBeersBCN -"Location Based Business Oportunity Detector"DataBeersBCN
The document discusses a potential location-based business opportunity detector service called an LBBOD. The LBBOD would gather sensor, business, and fieldwork data around a user to detect opportunities near them and save them time spent searching. It would provide information on competitors and collaborators, the type of business environment, and the suitability of different locations for businesses. The LBBOD aims to answer questions about opportunities, risks, commercial activity in an area, and whether specific locations would work for a user's business.
#4 DataBeersBCN - "We know what you did last sonar" by Fernando CucchiettiDataBeersBCN
This document discusses using Cassandra DB for data storage, analytics, real-time processing, recommendations, aggregated statistics tracking, sensors, and visualization for business intelligence and t-shirts. It also mentions Sonar+D PlantaComplex Village, Sonardome Hall, and the number of iOS devices compared to people and addresses, with signals lasting less than 20 minutes. A schedule is listed for various days of the week.
#3 DataBeersBCN - "The impact of data in reality" by Karina GibertDataBeersBCN
The document discusses the impact of data on decision making in complex real-world domains. It notes that while data availability is growing exponentially, less than 1% of data is currently analyzed, leading to suboptimal decisions. Two main research fields—data mining and intelligent decision support systems—aim to extract better knowledge from data and support decision makers, but there remains a gap between these fields. The document advocates for an integrative, multidisciplinary approach combining data mining, modeling, and decision theory to develop intelligent decision support systems that can help decision makers understand data and make more informed choices.
#3 DataBeersBCN - "Big Fun Data" by Xavier GuardiolaDataBeersBCN
The document discusses how King.com uses big data and the scientific method to optimize user engagement and retention in their mobile and web games. It notes that King.com has 185 games with 364 million monthly unique players generating 14 billion rows of data per day. King focuses on key metrics like retention, engagement, monetization, conversion, and virality. The presentation explains how King designs experiments by making small changes to levels and uses data science to analyze results and improve the user experience.
#4 DataBeersBCN - "When a Movement Becomes a Party" by Pablo AragonDataBeersBCN
The document analyzes how the political movement 15M transformed into a political party called Barcelona en Comú for the 2015 Barcelona City Council election. Through analysis of over 500k tweets, it finds that Barcelona en Comú exhibits both centralized and decentralized structures on Twitter. Specifically, it has a centralized and less resilient party cluster, as well as a decentralized and more resilient movement cluster, suggesting the party acts as an interface between minor political parties and 15M activists.
#2 DataBeersBCN - "Using data to make great and succesful mobile games" by J...DataBeersBCN
This document discusses using data to create successful mobile games. It highlights the importance of analyzing funnels to track the percentage of users completing tutorials and engagement events. Natural language processing is also mentioned as a way to understand what players say about games to help guide improvements. The overall message is that collecting and leveraging data on player behavior and feedback is key to developing great mobile games.
#2 DataBeersBCN - "Govern Obert - Opengov.cat" by Concha CatalanDataBeersBCN
The document discusses a seismometer, an API that tracks daily changes, and tweets from Govern and others about opening data and requests for information. Visualizar15 thanked Concha Catalan, MVTango, LasaRux, Albert Carles, and others for a new design and open government in Catalonia.
#2 DataBeersBCN - "Why counting people at public transport" by Caterina FontDataBeersBCN
Counting passengers at public transportation helps transportation agencies understand capacity levels over time, passenger travel patterns including popular routes and times of trips, and passenger demographics. This data allows agencies to appropriately size vehicles, adjust prices, improve scheduling flexibility and route planning to better match capacity with demand. Mobility surveys using people counters provide passenger numbers, origin-destination data and other insights over periods of weeks to help transportation planning.
#2 DataBeersBCN - "Analog data visualitzation in the digital age" by Alberto ...DataBeersBCN
This document provides a summary of 6 examples of analog data visualization projects. It lists the names and website URLs of projects like Data Cuisine, Decoding Dom Perignon, Dear Data, and Handmade Dataviz Kit, which create non-digital visualizations of data through means like postcards, paper crafts and physical installations. The document serves to highlight creative approaches to representing data and insights in tangible, non-screen based forms.
Alea iacta est! Understanding historical dynamics using Monte Carlo simulations” wtih Xavier-Rubio Campillo, researcher at Barcelona Supercomputing Center - @xrubiocampillo
“Pear Campaigns -Publicity Effective AwaReness Campaigns. 1st prize BBVA Innova Challenge Mx” with David Solans from Centre Innovació i Tecnoglogia, UPC
#1 DataBeersBCN - Dani Villatoro from BBVA DATA ANALYTICSDataBeersBCN
This document provides information about Databeers, which are events that combine presentations on data-related topics with socializing over beers. The events follow a standard format of talks no longer than 6 minutes on data relationships, followed by more socializing. Anyone can give a talk, as long as it avoids code, formulas, and sells the presenter's data story. The document promotes following the Databeers social media accounts and attending future related events that combine data, innovation challenges, food, and drinks.
The Ipsos - AI - Monitor 2024 Report.pdfSocial Samosa
According to Ipsos AI Monitor's 2024 report, 65% Indians said that products and services using AI have profoundly changed their daily life in the past 3-5 years.
End-to-end pipeline agility - Berlin Buzzwords 2024Lars Albertsson
We describe how we achieve high change agility in data engineering by eliminating the fear of breaking downstream data pipelines through end-to-end pipeline testing, and by using schema metaprogramming to safely eliminate boilerplate involved in changes that affect whole pipelines.
A quick poll on agility in changing pipelines from end to end indicated a huge span in capabilities. For the question "How long time does it take for all downstream pipelines to be adapted to an upstream change," the median response was 6 months, but some respondents could do it in less than a day. When quantitative data engineering differences between the best and worst are measured, the span is often 100x-1000x, sometimes even more.
A long time ago, we suffered at Spotify from fear of changing pipelines due to not knowing what the impact might be downstream. We made plans for a technical solution to test pipelines end-to-end to mitigate that fear, but the effort failed for cultural reasons. We eventually solved this challenge, but in a different context. In this presentation we will describe how we test full pipelines effectively by manipulating workflow orchestration, which enables us to make changes in pipelines without fear of breaking downstream.
Making schema changes that affect many jobs also involves a lot of toil and boilerplate. Using schema-on-read mitigates some of it, but has drawbacks since it makes it more difficult to detect errors early. We will describe how we have rejected this tradeoff by applying schema metaprogramming, eliminating boilerplate but keeping the protection of static typing, thereby further improving agility to quickly modify data pipelines without fear.
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
Analysis insight about a Flyball dog competition team's performanceroli9797
Insight of my analysis about a Flyball dog competition team's last year performance. Find more: https://github.com/rolandnagy-ds/flyball_race_analysis/tree/main