Often talks about data science focus on tools and methods. Tools are important and it is really important to stay up to date with the latest tools and technologies (they are the “how”). But data science is also about finding good problems and solving them (the “why”). Good problems are ones that are both valuable (someone really wants an answer) and tractable (there is data you can find to help you answer it). Today, with modern technologies, many more problems are tractable than ever before; lots of data is freely available on the internet and sensors make it easy to collect ad hoc data sets. This talk will emphasize the importance of asking “why” by exploring a few examples from Datascope’s client work and our various side projects where, if we hadn’t asked “why”, the outcomes would have been far less useful.
Northwestern data visualization - why why whyGabriel Gaster
Divvy bikes have changed the way we can get around Chicago. This talk will demonstrate the impact of Divvy with an interactive visualization. Rather than focus on the tools and languages used to build it, the talk will emphasize design and content aspects of the visualization (at divvy.datasco.pe) as well as some recent work to quantify the similarity of bike stations. The talk will feature a live-demo of the visualization and the opportunity for attendees to share their own thoughts and hypotheses about bike trip patterns.
Creating Your Social Media Plan & Strategy_DTCC April 2015Lisa Flowers
Lisa Flowers presented a workshop on creating a social media plan and strategy. She outlined key steps including setting SMART goals, conducting an audit and SWOT analysis, deciding on top social media platforms, creating a content plan and editorial calendar. The presentation emphasized the importance of understanding audiences, curating and creating engaging content, and developing a sustainable long-term social media strategy.
This document discusses various free marketing tools available through Realtor.com for real estate agents, including a Realtor.com profile, mobile listing builder, housing trends newsletter, business and marketing plans, social media connections, listing presentations, HouseLogic content, and the Realtors Property Resource tool. It provides information on setting up and using each tool to promote listings and an agent's business.
The document discusses various Google tools and services including Gmail, Drive, Calendar, Chrome, Chromecast, and Google+. It provides tips and tricks for using each service more efficiently. For Gmail, it recommends using conversation view, setting up multiple accounts, storing contacts, creating filters, and canned responses. For Drive, it highlights the free storage, collaboration features, and ability to use documents on any device. For Calendar, it discusses setting up multiple calendars, event collaboration, sharing calendars, and reminder options. Finally, it presents Chrome and Chromecast for easier browsing and casting browser tabs to a TV.
The document discusses high-level concurrency and the actor model architecture. It notes several problems with concurrent programming like race conditions and deadlocks. The actor model uses isolated independent objects called actors that communicate asynchronously via message passing, avoiding concurrency issues. Examples shown include parallel processing, monad binding, and Instagram's use of actors.
Digital Analytics in South East Asia: what to expect in 2016Lisa Collins
Lisa Collins reviewed the state of analytics in 2015 and what to expect in 2016. In 2015, there was a focus on tag management, mobile app analytics, integrating second and third party data, and testing and personalization. Major trends to watch for in 2016 include increased attention to privacy regulations, development of algorithmic attribution models, and data management platforms to integrate first, second, and third party data. Data science and analytics functions are also expected to increasingly merge, relying more on machine learning.
Para crear una cuenta en Gmail, se debe ingresar a la página de Gmail, completar un formulario con datos personales y verificar la cuenta a través de un código de letras. Después de completar estos pasos, se podrá acceder a la nueva cuenta de correo electrónico de Gmail y enviar y recibir correos.
This document provides a summary of Sérgio Luis Garrido's qualifications and experience. It outlines his educational background which includes an MBA from FGV and various other degrees and specializations. His professional experience includes over 20 years working for General Motors do Brasil, where he currently serves as a Supplier Quality Launch Leader. Prior to this role, he held positions supporting quality engineering and suppliers. The document also lists other relevant roles and achievements throughout his career in the automotive and chemical industries.
Northwestern data visualization - why why whyGabriel Gaster
Divvy bikes have changed the way we can get around Chicago. This talk will demonstrate the impact of Divvy with an interactive visualization. Rather than focus on the tools and languages used to build it, the talk will emphasize design and content aspects of the visualization (at divvy.datasco.pe) as well as some recent work to quantify the similarity of bike stations. The talk will feature a live-demo of the visualization and the opportunity for attendees to share their own thoughts and hypotheses about bike trip patterns.
Creating Your Social Media Plan & Strategy_DTCC April 2015Lisa Flowers
Lisa Flowers presented a workshop on creating a social media plan and strategy. She outlined key steps including setting SMART goals, conducting an audit and SWOT analysis, deciding on top social media platforms, creating a content plan and editorial calendar. The presentation emphasized the importance of understanding audiences, curating and creating engaging content, and developing a sustainable long-term social media strategy.
This document discusses various free marketing tools available through Realtor.com for real estate agents, including a Realtor.com profile, mobile listing builder, housing trends newsletter, business and marketing plans, social media connections, listing presentations, HouseLogic content, and the Realtors Property Resource tool. It provides information on setting up and using each tool to promote listings and an agent's business.
The document discusses various Google tools and services including Gmail, Drive, Calendar, Chrome, Chromecast, and Google+. It provides tips and tricks for using each service more efficiently. For Gmail, it recommends using conversation view, setting up multiple accounts, storing contacts, creating filters, and canned responses. For Drive, it highlights the free storage, collaboration features, and ability to use documents on any device. For Calendar, it discusses setting up multiple calendars, event collaboration, sharing calendars, and reminder options. Finally, it presents Chrome and Chromecast for easier browsing and casting browser tabs to a TV.
The document discusses high-level concurrency and the actor model architecture. It notes several problems with concurrent programming like race conditions and deadlocks. The actor model uses isolated independent objects called actors that communicate asynchronously via message passing, avoiding concurrency issues. Examples shown include parallel processing, monad binding, and Instagram's use of actors.
Digital Analytics in South East Asia: what to expect in 2016Lisa Collins
Lisa Collins reviewed the state of analytics in 2015 and what to expect in 2016. In 2015, there was a focus on tag management, mobile app analytics, integrating second and third party data, and testing and personalization. Major trends to watch for in 2016 include increased attention to privacy regulations, development of algorithmic attribution models, and data management platforms to integrate first, second, and third party data. Data science and analytics functions are also expected to increasingly merge, relying more on machine learning.
Para crear una cuenta en Gmail, se debe ingresar a la página de Gmail, completar un formulario con datos personales y verificar la cuenta a través de un código de letras. Después de completar estos pasos, se podrá acceder a la nueva cuenta de correo electrónico de Gmail y enviar y recibir correos.
This document provides a summary of Sérgio Luis Garrido's qualifications and experience. It outlines his educational background which includes an MBA from FGV and various other degrees and specializations. His professional experience includes over 20 years working for General Motors do Brasil, where he currently serves as a Supplier Quality Launch Leader. Prior to this role, he held positions supporting quality engineering and suppliers. The document also lists other relevant roles and achievements throughout his career in the automotive and chemical industries.
Korea was first ruled by the Han Dynasty of China starting in 108 BC. Korean clans formed during this period but began declining around 108 AD. Over subsequent centuries, Korea experienced frequent changes in rule, first being taken over by various powers and establishing its own governments including the Silla kingdom in the 600s, followed by the Koryo dynasty from 935 to 1392 and finally the Joseon dynasty which collapsed later that year. Buddhism was adopted as the main religion in Korea starting in the 500s.
What goes into making a Data Visualization?Gabriel Gaster
The document discusses the design choices for visualizing transportation data in Chicago. It describes using Voronoi tessellation to divide the city into regions based on proximity to Divvy bike share stations. Color is used to represent bike trip volumes within each region, with binned coloring and transparent empty bins chosen over a gradient scale. The visualization aims to show how Divvy has changed where people can travel and explore differences between neighborhoods.
El documento describe una actividad individual con Quandary, un juego de toma de decisiones éticas. Los estudiantes jugarán el juego solo y reflexionarán sobre las decisiones difíciles que enfrentan los personajes y las consecuencias de esas decisiones. Después compartirán sus experiencias y perspectivas con la clase.
The document discusses the results of a study on the effects of a new drug on memory and cognitive function in older adults. The double-blind study involved 100 participants aged 65-80 and found that those given the drug performed significantly better on memory and problem-solving tests than the placebo group after 6 months. The drug was found to be safe and well-tolerated with no serious side effects reported.
Aerospace engineering involves designing and building aircraft and spacecraft. Geometry is used to design efficient aircraft parts and ensure they fit together properly. Aerospace engineers must consider the size of parts and how to assemble complex vehicles like airplanes and spacecraft from individual components.
Music Therapy Notes / More Reflecting, Less Wadingrachelmsmith
This set of slides outlines the Music Therapy Notes app which primarily is a tool to help music therapists organise audio and visual data recorded in music therapy sessions. It was created by @oxfordsing and @timdavies at the Berklee College of Music Music Therapy Hack 28th - 29th March 2014.
This document provides an overview of the water purifier industry in India. It discusses how people in India have become more health conscious in recent decades as around 80% of diseases are caused by water-borne microorganisms. The WHO estimates that simple water treatment techniques could save millions of lives each year from waterborne diseases. The water purifier industry provides mechanized and portable purification options to remove contaminants from raw water sources and meet the growing demand for safe drinking water in India.
This document discusses Ruby data types including numbers, text, arrays, hashes, ranges, symbols, and objects. It provides details on integer and float numbers, strings, string operations, arrays and common array methods, hashes and hash syntax, ranges and how they work, symbols and how they differ from strings, and Ruby objects and operations like equal?, conversions and tainting objects.
13 Trends and Challenges with School Wireless NetworksCorey Anderson
This document discusses trends and challenges schools face in supporting mobile learning and provides solutions for wireless network design. It introduces two speakers: Corey Anderson, a mobility specialist, and Michael McNamee, a chief network engineer with experience deploying wireless networks in over 300 schools. The document outlines trends like 1:1 initiatives, BYOD, cloud-based mobile learning and multimedia usage. It also discusses challenges such as ensuring security, managing growth and different environments. Solutions proposed include load balancing, dual-radio access points, and centralized wireless management with real-time device visibility. The speakers offer a free wireless network design assessment to help schools determine needs.
The document discusses different measures of central tendency (mean, median, mode) and how to determine which is most appropriate based on the type of data. It also covers measures of dispersion like range, standard deviation, and variance which provide information about how spread out values are from the central point. The mean is the most commonly used measure of central tendency but the median is less affected by outliers, while the mode represents the most frequent value.
Morphological and wavelet transform techniques were applied to enhance mammographic phantom images containing microcalcifications, nodules, and fibrils. Four observers evaluated the original and enhanced images using receiver operating characteristic analysis and subjective rating scales. While some techniques improved detection of certain structures over original images based on ROC curve analysis, subjective ratings indicated original images had better contrast, sharpness, and quality. Overall, the enhancement methods did not consistently increase detection performance. Future work should focus on improving enhancement algorithms to more effectively enhance image quality and visualization without altering structure morphology.
Pharmacodynamics is the study of how drugs act on biological systems and their mechanisms of action. Drugs can interact with receptors to mimic or block physiological messengers. Agonists activate receptors while antagonists reduce or prevent agonist effects. Receptors are often proteins that drugs bind to through covalent, ionic, or hydrogen bonds. Signaling mechanisms involve ligand-gated ion channels, G-protein coupled receptors, and second messengers such as cAMP or calcium ions. Understanding these interactions is important for elucidating drug actions at the cellular level.
The document discusses acid rain and its causes and effects. It explains that acid rain is caused by emissions of sulfur dioxide and nitrogen oxides reacting with water, moisture and oxygen in the air. When the rain falls to the ground it can harm plants, animals, and cause damage to statues and building materials. Acid rain also acidifies lakes and streams, and can kill fish and other aquatic animals.
This document analyzes the psychometric properties of the Swedish version of the Strengths and Difficulties Questionnaire (SDQ) among children and adolescents ages 12-16. It examines the internal consistency, factor structure, and validity of the SDQ using data from community and service contact samples. The results show good internal consistency for most scales. Factor analyses support a bifactor model over the original 5-factor model. Validity analyses find the Emotional Problems scale best distinguishes the samples, while other scales have lower accuracy. Further analyses are suggested to improve understanding of the SDQ's performance in Swedish populations.
Central gas equipment for industrial gases (2011 edition in english) uk619 10...PRITAM JADHAV
The document provides information about gas supply equipment for industrial installations, including central gas manifolds. It discusses the advantages of central gas systems such as improved safety, efficiency and cost savings. Central gas manifolds allow for continuous gas supply to work stations from one location, reducing transportation of gas cylinders within the workplace. The document also outlines requirements for gas installations including engineering according to relevant laws and regulations, applying for necessary permits, and conducting a risk analysis.
Social media trends and audiences: March 2105Bob Crawshaw
This document discusses social media trends in Australia based on various data sources. Some key findings include:
- 47% of online time is spent on social media, with 28 minutes out of every 60 spent on social platforms.
- 9 out of 10 people check their smartphones as part of their daily routine.
- Smartphone sales in Australia have increased significantly between 2011 and 2014.
- Most social network access is via smartphones, with 55% accessing daily.
- Facebook has 9 million daily active users in Australia, with 7.3 million accessing via mobile.
Korea was first ruled by the Han Dynasty of China starting in 108 BC. Korean clans formed during this period but began declining around 108 AD. Over subsequent centuries, Korea experienced frequent changes in rule, first being taken over by various powers and establishing its own governments including the Silla kingdom in the 600s, followed by the Koryo dynasty from 935 to 1392 and finally the Joseon dynasty which collapsed later that year. Buddhism was adopted as the main religion in Korea starting in the 500s.
What goes into making a Data Visualization?Gabriel Gaster
The document discusses the design choices for visualizing transportation data in Chicago. It describes using Voronoi tessellation to divide the city into regions based on proximity to Divvy bike share stations. Color is used to represent bike trip volumes within each region, with binned coloring and transparent empty bins chosen over a gradient scale. The visualization aims to show how Divvy has changed where people can travel and explore differences between neighborhoods.
El documento describe una actividad individual con Quandary, un juego de toma de decisiones éticas. Los estudiantes jugarán el juego solo y reflexionarán sobre las decisiones difíciles que enfrentan los personajes y las consecuencias de esas decisiones. Después compartirán sus experiencias y perspectivas con la clase.
The document discusses the results of a study on the effects of a new drug on memory and cognitive function in older adults. The double-blind study involved 100 participants aged 65-80 and found that those given the drug performed significantly better on memory and problem-solving tests than the placebo group after 6 months. The drug was found to be safe and well-tolerated with no serious side effects reported.
Aerospace engineering involves designing and building aircraft and spacecraft. Geometry is used to design efficient aircraft parts and ensure they fit together properly. Aerospace engineers must consider the size of parts and how to assemble complex vehicles like airplanes and spacecraft from individual components.
Music Therapy Notes / More Reflecting, Less Wadingrachelmsmith
This set of slides outlines the Music Therapy Notes app which primarily is a tool to help music therapists organise audio and visual data recorded in music therapy sessions. It was created by @oxfordsing and @timdavies at the Berklee College of Music Music Therapy Hack 28th - 29th March 2014.
This document provides an overview of the water purifier industry in India. It discusses how people in India have become more health conscious in recent decades as around 80% of diseases are caused by water-borne microorganisms. The WHO estimates that simple water treatment techniques could save millions of lives each year from waterborne diseases. The water purifier industry provides mechanized and portable purification options to remove contaminants from raw water sources and meet the growing demand for safe drinking water in India.
This document discusses Ruby data types including numbers, text, arrays, hashes, ranges, symbols, and objects. It provides details on integer and float numbers, strings, string operations, arrays and common array methods, hashes and hash syntax, ranges and how they work, symbols and how they differ from strings, and Ruby objects and operations like equal?, conversions and tainting objects.
13 Trends and Challenges with School Wireless NetworksCorey Anderson
This document discusses trends and challenges schools face in supporting mobile learning and provides solutions for wireless network design. It introduces two speakers: Corey Anderson, a mobility specialist, and Michael McNamee, a chief network engineer with experience deploying wireless networks in over 300 schools. The document outlines trends like 1:1 initiatives, BYOD, cloud-based mobile learning and multimedia usage. It also discusses challenges such as ensuring security, managing growth and different environments. Solutions proposed include load balancing, dual-radio access points, and centralized wireless management with real-time device visibility. The speakers offer a free wireless network design assessment to help schools determine needs.
The document discusses different measures of central tendency (mean, median, mode) and how to determine which is most appropriate based on the type of data. It also covers measures of dispersion like range, standard deviation, and variance which provide information about how spread out values are from the central point. The mean is the most commonly used measure of central tendency but the median is less affected by outliers, while the mode represents the most frequent value.
Morphological and wavelet transform techniques were applied to enhance mammographic phantom images containing microcalcifications, nodules, and fibrils. Four observers evaluated the original and enhanced images using receiver operating characteristic analysis and subjective rating scales. While some techniques improved detection of certain structures over original images based on ROC curve analysis, subjective ratings indicated original images had better contrast, sharpness, and quality. Overall, the enhancement methods did not consistently increase detection performance. Future work should focus on improving enhancement algorithms to more effectively enhance image quality and visualization without altering structure morphology.
Pharmacodynamics is the study of how drugs act on biological systems and their mechanisms of action. Drugs can interact with receptors to mimic or block physiological messengers. Agonists activate receptors while antagonists reduce or prevent agonist effects. Receptors are often proteins that drugs bind to through covalent, ionic, or hydrogen bonds. Signaling mechanisms involve ligand-gated ion channels, G-protein coupled receptors, and second messengers such as cAMP or calcium ions. Understanding these interactions is important for elucidating drug actions at the cellular level.
The document discusses acid rain and its causes and effects. It explains that acid rain is caused by emissions of sulfur dioxide and nitrogen oxides reacting with water, moisture and oxygen in the air. When the rain falls to the ground it can harm plants, animals, and cause damage to statues and building materials. Acid rain also acidifies lakes and streams, and can kill fish and other aquatic animals.
This document analyzes the psychometric properties of the Swedish version of the Strengths and Difficulties Questionnaire (SDQ) among children and adolescents ages 12-16. It examines the internal consistency, factor structure, and validity of the SDQ using data from community and service contact samples. The results show good internal consistency for most scales. Factor analyses support a bifactor model over the original 5-factor model. Validity analyses find the Emotional Problems scale best distinguishes the samples, while other scales have lower accuracy. Further analyses are suggested to improve understanding of the SDQ's performance in Swedish populations.
Central gas equipment for industrial gases (2011 edition in english) uk619 10...PRITAM JADHAV
The document provides information about gas supply equipment for industrial installations, including central gas manifolds. It discusses the advantages of central gas systems such as improved safety, efficiency and cost savings. Central gas manifolds allow for continuous gas supply to work stations from one location, reducing transportation of gas cylinders within the workplace. The document also outlines requirements for gas installations including engineering according to relevant laws and regulations, applying for necessary permits, and conducting a risk analysis.
Social media trends and audiences: March 2105Bob Crawshaw
This document discusses social media trends in Australia based on various data sources. Some key findings include:
- 47% of online time is spent on social media, with 28 minutes out of every 60 spent on social platforms.
- 9 out of 10 people check their smartphones as part of their daily routine.
- Smartphone sales in Australia have increased significantly between 2011 and 2014.
- Most social network access is via smartphones, with 55% accessing daily.
- Facebook has 9 million daily active users in Australia, with 7.3 million accessing via mobile.
9ª edição das Industry Sessions by EDIT. com a temática Digital Marketing, em Lisboa
Filipe Bernardes / Marketing & Lead Generations Manager / MOBIZY
Aquilo que "vemos" na web social é, normalmente, apenas a ponta de um icebergue cada vez mais complexo. É necessário repensar modelos de atuação mais passivos, e garantir a integração de informação preciosa nos processos de tomada de decisão de uma marca.
Winning with analytics - Data Innovation Summit 2015 - Made in BelgiumPython Predictions
Winning with (predictive) analytics is not much different from winning with beer.
15 minute key presentation by @pythongeert on Data Innovation Summit 2015 - Made in Belgium by datasciencebe.com
"Data Informed vs Data Driven" by Casper Sermsuksan (Kulina)Tech in Asia ID
Casper is currently the Head of Product & Growth at Kulina, an online food subscription service in Jakarta. Casper is responsible for driving product management and growth initiatives as well as leading marketing efforts. Previously, he led the product marketing teams at Product Madness in San Francisco. During his tenure at Product Madness, he helped the company's top app, Heart of Vegas achieve the record of $200M in annual revenue. Outside of his day-to-day work, he advises corporations and startups on product and growth, and writes frequently on Startup Grind, Mind the Product & Muzli. He graduated with a business degree from the University of Southern California in Los Angeles.
***
This slide was shared at Tech in Asia Product Development Conference 2017 (PDC'17) on 9-10 August 2017.
Get more insightful updates from TIA by subscribing techin.asia/updateselalu
Social Media for Non Profits DTCC Dover Oct 2015Lisa Flowers
The document outlines Lisa Flowers' presentation on social media for non-profits. It includes an agenda, introduction of Lisa Flowers, a video on social media, steps for non-profits to use social media effectively, and tools and best practices discussed. Key topics covered are developing goals and strategies, engaging audiences through different types of content, using social media for events, and measuring performance.
The document discusses Nvest, an investment recommendation platform. It notes that while many investors prefer human financial advice online, current recommendations can be unquantifiable and lacking transparency. Nvest aims to provide insightful and fully transparent stock recommendations through advanced analytical tools available for free. The document also outlines Nvest's business model of monetizing individual subscriptions and organizational private groups, and long term goals of launching stock analysis reports, mutual funds, and growing globally.
Data-driven Growth - Analytics & Attribution for Marketers in 2016 | Turing F...Andy Young
This document summarizes Andy Young's presentation on analytics tools and tips for marketers in 2016. The presentation covers why analytics are important for businesses, common analytics failures to avoid, key metrics to track at different stages of business and product growth, techniques for early-stage and growth-stage businesses, and considerations for selecting analytics tools and platforms. The goal is to provide marketers with guidance on using data-driven approaches to optimize growth.
Is your content working better for someone else? @jonearnshawJon Earnshaw
A presentation demonstrating how easy it is to lose visibility in the organic SERPS when another website takes only a small amount of your content. The impact is wide reaching and can be disastrous; even to resellers and channel partners.
(Big)Data_just use it 30 sep2015_Frank_VullersFrank Vullers
This document summarizes a presentation about using big data. It discusses four customer cases where companies gained insights from big data to improve their businesses. The cases demonstrate how analyzing online customer behavior, social media sentiment, cross-promotion affinity, and social media chatter can provide benefits. Key learnings are to start experimenting with data, involve the right people and processes, and create the right flexible architecture. The presentation encourages companies to just start using their data.
Big Data Expo 2015 - Teradata Big Data : Just use it!BigDataExpo
Veel Retailers hebben de data in huis maar weten niet hoe dit te analyseren, andere Retailers worstelen nog met het samenbrengen van de data . Toch zijn verschillende Retailers er wel in geslaagd om data te gebruiken en de concurrentie voor te blijven. Hoe hebben zij dat gedaan ? Wat kwamen zij tegen ? Wat hebben zij geleerd ? Een overzicht van succesvolle Internationale retailers die er in slaagden om de Big Data hype te verzilveren . De transformatie van gut feeling naar data driven is een boeiende reis, niet zonder hindernissen maar het resultaat telt.
Content Marketing Meet-up - What is content marketing it, Should it do it, Wh...Martijn Burgman
The term of “content marketing” has been buzzing for a while, but what does it really entail? Creating blogs, videos, e-books, infographic and white papers with thought-leadership content is only part of the equation. You also need to make sure the content supply chain and distribution channels are running smoothly, ready for market feedback.
Martijn Burgman - an expert of this topic who will share with you the foundation of content marketing, practical examples and actionable takeaways. Stay tuned for updates.
This webinar discussed the importance of developing a content strategy. It began with an overview of the key components of a content strategy: goals, audience, voice, content types, channels, and execution. The presenter then discussed each of these components in more detail. For goals, the content strategy should align with business strategy. For audience, it is important to understand who your audience is through research. Voice determines the tone and style of content. Content types and channels depend on what your audience prefers. Execution determines how the content will be created and distributed. The webinar emphasized starting by defining business goals and learning about your audience. It concluded with a question and answer session.
How Benchmarking is the Key to Marketing Success. Using competitive intelligence to plan and execute better marketing - Workshop by Owen Tyzack, Sales Director of SimilarWeb at the NOAH 2015 Conference in Berlin, Tempodrom on the 10th of June 2015.
This document discusses predictive analytics and the growing use of data science in industries. It describes how industries are using predictive analytics to predict disease propagation, perform direct marketing, and provide recommendations. The document also outlines emerging technologies like self-driving cars, health monitoring, smart homes and cities, and augmented reality recommendations. Finally, it discusses common algorithms, the importance of prototyping quickly, and getting results from predictive analytics projects.
We aim to create high-quality content. We really do. But, more-often-than-not, we fail. We understand that high-quality content must be clear, concise, and consistent in voice, tone, and terminology. We also know that it’s supposed to be easily findable, accessible, retrievable, and relevant those who need it—delivered when, where, and how they prefer it.
Crafting content that follows the rules (grammar, punctuation, linguistics) isn’t good enough. Our content also has to be helpful.
In this fast-paced talk, Scott Abel describes what it means to be helpful. You’ll discover how understanding the power of explanation
Presented November 27, 2018, at Quadrus Conference Center for Information Development World 2018.
We are pleased to share with you the latest VCOSA statistical report on the cotton and yarn industry for the month of March 2024.
Starting from January 2024, the full weekly and monthly reports will only be available for free to VCOSA members. To access the complete weekly report with figures, charts, and detailed analysis of the cotton fiber market in the past week, interested parties are kindly requested to contact VCOSA to subscribe to the newsletter.
Orchestrating the Future: Navigating Today's Data Workflow Challenges with Ai...Kaxil Naik
Navigating today's data landscape isn't just about managing workflows; it's about strategically propelling your business forward. Apache Airflow has stood out as the benchmark in this arena, driving data orchestration forward since its early days. As we dive into the complexities of our current data-rich environment, where the sheer volume of information and its timely, accurate processing are crucial for AI and ML applications, the role of Airflow has never been more critical.
In my journey as the Senior Engineering Director and a pivotal member of Apache Airflow's Project Management Committee (PMC), I've witnessed Airflow transform data handling, making agility and insight the norm in an ever-evolving digital space. At Astronomer, our collaboration with leading AI & ML teams worldwide has not only tested but also proven Airflow's mettle in delivering data reliably and efficiently—data that now powers not just insights but core business functions.
This session is a deep dive into the essence of Airflow's success. We'll trace its evolution from a budding project to the backbone of data orchestration it is today, constantly adapting to meet the next wave of data challenges, including those brought on by Generative AI. It's this forward-thinking adaptability that keeps Airflow at the forefront of innovation, ready for whatever comes next.
The ever-growing demands of AI and ML applications have ushered in an era where sophisticated data management isn't a luxury—it's a necessity. Airflow's innate flexibility and scalability are what makes it indispensable in managing the intricate workflows of today, especially those involving Large Language Models (LLMs).
This talk isn't just a rundown of Airflow's features; it's about harnessing these capabilities to turn your data workflows into a strategic asset. Together, we'll explore how Airflow remains at the cutting edge of data orchestration, ensuring your organization is not just keeping pace but setting the pace in a data-driven future.
Session in https://budapestdata.hu/2024/04/kaxil-naik-astronomer-io/ | https://dataml24.sessionize.com/session/667627
Build applications with generative AI on Google CloudMárton Kodok
We will explore Vertex AI - Model Garden powered experiences, we are going to learn more about the integration of these generative AI APIs. We are going to see in action what the Gemini family of generative models are for developers to build and deploy AI-driven applications. Vertex AI includes a suite of foundation models, these are referred to as the PaLM and Gemini family of generative ai models, and they come in different versions. We are going to cover how to use via API to: - execute prompts in text and chat - cover multimodal use cases with image prompts. - finetune and distill to improve knowledge domains - run function calls with foundation models to optimize them for specific tasks. At the end of the session, developers will understand how to innovate with generative AI and develop apps using the generative ai industry trends.
Build applications with generative AI on Google Cloud
Why Why Why
1. berlin pydata | @gabegaster | 2015 february
what is data science?
2. berlin pydata | @gabegaster | 2015 february
what is data science?
3. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
4. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
review of literature
5. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
review of literature
6. berlin pydata | @gabegaster | 2015 february
what is data science?
review of literature
7. berlin pydata | @gabegaster | 2015 february
what is data science?
review of literature
8. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
9. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
10. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
11. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
• repeatable analysis
12. berlin pydata | @gabegaster | 2015 february
what is data science?
who is a data scientist?
“a scientist who can code”
• lower barrier to attack new problems
• repeatable analysis
• freedom to think about problems new ways
13. berlin pydata | @gabegaster | 2015 february
what is data science?
14. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
15. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
16. berlin pydata | @gabegaster | 2015 february
which were difficult to answer before
17. berlin pydata | @gabegaster | 2015 february
computing has progressed
which were difficult to answer before
18. berlin pydata | @gabegaster | 2015 february
1950
computing has progressed
19. berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
computing has progressed
20. berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
computing has progressed
21. berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
computing has progressed
22. berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
v
computing has progressed
23. berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
hoursv
v
computing has progressed
24. berlin pydata | @gabegaster | 2015 february
1950
cost of new
analysis
years
today
same person thinking about the problem
can conduct experiments to answer it
hoursv
v
computing has progressed
25. berlin pydata | @gabegaster | 2015 february
computing has progressed
26. berlin pydata | @gabegaster | 2015 february
open-source code
computing has progressed
27. berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
28. berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
29. berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
computing has progressed
30. berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
reinventing the wheel
computing has progressed
31. berlin pydata | @gabegaster | 2015 february
open-source code
standing on
shoulders of giants
reinventing the wheel
computing has progressed
32. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
33. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
knowing
what is possible
which were difficult to answer before
34. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
35. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
HOW
36. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
HOW WHY
37. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
38. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
tools
39. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools
40. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking why
tools
41. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools
42. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools WHY
43. berlin pydata | @gabegaster | 2015 february
what is data science?
using emerging technologies to approach
problems scientifically
which were difficult to answer before
knowing
what is possible
doing
something useful
using
new
good
the right
asking whytools WHY
WHY
44. berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
45. berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
science is about asking why
46. berlin pydata | @gabegaster | 2015 february
why why why
what is data science?
science is about asking why
start there
56. berlin pydata | @gabegaster | 2015 february
goal: save money
57. berlin pydata | @gabegaster | 2015 february
goal: save money
58. berlin pydata | @gabegaster | 2015 february
goal: save money
59. berlin pydata | @gabegaster | 2015 february
goal: save money
60. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
61. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
62. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
goal: save money
task: find needle in the haystack (without poking yourself)
63. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
goal: save money
task: find needle in the haystack (without poking yourself)
64. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
goal: save money
task: find needle in the haystack (without poking yourself)
65. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save money
task: find needle in the haystack (without poking yourself)
66. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
aboutpatent
not
aboutpatent
turn over to plaintiff
don’t
turn over to plaintiff
adverse inference
give away trade secrets
goal: save money
task: find needle in the haystack (without poking yourself)
67. berlin pydata | @gabegaster | 2015 februaryberlin pydata | @gabegaster | 2015 february
turn over to plaintiff
don’t
turn over to plaintiff
goal: save money
task: find needle in the haystack (without poking yourself)
81. berlin pydata | @gabegaster | 2015 february
classify schizophrenia w MRItask:
82. berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
83. berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
84. berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
how?
85. berlin pydata | @gabegaster | 2015 february
why?
classify schizophrenia w MRItask:
improve understanding of disease
how? … outside contest purview
86. berlin pydata | @gabegaster | 2015 february
why? outside contest purview
87. berlin pydata | @gabegaster | 2015 february
why? outside contest purview
88. berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
89. berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
getting data
&
making usable
90. berlin pydata | @gabegaster | 2015 february
why? outside contest purview
kaggle
getting data
&
making usable
WHY
91. berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
92. berlin pydata | @gabegaster | 2015 february
timeline of contest
AUC
Accuracy of Classification
93. berlin pydata | @gabegaster | 2015 february
what is AUC?
AUC
94. berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
95. berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve?
96. berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
97. berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
98. berlin pydata | @gabegaster | 2015 february
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
99. berlin pydata | @gabegaster | 2015 february
balances:
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
100. berlin pydata | @gabegaster | 2015 february
balances:
True Positive Rate
False Positive Rate
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
101. berlin pydata | @gabegaster | 2015 february
balances:
True Positive Rate
False Positive Rate
AUC
what is AUC? Area Under Curve
what curve? Receiver Operating
Characteristic
102. berlin pydata | @gabegaster | 2015 february
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
103. berlin pydata | @gabegaster | 2015 february
why?
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
104. berlin pydata | @gabegaster | 2015 february
why?
…
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
105. berlin pydata | @gabegaster | 2015 february
why?
…
upshot:
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
106. berlin pydata | @gabegaster | 2015 february
why?
…
choice of metric matters a LOT
upshot:
in practice
AUC
what is AUC?
balances:
True Positive Rate
False Positive Rate
Area Under Curve
what curve? Receiver Operating
Characteristic
107. berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
108. berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
random guess
109. berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
AUC
random guess
basic SVM
110. berlin pydata | @gabegaster | 2015 february
timeline of contest
goal?
Accuracy of Classification
AUC
random guess
basic SVM
111. berlin pydata | @gabegaster | 2015 february
timeline of contest
goal: depends on why
Accuracy of Classification
AUC
random guess
basic SVM
112. berlin pydata | @gabegaster | 2015 february
random guess
basic SVM
timeline of contest
Accuracy of Classification
AUC
113. berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
114. berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
turned out to place 9th — because overfitting
115. berlin pydata | @gabegaster | 2015 february
me
timeline of contest
Accuracy of Classification
AUC
turned out to place 9th — because overfitting
very common problem
116. berlin pydata | @gabegaster | 2015 february
timeline of contest
Accuracy of Classification
worth it?
AUC
126. berlin pydata | @gabegaster | 2015 february
an example
just for fun
127. berlin pydata | @gabegaster | 2015 february
Chicago Bike Share System
!
!
kind of like call-a-bike
128. berlin pydata | @gabegaster | 2015 february
Show what I like about Bike share
!
Chicago Bike Share System
!
!
kind of like call-a-bike
129. berlin pydata | @gabegaster | 2015 february
Show what I like about Bike share
!
Think about how bike share has changed geography
Chicago Bike Share System
!
!
kind of like call-a-bike
130. berlin pydata | @gabegaster | 2015 february
a typical trip for me
131. berlin pydata | @gabegaster | 2015 february
Bus transit
times
=
a LIE
132. berlin pydata | @gabegaster | 2015 february
Chicago is a grid city
133. berlin pydata | @gabegaster | 2015 february
Difficult
Public
Transit on
the grid
=+
Diagonals
134. berlin pydata | @gabegaster | 2015 february
Difficult
Public
Transit on
the grid
=+
Diagonals
2+ buses = FAIL
135. berlin pydata | @gabegaster | 2015 february
Adding bikes to
public transit
=
win
136. berlin pydata | @gabegaster | 2015 february
show how has divvy
changed where people
can go
viz Goal:
137. berlin pydata | @gabegaster | 2015 february
show how has divvy
changed where people
can go
show where people
actually go
viz Goal: