What data scientists really do, according to 50 data scientistsHugo Bowne-Anderson
My talk at PyData NYC, 2018.
This is the abstract:
Hugo Bowne-Anderson, data scientist and host of the DataFramed podcast, will give you a view into the thinking of 50 leading data scientists from around the world about the trends driving the data science revolution. During his interviews with these thought leaders, Hugo discovered themes and lessons about the past, present, and future of data science.
A presentation delivered by Mohammed Barakat on the 2nd Jordanian Continuous Improvement Open Day in Amman. The presentation is about Data Science and was delivered on 3rd October 2015.
What data scientists really do, according to 50 data scientistsHugo Bowne-Anderson
My talk at PyData NYC, 2018.
This is the abstract:
Hugo Bowne-Anderson, data scientist and host of the DataFramed podcast, will give you a view into the thinking of 50 leading data scientists from around the world about the trends driving the data science revolution. During his interviews with these thought leaders, Hugo discovered themes and lessons about the past, present, and future of data science.
A presentation delivered by Mohammed Barakat on the 2nd Jordanian Continuous Improvement Open Day in Amman. The presentation is about Data Science and was delivered on 3rd October 2015.
Matthew Russell's "Unleashing Twitter Data for Fun and Insight" presentation from Strata 2011. Matthew Russell's "Unleashing Twitter Data for Fun and Insight" presentation from Strata 2011. See http://strataconf.com/strata2011/public/schedule/detail/17714 for an overview of the talk.
Data Driven PR: 8 Steps to Building Media Attention with ResearchWalkerSands
Do you want to learn how your internal data can be used to gain media coverage in The New York Times, USA Today, and Mashable? Or how a simple consumer survey can lead to hundreds of new leads for your business?
Learn how in this presentation from Mike Santoro, President of tech PR firm Walker Sands, and Andrea Kempfer, Director of Marketing at market research firm Lab42.
The recorded presentation can be viewed at: http://www.walkersands.com/Data-Driven-PR-Webinar
Business Models in the Data Economy: A Case Study from the Business Partner D...Boris Otto
Data management seems to experience a renaissance today. One particular trend in the so-called data economy has been the emergence of business models based on the provision of high-quality data. In this context, the paper
examines business models of business partner data providers. The paper explores as to how and why these business models differ. Based on a study of six cases, the paper identifies three different business model patterns. A resource-based view is taken to explore the details of these patterns. Furthermore, the paper develops a set of propositions that help understand why the different business models evolved and how they may develop in the future. Finally, the paper discusses the ongoing market transformation process indicating a shift from traditional value chains toward value networks—a change which, if it is sustainable, would seriously threaten the business models of well-established data providers, such as Dun & Bradstreet, for example.
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...SocialBiz UserGroup
In this tip, speaker Scott Padget explains how socialytics provides customer and competitive insights as well as real-time operational insights. He introduces the SIFT (Social Intelligence Fusion Toolkit) Solution that funnels big social data into actionable business intelligence. Scott also describes the lifecycle of socialytics and gives a live demo. Obviously, the slides don’t capture the exact live demo, but they do show some screenshot examples of the SIFT Solution in action.
Analyzing social conversation: a guide to data mining and data visualization Tempero UK
These slides were presented by Mick Conroy of Tempero and Jonathan Stray of Associated Press/Overview Project as part of Social Media Week New York #smwnyc
In this talk we outline some of the key challenges in text analytics, describe some of Endeca's current research work in this area, examine the current state of the text analytics market and explore some of the prospects for the future.
A deck presented at the MRS 'Maximising the Value of Big Data' conference in London, January 2013.
Presents my view of big data and the potential it gives us for mapping the systems that we deal with on a day-to-day basis. Big data holds the promise of providing us with a meta-view of the systems that we all think we are so familiar with. I think we will find that the woods look nothing like the trees.
Learn How a New Kind of Marketing Mix Modeling is Better for Media PlanningThinkVine
This presentation discusses the use of agent-based modeling and its proven advantages to media planners, including the abilities to create effective media plans based on consumer differences, accurately attribute results to media tactics, quantify long-term effects, and forecast sales and ROI results.
This presentation explains how brands can mine social media data, both text and images, in order to find insights about your customers and markets that can provide real business value.
Staying on the Right Side of the Fence when Analyzing Human DataDataSift
Data is all around us and comes from many different sources. This data is generated by human behavior and it’s growing at an astonishing rate. Companies are collecting this data and using it in ways they could have never imagined.
This brings a sense of unease among people that their intimate information is no longer their own. Yet this data is central to companies ability to better serve customers, but it is necessary that companies find the balance and honor customers privacy. How can we strike the balance?
Join this webinar and you will learn:
About the current and future challenges in this data-rich world
How to be a good guy, and still achieve your business objectives while analyzing Human Data
About PYLON for Facebook Topic Data and how you can build insights from Facebook while protecting user privacy
Agile Data Science is a lean methodology that is adopted from Agile Software Development. At the core it centers around people, interactions, and building minimally viable products to ship fast and often to solicit customer feedback. In this presentation, I describe how this work was done in the past with examples. Get started today with our help by visiting http://www.alpinenow.com
Matthew Russell's "Unleashing Twitter Data for Fun and Insight" presentation from Strata 2011. Matthew Russell's "Unleashing Twitter Data for Fun and Insight" presentation from Strata 2011. See http://strataconf.com/strata2011/public/schedule/detail/17714 for an overview of the talk.
Data Driven PR: 8 Steps to Building Media Attention with ResearchWalkerSands
Do you want to learn how your internal data can be used to gain media coverage in The New York Times, USA Today, and Mashable? Or how a simple consumer survey can lead to hundreds of new leads for your business?
Learn how in this presentation from Mike Santoro, President of tech PR firm Walker Sands, and Andrea Kempfer, Director of Marketing at market research firm Lab42.
The recorded presentation can be viewed at: http://www.walkersands.com/Data-Driven-PR-Webinar
Business Models in the Data Economy: A Case Study from the Business Partner D...Boris Otto
Data management seems to experience a renaissance today. One particular trend in the so-called data economy has been the emergence of business models based on the provision of high-quality data. In this context, the paper
examines business models of business partner data providers. The paper explores as to how and why these business models differ. Based on a study of six cases, the paper identifies three different business model patterns. A resource-based view is taken to explore the details of these patterns. Furthermore, the paper develops a set of propositions that help understand why the different business models evolved and how they may develop in the future. Finally, the paper discusses the ongoing market transformation process indicating a shift from traditional value chains toward value networks—a change which, if it is sustainable, would seriously threaten the business models of well-established data providers, such as Dun & Bradstreet, for example.
Tip from IBM Connect 2014: Socialytics = Social Business, Big Social Data and...SocialBiz UserGroup
In this tip, speaker Scott Padget explains how socialytics provides customer and competitive insights as well as real-time operational insights. He introduces the SIFT (Social Intelligence Fusion Toolkit) Solution that funnels big social data into actionable business intelligence. Scott also describes the lifecycle of socialytics and gives a live demo. Obviously, the slides don’t capture the exact live demo, but they do show some screenshot examples of the SIFT Solution in action.
Analyzing social conversation: a guide to data mining and data visualization Tempero UK
These slides were presented by Mick Conroy of Tempero and Jonathan Stray of Associated Press/Overview Project as part of Social Media Week New York #smwnyc
In this talk we outline some of the key challenges in text analytics, describe some of Endeca's current research work in this area, examine the current state of the text analytics market and explore some of the prospects for the future.
A deck presented at the MRS 'Maximising the Value of Big Data' conference in London, January 2013.
Presents my view of big data and the potential it gives us for mapping the systems that we deal with on a day-to-day basis. Big data holds the promise of providing us with a meta-view of the systems that we all think we are so familiar with. I think we will find that the woods look nothing like the trees.
Learn How a New Kind of Marketing Mix Modeling is Better for Media PlanningThinkVine
This presentation discusses the use of agent-based modeling and its proven advantages to media planners, including the abilities to create effective media plans based on consumer differences, accurately attribute results to media tactics, quantify long-term effects, and forecast sales and ROI results.
This presentation explains how brands can mine social media data, both text and images, in order to find insights about your customers and markets that can provide real business value.
Staying on the Right Side of the Fence when Analyzing Human DataDataSift
Data is all around us and comes from many different sources. This data is generated by human behavior and it’s growing at an astonishing rate. Companies are collecting this data and using it in ways they could have never imagined.
This brings a sense of unease among people that their intimate information is no longer their own. Yet this data is central to companies ability to better serve customers, but it is necessary that companies find the balance and honor customers privacy. How can we strike the balance?
Join this webinar and you will learn:
About the current and future challenges in this data-rich world
How to be a good guy, and still achieve your business objectives while analyzing Human Data
About PYLON for Facebook Topic Data and how you can build insights from Facebook while protecting user privacy
Agile Data Science is a lean methodology that is adopted from Agile Software Development. At the core it centers around people, interactions, and building minimally viable products to ship fast and often to solicit customer feedback. In this presentation, I describe how this work was done in the past with examples. Get started today with our help by visiting http://www.alpinenow.com
Highlights and summary of long-running programmatic research on data science; practices, roles, tools, skills, organization models, workflow, outlook, etc. Profiles and persona definition for data scientist model. Landscape of org models for data science and drivers for capability planning. Secondary research materials.
Requirements Engineering for the HumanitiesShawn Day
This workshop explores how requirements engineering can be employed by digital and non-digital humanities scholars (and others) to conceptualise and communicate a research project.
requirementsEngineeringAs the field of digital humanities has evolved, one of the biggest challenges has been getting the marrying technical expertise with humanities scholarly practice to successfully deliver sustainable and sound digital projects. At its core this is a communications exercise. However, to communicate effectively demands an ability to effectively translate, define and find clarity in your own mind.
Tactics and Decision Making for Successful Museum Digital ProjectsAndrew Lewis
This paper discusses what tactics and decision-making mean in practice within museum digital technology projects. It offers practical suggestion for tactical approaches drawn from the author’s twelve years of experience managing digital projects and services.
Pathways to Technology Transfer and Adoption: Achievements and ChallengesTao Xie
Dongmei Zhang and Tao Xie. Pathways to Technology Transfer and Adoption: Achievements and Challenges. In Proceedings of the 35th International Conference on Software Engineering (ICSE 2013), Software Engineering in Practice (SEIP), Mini-Tutorial, San Francisco, CA, May 2013. http://people.engr.ncsu.edu/txie/publications/icse13seip-techtransfer.pdf
Large language models in higher educationPeter Trkman
Discussing the possibilities of large language models for the automatic generation of academic content by the students (e.g. master thesis), and the related need for changes in the way in which to educate and evaluate students.
Toward a System Building Agenda for Data Integration(and Dat.docxjuliennehar
Toward a System Building Agenda for Data Integration
(and Data Science)
AnHai Doan, Pradap Konda, Paul Suganthan G.C., Adel Ardalan, Jeffrey R. Ballard, Sanjib Das,
Yash Govind, Han Li, Philip Martinkus, Sidharth Mudgal, Erik Paulson, Haojun Zhang
University of Wisconsin-Madison
Abstract
We argue that the data integration (DI) community should devote far more effort to building systems,
in order to truly advance the field. We discuss the limitations of current DI systems, and point out that
there is already an existing popular DI “system” out there, which is PyData, the open-source ecosystem
of 138,000+ interoperable Python packages. We argue that rather than building isolated monolithic DI
systems, we should consider extending this PyData “system”, by developing more Python packages that
solve DI problems for the users of PyData. We discuss how extending PyData enables us to pursue an
integrated agenda of research, system development, education, and outreach in DI, which in turn can
position our community to become a key player in data science. Finally, we discuss ongoing work at
Wisconsin, which suggests that this agenda is highly promising and raises many interesting challenges.
1 Introduction
In this paper we focus on data integration (DI), broadly interpreted as covering all major data preparation steps
such as data extraction, exploration, profiling, cleaning, matching, and merging [10]. This topic is also known
as data wrangling, munging, curation, unification, fusion, preparation, and more. Over the past few decades, DI
has received much attention (e.g., [37, 29, 31, 20, 34, 33, 6, 17, 39, 22, 23, 5, 8, 36, 15, 35, 4, 25, 38, 26, 32, 19,
2, 12, 11, 16, 2, 3]). Today, as data science grows, DI is receiving even more attention. This is because many
data science applications must first perform DI to combine the raw data from multiple sources, before analysis
can be carried out to extract insights.
Yet despite all this attention, today we do not really know whether the field is making good progress. The
vast majority of DI works (with the exception of efforts such as Tamr and Trifacta [36, 15]) have focused on
developing algorithmic solutions. But we know very little about whether these (ever-more-complex) algorithms
are indeed useful in practice. The field has also built mostly isolated system prototypes, which are hard to use and
combine, and are often not powerful enough for real-world applications. This makes it difficult to decide what
to teach in DI classes. Teaching complex DI algorithms and asking students to do projects using our prototype
systems can train them well for doing DI research, but are not likely to train them well for solving real-world DI
problems in later jobs. Similarly, outreach to real users (e.g., domain scientists) is difficult. Given that we have
Copyright 0000 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for
advertising or promotional purpose ...
Advancing Foundation and Practice of Software AnalyticsTao Xie
Vision Statement Presentation on "Advancing Foundation & Practice of Software Analytics" at the 2nd International NSF sponsored Workshop on Realizing Artificial Intelligence Synergies in Software Engineering (RAISE 2013) http://promisedata.org/raise/2013/
Academic Innovation Data Showcase 2-14-19umichiganai
On Thursday, February 14 from 9:30 a.m. to 12:00 p.m. the Office of Academic Innovation hosted our first Data Showcase - an event for all University of Michigan (U-M) community members to come take a tour through the data that power our work.
Usability testing: rapid results when you need them. Have a question about whether a new feature or design idea works for users? It’s easy to find out early, so your design process is as responsive as your code. We'll look at ways to run quick usability test, how to find users in the wild, and when to add it to your project plan. Yes, it can be fast, good, and cheap.
Presentation at the dotgov design conference - March 27, 2015
Similar to Influence mapping Toolbox Presentation London 2015 (20)
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
Threats to mobile devices are more prevalent and increasing in scope and complexity. Users of mobile devices desire to take full advantage of the features
available on those devices, but many of the features provide convenience and capability but sacrifice security. This best practices guide outlines steps the users can take to better protect personal devices and information.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
UiPath Test Automation using UiPath Test Suite series, part 6DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 6. In this session, we will cover Test Automation with generative AI and Open AI.
UiPath Test Automation with generative AI and Open AI webinar offers an in-depth exploration of leveraging cutting-edge technologies for test automation within the UiPath platform. Attendees will delve into the integration of generative AI, a test automation solution, with Open AI advanced natural language processing capabilities.
Throughout the session, participants will discover how this synergy empowers testers to automate repetitive tasks, enhance testing accuracy, and expedite the software testing life cycle. Topics covered include the seamless integration process, practical use cases, and the benefits of harnessing AI-driven automation for UiPath testing initiatives. By attending this webinar, testers, and automation professionals can gain valuable insights into harnessing the power of AI to optimize their test automation workflows within the UiPath ecosystem, ultimately driving efficiency and quality in software development processes.
What will you get from this session?
1. Insights into integrating generative AI.
2. Understanding how this integration enhances test automation within the UiPath platform
3. Practical demonstrations
4. Exploration of real-world use cases illustrating the benefits of AI-driven test automation for UiPath
Topics covered:
What is generative AI
Test Automation with generative AI and Open AI.
UiPath integration with generative AI
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
How to Get CNIC Information System with Paksim Ga.pptxdanishmna97
Pakdata Cf is a groundbreaking system designed to streamline and facilitate access to CNIC information. This innovative platform leverages advanced technology to provide users with efficient and secure access to their CNIC details.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
Goodbye Windows 11: Make Way for Nitrux Linux 3.5.0!SOFTTECHHUB
As the digital landscape continually evolves, operating systems play a critical role in shaping user experiences and productivity. The launch of Nitrux Linux 3.5.0 marks a significant milestone, offering a robust alternative to traditional systems such as Windows 11. This article delves into the essence of Nitrux Linux 3.5.0, exploring its unique features, advantages, and how it stands as a compelling choice for both casual users and tech enthusiasts.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
A tale of scale & speed: How the US Navy is enabling software delivery from l...sonjaschweigert1
Rapid and secure feature delivery is a goal across every application team and every branch of the DoD. The Navy’s DevSecOps platform, Party Barge, has achieved:
- Reduction in onboarding time from 5 weeks to 1 day
- Improved developer experience and productivity through actionable findings and reduction of false positives
- Maintenance of superior security standards and inherent policy enforcement with Authorization to Operate (ATO)
Development teams can ship efficiently and ensure applications are cyber ready for Navy Authorizing Officials (AOs). In this webinar, Sigma Defense and Anchore will give attendees a look behind the scenes and demo secure pipeline automation and security artifacts that speed up application ATO and time to production.
We will cover:
- How to remove silos in DevSecOps
- How to build efficient development pipeline roles and component templates
- How to deliver security artifacts that matter for ATO’s (SBOMs, vulnerability reports, and policy evidence)
- How to streamline operations with automated policy checks on container images
3. Open Integrity Initiative - Digital Security and
Privacy
Open Oil Navigator - Oil and Gas Industries
Panic Initiative - Mobile App with Amnesty
Ultra-Rural Tech - Lake Tanganyika Medical Records
10. Journey Entry Points
Primary
● "How to identify data for your project?"
(Practices/Projects/Tools)
● "How to organise your data?" (Practices/Tools)
● "How to make sense of your data?"
(Practices/Tools)
● "How to present your data and findings?"
(Practices/Tools)
Secondary
● "Take a tour of the tools available"
(End User Tools)
● "What are the best approaches to build your
own tools?" (Dev Tools)
● “Who’s doing work like yours?” (Projects)
● “Influence mapping success stories” (Case
Studies)
● “Influence mapping essentials” (Practices)
12. Structure
Projects: Existing influence mapping project including projects aiming to provide
data to others
Tools: Include End user tools, such as spreadsheets or online services that don't
require development skills
Dev Tools: Libraries, Framework, Programming Languages, Database systems…
Practices: Activities, tasks (recipes) or methods that are linked to the practice of
influence mapping.
Case Studies: Detailed analysis of existing projects in order to help others learn
about various concrete practices
Guides, Data Providers,...
16. Guides / Case Studies
Abstract Use Cases
Growing the data little by little by themselves
Big data dump (like a leak)
Lead generation (Discovery, finding about a
new topic)
Concrete Use Cases
Police Corruption (Overview, Topic mapping
to "remove" and then manually analyse the
ones that don't fit).
WSJ, Organic Farms violations (Overview)
Dealing with EU Data Protection requests
(Open Corporates)
17. Contribute!
●Give us more feedback about structure
●Tell us if you want to contribute to the content (manage
your project page, your own tool page, share your
practices/recipes...)
●Get in touch if you have colleagues or partners that
could test this in beta in about a month or two.
influencemapping@iilab.org