My recent presentation about what is Big Data, Why so much Hype now, Startling Facts, Opportunity, History, Important Research Papers such as GFS, Map-Reduce , Technology Platforms and Organizations , Hadoop, Cassandra, Introduction to Hadoop, Contribution of Indians to various Big Data technologies working in Google, Cloudera, Hortonworks, Yahoo, Facebook, Aadhar - "All your answers lie in data - @Sameer Sawhney"
Microsoft also recently launched the second try out of its other Hadoop substitute LINQ to HPC, formerly known as Dryad. LINQ/Dryad have been used for Google for some time, but not the various resources are available to users of Microsoft windows HPC Server 2008 groups.
My recent presentation about what is Big Data, Why so much Hype now, Startling Facts, Opportunity, History, Important Research Papers such as GFS, Map-Reduce , Technology Platforms and Organizations , Hadoop, Cassandra, Introduction to Hadoop, Contribution of Indians to various Big Data technologies working in Google, Cloudera, Hortonworks, Yahoo, Facebook, Aadhar - "All your answers lie in data - @Sameer Sawhney"
Microsoft also recently launched the second try out of its other Hadoop substitute LINQ to HPC, formerly known as Dryad. LINQ/Dryad have been used for Google for some time, but not the various resources are available to users of Microsoft windows HPC Server 2008 groups.
Mark McCrindle Communities Inspired Presentation Australian Communities ForumMark McCrindle
Mark McCrindle of McCrindle Research presented this analysis of the various types of community organisations and groups that we are a part of - from ones where the purposes is around belief and ethos versus those around interest and practice; and how some communities are formed with a relational people focus whereas others with a more functional people focus. Mark presented this as the final session of the Australian Communities Forum in November 2012.
Data Journalism - Data Visualisation with MapsBahareh Heravi
Data Journalism lecture - Week 10: Data Visualisation with Maps
Lecture date: 11 Nov 2015
MA in Journalism
National University of Ireland, Galway
Title slide image from The Data Journalism Handbook
Colman McMahon, DIT School of Computing: Getting Started with Data VisualisationDublinked .
Graduating with a BA from UCD in 1995, Colman emigrated to America to pursue a career that combined creativity, commerce and computers. Heading west to California, Colman worked for 11 years in Hollywood's visual effects (VFX) industry. During this time he worked mainly at The Walt Disney Co. and also as a. In 2006, Colman returned home to Ireland to undertake a . A short time after the conclusion of the course, while starting up his own , Colman was invited back to DIT as a part-time lecturer. In 2011, Colman was offered a PhD Fellowship at modeling and simulating the relationship between innovation and profit. This full-time study is under the direction of Prof. Petra Ahrweiler, Director UCD Innovation Research Unit and Professor of Technology and Innovation Management, Smurfit School of Business. In 2012, Colman designed and delivered the first iteration of a new Visualisation module as part of DIT's .
Details of Colman's research activities can be found at .
-Dubinked-
Drawing from a new module at DIT, Colman's presentation at Dublinked will be an introduction to the domain of visualisation and a demonstration of powerful yet "do-able" data visualisations. The ethos of the presentation is for people who have little or no visualisation experience but have an aptitude and appetite for using technical tools to surface meaning from data. The tools used will be R, R Studio and Inkscape.
Tucana HR Analytics Data Visualisation, April 2014 (London)Mark Lawrence
Introduction to effective data visualisation ( #datavis ) techniques, delivered to an HR Data Analytics conference hosted by Tucana, in London, April 2014. These slides have been adjusted since the original presentation, to mitigate confidentiality or privacy concerns. Data included herein, should be considered as Dummy Data. Ref's provided.
Disclaimer: Although I worked for IBM, and represented IBM at the time, subsequent opinions should not be considered to be in line with those of IBM.
Improving business outcomes through rapid data visualisationThoughtWorks Studios
Visualising data provides clarity, increases engagement and delivers unexpected insights. A rapid and adaptive approach to building visualisations can help you realise value with a minimal investment.
David and Ray shared thoughts and client stories from work in Perth and Melbourne at an evening briefing in Perth, Western Australia, on 29 October 2013. David is a lead management consultant with a mathematical visualisation bent (find him on LinkedIn or see his blog). Ray is a lead developer consultant who enjoys thinking up and building products (twitter @grassdog).
Mark McCrindle is a social researcher with an international reputation for tracking the emerging trends and analysing the diverse generations. He is the Director of McCrindle Research, whose clients include over 100 multinational organisations. His highly valued research and reports have developed his renown as a futurist, demographer and social commentator. Mark is highly regarded for his engaging presentations whether they be keynote addresses at conferences, boardroom briefings or facilitated workshops.
Andy Kirk's Webinar for Tableau (July 2016)Andy Kirk
These are the slides from the talk given by Andy Kirk (@visualisingdata) on a webinar hosted by Tableau Software on 20th July 2016. The title is 'Bringing Method to the Madness' and concerns a demonstration of a data visualisation design workflow.
Digital Transactors vs Digital Integrators: A QuizMark McCrindle
While all the generations use social media, new technologies and mobile devices, the age at which we are first exposed to the technologies determines how embedded they become in our lifestyles. That is why Baby Boomers and Gen Xers are Digital Transactors, using the new technology tools efficiently, structurally and productively. However Generations Y and Z are more likely Digital Integrators, seamlessly integrating the technology into their communications, behaviours and world. This quick quiz will test which of these digital types you are. See how you go in this A-Z activity with the goal to identify each of these digital platform logos just by looking at the first letter of them. Check your score at the end and see which generation you best fit!
Learning the Craft of Data VisualisationAmit Kapoor
Data visualisation is a cross disciplinary activity that requires us to operate at the intersection of a visual designer, data scientist and storyteller. It is both a science and an art.
So how does one realistically learn these multitude of skills needed to get good at the craft of data visualisation. In this slide deck, I discuss ideas about the possible path that a beginner could adopt to learn this craft through sustained practice.
This presentation was for Social Media Week Berlin on Tuesday, 24th September. It was targeted at NGOs, NPOs, activist organisations and charities who have important key messages to share with the community. The event will combine elements of a presentation and workshop. We will examine case studies of campaigns that have successfully used data visualisation in tandem with social media and content marketing techniques to spread information and ideas, and to counteract prevailing myths about climate change and renewable energy technology. We will then allow time for participants to split up into small working groups. Structured discussion tasks and group feedback will allow participants to investigate how these strategies can apply to their own organisation or issue. Participants will learn practical steps for identifying important messages, researching and developing content, incorporating data visualisation in a powerful and meaningful way, and promoting their data visualisation campaigns through social media and email outreach. In particular, the event will focus on developing powerful stories that will attract the support of influential sharers and thought leaders from a range of backgrounds, from activism through to industry, so as to maximise the campaign's reach and impact.
Mark McCrindle Communities Inspired Presentation Australian Communities ForumMark McCrindle
Mark McCrindle of McCrindle Research presented this analysis of the various types of community organisations and groups that we are a part of - from ones where the purposes is around belief and ethos versus those around interest and practice; and how some communities are formed with a relational people focus whereas others with a more functional people focus. Mark presented this as the final session of the Australian Communities Forum in November 2012.
Data Journalism - Data Visualisation with MapsBahareh Heravi
Data Journalism lecture - Week 10: Data Visualisation with Maps
Lecture date: 11 Nov 2015
MA in Journalism
National University of Ireland, Galway
Title slide image from The Data Journalism Handbook
Colman McMahon, DIT School of Computing: Getting Started with Data VisualisationDublinked .
Graduating with a BA from UCD in 1995, Colman emigrated to America to pursue a career that combined creativity, commerce and computers. Heading west to California, Colman worked for 11 years in Hollywood's visual effects (VFX) industry. During this time he worked mainly at The Walt Disney Co. and also as a. In 2006, Colman returned home to Ireland to undertake a . A short time after the conclusion of the course, while starting up his own , Colman was invited back to DIT as a part-time lecturer. In 2011, Colman was offered a PhD Fellowship at modeling and simulating the relationship between innovation and profit. This full-time study is under the direction of Prof. Petra Ahrweiler, Director UCD Innovation Research Unit and Professor of Technology and Innovation Management, Smurfit School of Business. In 2012, Colman designed and delivered the first iteration of a new Visualisation module as part of DIT's .
Details of Colman's research activities can be found at .
-Dubinked-
Drawing from a new module at DIT, Colman's presentation at Dublinked will be an introduction to the domain of visualisation and a demonstration of powerful yet "do-able" data visualisations. The ethos of the presentation is for people who have little or no visualisation experience but have an aptitude and appetite for using technical tools to surface meaning from data. The tools used will be R, R Studio and Inkscape.
Tucana HR Analytics Data Visualisation, April 2014 (London)Mark Lawrence
Introduction to effective data visualisation ( #datavis ) techniques, delivered to an HR Data Analytics conference hosted by Tucana, in London, April 2014. These slides have been adjusted since the original presentation, to mitigate confidentiality or privacy concerns. Data included herein, should be considered as Dummy Data. Ref's provided.
Disclaimer: Although I worked for IBM, and represented IBM at the time, subsequent opinions should not be considered to be in line with those of IBM.
Improving business outcomes through rapid data visualisationThoughtWorks Studios
Visualising data provides clarity, increases engagement and delivers unexpected insights. A rapid and adaptive approach to building visualisations can help you realise value with a minimal investment.
David and Ray shared thoughts and client stories from work in Perth and Melbourne at an evening briefing in Perth, Western Australia, on 29 October 2013. David is a lead management consultant with a mathematical visualisation bent (find him on LinkedIn or see his blog). Ray is a lead developer consultant who enjoys thinking up and building products (twitter @grassdog).
Mark McCrindle is a social researcher with an international reputation for tracking the emerging trends and analysing the diverse generations. He is the Director of McCrindle Research, whose clients include over 100 multinational organisations. His highly valued research and reports have developed his renown as a futurist, demographer and social commentator. Mark is highly regarded for his engaging presentations whether they be keynote addresses at conferences, boardroom briefings or facilitated workshops.
Andy Kirk's Webinar for Tableau (July 2016)Andy Kirk
These are the slides from the talk given by Andy Kirk (@visualisingdata) on a webinar hosted by Tableau Software on 20th July 2016. The title is 'Bringing Method to the Madness' and concerns a demonstration of a data visualisation design workflow.
Digital Transactors vs Digital Integrators: A QuizMark McCrindle
While all the generations use social media, new technologies and mobile devices, the age at which we are first exposed to the technologies determines how embedded they become in our lifestyles. That is why Baby Boomers and Gen Xers are Digital Transactors, using the new technology tools efficiently, structurally and productively. However Generations Y and Z are more likely Digital Integrators, seamlessly integrating the technology into their communications, behaviours and world. This quick quiz will test which of these digital types you are. See how you go in this A-Z activity with the goal to identify each of these digital platform logos just by looking at the first letter of them. Check your score at the end and see which generation you best fit!
Learning the Craft of Data VisualisationAmit Kapoor
Data visualisation is a cross disciplinary activity that requires us to operate at the intersection of a visual designer, data scientist and storyteller. It is both a science and an art.
So how does one realistically learn these multitude of skills needed to get good at the craft of data visualisation. In this slide deck, I discuss ideas about the possible path that a beginner could adopt to learn this craft through sustained practice.
This presentation was for Social Media Week Berlin on Tuesday, 24th September. It was targeted at NGOs, NPOs, activist organisations and charities who have important key messages to share with the community. The event will combine elements of a presentation and workshop. We will examine case studies of campaigns that have successfully used data visualisation in tandem with social media and content marketing techniques to spread information and ideas, and to counteract prevailing myths about climate change and renewable energy technology. We will then allow time for participants to split up into small working groups. Structured discussion tasks and group feedback will allow participants to investigate how these strategies can apply to their own organisation or issue. Participants will learn practical steps for identifying important messages, researching and developing content, incorporating data visualisation in a powerful and meaningful way, and promoting their data visualisation campaigns through social media and email outreach. In particular, the event will focus on developing powerful stories that will attract the support of influential sharers and thought leaders from a range of backgrounds, from activism through to industry, so as to maximise the campaign's reach and impact.
Data cleansing is the process of fixing wrong, corrupted, incorrectly formatted and duplicate files in the database to improve the functioning of your firm. Data cleansing services help in removing wrong data from the database of your firm in a professional manner. There are several tools that can simplify this process.
Data Science Provenance: From Drug Discovery to Fake FansJameel Syed
Knowledge work adds value to raw data; how this activity is performed is critical for how reliably results can be reproduced and scrutinized. With a brief diversion into epistemology, the presentation will outline the challenges for practitioners and consumers of Big Data analysis, and demonstrate how these were tackled at Inforsense (life sciences workflow analytics platform) and Musicmetric (social media analytics for music).
The talk covers the following issues with concrete examples:
- Representations of provenance
- Considerations to allow analysis computation to be recreated
- Reliable collection of noisy data from the internet
- Archiving of data and accommodating retrospective changes
- Using linked data to direct Big Data analytics
Data 2.0 - Harnessing New Data Visualization Tools CIL 2008Darlene Fichter
What happens when you combine data and Web 2.0?
Mountains of data are piling up in libraries and corporate intranets, but how do we say it visually and make data more useful? Fichter looks at new online tools that enable sharing, analysis, and data visualization to show trends, associations and new insights on your library website that quickly engage, inform, and empower your customers to create their own visualizations. Tools such as Many Eyes go beyond the data visualization and tap into the social dimension encouraging sharing, collaboration, and reuse. Fichter talks about what happens when data goes “social.” From visual dictionaries, textual analysis of works of fiction, to star size and nutritional components of Big Macs, hop on board for this fast-paced look at an amazing set of tools that are at your fingertips.
Overview of tools for data analysis and visualisation (2021)Marié Roux
This presentation gives a summary of important tools for data analysis and visualisation, for example to clean your data, do statistical analysis, visualisation application and programmes, qualitative analysis, GIS, temporal analysis, network analysis, etc.
Presentation by Ivan Schotsmans (DV Community) at the Data Vault Modelling an...Patrick Van Renterghem
The start of GDPR implementations in Europe was, for most organizations, also the start of rethinking their Data Warehouse strategy. The experience of past implementations gave a better view on the do's and don'ts. One of the important lessons learned was the approach of handling information quality. It's not something you handle on top of your data warehouse. To be successful, information quality goes hand in hand with your data warehouse implementation.
Preconference Overview of data visualisation and technologyJen Stirrup
In this module, we will look at an overview of theory and scientific evidence about data visualisation. Understanding the ‘why’ can help to make us better at the ‘how’, regardless of the technology.We will also look at an overview of the Power BI suite of tools.
From Data to Visualization: Emerging Tools for Research / Jan JohanssonPVC.ASIST
Traditionally, data analysis has required extensive expertise in math and statistics combined with training in complex software applications like SAS, Matlab, and the R programming language. Recently, however, new technologies and services have emerged that make it possible to work with raw data using web applications with simple visual interfaces. These visually compelling tools allow researchers to quickly see and communicate relationships between diverse trends, like the correlation between weather and burglary.This session will include a brief review of the visualizations built into traditional statistical software packages like R and Matlab followed by a demonstration of three new web-based tools applied to three real research topics.
Similar to See the forest AND the trees: Free tools for data visualisation (20)
Bots are automated pieces of software that carry out a set of operations with some degree of independence, often with behaviour mimicking a person. They’re becoming increasingly sophisticated and have been adopted for a range of purposes within the museum sector such as sharing random collection items and helping to answer visitors’ questions.
Serendipity and readability - Building an engaging online collection site wit...Paul Rowe
The Sarjeant Gallery's online collection site makes careful use of staff time and computer vision tools to provide multiple ways for visitors to explore their collection. The slides are a joint presentation from Paul Rowe and Jennifer Taylor Moore.
R2-D2 analysed your collection images and here's what he foundPaul Rowe
Techniques for the automated review of images continue to improve. Analysis of colours and patterns within your collection images can provide new connections between collection items. Artificial intelligence can work out the subject of an image with increasing accuracy, sometimes rivalling the descriptions created manually by cataloguers. This presentation looks at some of the emerging tools in this field and how they might be applied to your image collections.
How to cope when you open your digital doorsPaul Rowe
Museums are striving to share information about their collections online. What impact does this have on their organisations and how can they cope?
This presentation is the result of a survey of 16 different online collections where the organisations have chosen to publish the majority of their records online.
Using eHive to catalogue and share collection informationPaul Rowe
eHive is a web-based system to manage cataloguing and access to collection information. It is particularly suited to small museums and consortiums working together. It provides easy options for converting legacy data and for updating the content from anywhere with an Internet connection. You can share collection records on custom WordPress websites or with larger projects such as the Australian National Library's Trove website.
C.O.P.E. is a concept coined within National Public Radio as part of their new digital content strategy. How could this concept be applied to museum collection to promote better reuse of collection content. What problems are there with publishing museum collection content to many locations?
What do we want from online collections? - UK Museums on the Web 2012Paul Rowe
Keynote presentation from the annual UK Museums on the Web conference - Nov, 2012.
What can museums achieve by putting their collections online? What channels can the content be distributed on? How can they encourage sharing and engagement with their content? How can online collections provide more transparency around museum operations? How can the success of collections online be mesasured?
Places play a key part in the stories attached to a museum and its collections.
How are museums linking their collections to the outside world? Recent examples include Christchurch Art Gallery’s project to crowd-source subject geo-locations for their artworks, NZMuseums’ new Map View for viewing museum locations and Balboa Park’s mobile adventure game.
What could your museum be doing to connect to places and why is this important?
One year on, this presentation provides an overview of how the New Zealand Museums website has evolved. New museums continue to join NZMuseums and new collection records and images are published on the site daily. The demonstration will cover how museums can use the NZMuseums website, changes made to the web site over the last year (particularly the integration with public events from NZLive.com), and further development plans.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
UiPath Test Automation using UiPath Test Suite series, part 3DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 3. In this session, we will cover desktop automation along with UI automation.
Topics covered:
UI automation Introduction,
UI automation Sample
Desktop automation flow
Pradeep Chinnala, Senior Consultant Automation Developer @WonderBotz and UiPath MVP
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
Accelerate your Kubernetes clusters with Varnish CachingThijs Feryn
A presentation about the usage and availability of Varnish on Kubernetes. This talk explores the capabilities of Varnish caching and shows how to use the Varnish Helm chart to deploy it to Kubernetes.
This presentation was delivered at K8SUG Singapore. See https://feryn.eu/presentations/accelerate-your-kubernetes-clusters-with-varnish-caching-k8sug-singapore-28-2024 for more details.
Transcript: Selling digital books in 2024: Insights from industry leaders - T...BookNet Canada
The publishing industry has been selling digital audiobooks and ebooks for over a decade and has found its groove. What’s changed? What has stayed the same? Where do we go from here? Join a group of leading sales peers from across the industry for a conversation about the lessons learned since the popularization of digital books, best practices, digital book supply chain management, and more.
Link to video recording: https://bnctechforum.ca/sessions/selling-digital-books-in-2024-insights-from-industry-leaders/
Presented by BookNet Canada on May 28, 2024, with support from the Department of Canadian Heritage.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...UiPathCommunity
💥 Speed, accuracy, and scaling – discover the superpowers of GenAI in action with UiPath Document Understanding and Communications Mining™:
See how to accelerate model training and optimize model performance with active learning
Learn about the latest enhancements to out-of-the-box document processing – with little to no training required
Get an exclusive demo of the new family of UiPath LLMs – GenAI models specialized for processing different types of documents and messages
This is a hands-on session specifically designed for automation developers and AI enthusiasts seeking to enhance their knowledge in leveraging the latest intelligent document processing capabilities offered by UiPath.
Speakers:
👨🏫 Andras Palfi, Senior Product Manager, UiPath
👩🏫 Lenka Dulovicova, Product Program Manager, UiPath
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
See the forest AND the trees: Free tools for data visualisation
1. See the Forest AND the Trees
Free Data Visualisation Tools
Paul Rowe, @armchair_caver
National Digital Forum, Nov 2016
2. ““It doesn’t look like anything to me”It doesn’t look like anything to me”
Helping you spot patterns in big data setsHelping you spot patterns in big data sets
3. Use of Data VisualisationUse of Data Visualisation
Software: Typically 3 key stepsSoftware: Typically 3 key steps
1.1.Export to standard format.Export to standard format.
2.2.Clean up dataClean up data
3.3.Import/view in a visualisationImport/view in a visualisation
tooltool
5. Google’s Open Refine has powerfulGoogle’s Open Refine has powerful
tools for data cleanuptools for data cleanup
http://programminghistorian.org/lessons/cleaning-data-with-openrefine
6. Creating layers with Google Maps
https://support.google.com/mymaps/answer/3024933
7.
8.
9. Atlas of Living Australia: theAtlas of Living Australia: the
power of aggregation sitespower of aggregation sites
10. Google Analytics & Digital NZ’sGoogle Analytics & Digital NZ’s
metrics dashboardmetrics dashboard
23. Remember: Data is your friendRemember: Data is your friend
Want to find out more?Want to find out more?
Excel Pivot Tables:Excel Pivot Tables:
https://support.office.com/en-us/article/Create-a-PivotTable
Tableau Public:Tableau Public: https://public.tableau.comhttps://public.tableau.com
IBM’s Watson Analytics:IBM’s Watson Analytics:
http://www.ibm.com/analytics/watson-analytics/http://www.ibm.com/analytics/watson-analytics/
Wordle:Wordle: http://www.wordle.net
www.slideshare.net/PaulRowewww.slideshare.net/PaulRowe
Editor's Notes
Kia ora. I’m Paul Rowe, from Vernon Systems. We often work with large sets of collections data and we’ve been experimenting with different ways to view that data.
This is one example of a visualisation, but not a particularly useful one! This is a list of all the artworks acquired by Art Gallery of New South Wales in the 1990s. Today I’ll look at free software to view sets of data like this in graphical form.
There are three typical steps. Get your data in a common format, tidy it up and then start viewing in the visualisation software.
Excel is one tool that can be used at the cleanup stage. Here I’ve used a couple of Excel features to split the data in one text column and to extract just the year from the Acquisition Date column.
Google’s Open Refine is another option for data cleanup. The Programming Historian has written up an excellent summary of using it with collections data.
If you have map references for collection items then Google Maps provide options for viewing an interactive map of the data. Google Maps can manage multiple map layers and these can be kept private or shared with others.
Viewing the data on a map can help you see the trees, or in this case – a kiwi collected on its annual migration to the breeding grounds in the North Pacific Ocean.
That doesn’t seem quite right, so looking at the source data more closely it’s simply a case of the latitude being recorded as a positive, or northern hemisphere, coordinate. We can update the record and put the kiwi back in Stewart Island.
Contributing your data to wider projects can give you access to visualisation tools that others already built. Here we can the forest of Queensland Museum’s ¾ million published specimen records on the Atlas of Living Australia and we can instantly see where they were collected from.
Google Analytics provides options for visualising data about web pages. Digital NZ has also developed a dashboard which gives stats about the records you’ve supplied, providing another good reason for contributing to these higher level projects.
So you now have tidy data in a standard format. Microsoft Excel provides options for visualising the data using the Pivot Tables & Charts features. We could compare the media of the artworks against the historic period when they were made.
Here I have drilled down into the chart for Chinese ceramics and can see that the Gallery’s collection concentrates on a few periods, including the Ming Dynasty.
There are many data analysis services coming onto the market. IBM’s Watson Analytics is one example.
After importing your collection data Watson Analytics indicates the quality of each data column. Here the Acquisition Year is the best quality data. It can be understood as dates and is available for all records.
Watson has a clever feature where it suggests questions that it might be able to answer based on your data, or alternatively you can construct your own questions.
This is the graph for the question ‘How does the number of rows compare with the acquisition year’. We can immediately see 1990 was a particularly busy year.
We could break it down further, colour coding by departments within the Gallery. Now we can see that the majority of the 1990 items were in the Australian Art Department.
Watson provides a wide range of options for viewing the information. Here we see the proportion of the collection for each media category and it’s much easier to spot the most popular media.
You can also build your own layouts to present multiple charts and graphs.
These can be downloaded in several formats and could be used in documents like annual reports. They can also be combined to build up an interactive dashboard. Unfortunately, multi-user access to the dashboards is only possible with the paid Watson Analytics Pro version.
There are some really simple tools. The Wordle site lets you copy in a block of text to find the most popular words. This example plots the words from the mission statements of a set of museums.
This last example uses Tableau Public. We can see trends over time, in this case George Harrison’s song writing contributions increasing over the series of Beatles albums. Tableau is one of the more complex products but it can produce beautiful views of the data.
That’s a lightning overview of some options for data visualisation. Thank you.