Jose Luis Fernandez-Marquez (UNIGE) presenting CCL tracker framework at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
NGIPS Market Trends and Group Test ResultsNSS Labs
Next Generation Intrusion Prevention Systems are designed to protect against a new generation of threats that move faster and are more evasive than ever. Join NSS Labs as we explain NGIPS market trends, group test results, and the critical features to consider when purchasing a NGIPS.
Tina Phillips (Cornell Lab of Ornithology) - the DEVISE projectCitizenCyberlab
Tina Phillips (Cornell Lab of Ornithology) presenting the DEVISE project, and learning in citizen science research at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
How Celtra Optimizes its Advertising Platformwith DatabricksGrega Kespret
Leading brands such as Pepsi and Macy’s use Celtra’s technology platform for brand advertising. To inform better product design and resolve issues faster, Celtra relies on Databricks to gather insights from large-scale, diverse, and complex raw event data. Learn how Celtra uses Databricks to simplify their Spark deployment, achieve faster project turnaround time, and empower people to make data-driven decisions.
In this webinar, you will learn how Databricks helps Celtra to:
- Utilize Apache Spark to power their production analytics pipeline.
- Build a “Just-in-Time” data warehouse to analyze diverse data sources such as Elastic Load Balancer access logs, raw tracking events, operational data, and reportable metrics.
- Go beyond simple counting and group events into sequences (i.e., sessionization) and perform more complex analysis such as funnel analytics.
NGIPS Market Trends and Group Test ResultsNSS Labs
Next Generation Intrusion Prevention Systems are designed to protect against a new generation of threats that move faster and are more evasive than ever. Join NSS Labs as we explain NGIPS market trends, group test results, and the critical features to consider when purchasing a NGIPS.
Tina Phillips (Cornell Lab of Ornithology) - the DEVISE projectCitizenCyberlab
Tina Phillips (Cornell Lab of Ornithology) presenting the DEVISE project, and learning in citizen science research at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
How Celtra Optimizes its Advertising Platformwith DatabricksGrega Kespret
Leading brands such as Pepsi and Macy’s use Celtra’s technology platform for brand advertising. To inform better product design and resolve issues faster, Celtra relies on Databricks to gather insights from large-scale, diverse, and complex raw event data. Learn how Celtra uses Databricks to simplify their Spark deployment, achieve faster project turnaround time, and empower people to make data-driven decisions.
In this webinar, you will learn how Databricks helps Celtra to:
- Utilize Apache Spark to power their production analytics pipeline.
- Build a “Just-in-Time” data warehouse to analyze diverse data sources such as Elastic Load Balancer access logs, raw tracking events, operational data, and reportable metrics.
- Go beyond simple counting and group events into sequences (i.e., sessionization) and perform more complex analysis such as funnel analytics.
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...Precisely
Enterprises with mainframes and Cloud/server architectures face unique issues and challenges and if your enterprise delivers a service whose operation spans mainframe and distributed and/or Cloud infrastructures (e.g. a mobile banking/customer app), this webinar is for you.
See how you can gain unique business and service-relevant context using your own machine data, including that from your z/OS mainframe. Implicitly learn patterns, eliminate costly false alerts, identify anomalies, and baseline normal operations by employing advanced analytics driven by machine learning. You’ll also see and learn about:
• Accelerating root-cause analysis and getting ahead of customer-impacting outages and slow-downs for your service
• “Glass Table” view for clickable visualization of the entire service-relevant infrastructure
• Machine Learning in IT Service Intelligence
• The Machine Learning Toolkit available today
Analytical Innovation: How to Build the Next Generation Data PlatformVMware Tanzu
There was a time when the Enterprise Data Warehouse (EDW) was the only way to provide a 360-degree analytical view of the business. In recent years many organizations have deployed disparate analytics alternatives to the EDW, including: cloud data warehouses, machine learning frameworks, graph databases, geospatial tools, and other technologies. Often these new deployments have resulted in the creation of analytical silos that are too complex to integrate, seriously limiting global insights and innovation.
Join guest speaker, 451 Research’s Jim Curtis and Pivotal’s Jacque Istok for an interactive discussion about some of the overarching trends affecting the data warehousing market, as well as how to build a next generation data platform to accelerate business innovation. During this webinar you will learn:
- The significance of a multi-cloud, infrastructure-agnostic analytics
- What is working and what isn’t, when it comes to analytics integration
- The importance of seamlessly integrating all your analytics in one platform
- How to innovate faster, taking advantage of open source and agile software
Speakers: James Curtis, Senior Analyst, Data Platforms & Analytics, 451 Research & Jacque Istok, Head of Data, Pivotal
There are only TWO substantial phases in a life of a digital service: either they are being built or they are being optimized. How to improve your UX with digital analytics?
Big Data Ecosystem for Data-Driven Decision MakingAbzetdin Adamov
The extremely fast grow of Internet Services, Web and Mobile Applications and advance of the related Pervasive, Ubiquity and Cloud Computing concepts have stumulated production of tremendous amounts of data partially available online (call metadata, texts, emails, social media updates, photos, videos, location, etc.). Even with the power of today’s modern computers it still big challenge for business and government organizations to manage, search, analyze, and visualize this vast amount of data as information. Data-Intensive computing which is intended to address this problems become quite intense during the last few years yielding strong results. Data intensive computing framework is a complex system which includes hardware, software, communications, and Distributed File System (DFS) architecture.
Just small part of this huge amount is structured (Databases, XML, logs) or semistructured (web pages, email), over 90% of this information is unstructured, what means data does not have predefined structure and model. Generally, unstructured data is useless unless applying data mining and analysis techniques. At the same time, just in case if you can process and understand your data, this data worth anything, otherwise it becomes useless.
Streaming and Visual Data Discovery for the Internet of ThingsDatawatchCorporation
Sensor devices and their associated data streams are rapidly becoming a big source of differentiation for organizations that can effectively harness this information to drive new insights and take action. The breakthrough is enabled by new solutions for applying visual data discovery to streaming data in motion. This session will focus on industrial analytics and how best to apply new technologies that drive synergies between IT and OT.
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...BigData_Europe
Presentation by Martin Kaltenböck, Semantic Web Company, at the first workshop of Societal Challlenge 6 in the BigDataEurope project, taking place in Luxembourg on 18 November 2015.
http://www.big-data-europe.eu/social-sciences/
Building a Real-Time Security Application Using Log Data and Machine Learning...Sri Ambati
Building a Real-Time Security Application Using Log Data and Machine Learning- Karthik Aaravabhoomi
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
We explain the history of our agile organization with a focus on the latest round of evolution of our Product and Engineering organization, moving from business-oriented feature teams to mission teams.
Before vs After: Redesigning a Website to be Useful and Informative for Devel...Teresa Giacomini
There are so many fun challenges in creating a useful website for a developer audience today: you’ve got to empathize with your audience, nail the voice, understand the “jobs” your site’s visitors are trying to accomplish, make sure you anticipate (and answer!) the questions people are likely to have. In this quick lightning talk, I’ll share some before vs. after pics of a recent Citus Data site redesign—and will share some of the best practices we used, based on my years as a developer, software engineering manager, product manager, and now, as a marketer.
Michael Weber - Rechenkraft.net - From Volunteers to ScientistsCitizenCyberlab
Michael Weber presenting Rechenkraft.net - From Volunteers to Scientists, at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Brainstorming sessions at Citizen Cyber SummitCitizenCyberlab
Brainstorming sessions at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE). EpiCollect, CitizenGrid, Virtual Atom Smasher (VAS), GeoTag-X.
More Related Content
Similar to Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker
Machine Learning & IT Service Intelligence for the Enterprise: The Future is ...Precisely
Enterprises with mainframes and Cloud/server architectures face unique issues and challenges and if your enterprise delivers a service whose operation spans mainframe and distributed and/or Cloud infrastructures (e.g. a mobile banking/customer app), this webinar is for you.
See how you can gain unique business and service-relevant context using your own machine data, including that from your z/OS mainframe. Implicitly learn patterns, eliminate costly false alerts, identify anomalies, and baseline normal operations by employing advanced analytics driven by machine learning. You’ll also see and learn about:
• Accelerating root-cause analysis and getting ahead of customer-impacting outages and slow-downs for your service
• “Glass Table” view for clickable visualization of the entire service-relevant infrastructure
• Machine Learning in IT Service Intelligence
• The Machine Learning Toolkit available today
Analytical Innovation: How to Build the Next Generation Data PlatformVMware Tanzu
There was a time when the Enterprise Data Warehouse (EDW) was the only way to provide a 360-degree analytical view of the business. In recent years many organizations have deployed disparate analytics alternatives to the EDW, including: cloud data warehouses, machine learning frameworks, graph databases, geospatial tools, and other technologies. Often these new deployments have resulted in the creation of analytical silos that are too complex to integrate, seriously limiting global insights and innovation.
Join guest speaker, 451 Research’s Jim Curtis and Pivotal’s Jacque Istok for an interactive discussion about some of the overarching trends affecting the data warehousing market, as well as how to build a next generation data platform to accelerate business innovation. During this webinar you will learn:
- The significance of a multi-cloud, infrastructure-agnostic analytics
- What is working and what isn’t, when it comes to analytics integration
- The importance of seamlessly integrating all your analytics in one platform
- How to innovate faster, taking advantage of open source and agile software
Speakers: James Curtis, Senior Analyst, Data Platforms & Analytics, 451 Research & Jacque Istok, Head of Data, Pivotal
There are only TWO substantial phases in a life of a digital service: either they are being built or they are being optimized. How to improve your UX with digital analytics?
Big Data Ecosystem for Data-Driven Decision MakingAbzetdin Adamov
The extremely fast grow of Internet Services, Web and Mobile Applications and advance of the related Pervasive, Ubiquity and Cloud Computing concepts have stumulated production of tremendous amounts of data partially available online (call metadata, texts, emails, social media updates, photos, videos, location, etc.). Even with the power of today’s modern computers it still big challenge for business and government organizations to manage, search, analyze, and visualize this vast amount of data as information. Data-Intensive computing which is intended to address this problems become quite intense during the last few years yielding strong results. Data intensive computing framework is a complex system which includes hardware, software, communications, and Distributed File System (DFS) architecture.
Just small part of this huge amount is structured (Databases, XML, logs) or semistructured (web pages, email), over 90% of this information is unstructured, what means data does not have predefined structure and model. Generally, unstructured data is useless unless applying data mining and analysis techniques. At the same time, just in case if you can process and understand your data, this data worth anything, otherwise it becomes useless.
Streaming and Visual Data Discovery for the Internet of ThingsDatawatchCorporation
Sensor devices and their associated data streams are rapidly becoming a big source of differentiation for organizations that can effectively harness this information to drive new insights and take action. The breakthrough is enabled by new solutions for applying visual data discovery to streaming data in motion. This session will focus on industrial analytics and how best to apply new technologies that drive synergies between IT and OT.
SC6 Workshop 1: Big Data Europe platform requirements and draft architecture:...BigData_Europe
Presentation by Martin Kaltenböck, Semantic Web Company, at the first workshop of Societal Challlenge 6 in the BigDataEurope project, taking place in Luxembourg on 18 November 2015.
http://www.big-data-europe.eu/social-sciences/
Building a Real-Time Security Application Using Log Data and Machine Learning...Sri Ambati
Building a Real-Time Security Application Using Log Data and Machine Learning- Karthik Aaravabhoomi
- Powered by the open source machine learning software H2O.ai. Contributors welcome at: https://github.com/h2oai
- To view videos on H2O open source machine learning software, go to: https://www.youtube.com/user/0xdata
We explain the history of our agile organization with a focus on the latest round of evolution of our Product and Engineering organization, moving from business-oriented feature teams to mission teams.
Before vs After: Redesigning a Website to be Useful and Informative for Devel...Teresa Giacomini
There are so many fun challenges in creating a useful website for a developer audience today: you’ve got to empathize with your audience, nail the voice, understand the “jobs” your site’s visitors are trying to accomplish, make sure you anticipate (and answer!) the questions people are likely to have. In this quick lightning talk, I’ll share some before vs. after pics of a recent Citus Data site redesign—and will share some of the best practices we used, based on my years as a developer, software engineering manager, product manager, and now, as a marketer.
Similar to Jose Luis Fernandez-Marquez (UNIGE) - CCL tracker (20)
Michael Weber - Rechenkraft.net - From Volunteers to ScientistsCitizenCyberlab
Michael Weber presenting Rechenkraft.net - From Volunteers to Scientists, at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Brainstorming sessions at Citizen Cyber SummitCitizenCyberlab
Brainstorming sessions at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE). EpiCollect, CitizenGrid, Virtual Atom Smasher (VAS), GeoTag-X.
John Darlington (Imperial College London) - CitizenGridCitizenCyberlab
John Darlington (Imperial College London) presenting a user-friendly crowdsourcing platform, CitizenGrid, at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Peter Hanappe (Sony Computer Science Lab, Paris) - Agroecology as citizen sci...CitizenCyberlab
Peter Hanappe (Sony Computer Science Lab, Paris) presenting Agroecology as citizen science at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
En-Te-Hwu (Academia Sinica, Taiwan) presenting DIY Atomic Force Microscope & Lifelong learning at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Eleanor Rusack (UNITAR) presenting GeoTag-X, a crowdsourcing application for humanitarian crises, at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Charlene Jennet (UCL) - Creativity in citizen scienceCitizenCyberlab
Charlene Jennet (UCL) presenting findings about creativity in citizen science projects, within and beyond Citizen Cyberlab, at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Bruno Strasser (UNIGE) - Bioscope & History of Citizen ScienceCitizenCyberlab
Bruno Strasser (UNIGE) presenting Bioscope (a bio-research based outreach project to the public) & History of Citizen Science at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
David Aanensen (Imperial College London) - Epicollect (demo)CitizenCyberlab
David Aanensen (Imperial College London) presenting a demo of Epicollect at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Juan Hindo (IBM) presented the research about learning & engagement of volunteers at World Community Grid, at the Citizen Cyberlab Summit, 17-18 September 2015, University of Geneva (UNIGE).
Opendatabay - Open Data Marketplace.pptxOpendatabay
Opendatabay.com unlocks the power of data for everyone. Open Data Marketplace fosters a collaborative hub for data enthusiasts to explore, share, and contribute to a vast collection of datasets.
First ever open hub for data enthusiasts to collaborate and innovate. A platform to explore, share, and contribute to a vast collection of datasets. Through robust quality control and innovative technologies like blockchain verification, opendatabay ensures the authenticity and reliability of datasets, empowering users to make data-driven decisions with confidence. Leverage cutting-edge AI technologies to enhance the data exploration, analysis, and discovery experience.
From intelligent search and recommendations to automated data productisation and quotation, Opendatabay AI-driven features streamline the data workflow. Finding the data you need shouldn't be a complex. Opendatabay simplifies the data acquisition process with an intuitive interface and robust search tools. Effortlessly explore, discover, and access the data you need, allowing you to focus on extracting valuable insights. Opendatabay breaks new ground with a dedicated, AI-generated, synthetic datasets.
Leverage these privacy-preserving datasets for training and testing AI models without compromising sensitive information. Opendatabay prioritizes transparency by providing detailed metadata, provenance information, and usage guidelines for each dataset, ensuring users have a comprehensive understanding of the data they're working with. By leveraging a powerful combination of distributed ledger technology and rigorous third-party audits Opendatabay ensures the authenticity and reliability of every dataset. Security is at the core of Opendatabay. Marketplace implements stringent security measures, including encryption, access controls, and regular vulnerability assessments, to safeguard your data and protect your privacy.
Explore our comprehensive data analysis project presentation on predicting product ad campaign performance. Learn how data-driven insights can optimize your marketing strategies and enhance campaign effectiveness. Perfect for professionals and students looking to understand the power of data analysis in advertising. for more details visit: https://bostoninstituteofanalytics.org/data-science-and-artificial-intelligence/
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
1. CCLTracker
Framework
Monitoring
users
learning
and
ac7vity
in
web
based
ci7zen
science
projects
Jose
Luis
Fernandez-‐Marquez
Joseluis.fernandez@unige.ch
3. Mo7va7on
Bounce
rate,
pages
views,
avg.
7me
per
session,
etc…
might
not
be
relevant
to
measure
user
engagement,
or
relevant
in
our
ci7zen
science
project.
11. Segmenta7on,
crossing
data
and
adding
filters
• Give
me
previous
stats
focusing
on
visitors
coming
from
reddit.
• Give
me
previous
stats
focusing
on
female
visitors,
between
18
and
24
years,
linux
users,
living
in
Switzerland.
• Infinite
number
of
possible
combina7ons.
12. Segmenta7on,
crossing
data
and
adding
filters
• Top
10
countries
by
Female
users.
• Top
10
referrals
(reddit,
facebook,
…)
gathering
female
users.
• Top
10
referrals
gathering
visitors
who
immediately
run
away
from
the
site.
14. Bounce
rate,
pages
views,
and
avg.
4me
per
session
might
not
be
relevant
for
a
ci7zen
science
project.
– We
need
to
know
the
ac7ons
performed
in
the
site
to
measure
par7cipants
contribu7on.
– We
need
to
be
able
to
make
public
analy7c
data.
– We
need
to
be
able
to
create
advance
data
aggrega7on.
I.e.
clustering
analysis,
advance
engagement
func7ons.
15. CCLTracker
framework
CCLTracker
JS Library
Google
Analytics
Google Tag
Manager
RGA Library
R
Monitoring Storing, aggregating,
reporting
Advance Aggregation
and reporting
Google
Super Proxy
16. CCLTracker
events
-‐ Is
the
user
scrolling
down
on
the
web
site
(0%,
25%,50%,75%100%)
-‐
is
the
user
clicking
new
projects,
about,
forum,
etc?
19. Segmenta7on,
crossing
data
and
filtering
• Stats
only
for
par7cipants
who
has
properly
installed
the
CERN
VM.
(segmenta7on)
• Comparing
two
segments
(e.g.
all
sessions
vs
sessions
running
the
virtual
machine.
26. Advance
Aggrega7on
Visitors
per
age
group
Visitors
successful
running
VM
per
age
group
0
10
20
30
40
50
60
18-‐24
25-‐34
35-‐44
45-‐54
55-‐64
>65
%
of
sessions
Users'
age
User
engagement
per
age
group
29. Why
do
we
want
all
this
data?
• Increase
the
number
of
par7cipants
• Increase
par7cipants’
engagement.
• Improve
the
navigability,
and
accessibility
of
the
website.
• Improve
users’
learning
experience.
Are
users
improving
the
quality
of
their
contribu7ons?
30. What
did
we
learn
using
CCLTracker?
• Transla7on
to
different
languages
is
important
to
reach
a
large
audience.
E.g.
Russian
referral.
• Addressing
technology
sec7ons
in
newspapers.
• Web
site
naviga7on
is
not
trivial.
• Low
engagement.
• Low
female
par7cipa7on.
31. 2nd
CERN
Public
Compu7ng
Challenge
• Increase
the
number
of
par7cipants:
– Addressing
female
par7cipa7on.
– Transla7ng
the
website
to
different
languages.
– Pos7ng
the
informa7on
in
data
hubs,
scien7fic
sec7ons
of
newspapers,
etc…
• Segmen7ng
data
by
different
level
of
engagements:
– Visitors
who
do
not
run
the
VM.
– Visitors
who
compute
10
jobs,
20
jobs,
…
n
jobs.
– Visitors
who
compute
for
1
h,
5
h,
…
n
hours.
– Visitors
who
par7cipate
over
the
whole
challenge…
32. “top
10%
of
contributors
responsible
for
almost
80%
of
total
classifica7ons.”
Open
ques7on