SlideShare a Scribd company logo
1 of 28
Jesus-Enrique Aldana-Sigona
October 2017
DYNAMIC USER PROFILING IN OPEN SCIENCE INFRASTRUCTURES
RESEARCH CENTER
Open Digital Innovation
Open science
Open Science
Tools
Open
Reproducible
Research
Doing science supported by scientific digital
infrastructures has changed the way of
doing science
Source: strategyzer.com
Open access
Open Data
Limited access,
data, tools
DIGITAL
SCIENTIFIC
INFRASTRUCTU
RES
Current challenge for the Science 2.0 platforms
Scientific platforms help the users on scientific related activities
Source: The Center for Open Science
Different infrastructures rely their impact and operational dynamics diferrent. In
particular, the creation infrastructures are more sensible to the community creation for
the continuity of the developed creations. The lack of engagement increases the
resistance adoption of Open science practices according to a survey from European
Commission (2015).
Digital Scientific output challenges
Traditional Scientific outputs.
E.g. Journal publications
Digital Scientific outputs. E.g.
scientific-oriented programming code,
repositories, etc.
Yields to…
Source: strategyzer.com
Communities of creation
What is engagement?
Behavioral
Engagementfactors[1]PlatformCommunity
Cognitive
Emotional
Qualitative Quantitative
Typeof
studies
Cognitive
Emotional
Qualitative
Physical
Log behavior on the
platform
Networks have help to understand human
behaviors in the past, why not here?
[1] Fredricks, J., Blumenfeld, P.C., Paris, A.H.: School engagement: potential
of the concept, state of the evidence. Rev. Educ. Res. 74, 59–109 (2004)
(?)
Engagement
Psychology
PhilosophySociology
Computer
Science
Engagement metrics
Lifespan
ACTIVITY RATIO:
LURKING RATIO
Σ + Σ
𝑙𝑖𝑓𝑒 𝑠𝑝𝑎𝑛
Σ
𝑙𝑖𝑓𝑒 𝑠𝑝𝑎𝑛
Blocs
Variation in
periodicity
STD ( Blocks of )
Aims
Identify users’ infrastructure life journeys where the engagement increased
above average infrastructure population
Generate a predictive model for new users where the current behavior
indicates the next possible behavior state in order to optimize the
engagement levels
METHODOLOGY
Profiling using density clustering method
Big Data Implications on profiling user’s Behavior
In addition to density clustering methods for the detection of profiles in continuous data,
we will use the hierarchical form to expose embedded types of behavior
User type A
User type A mixed
with B User type B
?
Where to
cut?
Users living X number of days
Embedded
types
1. Maria Aristeidou, Eileen Scanlon, Mike Sharples, Profiles of engagement in online communities of citizen science
participation, In Computers in Human Behavior, Volume 74, 2017, Pages 246-256, ttps://doi.org/10.1016/j.chb.2017.04.044.
2. Shirkhorshidi A.S., Aghabozorgi S., Wah T.Y., Herawan T. (2014) Big Data Clustering: A Review. In: Murgante B. et al.
(eds) Computational Science and Its Applications – ICCSA 2014. ICCSA 2014. Lecture Notes in Computer Science, vol
8583. Springer, Cham
Cluster tendency
validation
Hierarchical
Clustering
Internal
Validation
Pattern
recognition
Network
implications
Classification
Methodology
1 2 3
6 5 4
Preliminary results
In progress
CASE OF STUDY: NANOHUB
Case of Study
Nanohub
> 5700
Resources
> 500
simulation
tools
> 300,000
Tutorial/
Lecture
Users
In 172
countries
> 1.4 M
visitors
> 16, 000
Simulation
Users
NanoHub represents a novel form of virtual collective science production
NanoHub Research
Ecosystem
Students
Tool
developers
Educators
Socio-technical infrastructure
Publication
network
Social & organizational
behavior
Technologies
Research Ecosystem
Open Science
Brunswicker, et al 2016
Nanohub
> 5700
Resources
> 500
simulation
tools
> 300,000
Tutorial/
Lecture
Users
In 172
countries
> 1.4 M
visitors
> 16, 000
Simulation
Users
PRELIMINARY RESULTS
1. Cluster tendency Validation
Validation of clustering tendency using Hopkins statistic
Source: Yerpo, Population distribution
Natural dispersion of among communities’ data
Engagement
behavior dataset*
has presented a
strong clustering
tendency
0.01 0.30 0.50 0.70 1.000.99
2-Ddataview
Dataset: > 40,000 registered users in the range between 2010 - 2016
17
PCA Results
Goal: Identify the impact of each proven
metrics from nanohub dataset.
Metrics Percentage of
Variance explained
Standard deviation of absence days
between logins
Discussion
1. The periodicity var value was not making
sense because that meant that the user
engagement is just defined by the
download behavior.
2. The session Creation value was zero
because the number of users who create
simulation in contrast with the number of
users who download is smaller. Therfor,
the influence of download activity was
based on an unbalance proportion
• Conclusion: The current variables are
not presenting sufficiently significant
based on the unbalance dataset.
TSE representation of populations
User’s life representation
User Behaviors across populations created by lifespanDistribution of users by lifespan
The current high dimensional representation
of the user’s behavior showed non regular
shapes that supports the usage of density
based clustering algorithms
Engagement Behavior has a path
1 2 3
1
2
3
Similarity Matrix Example
populations
Population 2 & 3 tested in Two-sample KS
test and had shown non significant
difference
Younger day population behave
significantly different behavior but
elder populations become similar
Not similar
Strongly
similar
Lifespan populations
MERGING ALL THE POPULATIONS THAT ARE SIMILAR BETWEEN THEMSELVES
Not similar
Strongly
similar
Despite the statistical similarity
A = B =C => A=C FALSE
MERGING MULTIPLE
LIFESPANS
GRAPH BASED SOLUTION:
MAXIMAL CLIQUES
A
B
C
E
D
Contributions
Evidence of non spherical behavioral
engagement
The effects of unbalance data over time for
profiling analysis
Evidence of similarity among high engaged
populations
Next steps
Pattern recognition over time
Day 1 Day 2 Day 4
Snapshot populations
User 1
User 2
User behavior overtime
A
C
B
A
C
B
A
C
B
Network of Shared knowledge interests
A BUsers
Common
interest
represented
by X
resource
Critical behavior for longer
engagement
Thanks
Special thank you to …
Dr. Gerhard
Klimeck
Dr. Michael
Zentner
Dr. Sabine
Brunswicker
Dr. Satyam
Mukherjee
National
Science
Foundation
CONACYT Mexico Government
Babak Ravandi
24
Thank you!
BACKUP SLIDES
Continuous login behavior
Goal: Identify if the habit
concept as a significant
variables for further
analysis.
Q: How many users with
every day logins are
expected to be seen after x
days?
A: 7 is the support value of
max continuous days that
more than 1 user is seen.
Correlation matrix
Correlation matrix of variables of interest
Goal: Select the
variables with more
influence across the
set of variables
Results: Not
conclusive yet
CriteriaClustering t Hierarchical Density-
based
Non-hierarchical
No apriori # of
clusters
YES NO
Convex clusters YES NO
Dif size of the
clusters
YES Some
Scalabe NO YES
Convergance YES NO
Nested clusters YES NO
Unique cluster YES NO
Noise detection YES Algorithm dependant
Parameters
robustness
NO Algorithm dependant
Non parametric Method selection
Method selection
Source: Jain et al 1999
Clustering methods
Non-spherical
clusters
Internal clustering validation
Silhouette Index
Taken from Kenn State DataMining course ‘08

More Related Content

What's hot

Open Source, Open Science, & Citizen Science
Open Source, Open Science, & Citizen ScienceOpen Source, Open Science, & Citizen Science
Open Source, Open Science, & Citizen ScienceAndrea Wiggins
 
Spreading processes on temporal networks
Spreading processes on temporal networksSpreading processes on temporal networks
Spreading processes on temporal networksPetter Holme
 
Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2
Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2
Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2Carter Craft
 
Effects of Network Structure, Competition and Memory Time on Social Spreading...
Effects of Network Structure, Competition and Memory Time on Social Spreading...Effects of Network Structure, Competition and Memory Time on Social Spreading...
Effects of Network Structure, Competition and Memory Time on Social Spreading...James Gleeson
 
Activating Research Collaboratories with Collaboration Patterns
Activating Research Collaboratories with Collaboration PatternsActivating Research Collaboratories with Collaboration Patterns
Activating Research Collaboratories with Collaboration PatternsCommunitySense
 
Visually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of LifeVisually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of LifeHarish Vaidyanathan
 
DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?
DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?
DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?Todd Suomela
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networksFrancisco Restivo
 
Important spreaders in networks: exact results on small graphs
Important spreaders in networks: exact results on small graphsImportant spreaders in networks: exact results on small graphs
Important spreaders in networks: exact results on small graphsPetter Holme
 
Network Structures For A Better Twitter Community
Network Structures For A Better Twitter CommunityNetwork Structures For A Better Twitter Community
Network Structures For A Better Twitter CommunityWeiai Wayne Xu
 
Predicting opinion leadership on twitter
Predicting opinion leadership on twitter   Predicting opinion leadership on twitter
Predicting opinion leadership on twitter Weiai Wayne Xu
 
Kwon Ph.D. Dissertation 2016
Kwon Ph.D. Dissertation 2016Kwon Ph.D. Dissertation 2016
Kwon Ph.D. Dissertation 2016Karl Kwon, Ph.D.
 
Mobile Sensors in the City
Mobile Sensors in the CityMobile Sensors in the City
Mobile Sensors in the CityNeal Lathia
 
Academic Article you are mentioned in
Academic Article you are mentioned inAcademic Article you are mentioned in
Academic Article you are mentioned inPortia Thomas
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-ResearchEric Meyer
 
Online Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsOnline Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsAndrea Wiggins
 
Complex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreComplex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreMatteo Moci
 
Big data divided (24 march2014)
Big data divided (24 march2014)Big data divided (24 march2014)
Big data divided (24 march2014)Han Woo PARK
 

What's hot (20)

Open Source, Open Science, & Citizen Science
Open Source, Open Science, & Citizen ScienceOpen Source, Open Science, & Citizen Science
Open Source, Open Science, & Citizen Science
 
24 The Evolution of Network Thinking
24 The Evolution of Network Thinking24 The Evolution of Network Thinking
24 The Evolution of Network Thinking
 
Spreading processes on temporal networks
Spreading processes on temporal networksSpreading processes on temporal networks
Spreading processes on temporal networks
 
Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2
Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2
Living Labs Roundtable / NYC Climate Week 2020/ Part 2 of 2
 
Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013Kno.e.sis Review: late 2012 to mid 2013
Kno.e.sis Review: late 2012 to mid 2013
 
Effects of Network Structure, Competition and Memory Time on Social Spreading...
Effects of Network Structure, Competition and Memory Time on Social Spreading...Effects of Network Structure, Competition and Memory Time on Social Spreading...
Effects of Network Structure, Competition and Memory Time on Social Spreading...
 
Activating Research Collaboratories with Collaboration Patterns
Activating Research Collaboratories with Collaboration PatternsActivating Research Collaboratories with Collaboration Patterns
Activating Research Collaboratories with Collaboration Patterns
 
Visually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of LifeVisually Exploring Social Participation in Encyclopedia of Life
Visually Exploring Social Participation in Encyclopedia of Life
 
DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?
DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?
DigiCCurr 2013 PhD Workshop - Citizen Science and Data Curation: Who needs what?
 
Community detection in social networks
Community detection in social networksCommunity detection in social networks
Community detection in social networks
 
Important spreaders in networks: exact results on small graphs
Important spreaders in networks: exact results on small graphsImportant spreaders in networks: exact results on small graphs
Important spreaders in networks: exact results on small graphs
 
Network Structures For A Better Twitter Community
Network Structures For A Better Twitter CommunityNetwork Structures For A Better Twitter Community
Network Structures For A Better Twitter Community
 
Predicting opinion leadership on twitter
Predicting opinion leadership on twitter   Predicting opinion leadership on twitter
Predicting opinion leadership on twitter
 
Kwon Ph.D. Dissertation 2016
Kwon Ph.D. Dissertation 2016Kwon Ph.D. Dissertation 2016
Kwon Ph.D. Dissertation 2016
 
Mobile Sensors in the City
Mobile Sensors in the CityMobile Sensors in the City
Mobile Sensors in the City
 
Academic Article you are mentioned in
Academic Article you are mentioned inAcademic Article you are mentioned in
Academic Article you are mentioned in
 
The End(s) of e-Research
The End(s) of e-ResearchThe End(s) of e-Research
The End(s) of e-Research
 
Online Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCamsOnline Communities in Citizen Science & BirdCams
Online Communities in Citizen Science & BirdCams
 
Complex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma TreComplex Networks Analysis @ Universita Roma Tre
Complex Networks Analysis @ Universita Roma Tre
 
Big data divided (24 march2014)
Big data divided (24 march2014)Big data divided (24 march2014)
Big data divided (24 march2014)
 

Similar to Enrique RCODI presentation symposium 2017

Towards Collaboration Translucence: Giving Meaning to Multimodal Group Data
Towards Collaboration Translucence: Giving Meaning to Multimodal Group DataTowards Collaboration Translucence: Giving Meaning to Multimodal Group Data
Towards Collaboration Translucence: Giving Meaning to Multimodal Group DataSimon Buckingham Shum
 
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...Ramine Tinati
 
Online Communities in Citizen Science
Online Communities in Citizen ScienceOnline Communities in Citizen Science
Online Communities in Citizen ScienceAndrea Wiggins
 
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & SchroederOII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & SchroederEric Meyer
 
Thinking About the Making of Data
Thinking About the Making of DataThinking About the Making of Data
Thinking About the Making of DataPaul Groth
 
Learning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksLearning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksSymeon Papadopoulos
 
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...Micah Altman
 
Biodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBiodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBryan Heidorn
 
British columbia september2014_final
British columbia september2014_finalBritish columbia september2014_final
British columbia september2014_finalcarolbumd
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeLizLyon
 
Towards a systematic framework for analysis of tools and approaches to integr...
Towards a systematic framework for analysis of tools and approaches to integr...Towards a systematic framework for analysis of tools and approaches to integr...
Towards a systematic framework for analysis of tools and approaches to integr...Aberdeen CES
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFOlga Scrivner
 
Social media as a tool for researchers
Social media as a tool for researchersSocial media as a tool for researchers
Social media as a tool for researchersJari Laru
 
Media X at Stanford University - Description
Media X at Stanford University - DescriptionMedia X at Stanford University - Description
Media X at Stanford University - DescriptionMartha Russell
 
WEBINAR: Joining the "buzz": the role of social media in raising research vi...
WEBINAR:  Joining the "buzz": the role of social media in raising research vi...WEBINAR:  Joining the "buzz": the role of social media in raising research vi...
WEBINAR: Joining the "buzz": the role of social media in raising research vi...HELIGLIASA
 
Research-Open Access-Social Media: a winning combination
Research-Open Access-Social Media: a winning combinationResearch-Open Access-Social Media: a winning combination
Research-Open Access-Social Media: a winning combinationRhodes University Library
 
Interactive recommender systems: opening up the “black box”
Interactive recommender systems: opening up the “black box”Interactive recommender systems: opening up the “black box”
Interactive recommender systems: opening up the “black box”Katrien Verbert
 
The role of learning in citizen science
The role of learning in citizen scienceThe role of learning in citizen science
The role of learning in citizen scienceMuki Haklay
 
Social Machines of Scholarly Collaboration
Social Machines of Scholarly CollaborationSocial Machines of Scholarly Collaboration
Social Machines of Scholarly CollaborationDavid De Roure
 

Similar to Enrique RCODI presentation symposium 2017 (20)

Towards Collaboration Translucence: Giving Meaning to Multimodal Group Data
Towards Collaboration Translucence: Giving Meaning to Multimodal Group DataTowards Collaboration Translucence: Giving Meaning to Multimodal Group Data
Towards Collaboration Translucence: Giving Meaning to Multimodal Group Data
 
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
WSI Stimulus Project: Centre for longitudinal studies of online citizen parti...
 
Online Communities in Citizen Science
Online Communities in Citizen ScienceOnline Communities in Citizen Science
Online Communities in Citizen Science
 
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & SchroederOII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
OII Summer Doctoral Programme 2010: Global brain by Meyer & Schroeder
 
Thinking About the Making of Data
Thinking About the Making of DataThinking About the Making of Data
Thinking About the Making of Data
 
Learning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction NetworksLearning to Classify Users in Online Interaction Networks
Learning to Classify Users in Online Interaction Networks
 
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
Brown Bag: New Models of Scholarly Communication for Digital Scholarship, by ...
 
Biodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary ChallengeBiodiversity Informatics: An Interdisciplinary Challenge
Biodiversity Informatics: An Interdisciplinary Challenge
 
British columbia september2014_final
British columbia september2014_finalBritish columbia september2014_final
British columbia september2014_final
 
Acting as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decadeActing as Advocate? Seven steps for libraries in the data decade
Acting as Advocate? Seven steps for libraries in the data decade
 
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
CIC Networked Learning Practices Workshop - Caroline HaythornthwaiteCIC Networked Learning Practices Workshop - Caroline Haythornthwaite
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
 
Towards a systematic framework for analysis of tools and approaches to integr...
Towards a systematic framework for analysis of tools and approaches to integr...Towards a systematic framework for analysis of tools and approaches to integr...
Towards a systematic framework for analysis of tools and approaches to integr...
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVF
 
Social media as a tool for researchers
Social media as a tool for researchersSocial media as a tool for researchers
Social media as a tool for researchers
 
Media X at Stanford University - Description
Media X at Stanford University - DescriptionMedia X at Stanford University - Description
Media X at Stanford University - Description
 
WEBINAR: Joining the "buzz": the role of social media in raising research vi...
WEBINAR:  Joining the "buzz": the role of social media in raising research vi...WEBINAR:  Joining the "buzz": the role of social media in raising research vi...
WEBINAR: Joining the "buzz": the role of social media in raising research vi...
 
Research-Open Access-Social Media: a winning combination
Research-Open Access-Social Media: a winning combinationResearch-Open Access-Social Media: a winning combination
Research-Open Access-Social Media: a winning combination
 
Interactive recommender systems: opening up the “black box”
Interactive recommender systems: opening up the “black box”Interactive recommender systems: opening up the “black box”
Interactive recommender systems: opening up the “black box”
 
The role of learning in citizen science
The role of learning in citizen scienceThe role of learning in citizen science
The role of learning in citizen science
 
Social Machines of Scholarly Collaboration
Social Machines of Scholarly CollaborationSocial Machines of Scholarly Collaboration
Social Machines of Scholarly Collaboration
 

Recently uploaded

Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 

Recently uploaded (20)

Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 

Enrique RCODI presentation symposium 2017

  • 1. Jesus-Enrique Aldana-Sigona October 2017 DYNAMIC USER PROFILING IN OPEN SCIENCE INFRASTRUCTURES RESEARCH CENTER Open Digital Innovation
  • 2. Open science Open Science Tools Open Reproducible Research Doing science supported by scientific digital infrastructures has changed the way of doing science Source: strategyzer.com Open access Open Data Limited access, data, tools DIGITAL SCIENTIFIC INFRASTRUCTU RES
  • 3. Current challenge for the Science 2.0 platforms Scientific platforms help the users on scientific related activities Source: The Center for Open Science Different infrastructures rely their impact and operational dynamics diferrent. In particular, the creation infrastructures are more sensible to the community creation for the continuity of the developed creations. The lack of engagement increases the resistance adoption of Open science practices according to a survey from European Commission (2015). Digital Scientific output challenges Traditional Scientific outputs. E.g. Journal publications Digital Scientific outputs. E.g. scientific-oriented programming code, repositories, etc. Yields to… Source: strategyzer.com Communities of creation
  • 4. What is engagement? Behavioral Engagementfactors[1]PlatformCommunity Cognitive Emotional Qualitative Quantitative Typeof studies Cognitive Emotional Qualitative Physical Log behavior on the platform Networks have help to understand human behaviors in the past, why not here? [1] Fredricks, J., Blumenfeld, P.C., Paris, A.H.: School engagement: potential of the concept, state of the evidence. Rev. Educ. Res. 74, 59–109 (2004) (?) Engagement Psychology PhilosophySociology Computer Science
  • 5. Engagement metrics Lifespan ACTIVITY RATIO: LURKING RATIO Σ + Σ 𝑙𝑖𝑓𝑒 𝑠𝑝𝑎𝑛 Σ 𝑙𝑖𝑓𝑒 𝑠𝑝𝑎𝑛 Blocs Variation in periodicity STD ( Blocks of )
  • 6. Aims Identify users’ infrastructure life journeys where the engagement increased above average infrastructure population Generate a predictive model for new users where the current behavior indicates the next possible behavior state in order to optimize the engagement levels
  • 8. Profiling using density clustering method Big Data Implications on profiling user’s Behavior In addition to density clustering methods for the detection of profiles in continuous data, we will use the hierarchical form to expose embedded types of behavior User type A User type A mixed with B User type B ? Where to cut? Users living X number of days Embedded types 1. Maria Aristeidou, Eileen Scanlon, Mike Sharples, Profiles of engagement in online communities of citizen science participation, In Computers in Human Behavior, Volume 74, 2017, Pages 246-256, ttps://doi.org/10.1016/j.chb.2017.04.044. 2. Shirkhorshidi A.S., Aghabozorgi S., Wah T.Y., Herawan T. (2014) Big Data Clustering: A Review. In: Murgante B. et al. (eds) Computational Science and Its Applications – ICCSA 2014. ICCSA 2014. Lecture Notes in Computer Science, vol 8583. Springer, Cham
  • 10. CASE OF STUDY: NANOHUB
  • 11. Case of Study Nanohub > 5700 Resources > 500 simulation tools > 300,000 Tutorial/ Lecture Users In 172 countries > 1.4 M visitors > 16, 000 Simulation Users NanoHub represents a novel form of virtual collective science production
  • 12. NanoHub Research Ecosystem Students Tool developers Educators Socio-technical infrastructure Publication network Social & organizational behavior Technologies Research Ecosystem Open Science Brunswicker, et al 2016 Nanohub > 5700 Resources > 500 simulation tools > 300,000 Tutorial/ Lecture Users In 172 countries > 1.4 M visitors > 16, 000 Simulation Users
  • 14. 1. Cluster tendency Validation Validation of clustering tendency using Hopkins statistic Source: Yerpo, Population distribution Natural dispersion of among communities’ data Engagement behavior dataset* has presented a strong clustering tendency 0.01 0.30 0.50 0.70 1.000.99 2-Ddataview Dataset: > 40,000 registered users in the range between 2010 - 2016
  • 15. 17 PCA Results Goal: Identify the impact of each proven metrics from nanohub dataset. Metrics Percentage of Variance explained Standard deviation of absence days between logins Discussion 1. The periodicity var value was not making sense because that meant that the user engagement is just defined by the download behavior. 2. The session Creation value was zero because the number of users who create simulation in contrast with the number of users who download is smaller. Therfor, the influence of download activity was based on an unbalance proportion • Conclusion: The current variables are not presenting sufficiently significant based on the unbalance dataset.
  • 16. TSE representation of populations User’s life representation User Behaviors across populations created by lifespanDistribution of users by lifespan The current high dimensional representation of the user’s behavior showed non regular shapes that supports the usage of density based clustering algorithms
  • 17. Engagement Behavior has a path 1 2 3 1 2 3 Similarity Matrix Example populations Population 2 & 3 tested in Two-sample KS test and had shown non significant difference Younger day population behave significantly different behavior but elder populations become similar Not similar Strongly similar
  • 18. Lifespan populations MERGING ALL THE POPULATIONS THAT ARE SIMILAR BETWEEN THEMSELVES Not similar Strongly similar Despite the statistical similarity A = B =C => A=C FALSE MERGING MULTIPLE LIFESPANS GRAPH BASED SOLUTION: MAXIMAL CLIQUES A B C E D
  • 19. Contributions Evidence of non spherical behavioral engagement The effects of unbalance data over time for profiling analysis Evidence of similarity among high engaged populations
  • 20. Next steps Pattern recognition over time Day 1 Day 2 Day 4 Snapshot populations User 1 User 2 User behavior overtime A C B A C B A C B Network of Shared knowledge interests A BUsers Common interest represented by X resource Critical behavior for longer engagement
  • 21. Thanks Special thank you to … Dr. Gerhard Klimeck Dr. Michael Zentner Dr. Sabine Brunswicker Dr. Satyam Mukherjee National Science Foundation CONACYT Mexico Government Babak Ravandi
  • 24. Continuous login behavior Goal: Identify if the habit concept as a significant variables for further analysis. Q: How many users with every day logins are expected to be seen after x days? A: 7 is the support value of max continuous days that more than 1 user is seen.
  • 25. Correlation matrix Correlation matrix of variables of interest Goal: Select the variables with more influence across the set of variables Results: Not conclusive yet
  • 26. CriteriaClustering t Hierarchical Density- based Non-hierarchical No apriori # of clusters YES NO Convex clusters YES NO Dif size of the clusters YES Some Scalabe NO YES Convergance YES NO Nested clusters YES NO Unique cluster YES NO Noise detection YES Algorithm dependant Parameters robustness NO Algorithm dependant Non parametric Method selection
  • 27. Method selection Source: Jain et al 1999 Clustering methods Non-spherical clusters
  • 28. Internal clustering validation Silhouette Index Taken from Kenn State DataMining course ‘08