Presentation at the 5th International Conference on eSocial Science. Part of a workshop on the law and ethics of eSocial Science research. It outlines three domains I am currently researching and some of the ethical issues I have encountered including reporting on a third party (Facebook), deception (craigslist) and information access (grouphug.us).
Dissemination 2.0 - the role of social media in research disseminationPetter Bae Brandtzæg
Dissemination 2.0 - the role of social media in research dissemination.
My talk at The 6th Munin conference 2011 – Enhancing publications. Tromsø, Norway, 23.11.2011 http://www.ub.uit.no/baser/ocs/index.php/Munin/MC6
From a theoretical physicist who come up with experiments to find extra dimensions in the universe and searching genes using machine learning, I want to talk about two realistic and achievable ideas that I like to build as a technologist. These two products revolve around speech recognition & next generation social networks. Presented for Barcamp Singapore 3.
Presentation at "Strategies for managing social media research data", Feb 12, 2016. Cambridge. http://www.data.cam.ac.uk/events/strategies-managing-social-media-research-data
Dissemination 2.0 - the role of social media in research disseminationPetter Bae Brandtzæg
Dissemination 2.0 - the role of social media in research dissemination.
My talk at The 6th Munin conference 2011 – Enhancing publications. Tromsø, Norway, 23.11.2011 http://www.ub.uit.no/baser/ocs/index.php/Munin/MC6
From a theoretical physicist who come up with experiments to find extra dimensions in the universe and searching genes using machine learning, I want to talk about two realistic and achievable ideas that I like to build as a technologist. These two products revolve around speech recognition & next generation social networks. Presented for Barcamp Singapore 3.
Presentation at "Strategies for managing social media research data", Feb 12, 2016. Cambridge. http://www.data.cam.ac.uk/events/strategies-managing-social-media-research-data
2010 june - personal democracy forum - marc smith - mapping political socia...Marc Smith
Marc Smith's presentation to the Personal Democracy Forum 2010 in New York City on June 4th, 2010 about the use of NodeXL, a social media network analysis tool, to map political topics in services like Twitter.
NodeXL is available from http://nodexl.codeplex.com
An Automated Snowball Census of the Political Web - JITP 2011Abe Gong
Working abstract: This paper solves a persistent methodological problem for social scientists studying the political web: representative sampling. Virtually all existing studies of the political web are based on incomplete samples, and therefore lack generalizability. In this paper, I combine methods from computer science and sampling theory to conduct an automated snowball census of the political web and constructs an all-but-complete index of English political websites. I check the robustness of this index, use it to generate descriptive statistics for the entire political web, and demonstrate that studies based on ad hoc sampling strategies are likely to be biased in important ways. In future research, this bias can be eliminated by using this index as a sampling universe. In addition, the methods and open-source software presented here can be used to creating similar sampling frames for other online content domains.
This is Lecture VII: What are the CHALLENGES on the Social Web? as part of the Social Web course at the VU University Amsterdam. Visit the website for more information: http://semanticweb.cs.vu.nl/socialweb2012/
Lora Aroyo, The Network Institute, VU University Amsterdam
(some slides based on article by Won Kim, Ok-Ran Jeong and Sang-Won Lee)
Expectations Of The Screenager GenerationGraham Steel
Young people and OA
Lynn Silipigni Connaway, Expectations of the Screenager Generation, presented at RLG Annual Partnership Symposium (Boston, June 3, 2009). (Thanks to Fabrizio Tinti.) Report on a study of 12-18 year olds and their expectations of libraries and information resources. H/T OA News:- http://www.earlham.edu/~peters/fos/fosblog.html
Slides for talk at ConTech 2011 the International Symposium on Convergence Technology (ConTech 2011) – Smart & Humane World – on November 3rd in Seoul, South Korea.
Date: 2011 November 3 (Thurs)
Place: COEX Grand Ballroom, Seoul, Korea
Organized by Advanced Institutes of Convergence Technologies (AICT), Seoul National University (SNU)
In Cooperation with Ministry of Knowledge Economy, Ministry of Education, Science and Technology, National Research Foundation of Korea, Graduate School of Convergence Science and Technology (GSCST)
LSS'11: Charting Collections Of Connections In Social MediaLocal Social Summit
Keynote Title: Charting Collections of Connections in Social Media: Creating Maps and Measures with NodeXL
Abstract: Networks are a data structure common found across all social media services that allow populations to author collections of connections. The Social Media Research Foundation‘s NodeXL project makes analysis of social media networks accessible to most users of the Excel spreadsheet application. With NodeXL, Networks become as easy to create as pie charts. Applying the tool to a range of social media networks has already revealed the variations present in online social spaces. A review of the tool and images of Twitter, flickr, YouTube, and email networks will be presented.
An overview of Web research areas of interest to social scientists presented at Brunel University 3 March 2010, including an overview of my attempts to understand social influence online for my PhD thesis (http://alekskrotoski.com/tags/phd). includes general findings and an overview of the themes discussed in BBC2's Virtual Revolution series.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
More Related Content
Similar to Ethical challenges for online social science research: Networks, rentals and confessionals
2010 june - personal democracy forum - marc smith - mapping political socia...Marc Smith
Marc Smith's presentation to the Personal Democracy Forum 2010 in New York City on June 4th, 2010 about the use of NodeXL, a social media network analysis tool, to map political topics in services like Twitter.
NodeXL is available from http://nodexl.codeplex.com
An Automated Snowball Census of the Political Web - JITP 2011Abe Gong
Working abstract: This paper solves a persistent methodological problem for social scientists studying the political web: representative sampling. Virtually all existing studies of the political web are based on incomplete samples, and therefore lack generalizability. In this paper, I combine methods from computer science and sampling theory to conduct an automated snowball census of the political web and constructs an all-but-complete index of English political websites. I check the robustness of this index, use it to generate descriptive statistics for the entire political web, and demonstrate that studies based on ad hoc sampling strategies are likely to be biased in important ways. In future research, this bias can be eliminated by using this index as a sampling universe. In addition, the methods and open-source software presented here can be used to creating similar sampling frames for other online content domains.
This is Lecture VII: What are the CHALLENGES on the Social Web? as part of the Social Web course at the VU University Amsterdam. Visit the website for more information: http://semanticweb.cs.vu.nl/socialweb2012/
Lora Aroyo, The Network Institute, VU University Amsterdam
(some slides based on article by Won Kim, Ok-Ran Jeong and Sang-Won Lee)
Expectations Of The Screenager GenerationGraham Steel
Young people and OA
Lynn Silipigni Connaway, Expectations of the Screenager Generation, presented at RLG Annual Partnership Symposium (Boston, June 3, 2009). (Thanks to Fabrizio Tinti.) Report on a study of 12-18 year olds and their expectations of libraries and information resources. H/T OA News:- http://www.earlham.edu/~peters/fos/fosblog.html
Slides for talk at ConTech 2011 the International Symposium on Convergence Technology (ConTech 2011) – Smart & Humane World – on November 3rd in Seoul, South Korea.
Date: 2011 November 3 (Thurs)
Place: COEX Grand Ballroom, Seoul, Korea
Organized by Advanced Institutes of Convergence Technologies (AICT), Seoul National University (SNU)
In Cooperation with Ministry of Knowledge Economy, Ministry of Education, Science and Technology, National Research Foundation of Korea, Graduate School of Convergence Science and Technology (GSCST)
LSS'11: Charting Collections Of Connections In Social MediaLocal Social Summit
Keynote Title: Charting Collections of Connections in Social Media: Creating Maps and Measures with NodeXL
Abstract: Networks are a data structure common found across all social media services that allow populations to author collections of connections. The Social Media Research Foundation‘s NodeXL project makes analysis of social media networks accessible to most users of the Excel spreadsheet application. With NodeXL, Networks become as easy to create as pie charts. Applying the tool to a range of social media networks has already revealed the variations present in online social spaces. A review of the tool and images of Twitter, flickr, YouTube, and email networks will be presented.
An overview of Web research areas of interest to social scientists presented at Brunel University 3 March 2010, including an overview of my attempts to understand social influence online for my PhD thesis (http://alekskrotoski.com/tags/phd). includes general findings and an overview of the themes discussed in BBC2's Virtual Revolution series.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Ethical challenges for online social science research: Networks, rentals and confessionals
1. Ethical challenges for online social
science research: Networks,
Rentals and Confessionals
Bernie Hogan
Research Fellow, Oxford Internet Institute
NCeSS - 5th International Conference on e-Social Science
June 24, 2009. Cologne, Germany
Wednesday, June 24, 2009 1
2. Three unethical
studies?
• Facebook network research
• Craigslist audit study
• Grouphug.us
Wednesday, June 24, 2009 2
4. What are the techniques?
• Spidering - Technically fussy, often considered
inappropriate by data controller
• API - Technically restrictive, gives false sense of data
ownership (See Facebook Developer Terms of Use
Section 2.A.6)
• Datadump - Facebook gives you the data
• Someone else’s application - May not give data, but only
a picture.
• Handcoding - Spidering for masochists
Wednesday, June 24, 2009 4
5. Who gets the data?
• Golder, S., Wilkinson, D. M., and Huberman, B. A. (2007).
Rhythms of social interaction: Messaging within a
massive online network. In 3rd International Conference on
Communities and Technologies, East Lansing, MI. Springer.
• Traud, A., Kelsic, E., Mucha, P., and Porter, M. (2008). Community
structure in online collegiate networks. Working paper.
• Lewis, K., Kaufman, J., Gonzalez, M., Wimmer, A., and Christakis, N.
(2008). Tastes, ties, and time: A new social network
dataset using facebook.com. Social Networks, 30(4):330–342.
Wednesday, June 24, 2009 5
6. But isn’t it anonymous? No.
• Backstrom, L., Dwork, C., and Kleinberg, J. (2007).
Wherefore art thou r3579x? : anonymized social
networks, hidden patterns, and structural
steganography. In Proceedings of the 16th international
conference on World Wide Web, pages 181–190. ACM New
York, NY, USA.
• Direct attack needs ~ sqrt(log(n)) nodes.
• Narayanan, A. and Shmatikov,V. (2009). De-anonymizing
social networks. Forthcoming: IEEE C&S.
• Starting with even less and matching to existing network
can get over 90% of the network accurately.
Wednesday, June 24, 2009 6
7. Or simply use this guy
Zimmer, Michael. 2009.
“But the Data is Already
Public”: On the Ethics of
Research in Facebook.
8th International
Conference of Computer
Ethics: Philosophical
Enquiry. Corfu, Greece.
Wednesday, June 24, 2009 7
8. The only anonymous
network is one where
you know don’t know
the network structure.
This is unrealistic.
Wednesday, June 24, 2009 8
9. So what’s the precedent?
• Personal networks with informed consent.
• Name generators have historically asked individuals
to report data on their friends.
• They jump through an ethical loop-hole vis-a-vis the fact
that this is recall data.
• Information networks, however, permit not only data
created by an individual, but the friend of a friend data
that is merely accessible, not created, by the respondent.
Wednesday, June 24, 2009 9
10. Facebook properties enable you to
report on your friends to a third party.
Respondent
Friend 1 ? Friend 2
Wednesday, June 24, 2009 10
13. Methods
• This is a University of Toronto ethics board-approved
audit study.
• We selected craigslist.org, a highly popular free online
classifieds site.
• From March to June 2007 we selected approximately 10
new ads each day for inclusion in the study.
• Each landlord was emailed 5 messages. Each message
included one of five ethnicities randomly assigned with
one of five message bodies. Each experiment used one
gender only.
Wednesday, June 24, 2009 13
14. 1. Price and number of bedrooms 2. Masked email 3. Well-formed
almost always in header. address. date
4 . PostingID - key 5. Link to well-formed Google map, or
to linking data failing that, nearest intersection.
Wednesday, June 24, 2009 14
15. Jitter means that messages are
We send messages out one day after the
sent at a random time within "5"
posting (rather than immediately) at short
minutes of the specified time.
regular intervals. The parameters can be
Makes batches of messages look
tuned.
more realistic
By default we alternate between This window shows the five name / message
male and female names. combinations that will be sent out.
Wednesday, June 24, 2009 15
16. Date Email address. 1 of 5 different message bodies.
Secret posting ID:
1 of 5 female arabic names
ddhfegjfb = 337546951
Wednesday, June 24, 2009 16
17. Map of rentals in
Greater Toronto Area
Geographic distribution
of rental ads
(97% showing)
Wednesday, June 24, 2009 17
18. Ranked responses for names by
ethnicity and gender
• We ranked each of the Male Female
50 names from 1 (least 519 756
responses) to 50 (most
responses). Arab 31 113
Black 97 129
• The table shows the sum
of the ranks for all 5 SE Asian 88 179
names used in each
ethnicity-gender Caucasian 146 164
combination. Jewish 157 171
Wednesday, June 24, 2009 18
19. Issues
• Racism is often difficult to assess through
direct questioning.
• Deception in this study is necessary.
• There is no direct personal harm, and no
direct manipulation.
Wednesday, June 24, 2009 19
21. Online confessional site
• What constitutes anonymity?
• Grouphug is a website of approximately
one million posts (approximately 95%
unique).
• Does not store IP, actively discourages
quoting other posts and encodes the
entries in non-sequential strings
(timestamps exist but are hidden)
Wednesday, June 24, 2009 21
22. Nothing here to see...
(catch 22)
Wednesday, June 24, 2009 22
23. Ok, here are some examples
• “I am so happy that I can confess again. I don't
even care about seeing my confessions on here,
it's just the feeling of getting it off your chest and
sending it away!” (136158003)
• “I pee in the shower because I hate everyone I
live with.” (255678370)
Wednesday, June 24, 2009 23
24. Some worse examples
• “I paid my friend 200 dollars to do over 400 pages of
homework for the year, so that i can ditch school as
much as i want, while lying to my mother and saying im
still going to school” (194778021)
• “I have HPV, its a std. I have known about it for 7
years, but that has not stopped me from having sex with
9 people with out a condom. 4 of the girls where
married. I have never told anyone about my std. I have
no idea how many people are infected because of me,
it keeps me up at night.” (275447713)
Wednesday, June 24, 2009 24
25. So...
• Do we ignore anonymous confessionals as too
toxic, or treat them as insight to the id?
• Can we even analyze this data or merely view
it as passive bystanders? Are there legal
implications, especially dealing with data
designed to resist tracking? What is my
responsibility if I can do nothing to follow up
(or even confirm the veracity of the
statement)?
Wednesday, June 24, 2009 25
26. Summary
• Facebook - the ethics of capturing someone else’s
relationships is ambiguous. The network I see is not mine -
it is what I am allowed to see. I defer to Facebook’s terms
of use.
• Craigslist - the ethics of understanding racism as it
actually operates online is problematic. I defer to utilitarian
arguments and approval from the ethics board.
• Grouphug - the ethics of viewing and storing, let alone
analyzing, confessionals is ambiguous. How can we assure
no personally identifying information without looking for
it? How can we anonymize a million entries?
Wednesday, June 24, 2009 26
27. Opportunities
• We can get unprecedented access to
society in the wild.
• But is this fair? Is it justified?
• How close to ‘the social good’ must one be
to justify this work?
Wednesday, June 24, 2009 27
28. Thank You
Bernie Hogan
bernie.hogan@oii.ox.ac.uk
Wednesday, June 24, 2009 28