This document summarizes a presentation given by Ponnurangam Kumaraguru on credibility, identity resolution, privacy, and policing on online social media. Some key points include:
- Kumaraguru is an associate professor at IIIT-Delhi whose research interests include social computing, computational social science, and complex networks related to human behavior and security/privacy.
- His team has developed methods for credibility modeling of tweets during crisis events using machine learning techniques. They also built a system called TweetCred to analyze tweet credibility.
- The presentation discussed challenges around user identity resolution across multiple online social networks and described some of Kumaraguru's research in this area, including using profile attributes and network connections to find
Keynote at 4th International Symposium on Secuirty in Computing at Communicat...IIIT Hyderabad
With increase in usage of the Internet, there has been an exponential increase in the use of online social media on the Internet. Websites like Facebook, Google+, YouTube, Orkut, Twitter and Flickr have changed the way Internet is being used. There is a dire need to investigate, study and characterize privacy and security on online social media from various perspectives (computational, cultural, psychological). Real world scalable systems need to be built to detect and defend security and privacy issues on online social media. I will describe briefly some cool ongoing projects that we have: Twit-Digest, MultiOSN, Finding Nemo, OCEAN, Privacy in India, and Call Me MayBe. Many of our research work is made available for public use through tools or online services. Our work derives techniques from Data Mining, Text Mining, Statistics, Network Science, Public Policy, Complex networks, Human Computer Interaction, and Psychology. In particular, in this talk, I will focus on the following: (1) Twit-Digest is a tool to extract intelligence from Twitter which can be useful to security analysts. Twit-Digest is backed by award-winning research publications in international and national venues. (2) MultiOSN is a platform to analyze multiple OSM services to gain intelligence on a given topic / event of interest (2) OCEAN: Open source Collation of eGovernment data and Networks Here, we show how publicly available information on Government services can be used to profile citizens in India. This work obtained the Best Poster Award at Security and Privacy Symposium at IIT Kanpur, 2013 and it has gained a lot of traction in Indian media. (3) In Finding Nemo, given an identity in one online social media, we are interested in finding the digital foot print of the user in other social media services, this is also called digital identity stitching problem. This work is also backed by award-winning research publication. I will be more than happy to clarify, discuss, any of our work indetail, as required, after the talk.
2014 TheNextWeb-Mapping connections with NodeXLMarc Smith
Slides from a talk at the 2014 TheNextWeb in Amsterdam.
NodeXL social media network analysis of Twitter reveals six common structures in Twitter networks.
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...Marc Smith
Networks are a powerful way to understand social media.
This talk reviews the ways the NodeXL application can be used to reveal the social media networks structures around topics.
Anatoliy Gruzd and Philip Mai
Workshop presented at the TTRA Annual International Conference in Quebec City (June 20, 2017)
https://2017ttraannualinternationalconfe.sched.com/event/9yCg/social-listening-how-to-do-it-and-how-to-use-it-veille-sociale-comment-faire-et-comment-lutiliser?iframe=no&w=100%&sidebar=no&bg=no
Keynote at 4th International Symposium on Secuirty in Computing at Communicat...IIIT Hyderabad
With increase in usage of the Internet, there has been an exponential increase in the use of online social media on the Internet. Websites like Facebook, Google+, YouTube, Orkut, Twitter and Flickr have changed the way Internet is being used. There is a dire need to investigate, study and characterize privacy and security on online social media from various perspectives (computational, cultural, psychological). Real world scalable systems need to be built to detect and defend security and privacy issues on online social media. I will describe briefly some cool ongoing projects that we have: Twit-Digest, MultiOSN, Finding Nemo, OCEAN, Privacy in India, and Call Me MayBe. Many of our research work is made available for public use through tools or online services. Our work derives techniques from Data Mining, Text Mining, Statistics, Network Science, Public Policy, Complex networks, Human Computer Interaction, and Psychology. In particular, in this talk, I will focus on the following: (1) Twit-Digest is a tool to extract intelligence from Twitter which can be useful to security analysts. Twit-Digest is backed by award-winning research publications in international and national venues. (2) MultiOSN is a platform to analyze multiple OSM services to gain intelligence on a given topic / event of interest (2) OCEAN: Open source Collation of eGovernment data and Networks Here, we show how publicly available information on Government services can be used to profile citizens in India. This work obtained the Best Poster Award at Security and Privacy Symposium at IIT Kanpur, 2013 and it has gained a lot of traction in Indian media. (3) In Finding Nemo, given an identity in one online social media, we are interested in finding the digital foot print of the user in other social media services, this is also called digital identity stitching problem. This work is also backed by award-winning research publication. I will be more than happy to clarify, discuss, any of our work indetail, as required, after the talk.
2014 TheNextWeb-Mapping connections with NodeXLMarc Smith
Slides from a talk at the 2014 TheNextWeb in Amsterdam.
NodeXL social media network analysis of Twitter reveals six common structures in Twitter networks.
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...Marc Smith
Networks are a powerful way to understand social media.
This talk reviews the ways the NodeXL application can be used to reveal the social media networks structures around topics.
Anatoliy Gruzd and Philip Mai
Workshop presented at the TTRA Annual International Conference in Quebec City (June 20, 2017)
https://2017ttraannualinternationalconfe.sched.com/event/9yCg/social-listening-how-to-do-it-and-how-to-use-it-veille-sociale-comment-faire-et-comment-lutiliser?iframe=no&w=100%&sidebar=no&bg=no
Think Link: Network Insights with No Programming SkillsMarc Smith
Networks are everywhere, but the tools for end users to access, analyze, visualize and share insights into connected structures have been absent. NodeXL, the network overview discovery and exploration add-in for Excel makes network analysis as easy as making a pie chart.
A workshop about Social Network Analysis and Condor software developed by Peter Gloor, MIT. The workshop aimed to provide an introduction for design students into SNA and this free software tool for SNA.
Professor Hendrik Speck - Social and Virtual. - An Analysis Framework for Lar...Hendrik Speck
Professor Hendrik Speck - Social and Virtual. - An Analysis Framework for Large Scale Communities. Commercial Communities Conference. Technical University of Berlin. Institute of Sociology, October 30 - 31 2008, Berlin, Germany, User Generated Content, Interaction, Third Party Associations and Content, Access and Connectivity, API's, Beacons, and Data Feeds, Merger of Social, Mobile and Local, social network analysis, social network visualization, Audience and Participants, Relational Data, Mathematical Models, Analytical Framework, Processing, Computing Power, Computer Mediated Communication, Visualization Algorithms, Interest, Use Cases, Marketing, Commerce, Web Services, Type of Data, Attribute, Ideational, Relational, Research Method, Survey Research, Surveys and Interviews, Ethographic Research, Observations, Field Studies, Documentary Research
Logfiles, Texts and Archives, Type of Analysis, Variable, Typological, Network, User Profiles. Name, Age, Links, Interests, Hobbies, City, Country, Category, Videos Headline, Content, Descriptions, Tags, Playlists, Video Comments Author, Text, Tags, Themes, Ranking of Users and Channels Views or Subscriptions by Time and Category, Rankings of Videos Ratings or Views by Time and Category, Interaction Friends, Subscription, Comments, FollowUps
Betweenness, Centrality Closeness, Centrality Degree, Flow Betweenness Centrality, Centrality Eigenvector, Centralization, Clustering Coefficient, Cohesion, Contagion, Density, Integration, Path Length, Radiality, Reach, Structural Equivalence, Structural Hole, Islands
Scholarly Identity 2.0: What does the Web say about your research?Michael Habib
Congress Center Hotel Zira
Belgrade, Serbia – October 30, 2009
Hosted by University of Belgrade...
Blog post describing presentation and proposed concept model:
http://mchabib.com/2009/11/04/scholarly-identity-2-0-matrix-concept-model-and-presentation/
A video of the presentation is located here:
http://bit.ly/6VpsbX
From Academic Library 2.0 to (Literature) Research 2.0Michael Habib
Congress Center Hotel Zira
Belgrade, Serbia – October 29, 2009
Hosted by University of Belgrade...
Blog post discussing the presentation and the proposed Research 2.0 Concept Model:
http://mchabib.com/2009/11/04/research-2-0-concept-model-presentation/
A video of the lecture is now available here: http://bit.ly/6VpsbX
Scholarly Reputation Management Online: The Challenges and Opportunities of ...Michael Habib
Session 6: Wissenschaftskommunikation 2.0 – Social Software @ WorkSchloss Mickeln, Düsseldorf, 29. September 2009 Abstract: Social media provides scholars with unprecedented opportunities to promote their accomplishments and expertise. Conversely, social media creates more identity information to for scholars to manage. Different facets of scholar identity online will be introduced. Within this framework, new types of identity content produced by social software and the challenges this creates will be discussed. Lastly, opportunities for using social software to manage scholarly reputation will be explored.
Social media is now the place where people are gathering en masse to discuss the news with their friends, neighbors and complete strangers. This change in news consumers’ behavior is proving to be a challenge for local news, but it is also an opportunity. Users and system generated data from social media can also be a boon for content creators. This presentation will feature a case study showing how publishers can use social media analytics to gain insights into their audience and how to use this information to foster a stronger sense of community around their brand of journalism. The case study will focus on how to use Netlytic, a cloud-based social media analytics tool, to mine the public Facebook interactions of the readers of BlogTO, a regional, Canadian-based media outlet, to find out what their readers are interested in and what engages them.
Professor Hendrik Speck - Information Mining in the Social Web. Empolis Execu...Hendrik Speck
Professor Hendrik Speck - Information Mining in the Social Web. Empolis Executive Forum, June 8th 9th 2009 Berlin Germany. social networks, social media, web 2.0, social network analysis, usage, audience, user, markets, revenues, google, youtube, myspace, wikipedia, attributes, search engines, marketing, lobbying, information mining, information retrieval, risk, law, security, branding, marketing, privacy, private sphere, public sphere, anonymity, surveillance, panopticon, sousveillance, hype, history, features, examples, captcha, security, cracking, data portability, decentralization
Keynote address by Anatoliy Gruzd at the 2017 Altmetrics Conference in Toronto, Canada (Sep 27, 2017)
Abstract
Arguably, even the most innovative ideas take time to catch on. Ideas that seem obvious today, at one point were obscure oddities known only to a select few. Washing your hands, airbags in cars, the internet - none of these ideas were accepted immediately. New ideas need time to incubate, the process of switching from old ideas to new is not seamless nor is it linear. In today’s social media-connected world, even though ideas can spread quickly and more efficiently than ever before, they are now competing for attention with a multitude of other ideas, memes, tweets, snaps, YouTube videos and news (fake and real). Conceptually, if social media is a network of highways on which ideas and people travel, altmetrics are the billboard or traffic signs on these highways that can help interested parties to discover new ideas or re-discover ideas left on the side of the road. While often neglected, the above metaphor is meant to illuminate the important role of altmetrics for researchers, innovators and funders seeking to track the impacts of new ideas, as well as for the many idea consumers looking for emerging and novel insights.
This talk will outline the current state of altmetrics research and how altmetrics are being commonly calculated and used by different stakeholders. It will also explore the social network properties of ideas and how these properties might be used to customize altmetrics for different audiences and uses. The keynote will conclude by calling for the development of training strategies to provide learning opportunities for researchers and administrators from various fields to acquire necessary digital literacy skills so that they better understand how altmetrics are measured and how they can be interpreted for decision making. The keynote will also call on altmetrics developers and researchers to create algorithms and data collection strategies that are less prone to manipulation by the rapid rise of social bots.
Best Practices in Social Media: Part IEric Hodgson
Communicating with prospective students, current students, and alumni has become a full-time, multi-media job, and your current web site is one-dimensional for ongoing conversations. Facebook, twitter, flickr and YouTube are the places your audience goes to learn about everything, including you. Haven’t become an active participant in social media yet? During this two-part series, we will look at the right way to get started, using examples in admissions, student services, alumni relations, and individual academic departments.
2010 june - personal democracy forum - marc smith - mapping political socia...Marc Smith
Marc Smith's presentation to the Personal Democracy Forum 2010 in New York City on June 4th, 2010 about the use of NodeXL, a social media network analysis tool, to map political topics in services like Twitter.
NodeXL is available from http://nodexl.codeplex.com
This workshop will introduce some of the main principles and techniques of Social Network Analysis (SNA). We will use examples from organizational and social media-based networks to understand concepts such as network density, diameter, centrality measures, community detection algorithms, etc. The session will also introduce Gephi, a popular program for SNA. Gephi is a free and open-source tool that is available for both Mac and PC computers.
By the end of the session, you will develop a general understanding of what SNA is, what research questions it can help you answer, and how it can be applied to your own research. You will also learn how to use Gephi to visualize and examine networks using various layout and community detection algorithms.
Instructor’s Bio: Dr. Anatoliy Gruzd is a Canada Research Chair in Social Media Data Stewardship, Associate Professor at the Ted Rogers School of Management at Ryerson University, and Director of Research at the Social Media Lab. Anatoliy is also a Member of the Royal Society of Canada’s College of New Scholars, Artists and Scientists; a co-editor of a multidisciplinary journal on Big Data and Society; and a founding co-chair of the International Conference on Social Media and Society. His research initiatives explore how social media platforms are changing the ways in which people and organizations communicate, collaborate and disseminate information and how these changes impact the norms and structures of modern society.
Think Link: Network Insights with No Programming SkillsMarc Smith
Networks are everywhere, but the tools for end users to access, analyze, visualize and share insights into connected structures have been absent. NodeXL, the network overview discovery and exploration add-in for Excel makes network analysis as easy as making a pie chart.
A workshop about Social Network Analysis and Condor software developed by Peter Gloor, MIT. The workshop aimed to provide an introduction for design students into SNA and this free software tool for SNA.
Professor Hendrik Speck - Social and Virtual. - An Analysis Framework for Lar...Hendrik Speck
Professor Hendrik Speck - Social and Virtual. - An Analysis Framework for Large Scale Communities. Commercial Communities Conference. Technical University of Berlin. Institute of Sociology, October 30 - 31 2008, Berlin, Germany, User Generated Content, Interaction, Third Party Associations and Content, Access and Connectivity, API's, Beacons, and Data Feeds, Merger of Social, Mobile and Local, social network analysis, social network visualization, Audience and Participants, Relational Data, Mathematical Models, Analytical Framework, Processing, Computing Power, Computer Mediated Communication, Visualization Algorithms, Interest, Use Cases, Marketing, Commerce, Web Services, Type of Data, Attribute, Ideational, Relational, Research Method, Survey Research, Surveys and Interviews, Ethographic Research, Observations, Field Studies, Documentary Research
Logfiles, Texts and Archives, Type of Analysis, Variable, Typological, Network, User Profiles. Name, Age, Links, Interests, Hobbies, City, Country, Category, Videos Headline, Content, Descriptions, Tags, Playlists, Video Comments Author, Text, Tags, Themes, Ranking of Users and Channels Views or Subscriptions by Time and Category, Rankings of Videos Ratings or Views by Time and Category, Interaction Friends, Subscription, Comments, FollowUps
Betweenness, Centrality Closeness, Centrality Degree, Flow Betweenness Centrality, Centrality Eigenvector, Centralization, Clustering Coefficient, Cohesion, Contagion, Density, Integration, Path Length, Radiality, Reach, Structural Equivalence, Structural Hole, Islands
Scholarly Identity 2.0: What does the Web say about your research?Michael Habib
Congress Center Hotel Zira
Belgrade, Serbia – October 30, 2009
Hosted by University of Belgrade...
Blog post describing presentation and proposed concept model:
http://mchabib.com/2009/11/04/scholarly-identity-2-0-matrix-concept-model-and-presentation/
A video of the presentation is located here:
http://bit.ly/6VpsbX
From Academic Library 2.0 to (Literature) Research 2.0Michael Habib
Congress Center Hotel Zira
Belgrade, Serbia – October 29, 2009
Hosted by University of Belgrade...
Blog post discussing the presentation and the proposed Research 2.0 Concept Model:
http://mchabib.com/2009/11/04/research-2-0-concept-model-presentation/
A video of the lecture is now available here: http://bit.ly/6VpsbX
Scholarly Reputation Management Online: The Challenges and Opportunities of ...Michael Habib
Session 6: Wissenschaftskommunikation 2.0 – Social Software @ WorkSchloss Mickeln, Düsseldorf, 29. September 2009 Abstract: Social media provides scholars with unprecedented opportunities to promote their accomplishments and expertise. Conversely, social media creates more identity information to for scholars to manage. Different facets of scholar identity online will be introduced. Within this framework, new types of identity content produced by social software and the challenges this creates will be discussed. Lastly, opportunities for using social software to manage scholarly reputation will be explored.
Social media is now the place where people are gathering en masse to discuss the news with their friends, neighbors and complete strangers. This change in news consumers’ behavior is proving to be a challenge for local news, but it is also an opportunity. Users and system generated data from social media can also be a boon for content creators. This presentation will feature a case study showing how publishers can use social media analytics to gain insights into their audience and how to use this information to foster a stronger sense of community around their brand of journalism. The case study will focus on how to use Netlytic, a cloud-based social media analytics tool, to mine the public Facebook interactions of the readers of BlogTO, a regional, Canadian-based media outlet, to find out what their readers are interested in and what engages them.
Professor Hendrik Speck - Information Mining in the Social Web. Empolis Execu...Hendrik Speck
Professor Hendrik Speck - Information Mining in the Social Web. Empolis Executive Forum, June 8th 9th 2009 Berlin Germany. social networks, social media, web 2.0, social network analysis, usage, audience, user, markets, revenues, google, youtube, myspace, wikipedia, attributes, search engines, marketing, lobbying, information mining, information retrieval, risk, law, security, branding, marketing, privacy, private sphere, public sphere, anonymity, surveillance, panopticon, sousveillance, hype, history, features, examples, captcha, security, cracking, data portability, decentralization
Keynote address by Anatoliy Gruzd at the 2017 Altmetrics Conference in Toronto, Canada (Sep 27, 2017)
Abstract
Arguably, even the most innovative ideas take time to catch on. Ideas that seem obvious today, at one point were obscure oddities known only to a select few. Washing your hands, airbags in cars, the internet - none of these ideas were accepted immediately. New ideas need time to incubate, the process of switching from old ideas to new is not seamless nor is it linear. In today’s social media-connected world, even though ideas can spread quickly and more efficiently than ever before, they are now competing for attention with a multitude of other ideas, memes, tweets, snaps, YouTube videos and news (fake and real). Conceptually, if social media is a network of highways on which ideas and people travel, altmetrics are the billboard or traffic signs on these highways that can help interested parties to discover new ideas or re-discover ideas left on the side of the road. While often neglected, the above metaphor is meant to illuminate the important role of altmetrics for researchers, innovators and funders seeking to track the impacts of new ideas, as well as for the many idea consumers looking for emerging and novel insights.
This talk will outline the current state of altmetrics research and how altmetrics are being commonly calculated and used by different stakeholders. It will also explore the social network properties of ideas and how these properties might be used to customize altmetrics for different audiences and uses. The keynote will conclude by calling for the development of training strategies to provide learning opportunities for researchers and administrators from various fields to acquire necessary digital literacy skills so that they better understand how altmetrics are measured and how they can be interpreted for decision making. The keynote will also call on altmetrics developers and researchers to create algorithms and data collection strategies that are less prone to manipulation by the rapid rise of social bots.
Best Practices in Social Media: Part IEric Hodgson
Communicating with prospective students, current students, and alumni has become a full-time, multi-media job, and your current web site is one-dimensional for ongoing conversations. Facebook, twitter, flickr and YouTube are the places your audience goes to learn about everything, including you. Haven’t become an active participant in social media yet? During this two-part series, we will look at the right way to get started, using examples in admissions, student services, alumni relations, and individual academic departments.
2010 june - personal democracy forum - marc smith - mapping political socia...Marc Smith
Marc Smith's presentation to the Personal Democracy Forum 2010 in New York City on June 4th, 2010 about the use of NodeXL, a social media network analysis tool, to map political topics in services like Twitter.
NodeXL is available from http://nodexl.codeplex.com
This workshop will introduce some of the main principles and techniques of Social Network Analysis (SNA). We will use examples from organizational and social media-based networks to understand concepts such as network density, diameter, centrality measures, community detection algorithms, etc. The session will also introduce Gephi, a popular program for SNA. Gephi is a free and open-source tool that is available for both Mac and PC computers.
By the end of the session, you will develop a general understanding of what SNA is, what research questions it can help you answer, and how it can be applied to your own research. You will also learn how to use Gephi to visualize and examine networks using various layout and community detection algorithms.
Instructor’s Bio: Dr. Anatoliy Gruzd is a Canada Research Chair in Social Media Data Stewardship, Associate Professor at the Ted Rogers School of Management at Ryerson University, and Director of Research at the Social Media Lab. Anatoliy is also a Member of the Royal Society of Canada’s College of New Scholars, Artists and Scientists; a co-editor of a multidisciplinary journal on Big Data and Society; and a founding co-chair of the International Conference on Social Media and Society. His research initiatives explore how social media platforms are changing the ways in which people and organizations communicate, collaborate and disseminate information and how these changes impact the norms and structures of modern society.
Presentation at the Workshop on "Small Data and Big Data Controversies and Alternatives: Perspectives from The Sage Handbook of Social Media Research Methods" with Anabel Quan-Haase, Luke Sloan, Diane Rasmussen Pennington, et al.
LINK: http://sched.co/7G5N
Digital Forces - Social: Future Trends, Student Projects Highlight, Software and More. Talk by Dr. Ponnurangam Kumaraguru (PK), Associate Professor at IIIT-Delhi, as part of TCS iON Faculty Development Programme. Sept. 7, 2016.
2010 Catalyst Conference - Trends in Social Network AnalysisMarc Smith
Review of trends related to social network analysis in the enterprise. Presented at the 2010 Catalyst Conference in San Diego, CA july 29, 2010. Presented with Mike Gotta, Gartner Group.
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...Farida Vis
Keynote delivered at the SRA Social Media in Social Research conference, London, 24 June, 2013. The presentation highlights some thoughts on sampling, tools, data, ethics and user requirements for Twitter analytics, including an overview of a series of recent tools.
Extracting Social Network Data and Multimedia Communications from Social Medi...Shalin Hai-Jew
This presentation provides an overview of some of the data extractions that may be achieved on social media platforms using their respective APIs and a free open-source tool (NodeXL).
Eavesdropping on the Twitter Microblogging SiteShalin Hai-Jew
Research analysts go to Twitter to capture the general trends of public conversations, identify and profile influential accounts, and extract subgroups within larger collectives and larger discourses; they also go to eavesdrop on individual self-talk and individual-to-individual conversations. So what is technically in your tweets, asked Dave Rosenberg famously in a CNET article (2010). The answer: a whole lot more than 140 characters. How are the most influential social media accounts identified through #hashtag graphs? How are themes extracted? How are sentiments understood? How can users be profiled through their Tweetstreams? How can locations be mapped in terms of the Twitter conversations occurring in particular physical areas? How can live and trending issues be identified and categorized in terms of sentiment (positive, negative, and neutral)? This presentation will summarize some of the free and open-source tools as well as commercial and proprietary ones that enable increased knowability.
E-Learn 2014 Abstract: Today digital footprints are left all over the Internet for others to find. This article reviews the means through which scholars can organize research and connect digital scholarship for increased visibility and impact. A survey of the literature on scholarship tools to provide connections for publishing records, academic citations, and digital identity management was done. The authors reviewed Researcher ID, ORCID, and Google Scholar Citations. The numbers of portals for synthesizing research output and related identity management platforms are increasing; however, understanding what this research impact might look like in the digital age can provide questions for assessment for understanding these traces of scholarship online.
Identify, Inspect and Intervene Multimodal Fake NewsIIIT Hyderabad
Fake news refers to intentionally and verifiably false stories created to manipulate people’s perceptions of reality.
The concept of fake news is not new and has marked its presence dating back to AD 1475, affecting the citizens of Italy on eastern Sunday to the COVID-19 pandemic in 2020. Fake news has gained traction among audiences, created a buzz online, and faced repercussions offline. For instance, intruding hyperbolized fake articles into political campaigns or health and climate studies is havoc. In addition, the proliferation of fabricated stories has played a crucial role in inflaming or suppressing a social event. In conclusion, fake news is destructive and can lead to hatred against religion, politics, celebrities or organizations, resulting in riots/protests or even death.
The massive growth in the proliferation of fake news online might result from numerous technological advancements. Fake news seems to be the permanent reality, with social media being a primary conduit for its creation and dissemination. Despite the difficulty in identifying, tracking, and controlling unreliable content, there must be an effort to halt its expansion. Our research endeavors contribute to tackling various aspects of fake news, encompassing identification, inspection, and intervention. The premise of our thesis is firmly placed at the point where we analyze multiple facets of user-generated content produced online in the form of text and visuals to investigate the field of fake news.
First, we focus on devising different methods to Identify, a.k.a. detect fake news online, by extracting different feature sets from the given information. By designing foundational detection mechanisms, our work accelerates research innovations. Second, our research closely Inspects the fake stories from two perspectives. First, from the information point of view, one can inspect fabricated content to identify the patterns of false reports disseminating over the web, the modality used to create the fabricated content and the platform used for dissemination. Next, from the model point of view, we inspect detection mechanisms used in prior work and their generalizability to other datasets. The thesis also suggests Intervention techniques to help internet users broaden their comprehension of fake news. We discuss potential practical implications for social media platform owners and policymakers.
Data Science for Social Good: #MentalHealth #CodeMix #LegalNLP #AISafetyIIIT Hyderabad
Discuss work on using technology for Judiciary, Lawyers, etc. Analyse social media data, music listening habits for mental health. Bias and Safety in AI Systems.
Papers are available at https://precog.iiit.ac.in/pages/publications.html
Beyond the Surface: A Computational Exploration of Linguistic AmbiguityIIIT Hyderabad
We investigate two specific forms of linguistic ambiguities - polysemy, which is the multiplicity of meanings for a specific word, and tautology, which are seemingly uninformative and ambiguous phrases used in conversations. Both phenomena are widely-known manifestations of linguistic ambiguity at the lexical and pragmatic level, respectively.
The first part of the thesis focuses on addressing this challenge by proposing a new method for quantifying the degree of polysemy in words, which refers to the number of distinct meanings that a word can have. The proposed approach is a novel, unsupervised framework to compute and estimate polysemy scores for words in multiple languages, infusing syntactic knowledge in the form of dependency structures. The proposed framework is tested on curated datasets controlling for different sense distributions of words in three typologically diverse languages - English, French, and Spanish. The framework leverages contextual language models and syntactic structures to empirically support the widely held theoretical linguistic notion that syntax is intricately linked to ambiguity/polysemy.
The second part of the thesis explores how language models handle colloquial tautologies, a type of redundancy commonly used in conversational speech. We first present a dataset of colloquial tautologies and evaluate several state-of-the-art language models on this dataset using perplexity scores. We conduct probing experiments while controlling for the noun type, context and form of tautologies. The results reveal that BERT and GPT2 perform better with modal forms and human nouns, which aligns with previous literature and human intuition.
Data Science for Social Good: #LegalNLP #AlgorithmicBias...IIIT Hyderabad
Talk describes legal NLP idea discusses the following papers:
HLDC: Hindi Legal Documents Corpus https://precog.iiit.ac.in/pubs/HLDC_ACL_2022.pdf
Drug consumption: https://precog.iiit.ac.in/pubs/Effect_oF_Feedback_on_Drug_Consumption_Disclosures_on_Social_Media___ICWSM2023___16Sept1730hrs.pdf
Don’t Wait: Write
Importance of outline
Have Shepherds to review
Writing a Good Literature Review
Having a good title
Writing a Good Introduction
Active voice
Latex tips
Writing Rebuttal
I discussed our work on #LegalAI #CodeMixing #FakeNews #Elections and other cool projects that we are currently working on at https://precog.iiit.ac.in/
Modeling Online User Interactions and their Offline effects on Socio-Technica...IIIT Hyderabad
Do online interactions trigger reactions back in the offline world? How can these reactions be detected and quantified? Specifically, what insights can be extracted for users, platform owners, and policymakers to minimize the potential harm of such reactions?
Society functions based on the complex interactions between individuals, communities, and organizations. The advent of the Internet has enabled these interactions to move online. A website or an application that facilitates the digitization of social interactions is called a socio-technical platform. For instance, individuals converse with each other via direct messaging applications (e.g., WhatsApp, Telegram), share thoughts, and gather feedback from communities (e.g., Reddit, Twitter, Youtube). Trade of goods occurs via e-commerce (e.g., Flipkart, Amazon) and online marketplaces (e.g., Google Play store). At times interactions happening in the online world, trigger reactions in the offline world, which we call overflow. Such overflows can have either a positive or negative impact. Socio-technical platforms save every interaction and associated metadata, providing a unique opportunity to analyze rich data at scale. Discover interaction patterns, detect and quantify overflow of interactions, and extract insights for users and policymakers.
This report aims to study the interactions by keeping the individual as the focal point. We focus on two broad forms of interactions - i) the effect online community feedback can have on individual offline actions and ii) organizations leveraging individual customers' online presence to optimize business processes. In the first part, we work on two scenarios - (a) How does community feedback affect an individual future drug consumption frequency in a drug community forum? and (b) What changes does an individual undergo immediately after getting sudden popularity in Online social media? What actions help in maintaining popularity for longer? In the second part, we leverage online information about a customer to improve the prediction of Return-to-Origin in the e-commerce platform.
Development of Stress Induction and Detection System to Study its Effect on B...IIIT Hyderabad
Stress has become a significant mental health problem of the 21st century. The number of people suffering from stress is increasing rapidly. Thus, easy-to-use, inexpensive, and accurate biomarkers are needed to detect stress during its inception. Early detection of stress-related diseases allows people to access healthcare services. This thesis focuses on the development of stress stimuli and the detection of stress induced by these stimuli. Identifying brain regions affected while exposing the subject to these stressful stimuli has also been done. Three different stimuli, viz. videos, gamified application, and a game, are investigated to study their effect as stress induction stimuli. To this end, in this thesis, a system is proposed to classify participants into stressed and non-stressed categories using machine learning, deep learning, and statistical techniques. The statistical significance between stressed and non-stressed was found using Higuchi Fractal Dimensions (HFD) feature extracted from EEG. This feature also helped identify the brain’s most affected region due to stress. Another outcome of this thesis is the extra annotation of the ground truth which further helps to validate the participant’s experience under the influence of stressful stimuli. This annotation was performed by evaluating participant performance under time pressure. In addition, a technique based on in-game analytics is presented to complement the betterment of self-reported data. Further, another dimension utilizing signatures from WiFi Media Access Control (MAC) layer traffic is presented to detect stress indicators in a device-agnostic way.
A Framework for Automatic Question Answering in Indian LanguagesIIIT Hyderabad
The distribution of research efforts done in the field of Natural Language
Processing (NLP) has not been uniform across all natural languages. It has
been observed that there is a significant gap between the development of
NLP tools in Indic languages (indic-NLP), and in European languages. We
aim to explore different directions to develop an automatic question answering system for Indic languages. We built a FAQ-retrieval based chatbot for
healthcare workers and young mothers of India. It supported Hindi language in either Devanagri script or Roman script. We observed that, in our
FAQ database, if there exists a question similar to the query asked by the
user, then the developed chatbot is able to find a relevant Question-Answer
pair (QnA) among its top-3 suggestions 70% of the time. We also observed
that performance of our chatbot is dependent on the diversity in the FAQ
database. Since database creation requires substantial manual efforts, we decided to explore other ways to curate knowledge from raw text irrespective
of domain.
We developed an Open Information Extraction (OIE) tool for Indic languages. During the preprocessing, chunking of text is performed with our
fine-tuned chunker, and the phrase-level dependency tree was constructed
using the predicted chunks. In order to generate triples, various rules were
handcrafted using the dependency relations in Indic languages. Our method
performed better than other multilingual OIE tools on manual and automatic evaluations. The contextual embeddings used in this work does not
take syntactic structure of sentence into consideration. Hence, we devised
an architecture that takes the dependency tree of the sentence into consideration to calculate Dependency-aware Transformer (DaT) embeddings.
Since the dependency tree is also a graph, we used Graph Convolution
Network (GCN) to incorporate the dependency information into the contextual embeddings, thus producing DaT embeddings. We used a hate-speech
detection task to evaluate the effectiveness of DaT embeddings. Our future
plan is to evaluate the applicability of DaT embeddings for the task of chunking. Moreover, the broader aim for the future is to develop an end-to-end
pronoun resolution model to improve the quality of triples and DaT embeddings. We also aim to explore the applicability of all our works to solve the
problem of long-context question answering.
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
Andreas Schleicher presents at the OECD webinar ‘Digital devices in schools: detrimental distraction or secret to success?’ on 27 May 2024. The presentation was based on findings from PISA 2022 results and the webinar helped launch the PISA in Focus ‘Managing screen time: How to protect and equip students against distraction’ https://www.oecd-ilibrary.org/education/managing-screen-time_7c225af4-en and the OECD Education Policy Perspective ‘Students, digital devices and success’ can be found here - https://oe.cd/il/5yV
Ethnobotany and Ethnopharmacology:
Ethnobotany in herbal drug evaluation,
Impact of Ethnobotany in traditional medicine,
New development in herbals,
Bio-prospecting tools for drug discovery,
Role of Ethnopharmacology in drug evaluation,
Reverse Pharmacology.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
How to Split Bills in the Odoo 17 POS ModuleCeline George
Bills have a main role in point of sale procedure. It will help to track sales, handling payments and giving receipts to customers. Bill splitting also has an important role in POS. For example, If some friends come together for dinner and if they want to divide the bill then it is possible by POS bill splitting. This slide will show how to split bills in odoo 17 POS.
The Indian economy is classified into different sectors to simplify the analysis and understanding of economic activities. For Class 10, it's essential to grasp the sectors of the Indian economy, understand their characteristics, and recognize their importance. This guide will provide detailed notes on the Sectors of the Indian Economy Class 10, using specific long-tail keywords to enhance comprehension.
For more information, visit-www.vavaclasses.com
Credibility, Identity Resolution, Privacy, and Policing in Online Social Media
1. Credibility,
Identity
Resolution,
Privacy,
and
Policing
on
Online
Social
Media
IIT
Guwahati
Sept
26,
2016
Ponnurangam
Kumaraguru
(“PK”)
Associate
Professor
ACM
Distinguished
Speaker
fb/ponnurangam.kumaraguru,
@ponguru
2. Who
am
I?
– Associate
Professor,
IIIT-‐Delhi
– Ph.D.
from
School
of
Computer
Science,
Carnegie
Mellon
University
(CMU)
– Research
interests
-Social
Computing,
Computational
Social
Science,
Complex
Networks
pertaining
to
Human
Behavior,
specifically
in
the
context
of
Security
&
Privacy
– Co-‐ordinate
and
manage
Precog,
precog.iiitd.edu.in
– ACM
Distinguished
Speaker
2
7. Training
Data
– 500
Tweets
per
event
– Used
CrowdFlower
7
Event Tweets Users
Boston
Marathon
Blasts
(2013) 7,888,374 3,677,531
Typhoon Haiyan /
Yolanda
(2013) 671,918 368,269
Cyclone
Phailin (2013) 76,136 34,776
Washington
Navy yard shootings (2013) 484,609 257,682
Polar
vortex cold wave (2014) 143,959 116,141
Oklahoma
Tornadoes (2013) 809,154 542,049
Total
10,074,150 4,996,448
8. Credibility
Modeling
8
Feature
set
Features (45)
Tweet
meta-‐data
Number
of
seconds
since
the
tweet;
Source
of
tweet
(mobile
/
web/
etc);
Tweet
contains
geo-‐coordinates
Tweet
content
(simple)
Number
of
characters;
Number
of
words;
Number
of
URLs;
Number
of
hashtags;
Number
of
unique
characters;
Presence
of
stock
symbol;
Presence
of
happy
smiley;
Presence
of
sad
smiley;
Tweet
contains
`via';
Presence
of
colon
symbol
Tweet
content
(linguistic)
Presence
of
swear
words;
Presence
of
negative
emotion
words;
Presence
of
positive
emotion
words;
Presence
of
pronouns;
Mention
of
self
words
in
tweet
(I;
my;
mine)
Tweet
author
Number
of
followers;
friends;
time
since
the
user
if
on
Twitter;
etc.
Tweet
network
Number
of
retweets;
Number
of
mentions;
Tweet
is
a
reply;
Tweet
is
a
retweet
Tweet links
WOT
score
for
the
URL;
Ratio
of
likes
/
dislikes
for
a
YouTube
video
15. Challenges
15
ProfessionalOpinion
Dating
Heterogeneous
OSNs
Personal
Degree
of
Details
Quality
and
descriptive
personal
And
professional
information
Little
personal
information
Descriptive
opinions
Attribute
Evolution
Time
Information
evolved
on
one
but
not
on
other
{jainpari,
Bangalore}
Registration
with
same
information
on
both
OSNs
{paridhij,
New
Delhi}
17. Heuristic
Identity
Search
17
cerc.iiitd.ac.in
Profile
Content
Self-mention
Network
Syntactic
and Image
Search Linking
If self-identified /
returned by
more than one
search method
No
Yes
Candidate
Identities
name,
location,
username
mobile no,
post,
friends,
followers
Paridhi
Jain,
Ponnurangam Kumaraguru,
and
Anupam Joshi.
2013.
@I
seek
‘fb.me’:
Identifying
Users
across
Multiple
Online
Social
Networks.
In
Proceedings
of
the
22nd
International
Conference
on
World
Wide
Web,
WWW
’13
Companion.
ACM,
New
York,
NY,
USA,
1259-‐ 1268.
DOI=http://dx.doi.org/10.1145/2487788.2488160
[Honorable
Mention
Award}
20. 20
How
many
of
you
have
posted
mobile
numbers
on
Online
Social
Networks?
How
many
of
you
have
seen
mobile
numbers
being
posted
on
Online
Social
Networks?
29. Takeaways
– Online
Social
Media
is
a
different
beast
in
terms
of
privacy,
identity,
and
credibility
-Research
/
technologies
should
be
developed
– Multiple
interesting
research,
engineering,
and
innovation
waiting
to
be
done
– Interested
in
hosting
students
– B.Tech.,
M.Tech.,
Ph.D.
29