The document discusses detecting and modeling user temporal intention in social media. It presents a dissertation plan to analyze how shared resources change over time as links and content are updated. The plan includes collecting data from archives and social media, analyzing factors like shortened URLs and intention, developing models to predict intention, and creating applications to maintain temporal consistency. The goal is to better understand how users' intentions change as shared content changes and to build tools that preserve the intended version.
This document outlines the plan for a dissertation that aims to estimate users' temporal intention when sharing and accessing resources on social media. The plan involves analyzing archived web coverage, shortened URI persistence, the loss of shared social media resources over time, and contextual factors that influence intention. The goal is to build a model that can predict intention and induce preservation of vulnerable content to maintain temporal consistency. Key steps include collecting intention-based datasets, extracting intention features, training and evaluating predictive models, and developing applications. Publications will analyze coverage of archives, shortened URIs, and the persistence of shared social media resources.
This document provides an overview of resources for students to use when conducting research for an annotated bibliography. It discusses using primary and secondary sources, databases for finding scholarly articles, the difference between general and subject specific databases, interlibrary loans, and RefWorks citation management software. The librarian emphasizes exploring different search techniques and using library resources to develop an effective research strategy.
The document provides an overview of resources and strategies for research for an annotated bibliography, including an introduction to using primary and secondary sources, field research methods, databases, and evaluating internet sources. It also outlines the services and collections available at the Heterick Memorial Library, including knowledgeable librarians, RefWorks citation management software, and resources for finding books and articles.
Who Will Archive the Archives? Thoughts About the Future of Web ArchivingMichael Nelson
The document discusses the challenges of archiving the web at scale. It notes that while much of the web has been archived, temporal drift and gaps remain issues. Memento provides a framework for accessing content across multiple archives. To fully archive the web, more copies stored in diverse archives are needed due to the risk of any single archive becoming unavailable.
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 eswcsummerschool
This document discusses combining the social web and semantic web through crowdsourcing. It defines key concepts like the social web, crowdsourcing, and semantic technologies. It then provides examples of how semantic tasks can be crowdsourced, such as annotating research papers, mapping topics to ontologies, and curating linked data. Challenges with crowdsourcing semantic tasks are also explored, such as how to optimally structure tasks and validate crowd responses.
Discovering and Navigating Memes in Social MediaMatthew Lease
Invited talk at SBP 2012: Intl. Conf. on Social Computing, Behavioral-Cultural Modeling, & Prediction (April 3, 2012). Based on paper by Ryu, Lease, and Woodward, to appear at ACM HyperText 2012. Joint work with Hohyon Ryu and Nicholas Woodward.
The document proposes a community platform for service innovation using a multi-space framework. It describes a scenario where individuals like Joe, Sarah and Ben discover new connections and ideas by annotating and tagging web documents in different spaces. Their annotations and tags are connected through stigmergy, allowing Ben to make a new discovery. The framework includes spaces for fast individual thinking, slower collaborative thinking, and public memory. Tools like mind maps, concept maps and topic maps are recommended for structuring knowledge in the different spaces.
Building and Managing Social Media CollectionsJason Casden
Presenters:
Laura Wrubel
Jason Casden
Presented at DLF Forum 2015 on October 27th, 2015.
As venues for discourse and creation, social media platforms such as Twitter and Instagram are important source material for scholarly research. Future access to social media data will allow researchers to develop historical assessments based on materials representing the voices of a large and diverse set of participants. Much of this critical and ephemeral content may be lost if cultural heritage institutions are not collecting and preserving it, yet creating and managing these collections presents challenges around collecting mechanisms, curation, legal and ethical issues, and preservation.
This workshop will include the following components:
• A review of technical tools for collecting and guidelines for selecting an approach that works best for your institution and users
• A guided discussion of ethical and legal considerations in taking on this work and parallels with established archival practices
• A review of some existing use cases of libraries' social media data collecting followed by a group discussion of possible community-specific use cases and needs for supporting services.
• A demonstration of possible archival collecting workflows using NCSU Libraries' Social Media Combine collecting system (which includes NCSU Libraries' lentil system for Instagram harvesting and George Washington University's Social Feed Manager for Twitter harvesting). Participants who wish to follow along with their own instance may install it ahead of time.
Participants will leave with an awareness of the major components of a new social media collecting program, including available tools, research use cases, ethical and legal considerations, supporting resources, as well as a better understanding of how to integrate social media into existing practices and workflows. There will be opportunities to share collecting ideas with each other at the end of the workshop.
This document outlines the plan for a dissertation that aims to estimate users' temporal intention when sharing and accessing resources on social media. The plan involves analyzing archived web coverage, shortened URI persistence, the loss of shared social media resources over time, and contextual factors that influence intention. The goal is to build a model that can predict intention and induce preservation of vulnerable content to maintain temporal consistency. Key steps include collecting intention-based datasets, extracting intention features, training and evaluating predictive models, and developing applications. Publications will analyze coverage of archives, shortened URIs, and the persistence of shared social media resources.
This document provides an overview of resources for students to use when conducting research for an annotated bibliography. It discusses using primary and secondary sources, databases for finding scholarly articles, the difference between general and subject specific databases, interlibrary loans, and RefWorks citation management software. The librarian emphasizes exploring different search techniques and using library resources to develop an effective research strategy.
The document provides an overview of resources and strategies for research for an annotated bibliography, including an introduction to using primary and secondary sources, field research methods, databases, and evaluating internet sources. It also outlines the services and collections available at the Heterick Memorial Library, including knowledgeable librarians, RefWorks citation management software, and resources for finding books and articles.
Who Will Archive the Archives? Thoughts About the Future of Web ArchivingMichael Nelson
The document discusses the challenges of archiving the web at scale. It notes that while much of the web has been archived, temporal drift and gaps remain issues. Memento provides a framework for accessing content across multiple archives. To fully archive the web, more copies stored in diverse archives are needed due to the risk of any single archive becoming unavailable.
Tutorial: Social Semantic Web and Crowdsourcing - E. Simperl - ESWC SS 2014 eswcsummerschool
This document discusses combining the social web and semantic web through crowdsourcing. It defines key concepts like the social web, crowdsourcing, and semantic technologies. It then provides examples of how semantic tasks can be crowdsourced, such as annotating research papers, mapping topics to ontologies, and curating linked data. Challenges with crowdsourcing semantic tasks are also explored, such as how to optimally structure tasks and validate crowd responses.
Discovering and Navigating Memes in Social MediaMatthew Lease
Invited talk at SBP 2012: Intl. Conf. on Social Computing, Behavioral-Cultural Modeling, & Prediction (April 3, 2012). Based on paper by Ryu, Lease, and Woodward, to appear at ACM HyperText 2012. Joint work with Hohyon Ryu and Nicholas Woodward.
The document proposes a community platform for service innovation using a multi-space framework. It describes a scenario where individuals like Joe, Sarah and Ben discover new connections and ideas by annotating and tagging web documents in different spaces. Their annotations and tags are connected through stigmergy, allowing Ben to make a new discovery. The framework includes spaces for fast individual thinking, slower collaborative thinking, and public memory. Tools like mind maps, concept maps and topic maps are recommended for structuring knowledge in the different spaces.
Building and Managing Social Media CollectionsJason Casden
Presenters:
Laura Wrubel
Jason Casden
Presented at DLF Forum 2015 on October 27th, 2015.
As venues for discourse and creation, social media platforms such as Twitter and Instagram are important source material for scholarly research. Future access to social media data will allow researchers to develop historical assessments based on materials representing the voices of a large and diverse set of participants. Much of this critical and ephemeral content may be lost if cultural heritage institutions are not collecting and preserving it, yet creating and managing these collections presents challenges around collecting mechanisms, curation, legal and ethical issues, and preservation.
This workshop will include the following components:
• A review of technical tools for collecting and guidelines for selecting an approach that works best for your institution and users
• A guided discussion of ethical and legal considerations in taking on this work and parallels with established archival practices
• A review of some existing use cases of libraries' social media data collecting followed by a group discussion of possible community-specific use cases and needs for supporting services.
• A demonstration of possible archival collecting workflows using NCSU Libraries' Social Media Combine collecting system (which includes NCSU Libraries' lentil system for Instagram harvesting and George Washington University's Social Feed Manager for Twitter harvesting). Participants who wish to follow along with their own instance may install it ahead of time.
Participants will leave with an awareness of the major components of a new social media collecting program, including available tools, research use cases, ethical and legal considerations, supporting resources, as well as a better understanding of how to integrate social media into existing practices and workflows. There will be opportunities to share collecting ideas with each other at the end of the workshop.
The document discusses using paper prototyping to design mobile applications for college libraries. It describes how the organization supports over 80 college libraries and aims to redesign their mobile interfaces to better align with how students conduct research. They conducted focus groups and user testing with students to inform the redesign. Paper prototyping benefits include fast interface design, low cost, and early user feedback. Next steps include further user testing and refining interfaces before development and approval.
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social NetworksBang Hui Lim
We study how users of multiple online social net- works (OSNs) employ and share information by studying a common user pool that use six OSNs – Flickr, Google+, Instagram, Tumblr, Twitter, and YouTube. We analyze the temporal and topical signature of users’ sharing behaviour, showing how they exhibit distinct behaviorial patterns on different networks. We also examine cross-sharing (i.e., the act of user broadcasting their activity to multiple OSNs near-simultaneously), a previously unstudied behaviour and demonstrate how certain OSNs play the roles of originating source and destination sinks.
This document discusses learning analytics and ways of visualizing educational data. It outlines understandings of learning analytics and describes the data landscape in education including institutional data from systems like LMSs, individual social media data, and personal learning environments. The document discusses tools for reproducible research, looking for trends in large datasets, and visualizing data beyond dashboards. It provides examples of learning analytics tools and visualizations used at institutions like Purdue and explores analyzing social media and networks. The document concludes by discussing future scenarios for learning analytics research.
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012lljohnston
Big Data challenges in developing repositories include:
- Collections like web archives and historic newspapers contain billions of files and grow quickly, requiring constant processing and large-scale infrastructure.
- Researchers want to analyze entire collections using algorithms and computational methods rather than accessing individual items.
- Repository services need to support self-serve access, full-text search of entire collections, and APIs to enable computational research methods.
- Ingesting and providing access to collections measured in petabytes and containing highly diverse content and metadata requires normalization and standardization.
Lecture 5: Mining, Analysis and VisualisationMarieke van Erp
This is the fourth lecture in the Social Web course at the VU University Amsterdam
Visit the website for more information: <a>Social Web 2012</a>
Introduction to Information Architecture & Design - SVA Workshop 06/21/14Robert Stribley
The document provides an introduction to an information architecture and design workshop. It includes an agenda for the workshop that covers background on information architecture, the design process, a project overview, user research including surveys and competitive reviews, developing personas, and design deliverables like site maps, wireframes and prototypes. Key aspects of information architecture are defined, including the combination of organization, labeling and navigation to facilitate accessing content. The history of the field and example methodologies for user research, competitive reviews and developing personas are also outlined.
Introduction to Information Architecture & Design - SVA Workshop 03/22/14Robert Stribley
Events.com wants to revamp their website to become the go-to online resource for attending and promoting events across the US. The information architect conducted user research including surveys and interviews, reviewed competitors, and created personas to understand user needs. Key activities in the define phase included card sorting to organize content, creating site maps and wireframes, and designing the navigation and page types.
Charleston 2013: The Social Side of ResearchWilliam Gunn
The document discusses the changing landscape of scholarly communication and research workflows. It describes how early technologies like chat forums and Usenet allowed reaching beyond local environments, and how music and blogging industries were disrupted by new models. Libraries have been empowered by open access while publishers hesitated to embrace online formats. Mendeley brought user-friendly tools to scholarly sharing and discovery by creating an open platform. It has grown significantly, instrumenting the research workflow and enabling new discovery methods and analytics that support various stakeholders.
This illustrated lesson provides students with many illustrations. hyperlinked articles, and essential questions that can be used to create their own PowerPoint project about the issue of privacy or public safety.
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...SEAD
This document discusses research data management and the role of university libraries. It describes the SEAD (Sustainable Environment Actionable Data) project, which provides data services like curation, preservation, and a social community network to support research data across its lifecycle. SEAD aims to support interdisciplinary research by allowing researchers to define and manage related collections of data and metadata called Research Objects in a scalable way. The document argues that research organizations are best positioned to provide comprehensive long-term data services that integrate across the entire research process.
How can we mine, analyse and visualise the Social Web?
In this lecture, you will learn about mining social web data for analysis. Data preparation and gathering basic statistics on your data.
Towards Cognitive Agents for BigData DiscoveryJack Park
1) The document discusses developing cognitive agents to improve deep question answering and discovery by coupling two platforms: the Berkeley Data Analytics Stack (BDAS) for big data analysis and SolrSherlock for literature-based discovery.
2) It describes how these agents could harvest and represent patterns, contexts, and relations from literature to discover new processes and connections between concepts.
3) The goal is to augment existing methods by allowing hypothesis formation and evidence gathering across both structured data and unstructured literature at scale.
Introducing PRIME:Publisher, Repository and Institutional Metadata ExchangeBrian Hole
"Introducing PRIME:Publisher, Repository and Institutional Metadata Exchange" – Brian Hole, Ubiquity Press.
OpenAIRE Interoperability Workshop - University of Minho, Braga, Portugal, 8 February 2013
This illustrated lesson provides students with many illustrations, hyperlinked articles, and essential questions that can be used to create their own PowerPoint project about the challenges of technology.
This document discusses challenges with the current scientific publishing system and proposes a vision for next generation scientific publishing (NGSP). Some key problems include retractions due to misconduct, lack of reproducibility, and non-reusable data and methods. NGSP would feature transparent and computable data and methods, open annotation of narratives and objects, and no restrictions on text mining or remixing. It would move information more quickly and allow verification through an open, service-oriented system without walled gardens. Taking NGSP forward will require collaboration across stakeholders in research communications.
Linked Open Data in Libraries, Archives & MuseumsJon Voss
This document provides an overview of Linked Open Data for libraries, archives, and museums. It discusses the growing movement of LODLAM and how it allows these cultural institutions to represent their data as graphs using triples that describe entities in a machine-readable format. Key concepts covered include the use of URIs, RDF, vocabularies, and different legal tools for publishing open data.
This document provides an introduction and overview for a course on machine learning. It outlines the course structure, assignments, and expectations. The course will cover topics including linear regression, classification, model selection, and dimensionality reduction. It will teach students how to analyze data, preprocess it, extract features, train models, and evaluate model performance. The goal is for students to understand core machine learning algorithms and concepts. Required materials include an introduction to statistical learning textbook.
The dissertation defense presentation summarized Hany SalahEldeen's dissertation research on detecting, modeling, and predicting user temporal intention in social media. The research aimed to estimate the temporal intention of authors when sharing content and readers when accessing content. It also sought to model intention over time, predict how shared resources change over time, and implement models to preserve at-risk social media content and provide smooth temporal navigation of the social web. Key aspects of the research included analyzing loss and persistence of shared URLs over time, measuring existence and disappearance as a function of time, and using social context to find replacements for missing resources.
The document discusses using paper prototyping to design mobile applications for college libraries. It describes how the organization supports over 80 college libraries and aims to redesign their mobile interfaces to better align with how students conduct research. They conducted focus groups and user testing with students to inform the redesign. Paper prototyping benefits include fast interface design, low cost, and early user feedback. Next steps include further user testing and refining interfaces before development and approval.
#mytweet via Instagram: Exploring User Behaviour Across Multiple Social NetworksBang Hui Lim
We study how users of multiple online social net- works (OSNs) employ and share information by studying a common user pool that use six OSNs – Flickr, Google+, Instagram, Tumblr, Twitter, and YouTube. We analyze the temporal and topical signature of users’ sharing behaviour, showing how they exhibit distinct behaviorial patterns on different networks. We also examine cross-sharing (i.e., the act of user broadcasting their activity to multiple OSNs near-simultaneously), a previously unstudied behaviour and demonstrate how certain OSNs play the roles of originating source and destination sinks.
This document discusses learning analytics and ways of visualizing educational data. It outlines understandings of learning analytics and describes the data landscape in education including institutional data from systems like LMSs, individual social media data, and personal learning environments. The document discusses tools for reproducible research, looking for trends in large datasets, and visualizing data beyond dashboards. It provides examples of learning analytics tools and visualizations used at institutions like Purdue and explores analyzing social media and networks. The document concludes by discussing future scenarios for learning analytics research.
Leslie Johnston: Library Big Data Repository Services, Open Repositories 2012lljohnston
Big Data challenges in developing repositories include:
- Collections like web archives and historic newspapers contain billions of files and grow quickly, requiring constant processing and large-scale infrastructure.
- Researchers want to analyze entire collections using algorithms and computational methods rather than accessing individual items.
- Repository services need to support self-serve access, full-text search of entire collections, and APIs to enable computational research methods.
- Ingesting and providing access to collections measured in petabytes and containing highly diverse content and metadata requires normalization and standardization.
Lecture 5: Mining, Analysis and VisualisationMarieke van Erp
This is the fourth lecture in the Social Web course at the VU University Amsterdam
Visit the website for more information: <a>Social Web 2012</a>
Introduction to Information Architecture & Design - SVA Workshop 06/21/14Robert Stribley
The document provides an introduction to an information architecture and design workshop. It includes an agenda for the workshop that covers background on information architecture, the design process, a project overview, user research including surveys and competitive reviews, developing personas, and design deliverables like site maps, wireframes and prototypes. Key aspects of information architecture are defined, including the combination of organization, labeling and navigation to facilitate accessing content. The history of the field and example methodologies for user research, competitive reviews and developing personas are also outlined.
Introduction to Information Architecture & Design - SVA Workshop 03/22/14Robert Stribley
Events.com wants to revamp their website to become the go-to online resource for attending and promoting events across the US. The information architect conducted user research including surveys and interviews, reviewed competitors, and created personas to understand user needs. Key activities in the define phase included card sorting to organize content, creating site maps and wireframes, and designing the navigation and page types.
Charleston 2013: The Social Side of ResearchWilliam Gunn
The document discusses the changing landscape of scholarly communication and research workflows. It describes how early technologies like chat forums and Usenet allowed reaching beyond local environments, and how music and blogging industries were disrupted by new models. Libraries have been empowered by open access while publishers hesitated to embrace online formats. Mendeley brought user-friendly tools to scholarly sharing and discovery by creating an open platform. It has grown significantly, instrumenting the research workflow and enabling new discovery methods and analytics that support various stakeholders.
This illustrated lesson provides students with many illustrations. hyperlinked articles, and essential questions that can be used to create their own PowerPoint project about the issue of privacy or public safety.
Data Sets, Ensemble Cloud Computing, and the University Library:Getting the ...SEAD
This document discusses research data management and the role of university libraries. It describes the SEAD (Sustainable Environment Actionable Data) project, which provides data services like curation, preservation, and a social community network to support research data across its lifecycle. SEAD aims to support interdisciplinary research by allowing researchers to define and manage related collections of data and metadata called Research Objects in a scalable way. The document argues that research organizations are best positioned to provide comprehensive long-term data services that integrate across the entire research process.
How can we mine, analyse and visualise the Social Web?
In this lecture, you will learn about mining social web data for analysis. Data preparation and gathering basic statistics on your data.
Towards Cognitive Agents for BigData DiscoveryJack Park
1) The document discusses developing cognitive agents to improve deep question answering and discovery by coupling two platforms: the Berkeley Data Analytics Stack (BDAS) for big data analysis and SolrSherlock for literature-based discovery.
2) It describes how these agents could harvest and represent patterns, contexts, and relations from literature to discover new processes and connections between concepts.
3) The goal is to augment existing methods by allowing hypothesis formation and evidence gathering across both structured data and unstructured literature at scale.
Introducing PRIME:Publisher, Repository and Institutional Metadata ExchangeBrian Hole
"Introducing PRIME:Publisher, Repository and Institutional Metadata Exchange" – Brian Hole, Ubiquity Press.
OpenAIRE Interoperability Workshop - University of Minho, Braga, Portugal, 8 February 2013
This illustrated lesson provides students with many illustrations, hyperlinked articles, and essential questions that can be used to create their own PowerPoint project about the challenges of technology.
This document discusses challenges with the current scientific publishing system and proposes a vision for next generation scientific publishing (NGSP). Some key problems include retractions due to misconduct, lack of reproducibility, and non-reusable data and methods. NGSP would feature transparent and computable data and methods, open annotation of narratives and objects, and no restrictions on text mining or remixing. It would move information more quickly and allow verification through an open, service-oriented system without walled gardens. Taking NGSP forward will require collaboration across stakeholders in research communications.
Linked Open Data in Libraries, Archives & MuseumsJon Voss
This document provides an overview of Linked Open Data for libraries, archives, and museums. It discusses the growing movement of LODLAM and how it allows these cultural institutions to represent their data as graphs using triples that describe entities in a machine-readable format. Key concepts covered include the use of URIs, RDF, vocabularies, and different legal tools for publishing open data.
This document provides an introduction and overview for a course on machine learning. It outlines the course structure, assignments, and expectations. The course will cover topics including linear regression, classification, model selection, and dimensionality reduction. It will teach students how to analyze data, preprocess it, extract features, train models, and evaluate model performance. The goal is for students to understand core machine learning algorithms and concepts. Required materials include an introduction to statistical learning textbook.
The dissertation defense presentation summarized Hany SalahEldeen's dissertation research on detecting, modeling, and predicting user temporal intention in social media. The research aimed to estimate the temporal intention of authors when sharing content and readers when accessing content. It also sought to model intention over time, predict how shared resources change over time, and implement models to preserve at-risk social media content and provide smooth temporal navigation of the social web. Key aspects of the research included analyzing loss and persistence of shared URLs over time, measuring existence and disappearance as a function of time, and using social context to find replacements for missing resources.
This document provides an overview of Hany SalahEldeen's research goals and background. It discusses detecting, modeling, and predicting user temporal intention in social media posts. Specifically, it aims to:
1. Estimate the author's intention at time of posting to maintain consistency for readers.
2. Address issues like linked resources disappearing or changing over time, which could compromise the historical integrity of social media posts.
3. Develop models of author temporal intention and tools to match the predicted intention, such as retrieving the closest archived version of a changed resource.
The document provides examples of intention issues and outlines the angles of investigation, which include analyzing archived content, resource age and states, and detecting/modeling
Reading the Correct History? Modeling Temporal Intention in Resource Sharingheinestien
The document presents a model called the Temporal Intention Relevancy Model (TIRM) to detect inconsistencies between the intended and actual states of web resources shared on Twitter. It describes collecting data on tweet-link pairs using Amazon Mechanical Turk to train the model, extracting features related to the links, tweets, and archives, and using a random forest classifier that achieved 90.32% accuracy. Key findings were that over 25% of resources shared had changed by the time readers clicked the link, and features like celebrity mentions, number of archives, and text similarity were most predictive of intention.
Carbon Dating The Web: Estimating the Age of Web Resourcesheinestien
This document discusses methods for estimating the age of web resources by analyzing trails left over time related to their existence. It proposes that a resource's creation date can be estimated through examining: 1) the last modified date header, 2) the first appearance of backlinks from other pages or social media mentions, and 3) the earliest memento of the resource in web archives. The methods aim to provide a generic approach without relying on page templates or infrastructure by tracing the timeline of associated events.
This document summarizes Hany SalahEldeen's doctoral research proposal on detecting, modeling, and predicting user temporal intention in social media. The proposal aims to analyze how shared resources in social media change over time, estimate users' intentions in sharing content, and develop models to predict intentions and maintain temporal consistency. The research goals are to analyze archived coverage of social media, understand shortened URL resolving, and estimate loss of shared resources. The proposed work includes gathering data, extracting features of intention, modeling intention, evaluation, and applications for prediction and preservation.
Losing My Revolution Long Paper TPDL2012heinestien
This document summarizes research on analyzing social media resources shared during six socially significant events. Data was gathered from Twitter, websites and books related to the Egyptian revolution and other events. Analysis was conducted to determine the uniqueness of URLs shared, how many were still active on the live web, and how many were archived in public web archives. The results found that while most resources were unique, 10-35% were missing from the live web and around 30-40% were archived. This research aims to understand how long social media resources are maintained and if backups exist.
LAND USE LAND COVER AND NDVI OF MIRZAPUR DISTRICT, UPRAHUL
This Dissertation explores the particular circumstances of Mirzapur, a region located in the
core of India. Mirzapur, with its varied terrains and abundant biodiversity, offers an optimal
environment for investigating the changes in vegetation cover dynamics. Our study utilizes
advanced technologies such as GIS (Geographic Information Systems) and Remote sensing to
analyze the transformations that have taken place over the course of a decade.
The complex relationship between human activities and the environment has been the focus
of extensive research and worry. As the global community grapples with swift urbanization,
population expansion, and economic progress, the effects on natural ecosystems are becoming
more evident. A crucial element of this impact is the alteration of vegetation cover, which plays a
significant role in maintaining the ecological equilibrium of our planet.Land serves as the foundation for all human activities and provides the necessary materials for
these activities. As the most crucial natural resource, its utilization by humans results in different
'Land uses,' which are determined by both human activities and the physical characteristics of the
land.
The utilization of land is impacted by human needs and environmental factors. In countries
like India, rapid population growth and the emphasis on extensive resource exploitation can lead
to significant land degradation, adversely affecting the region's land cover.
Therefore, human intervention has significantly influenced land use patterns over many
centuries, evolving its structure over time and space. In the present era, these changes have
accelerated due to factors such as agriculture and urbanization. Information regarding land use and
cover is essential for various planning and management tasks related to the Earth's surface,
providing crucial environmental data for scientific, resource management, policy purposes, and
diverse human activities.
Accurate understanding of land use and cover is imperative for the development planning
of any area. Consequently, a wide range of professionals, including earth system scientists, land
and water managers, and urban planners, are interested in obtaining data on land use and cover
changes, conversion trends, and other related patterns. The spatial dimensions of land use and
cover support policymakers and scientists in making well-informed decisions, as alterations in
these patterns indicate shifts in economic and social conditions. Monitoring such changes with the
help of Advanced technologies like Remote Sensing and Geographic Information Systems is
crucial for coordinated efforts across different administrative levels. Advanced technologies like
Remote Sensing and Geographic Information Systems
9
Changes in vegetation cover refer to variations in the distribution, composition, and overall
structure of plant communities across different temporal and spatial scales. These changes can
occur natural.
How to Setup Warehouse & Location in Odoo 17 InventoryCeline George
In this slide, we'll explore how to set up warehouses and locations in Odoo 17 Inventory. This will help us manage our stock effectively, track inventory levels, and streamline warehouse operations.
हिंदी वर्णमाला पीपीटी, hindi alphabet PPT presentation, hindi varnamala PPT, Hindi Varnamala pdf, हिंदी स्वर, हिंदी व्यंजन, sikhiye hindi varnmala, dr. mulla adam ali, hindi language and literature, hindi alphabet with drawing, hindi alphabet pdf, hindi varnamala for childrens, hindi language, hindi varnamala practice for kids, https://www.drmullaadamali.com
বাংলাদেশের অর্থনৈতিক সমীক্ষা ২০২৪ [Bangladesh Economic Review 2024 Bangla.pdf] কম্পিউটার , ট্যাব ও স্মার্ট ফোন ভার্সন সহ সম্পূর্ণ বাংলা ই-বুক বা pdf বই " সুচিপত্র ...বুকমার্ক মেনু 🔖 ও হাইপার লিংক মেনু 📝👆 যুক্ত ..
আমাদের সবার জন্য খুব খুব গুরুত্বপূর্ণ একটি বই ..বিসিএস, ব্যাংক, ইউনিভার্সিটি ভর্তি ও যে কোন প্রতিযোগিতা মূলক পরীক্ষার জন্য এর খুব ইম্পরট্যান্ট একটি বিষয় ...তাছাড়া বাংলাদেশের সাম্প্রতিক যে কোন ডাটা বা তথ্য এই বইতে পাবেন ...
তাই একজন নাগরিক হিসাবে এই তথ্য গুলো আপনার জানা প্রয়োজন ...।
বিসিএস ও ব্যাংক এর লিখিত পরীক্ষা ...+এছাড়া মাধ্যমিক ও উচ্চমাধ্যমিকের স্টুডেন্টদের জন্য অনেক কাজে আসবে ...
Philippine Edukasyong Pantahanan at Pangkabuhayan (EPP) CurriculumMJDuyan
(𝐓𝐋𝐄 𝟏𝟎𝟎) (𝐋𝐞𝐬𝐬𝐨𝐧 𝟏)-𝐏𝐫𝐞𝐥𝐢𝐦𝐬
𝐃𝐢𝐬𝐜𝐮𝐬𝐬 𝐭𝐡𝐞 𝐄𝐏𝐏 𝐂𝐮𝐫𝐫𝐢𝐜𝐮𝐥𝐮𝐦 𝐢𝐧 𝐭𝐡𝐞 𝐏𝐡𝐢𝐥𝐢𝐩𝐩𝐢𝐧𝐞𝐬:
- Understand the goals and objectives of the Edukasyong Pantahanan at Pangkabuhayan (EPP) curriculum, recognizing its importance in fostering practical life skills and values among students. Students will also be able to identify the key components and subjects covered, such as agriculture, home economics, industrial arts, and information and communication technology.
𝐄𝐱𝐩𝐥𝐚𝐢𝐧 𝐭𝐡𝐞 𝐍𝐚𝐭𝐮𝐫𝐞 𝐚𝐧𝐝 𝐒𝐜𝐨𝐩𝐞 𝐨𝐟 𝐚𝐧 𝐄𝐧𝐭𝐫𝐞𝐩𝐫𝐞𝐧𝐞𝐮𝐫:
-Define entrepreneurship, distinguishing it from general business activities by emphasizing its focus on innovation, risk-taking, and value creation. Students will describe the characteristics and traits of successful entrepreneurs, including their roles and responsibilities, and discuss the broader economic and social impacts of entrepreneurial activities on both local and global scales.
Chapter wise All Notes of First year Basic Civil Engineering.pptxDenish Jangid
Chapter wise All Notes of First year Basic Civil Engineering
Syllabus
Chapter-1
Introduction to objective, scope and outcome the subject
Chapter 2
Introduction: Scope and Specialization of Civil Engineering, Role of civil Engineer in Society, Impact of infrastructural development on economy of country.
Chapter 3
Surveying: Object Principles & Types of Surveying; Site Plans, Plans & Maps; Scales & Unit of different Measurements.
Linear Measurements: Instruments used. Linear Measurement by Tape, Ranging out Survey Lines and overcoming Obstructions; Measurements on sloping ground; Tape corrections, conventional symbols. Angular Measurements: Instruments used; Introduction to Compass Surveying, Bearings and Longitude & Latitude of a Line, Introduction to total station.
Levelling: Instrument used Object of levelling, Methods of levelling in brief, and Contour maps.
Chapter 4
Buildings: Selection of site for Buildings, Layout of Building Plan, Types of buildings, Plinth area, carpet area, floor space index, Introduction to building byelaws, concept of sun light & ventilation. Components of Buildings & their functions, Basic concept of R.C.C., Introduction to types of foundation
Chapter 5
Transportation: Introduction to Transportation Engineering; Traffic and Road Safety: Types and Characteristics of Various Modes of Transportation; Various Road Traffic Signs, Causes of Accidents and Road Safety Measures.
Chapter 6
Environmental Engineering: Environmental Pollution, Environmental Acts and Regulations, Functional Concepts of Ecology, Basics of Species, Biodiversity, Ecosystem, Hydrological Cycle; Chemical Cycles: Carbon, Nitrogen & Phosphorus; Energy Flow in Ecosystems.
Water Pollution: Water Quality standards, Introduction to Treatment & Disposal of Waste Water. Reuse and Saving of Water, Rain Water Harvesting. Solid Waste Management: Classification of Solid Waste, Collection, Transportation and Disposal of Solid. Recycling of Solid Waste: Energy Recovery, Sanitary Landfill, On-Site Sanitation. Air & Noise Pollution: Primary and Secondary air pollutants, Harmful effects of Air Pollution, Control of Air Pollution. . Noise Pollution Harmful Effects of noise pollution, control of noise pollution, Global warming & Climate Change, Ozone depletion, Greenhouse effect
Text Books:
1. Palancharmy, Basic Civil Engineering, McGraw Hill publishers.
2. Satheesh Gopi, Basic Civil Engineering, Pearson Publishers.
3. Ketki Rangwala Dalal, Essentials of Civil Engineering, Charotar Publishing House.
4. BCP, Surveying volume 1
Beyond Degrees - Empowering the Workforce in the Context of Skills-First.pptxEduSkills OECD
Iván Bornacelly, Policy Analyst at the OECD Centre for Skills, OECD, presents at the webinar 'Tackling job market gaps with a skills-first approach' on 12 June 2024
1. Detecting, Modeling, & Predicting
User Temporal Intention
in Social Media
Hany M. SalahEldeen
Old Dominion University
Advisor: Dr. Michael L. Nelson
JCDL ‘12 Doctoral Consortium
2. Michael Jackson Dies
Snapshot on: June 25th 2009
http://web.archive.org/web/20090625232522/http://www.cnn.com/
3. Jeff tweets about it…
Published on: June 25th 2009
https://twitter.com/mdnitehk/status/2333993907
4. Jenny is off the grid
Jeff’s friend Jenny was on a vacation in Hawaii
for a month…
5. Jenny starts catching up a month later
Read on: July26th 2009
When she came back she checked Jeff’s tweets and was
shocked!
https://twitter.com/mdnitehk/status/2333993907
6. Jenny follows the link on July 26th
CNN page on: July 26th 2009
http://web.archive.org/web/20090726234411/http://www.cnn.com/
7. Jenny is confused!
• Implication:
– Jenny thought Jeff is making a joke about her
favorite singer and she got mad at him
• Problem:
– The tweet and the resource the tweet links to
have become unsynchronized.
9. Reading about it on Storify in
March 2012….
http://storify.com/maq4sure/egypts-revolution
10. I noticed some shared images are missing
http://storify.com/maq4sure/egypts-revolution
11. Some tweets are still intact…
https://twitter.com/miss_amy_qb/status/32477898581483521
12. …and some lost their meaning with the
disappearance of the images
https://twitter.com/aishes/status/32485352102952960
Missing ?
https://twitter.com/omar_chaaban/status/32203697597452289
13. The tweet remains but the shared
image disappeared…
http://yfrog.com/h5923xrvbqqvgzj
14. Cairo….we have a problem
• Implication:
– The reader cannot understand what the author of
the tweet meant because the image is not
available.
• Problem:
– The post is available but the linked resource
(image) is completely missing.
16. The Anatomy of a Tweet
Author’s username
Other user mention
Social
Post Tweet Body
Interaction Publishing Shortened URL Hash Tag
options timestamp to resource
Shared Resource
20. User’s Temporal Intention
The Focus of our research Instrumented shortener
Share time Implicit Explicit
Click time Implicit Explicit
Instrumented web client
Out of our scope
Purview of Facebook, Engineering problem
Twitter, Google, …etc
Solved by providing
tools
21. Sometimes you want a
previous version
The Correct Temporal
Intention
CNN.com at the closest time to the tweet: 25th June 2009 ~ 7pm
22. Sometimes you want the
current version
The Correct Temporal
Intention
In this case the current state of the press releases page
23. Research Question
Can we estimate the users’
intention at the time of posting
and reading to predict and
maintain temporal consistency?
24. Research Goals
• Detect the temporal intention of the:
1. Author upon sharing time
2. The reader upon dereferencing time
• Model this intention as a function of time, nature of the resource,
and its context.
• Predict how resources change with time and the intention behind
sharing them to minimize inconsistency.
• Implement the prediction model to automatically preserve
vulnerable social content that is prone to change or loss.
• Create an environment implementing this framework that
provides a smooth temporal navigation of the social web.
25. Related Work
• User’s Web Search Intention • Persistence of shared resources
– A. Ashkan ECIR ’09 – M. Nelson D-Lib ‘02
– C. Lee AINA ‘05 – R. Sanderson OR’11
– A. Loser IRSW ‘08 – F. McCown JCDL ‘07
– L. Azzopardi ECIR ‘09
– R. Baeza-Yates SPIR‘06
– N. Dai HT ’11
• URL Shortening
– D. Antoniades WWW ’11
• Commercial Intention
– Q. Guo SIGIR ’10 • Tweeting, Micro-blogging and Popularity
– A. Benczur AIRWeb ’07
– S. Wu WWW ’11
– A. Java SNA-KDD ’07
• Sentiment Analysis
– H. Kwak WWW ’10
– G. Mishne AAAI ‘06
– J. Bollen JCS ‘11
• Social Networks Growth and Evolution
• Access to Archives
– B. Meeder WWW ’11
– H. Van de Sompel OR‘09
26. Dissertation Plan
BEGIN
Read Literature
Collect Datasets
Analyze Archives Coverage
Analyze Shortened URIs
Prototype Application
Analyze Shared Resources Persistence and Coverage
Current
Analyze Contextual Intention
State
Create Intention-based dataset
Extract Intention Features
Train a Parametric Model to predict intention
Evaluate, test, cross-validate the model
Create a mockup application
Extend the model to induce preservation
Finish Writing the Dissertation
PhD Defense
27. Dissertation Plan
BEGIN
Read Literature
Collect Datasets
Analyze Archives Coverage
Analyze Shortened URIs
Prototype Application
Analyze Shared Resources Persistence and Coverage
Analyze Contextual Intention
Create Intention-based dataset
Extract Intention Features
Train a Parametric Model to predict intention
Evaluate, test, cross-validate the model
Create a mockup application
Extend the model to induce preservation
Finish Writing the Dissertation
PhD Defense
28. Estimating Web Archiving Coverage
• Goal: Estimate how much of the public web is present in the public archives
and how many copies are available?
• Action:
– Getting 4 different datasets from 4 different sources:
• Search Engines Indices
• Bit.ly
• DMOZ
• Delicious.
• Results: *
• Publications:
– How much of the web is archived? JCDL '11
* Table Courtesy of Ahmed AlSum JCDL 2011
29. Dissertation Plan
BEGIN
Read Literature
Collect Datasets
Analyze Archives Coverage
Analyze Shortened URIs
Prototype Application
Analyze Shared Resources Persistence and Coverage
Analyze Contextual Intention
Create Intention-based dataset
Extract Intention Features
Train a Parametric Model to predict intention
Evaluate, test, cross-validate the model
Create a mockup application
Extend the model to induce preservation
Finish Writing the Dissertation
PhD Defense
30. Shortened URI analysis
• Goal: Have a better understanding of URI shortening and resolving,
understand the effect of time on this process and the correlation between
the page’s features and characteristics, and its resolution.
• Action:
– Fresh Bit.lys
– Get hourly clicklogs, rate of change, social networking spread, and other
contextual information
– Longitudinal study
• Evaluation:
– Compare results with frequency of change analysis of Cho and Garcia-
Molina.
– Compare results with Antoniades et al. WWW 2011.
31. Dissertation Plan
BEGIN
Read Literature
Collect Datasets
Analyze Archives Coverage
Analyze Shortened URIs
Prototype Application
Analyze Shared Resources Persistence and Coverage
Analyze Contextual Intention
Create Intention-based dataset
Extract Intention Features
Train a Parametric Model to predict intention
Evaluate, test, cross-validate the model
Create a mockup application
Extend the model to induce preservation
Finish Writing the Dissertation
PhD Defense
32. Estimating Loss of Shared Resources
in Social Media
• Goal: Estimate how much of the public web is present in the public archives
and how many copies are available?
• Action:
– Sampling from 6 public events
– Events spanning 3 years
– Existence in the current web
– Existence in the public archives
– Find relation with time
• Results:
– After 1st year ~11% will be lost
– After that we will continue on losing 0.02% daily
• Publications:
– A year after the Egyptian revolution, 10% of the social media documentation is gone.
http://ws-dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html
– Losing my revolution: How Many Resources Shared on Social Media Have Been Lost?
TPDL '12
33. Dissertation Plan
BEGIN
Read Literature
Collect Datasets
Analyze Archives Coverage
Analyze Shortened URIs
Prototype Application
Analyze Shared Resources Persistence and Coverage
User Intention Analysis
Create Intention-based dataset
Extract Intention Features
Train a Parametric Model to predict intention
Evaluate, test, cross-validate the model
Create a mockup application
Extend the model to induce preservation
Finish Writing the Dissertation
PhD Defense
34. User Intention Analysis
• Goal: Have a better understanding of User Intention and what factors affect
it. Also create a new testing and training set.
• Action:
– Get a sample set of tweets selected at random
– Extract the URIs
– Get closest Memento
– Download the snapshot & current version
– Use Amazon’s Mechanical Turk in choosing the best version
• Evaluation:
– Measure cross-rater agreement and confidence.
35. Proposed Work
• Data Gathering
• Feature Extraction
• Modeling the intention engine
• Evaluation
• Application: Prediction and Preservation
37. Possible Solution for Jenny
The resource has changed since last time it was shared
Do you wish to see the version the author intended or
the current version?
Current Version Intended Version
38. Proposed Framework
Archived Version
Feature
Classifier
Extraction
Example Features: Current Version
- Tweet Content
- Click Logs
- Other Tweets
- Shared Resource
- Timemaps
44. My Publications
• S. G. Ainsworth, A. Alsum, H. SalahEldeen, M. C. Weigle, and M. L. Nelson. How
much of the web is archived? In Proceedings of the 11th annual international
ACM/IEEE joint conference on Digital libraries, JCDL '11, pages 133{136, 2011.
• H. SalahEldeen and M. L. Nelson. Losing my revolution: How much social media
content has been lost? Accepted in TPDL 2012
• H. SalahEldeen and M. L. Nelson. Losing my revolution: A year after the Egyptian
revolution, 10% of the social media documentation is gone. http://ws-
dl.blogspot.com/2012/02/2012-02-11-losing-my-revolution-year.html.
45. References
• D. Antoniades, I. Polakis, G. Kontaxis, E. Athanasopoulos, S. Ioannidis, E. P. Markatos, and T. Karagiannis. we.b: the web of short
urls. In Proceedings of the 20th international conference on World wide web, WWW '11, pages 715 {724, New York, NY, USA,
2011. ACM.
• A. Ashkan, C. L. Clarke, E. Agichtein, and Q. Guo. Classifying and characterizing query intent. In Proceedings of the 31th
European Conference on IR Research on Advances in Information Retrieval, ECIR '09, pages 578{586, Berlin, Heidelberg, 2009.
Springer-Verlag.
• L. Azzopardi and M. de Rijke. Query intention acquisition: A case study on automatically inferring structured queries. In
Proceedings DIR-2006, 2006.
• R. Baeza-Yates, L. Calderon-Benavides, and C. Gonzalez-Caro. The intention behind web queries. In F. Crestani, P. Ferragina, and
M. Sanderson, editors, String Processing and Information Retrieval, volume 4209 of Lecture Notes in Computer Science, pages
98{109. Springer Berlin / Heidelberg, 2006. 10.1007/11880561 9.
• A. Benczur, I. Bro, K. Csalogany, and T. Sarlos. Web spam detection via commercial intent analysis. In Proceedings of the 3rd
international workshop on Adversarial information retrieval on the web, AIRWeb '07, pages 89{92, New York, NY, USA, 2007.
ACM.
• J. Bollen, H. Mao, and X.-J. Zeng. Twitter mood predicts the stock market. CoRR, abs/1010.3003, 2010.
• N. Dai, X. Qi, and B. D. Davison. Bridging link and query intent to enhance web search. In Proceedings of the 22nd ACM
conference on Hypertext and hypermedia, HT '11, pages 17{26, New York, NY, USA, 2011. ACM.
• N. Dai, X. Qi, and B. D. Davison. Enhancing web search with entity intent. In Proceedings of the 20 th international conference
companion on World wide web, WWW '11, pages 29{30, New York, NY, USA, 2011. ACM.
• K. Durant and M. Smith. Predicting the political sentiment of web log posts using supervised machine learning techniques
coupled with feature selection. In O. Nasraoui, M. Spiliopoulou, J. Srivastava, B. Mobasher, and B. Masand, editors, Advances in
Web Mining and Web Usage Analysis, volume 4811 of Lecture Notes in Computer Science, pages 187{206. Springer Berlin /
Heidelberg, 2007. 10.1007/978-3-540-77485-3 11.
46. References
• Q. Guo and E. Agichtein. Ready to buy or just browsing?: detecting web searcher goals from interaction data. In Proceedings of the 33rd
international ACM SIGIR conference on Research and development in information retrieval, SIGIR '10, pages 130{137, New York, NY, USA,
2010. ACM.
• A. Java, X. Song, T. Finin, and B. Tseng. Why we twitter: understanding microblogging usage and communities. In Proceedings of the 9th
WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis, WebKDD/SNA-KDD '07, pages 56{65, New York, NY,
USA, 2007. ACM.
• H. Kwak, C. Lee, H. Park, and S. Moon. What is twitter, a social network or a news media? In Proceedings of the 19th international
conference on World wide web, WWW '10, pages 591{600, New York, NY, USA, 2010. ACM.
• C.-H. L. Lee and A. Liu. Modeling the query intention with goals. In Proceedings of the 19th International Conference on Advanced
Information Networking and Applications - Volume 2, AINA '05, pages 535{540, Washington, DC, USA, 2005. IEEE Computer Society.
• A. Loser, W. M. Barczynski, and F. Brauer. What's the intention behind your query? a few observations from a large developer community.
In IRSW, 2008.
• F. McCown, N. Diawara, and M. L. Nelson. Factors aecting website reconstruction from the web infrastructure. In JCDL '07: Proceedings of
the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, pages 39{48, 2007.
• B. Meeder, B. Karrer, A. Sayedi, R. Ravi, C. Borgs, and J. Chayes. We know who you followed last summer: inferring social link creation times
in twitter. In Proceedings of the 20th international conference on World wide web, WWW '11, pages 517{526, New York, NY, USA, 2011.
ACM.
• G. Mishne. Predicting movie sales from blogger sentiment. In In AAAI 2006 Spring Symposium on Computational Approaches to Analysing
Weblogs (AAAI-CAAW), 2006.
• M. L. Nelson and B. D. Allen. Object persistence and availability in digital libraries. D-Lib Magazine, 8(1), 2002.
• R. Sanderson, M. Phillips, and H. Van de Sompel. Analyzing the persistence of referenced web resources with memento. CoRR,
abs/1105.3459, 2011.
• H. Van de Sompel, M. L. Nelson, R. Sanderson, L. Balakireva, S. Ainsworth, and H. Shankar. Memento: Time travel for the web. CoRR,
abs/0911.1112, 2009.
• S. Wu, J. M. Hofman, W. A. Mason, and D. J. Watts. Who says what to whom on twitter. In Proceedings of the 20th international conference
on World wide web, WWW '11, pages 705{714, New York, NY, USA, 2011. ACM.