This document proposes and compares two approaches to tag recommendation in social bookmarking sites. The first approach (STaR) indexes existing tags by users and combines title, resource, and user profile recommendations. It had issues with recall and precision without crawling links. The final approach models it as a ranking problem, extracts candidate tags from descriptions, user tags, and other link tags. It constructs SVM features on tag frequencies and assigns training tags. A ranking SVM then ranks candidates, selecting the top K tags. It performed better than the first approach by including link crawling and a supervised learning model.
video link => http://youtu.be/D9PBX8FmtpQ
Tweets Classifier which categorises tweets into these 6 categories:
Business
Politics
Music
Health
Sports
Technology
Tag recommendation is very useful system in detecting the type of messages like GMAIL has divided its inbox in three tabs PRIMARY, SOCIAL, PROMOTIONS and there are some other labels like SPAM, Important etc. It can be used in other categories like Social Bookmarking, Search Engines etc.
Active Learning in Collaborative Filtering Recommender Systems : a SurveyUniversity of Bergen
In collaborative filtering recommender systems user’s preferences are expressed as ratings for items, and each additional rating extends the knowledge of the system and affects the system’s recommendation accuracy. In general, the more ratings are elicited from the users, the more effective the recommendations are. However, the usefulness of each rating may vary significantly, i.e., different ratings may bring a different amount and type of information about the user’s tastes. Hence, specific techniques, which are defined as “active learning strategies”, can be used to selectively choose the items to be presented to the user for rating. In fact, an active learning strategy identifies and adopts criteria for obtaining data that better reflects users’ preferences and enables to generate better recommendations.
Structural syntactic metrics for RDF Datasets that correlate with high level quality deficiencies.
The vision of the Linked Open Data (LOD) initiative is to provide a model for publishing data and meaningfully interlinking such dispersed but related data. Despite the importance of data quality for the successful growth of the LOD, only limited attention has been focused on quality of data prior to their publication on the LOD. This paper focuses on the systematic assessment of the quality of datasets prior to publication on the LOD cloud. To this end, we identify important quality deficiencies that need to be avoided and/or resolved prior to the publication of a dataset. We then propose a set of metrics to measure and identify these quality deficiencies in a dataset. This way, we enable the assessment and identification of undesirable quality characteristics of a dataset through our proposed metrics.
Slides for paper presentation at DEXA 2015:
Behshid Behkamal, Mohsen Kahani, Ebrahim Bagheri:
Quality Metrics for Linked Open Data. DEXA (1) 2015: 144-152
video link => http://youtu.be/D9PBX8FmtpQ
Tweets Classifier which categorises tweets into these 6 categories:
Business
Politics
Music
Health
Sports
Technology
Tag recommendation is very useful system in detecting the type of messages like GMAIL has divided its inbox in three tabs PRIMARY, SOCIAL, PROMOTIONS and there are some other labels like SPAM, Important etc. It can be used in other categories like Social Bookmarking, Search Engines etc.
Active Learning in Collaborative Filtering Recommender Systems : a SurveyUniversity of Bergen
In collaborative filtering recommender systems user’s preferences are expressed as ratings for items, and each additional rating extends the knowledge of the system and affects the system’s recommendation accuracy. In general, the more ratings are elicited from the users, the more effective the recommendations are. However, the usefulness of each rating may vary significantly, i.e., different ratings may bring a different amount and type of information about the user’s tastes. Hence, specific techniques, which are defined as “active learning strategies”, can be used to selectively choose the items to be presented to the user for rating. In fact, an active learning strategy identifies and adopts criteria for obtaining data that better reflects users’ preferences and enables to generate better recommendations.
Structural syntactic metrics for RDF Datasets that correlate with high level quality deficiencies.
The vision of the Linked Open Data (LOD) initiative is to provide a model for publishing data and meaningfully interlinking such dispersed but related data. Despite the importance of data quality for the successful growth of the LOD, only limited attention has been focused on quality of data prior to their publication on the LOD. This paper focuses on the systematic assessment of the quality of datasets prior to publication on the LOD cloud. To this end, we identify important quality deficiencies that need to be avoided and/or resolved prior to the publication of a dataset. We then propose a set of metrics to measure and identify these quality deficiencies in a dataset. This way, we enable the assessment and identification of undesirable quality characteristics of a dataset through our proposed metrics.
Slides for paper presentation at DEXA 2015:
Behshid Behkamal, Mohsen Kahani, Ebrahim Bagheri:
Quality Metrics for Linked Open Data. DEXA (1) 2015: 144-152
Textual information exchanged among users on online social network platforms provides deep understanding into users' interest and behavioral patterns. However, unlike traditional text-dominant settings such as online publishing, one distinct feature for online social network is users' rich interactions with the textual content, which, unfortunately, has not yet been well incorporated in the existing topic modeling frameworks.
In this paper, we propose an LDA-based behavior-topic
model (B-LDA) which jointly models user topic interests and behavioral patterns. We focus the study of the model on on-line social network settings such as microblogs like Twitter where the textual content is relatively short but user inter-actions on them are rich. We conduct experiments on real Twitter data to demonstrate that the topics obtained by our model are both informative and insightful. As an application of our B-LDA model, we also propose a Twitter followee rec-ommendation algorithm combining B-LDA and LDA, which we show in a quantitative experiment outperforms LDA with a signicant margin.
Recommendation and Information Retrieval: Two Sides of the Same Coin?Arjen de Vries
Status update on our current understanding of how collaborative filtering relates far more closely to information retrieval than usually thought. Includes work by Jun Wang and Alejandro Bellogín. This presentation has been given at the Siks PhD student course on computational intelligence, May 24th, 2013
In this emerging trend, it is necessary to understand the recent developments taking place in the
field of opinion mining and sentiment analysis (OMSA) as part of text mining in social networks,
which plays an important role for decision making process to the organization or company,
Government and general public. In this paper, we present the recent role of OMSA in Social
Networks with different frameworks such as data collection process, text pre-processing,
classification algorithms, and performance evaluation results. The achieved accuracy level is
compared and shown for different frameworks. Finally, we conclude the present challenges and future developments of OMSA.
Models for Information Retrieval and RecommendationArjen de Vries
Online information services personalize the user experience by applying recommendation systems to identify the information that is most relevant to the user. The question how to estimate relevance has been the core concept in the field of information retrieval for many years. Not so surprisingly then, it turns out that the methods used in online recommendation systems are closely related to the models developed in the information retrieval area. In this lecture, I present a unified approach to information retrieval and collaborative filtering, and demonstrate how this let’s us turn a standard information retrieval system into a state-of-the-art recommendation system.
Task oriented word embedding for text classificationPC LO
Liu, Q., Huang, H., Gao, Y., Wei, X., Tian, Y., & Liu, L. (2018). Task-oriented Word Embedding for Text Classification. In Proceedings of the 27th International Conference on Computational Linguistics (pp. 2023–2032). aclweb.org.
Design patterns are acknowledged as powerful conceptual tools to improve design quality and to reduce the time and cost of design
by effect of the reuse of “good” solutions. In many fields such as software engineering, web engineering, and interface design,
patterns are widely used by practitioners and are also investigated from a research perspective. Still, the concept of design pattern
has received marginal attention in the arena of user interfaces (UIs) for Recommender Systems (RSs). To our knowledge, a little
is known about the use of patterns in this specific class of applications, in spite of their increasing popularity, and no RS
specific interface pattern is available in existing pattern languages. We have performed a systematic analysis of 28 real-world RSs in
a variety of sectors, in order to: (i) discover occurrences of existing general (i.e., domain independent) UI patterns; (ii)
identify recurrent UI design solutions for RS specific features; (iii) elicit a set of new UI patterns for RS interfaces. The analysis
of patterns occurrences highlights the degree at which “good” UI design solutions are adopted in RSs for the different sectors. The
new patterns can be used by UI designers of RSs to improve the UX of their systems.
Sentiment Analysis/Opinion Mining of Twitter Data on Unigram/Bigram/Unigram+Bigram Model using:
1. Machine Learning
2. Lexical Scores
3. Emoticon Scores
YouTube Video: https://youtu.be/VuR16P87yPE
Link to the WebPage: http://akirato.github.io/Twitter-Sentiment-Analysis-Tool
Github Page: https://github.com/Akirato/Twitter-Sentiment-Analysis-Tool
A Survey on Decision Support Systems in Social MediaEditor IJCATR
Web 3.0 is the upcoming phase in web evolution. Web 3.0 will be about “feeding you the information that you want, when
you want it” i.e. personalization of the web. In web 3.0 the basic principle is linking, integrating and analyzing data from various data
sources into new information streams by means of semantic technology. So, we can say that Web 3.0 comprises of two platforms
semantic technologies and social computing environment. Recommender system is a subclass of decision support system.
Recommendations in social web are used to personalize the web [20]. Social Tagging System is one type of social media. In this
paper we present the survey of various recommendations in Social Tagging Systems (STSs) like tag, item, user and unified
recommendations along with semantic web and also discussed about major thrust areas of research in each category.
A Survey on Decision Support Systems in Social MediaEditor IJCATR
Web 3.0 is the upcoming phase in web evolution. Web 3.0 will be about “feeding you the information that you want, when you want it” i.e. personalization of the web. In web 3.0 the basic principle is linking, integrating and analyzing data from various data sources into new information streams by means of semantic technology. So, we can say that Web 3.0 comprises of two platforms semantic technologies and social computing environment. Recommender system is a subclass of decision support system. Recommendations in social web are used to personalize the web [20]. Social Tagging System is one type of social media. In this paper we present the survey of various recommendations in Social Tagging Systems (STSs) like tag, item, user and unified recommendations along with semantic web and also discussed about major thrust areas of research in each category.
Textual information exchanged among users on online social network platforms provides deep understanding into users' interest and behavioral patterns. However, unlike traditional text-dominant settings such as online publishing, one distinct feature for online social network is users' rich interactions with the textual content, which, unfortunately, has not yet been well incorporated in the existing topic modeling frameworks.
In this paper, we propose an LDA-based behavior-topic
model (B-LDA) which jointly models user topic interests and behavioral patterns. We focus the study of the model on on-line social network settings such as microblogs like Twitter where the textual content is relatively short but user inter-actions on them are rich. We conduct experiments on real Twitter data to demonstrate that the topics obtained by our model are both informative and insightful. As an application of our B-LDA model, we also propose a Twitter followee rec-ommendation algorithm combining B-LDA and LDA, which we show in a quantitative experiment outperforms LDA with a signicant margin.
Recommendation and Information Retrieval: Two Sides of the Same Coin?Arjen de Vries
Status update on our current understanding of how collaborative filtering relates far more closely to information retrieval than usually thought. Includes work by Jun Wang and Alejandro Bellogín. This presentation has been given at the Siks PhD student course on computational intelligence, May 24th, 2013
In this emerging trend, it is necessary to understand the recent developments taking place in the
field of opinion mining and sentiment analysis (OMSA) as part of text mining in social networks,
which plays an important role for decision making process to the organization or company,
Government and general public. In this paper, we present the recent role of OMSA in Social
Networks with different frameworks such as data collection process, text pre-processing,
classification algorithms, and performance evaluation results. The achieved accuracy level is
compared and shown for different frameworks. Finally, we conclude the present challenges and future developments of OMSA.
Models for Information Retrieval and RecommendationArjen de Vries
Online information services personalize the user experience by applying recommendation systems to identify the information that is most relevant to the user. The question how to estimate relevance has been the core concept in the field of information retrieval for many years. Not so surprisingly then, it turns out that the methods used in online recommendation systems are closely related to the models developed in the information retrieval area. In this lecture, I present a unified approach to information retrieval and collaborative filtering, and demonstrate how this let’s us turn a standard information retrieval system into a state-of-the-art recommendation system.
Task oriented word embedding for text classificationPC LO
Liu, Q., Huang, H., Gao, Y., Wei, X., Tian, Y., & Liu, L. (2018). Task-oriented Word Embedding for Text Classification. In Proceedings of the 27th International Conference on Computational Linguistics (pp. 2023–2032). aclweb.org.
Design patterns are acknowledged as powerful conceptual tools to improve design quality and to reduce the time and cost of design
by effect of the reuse of “good” solutions. In many fields such as software engineering, web engineering, and interface design,
patterns are widely used by practitioners and are also investigated from a research perspective. Still, the concept of design pattern
has received marginal attention in the arena of user interfaces (UIs) for Recommender Systems (RSs). To our knowledge, a little
is known about the use of patterns in this specific class of applications, in spite of their increasing popularity, and no RS
specific interface pattern is available in existing pattern languages. We have performed a systematic analysis of 28 real-world RSs in
a variety of sectors, in order to: (i) discover occurrences of existing general (i.e., domain independent) UI patterns; (ii)
identify recurrent UI design solutions for RS specific features; (iii) elicit a set of new UI patterns for RS interfaces. The analysis
of patterns occurrences highlights the degree at which “good” UI design solutions are adopted in RSs for the different sectors. The
new patterns can be used by UI designers of RSs to improve the UX of their systems.
Sentiment Analysis/Opinion Mining of Twitter Data on Unigram/Bigram/Unigram+Bigram Model using:
1. Machine Learning
2. Lexical Scores
3. Emoticon Scores
YouTube Video: https://youtu.be/VuR16P87yPE
Link to the WebPage: http://akirato.github.io/Twitter-Sentiment-Analysis-Tool
Github Page: https://github.com/Akirato/Twitter-Sentiment-Analysis-Tool
A Survey on Decision Support Systems in Social MediaEditor IJCATR
Web 3.0 is the upcoming phase in web evolution. Web 3.0 will be about “feeding you the information that you want, when
you want it” i.e. personalization of the web. In web 3.0 the basic principle is linking, integrating and analyzing data from various data
sources into new information streams by means of semantic technology. So, we can say that Web 3.0 comprises of two platforms
semantic technologies and social computing environment. Recommender system is a subclass of decision support system.
Recommendations in social web are used to personalize the web [20]. Social Tagging System is one type of social media. In this
paper we present the survey of various recommendations in Social Tagging Systems (STSs) like tag, item, user and unified
recommendations along with semantic web and also discussed about major thrust areas of research in each category.
A Survey on Decision Support Systems in Social MediaEditor IJCATR
Web 3.0 is the upcoming phase in web evolution. Web 3.0 will be about “feeding you the information that you want, when you want it” i.e. personalization of the web. In web 3.0 the basic principle is linking, integrating and analyzing data from various data sources into new information streams by means of semantic technology. So, we can say that Web 3.0 comprises of two platforms semantic technologies and social computing environment. Recommender system is a subclass of decision support system. Recommendations in social web are used to personalize the web [20]. Social Tagging System is one type of social media. In this paper we present the survey of various recommendations in Social Tagging Systems (STSs) like tag, item, user and unified recommendations along with semantic web and also discussed about major thrust areas of research in each category.
A Survey on Decision Support Systems in Social MediaEditor IJCATR
Web 3.0 is the upcoming phase in web evolution. Web 3.0 will be about “feeding you the information that you want, when
you want it” i.e. personalization of the web. In web 3.0 the basic principle is linking, integrating and analyzing data from various data
sources into new information streams by means of semantic technology. So, we can say that Web 3.0 comprises of two platforms
semantic technologies and social computing environment. Recommender system is a subclass of decision support system.
Recommendations in social web are used to personalize the web [20]. Social Tagging System is one type of social media. In this
paper we present the survey of various recommendations in Social Tagging Systems (STSs) like tag, item, user and unified
recommendations along with semantic web and also discussed about major thrust areas of research in each category.
Movie Recommender System Using Artificial Intelligence Shrutika Oswal
In recent years, a huge amount of information is available on the internet and it is very difficult for the user to collect the relevant information. While purchasing any product also a lot of choices available and the user is confused about what to choose. This will be a time-consuming process as well. The search engine will solve this problem to some extent by but it will fail in giving a personalized recommendation. In this presentation, I will describe the different types and working of the recommender system how they gather the data, build recommender, generate recommendations from it, evaluate the performance and effectiveness of the recommender system. The further part of the presentation will describe how to build a movie recommender system using python.
This is part 1 of the tutorial Xavier and Deepak gave at Recsys 2016 this year. You can find the second part http://www.slideshare.net/xamat/recsys-2016-tutorial-lessons-learned-from-building-reallife-recommender-systems
Profile Analysis of Users in Data Analytics DomainDrjabez
Data Analytics and Data Science is in the fast forward
mode recently. We see a lot of companies hiring people for data
analysis and data science, especially in India. Also, many
recruiting firms use stackoverflow to fish their potential
candidates. The industry has also started to recruit people based
on the shapes of expertise. Expertise of a personal is
metaphorically outlined by shapes of letters like I, T, M and
hyphen betting on her experiencein a section (depth) and
therefore the variety of areas of interest (width).This proposal
builds upon the work of mining shapes of user expertise in a
typical online social Question and Answer (Q&A) community
where expert users often answer questions posed by other
users.We have dealt with the temporal analysis of the expertise
among the Q&A community users in terms how the user/ expert
have evolved over time.
Keywords— Shapes of expertise, Graph communities, Expertise
evolution, Q&A community
Nesta palestra no evento GDG DataFest, apresentei uma introdução prática sobre as principais técnicas de sistemas de recomendação, incluindo arquiteturas recentes baseadas em Deep Learning. Foram apresentados exemplos utilizando Python, TensorFlow e Google ML Engine, e fornecidos datasets para exercitarmos um cenário de recomendação de artigos e notícias.
A Proposal on Social Tagging Systems Using Tensor Reduction and Controlling R...ijcsa
Social Tagging System is the process in which user makes their interest by tagging on a particular item. These STS are in associated with web 2.0 and has sourceful information for the users with their recommendations. It provides different types of recommendations are modeled by a 3-order tensor, on which multiway latent semantic analysis and dimensionality reduction is performed using both the Higher Order Singular Value Decomposition (HOSVD) method and the KernelSVD smoothing technique. We provide now with the 4-order tensor approach, which we named as Tensor Reduction. Here the items that are tagged can be viewed by the user who are recommended the same item and tagged over it. There by can improve the social tagging recommendations efficiency and also the unwanted request has been controlled. The results show significant improvements in terms of effectiveness.
AN EFFICIENT FEATURE SELECTION IN CLASSIFICATION OF AUDIO FILEScscpconf
In this paper we have focused on an efficient feature selection method in classification of audio files.
The main objective is feature selection and extraction. We have selected a set of features for further
analysis, which represents the elements in feature vector. By extraction method we can compute a
numerical representation that can be used to characterize the audio using the existing toolbox. In this
study Gain Ratio (GR) is used as a feature selection measure. GR is used to select splitting attribute
which will separate the tuples into different classes. The pulse clarity is considered as a subjective
measure and it is used to calculate the gain of features of audio files. The splitting criterion is
employed in the application to identify the class or the music genre of a specific audio file from
testing database. Experimental results indicate that by using GR the application can produce a
satisfactory result for music genre classification. After dimensionality reduction best three features
have been selected out of various features of audio file and in this technique we will get more than
90% successful classification result.
Similar to Tag recommendation in social bookmarking sites like deli (20)
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
Securing your Kubernetes cluster_ a step-by-step guide to success !KatiaHIMEUR1
Today, after several years of existence, an extremely active community and an ultra-dynamic ecosystem, Kubernetes has established itself as the de facto standard in container orchestration. Thanks to a wide range of managed services, it has never been so easy to set up a ready-to-use Kubernetes cluster.
However, this ease of use means that the subject of security in Kubernetes is often left for later, or even neglected. This exposes companies to significant risks.
In this talk, I'll show you step-by-step how to secure your Kubernetes cluster for greater peace of mind and reliability.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Key Trends Shaping the Future of Infrastructure.pdfCheryl Hung
Keynote at DIGIT West Expo, Glasgow on 29 May 2024.
Cheryl Hung, ochery.com
Sr Director, Infrastructure Ecosystem, Arm.
The key trends across hardware, cloud and open-source; exploring how these areas are likely to mature and develop over the short and long-term, and then considering how organisations can position themselves to adapt and thrive.
Key Trends Shaping the Future of Infrastructure.pdf
Tag recommendation in social bookmarking sites like deli
1. Tag Recommendation in Social
Bookmarking sites like Deli.cio.us
Varun Ahuja ()
Vinay Singri ()
Tanuj Sharma ( 201101138 )
2. Introduction
Automated process of suggesting
relevant keywords given a
dataset
Given link L, description D, and
user U, a set of personalized tags
CT(L) are suggested with help
from given dataset.
3. First Approach – STaR ( Social Tag
Recommender System )
Divided in 3 major steps – Pre-processing,
Indexing and Recommendation
Pre-processing – Remove useless tags, Case
Folding, Spam Removal
Indexing – Index existing tags against users.
Recommendation – Combine outputs of Title to
Tag, Resource Profile, User Profile
Recommender.
4. Problems in First Approach
Not all tags from the dataset
appeared.
Low Precision and Low Recall
Without crawling the given link,
this approach gives low accuracy
5. Final Approach – Supervised Learning Model
Modelled as a ranking problem of
candidate tags of a given URL
Consists of 3 stages –
◦Candidates Tag Extraction
◦SVM Features Construction
◦Ranking Process
Ranking SVM is used for ranking candidate
tags.
6. Candidates Tag Extraction
Extracted from –
◦Description field of link L
◦Tags assigned by the same user U
previously
◦Tags to assigned to the same link L by other
users
Given link L, user U, candidate tags
CT{L} = { description(L) union Tags(U) union
Tags(L) }
7. SVM Features Construction
5 features used for each Candidate Tag ( CT ) –
Candidate Tag's Term Frequency (TF) in link's description
terms
Candidate Tag's Term Frequency (TF) in link's URL terms
Candidate Tag’s Term Frequency (TF) in T{Rj} (tags
assigned to the same URL in the training data).
Candidate Tag’s Term Frequency (TF) in T{Ui} (tags
assigned previously by user in the training data.)
Times of candidate tag being assigned as a tag in the
training data.
8. Ranking
For any link in test dataset, Candidate
Tags are extracted
Features stored for each candidate tag.
SVM ranking model ranks the candidate
tags from top to bottom
Top K tags selected
10. Future Work
Extension to various datasets
Giving more enriched
recommendation for the seed URL
Candidate Tags can be expanded
using content similarity based KNN
model.
11. References
STaR: a Social Tag Recommender System Cataldo
Musto, Fedelucio Narducci, Marco de Gemmis,
Pasquale Lops, and Giovanni Semeraro
Department of Computer Science, University of
Bari, Italy
• Social Tag Prediction Base on
Supervised Ranking Model
Hao Cao, Maoqiang Xie, Lian Xue, Chunhua Liu, Fei
Teng and Yalou Huang
College of Software, Nankai University, Tianjin,
P.R.China