This document discusses social media analytics and sentiment analysis tools. It notes that while there is a lot of data on social media, analyzing and making sense of it is challenging due to issues like ambiguous language, sarcasm and misspellings. Commonly used tools often don't work well on social media texts. The document then introduces GATE and GATE Cloud as tools that can help address these challenges through approaches like knowledge-based linguistic analysis and pre-processing tailored for social media. GATE Cloud also offers the benefits of scalability, no hardware costs and easy integration of text analytics workflows into applications.
GATE: a text analysis tool for social mediaDiana Maynard
Short tutorial about how and why to use GATE for text analysis of social media, given at the Big Social Data workshop at Reading University in April 2015.
GATE: a text analysis tool for social mediaDiana Maynard
Short tutorial about how and why to use GATE for text analysis of social media, given at the Big Social Data workshop at Reading University in April 2015.
Multimodal opinion mining from social mediaDiana Maynard
Presentation at the BCS SGAI 2013 conference in Cambridge, December 2013, describing the combination of opinion mining from text and multimedia from social media.
20 Years of Text Mining Applications with GATE: from Donald Trump to curing c...Diana Maynard
Talk given at the Data Pioneers 1st meetup in London, 27 July 2017.
Abstract:
The GATE open source NLP toolkit has now been in continuous development for 20 years at the University of Sheffield. Originally funded by a small EPSRC research grant, it now involves a team of 12 researchers working on it, and has been downloaded by hundreds of thousands of users all over the world. Its users range from solitary research students to multinational companies and government institutions. In this talk, I will give an overview of my work with GATE, giving examples of real-life case studies, ranging from analysing polarised opinions in online political debates (Brexit, the UK, French and US elections) through to finding a new cause of cancer by analysing information in the biomedical domain.
This presentation is about GATE which is a Natural Language Processing Platform That supports many Languages. It also mentions Mimir which is an Indexing server for GATE that enables its users to search in a corpus of documents
What do you really mean when you tweet? Challenges for opinion mining on soci...Diana Maynard
This talk, given at BRACIS 2013, introduces the topics of opinion mining and social media analytics, in particular looking at the challenges they impose for an NLP system. It investigates the impact of non-standard text in social media, use of sarcasm, swear words, non-words, short sentences, multiple languages and so on, which impede the success of current NLP tools to perform good analysis, and examines tools being developed in some current cutting-edge research projects, including not only text-based research but also multimedia analysis.
Social media & sentiment analysis splunk conf2012Michael Wilde
This presentation was delivered at Splunk's User Conference (conf2012). It covers info about social media data, how to index / use it with Splunk and a lot of content around Sentiment Analysis.
Using Data Science for Social Good: Fighting Human TraffickingAnidata
In this talk, Vincent Emanuele tells the story of how Anidata has applied data science techniques in collaboration with law enforcement to fight human trafficking.
Dark Data and Improving Human Rights in Fulton CountyAnidata
In this talk, Dr Baxley discusses the application of data science techniques to fight human trafficking. He gives a brief overview of Anidata, then dives into the technical details of the implementation of the graph-based entity resolution algorithm developed and implemented using Python, Luigi, and NetworkX.
Artificial Assistants: How can I help you? by Christopher CurrinChristopher Currin
Chatbots are not equal; with different forms permeating our lives more and more. Virtual assistants are increasingly relevant for businesses and our day-to-day lives. Chatbots have become ubiquitous for interactions, yet ‘reasonable’ intelligence remains elusive.
In this talk, we explore and explain their underlying architectures and capabilities to understand what makes them work, their weaknesses, and future improvements.
Design principles from a technology and human perspective will be disseminated with examples of current production systems and their impact. Furthermore, the audience will have the opportunity to advance these best practices.
Resources will be made available, so the technology is relevant, practical, and accessible.
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...CSCJournals
In the era of technology and internet, people use online social media services like Twitter, Instagram, Facebook, Reddit, etc. to express their emotions. The idea behind this paper is to understand people’s emotion on Twitter and their opinion towards Presidential Election 2020. We collected 1.2 million tweets in total with keyword like “RealDonaldTrump”, “JoeBiden”, “Election2020” and other election related keywords using Twitter API and then processed them with natural language processing toolkit. A Bidirectional Long Short-Term Memory (BiLSTM) model has been trained and we have achieved 93.45% accuracy on our test dataset. We then used our trained model to perform sentiment analysis on the rest of our dataset. With the sentiment analysis results and comparison with 2016 Presidential Election, we have made predictions on who could win the US Presidential Election in 2020 with pre-election twitter data. We have also analyzed the impact of COVID-19 on people’s sentiment about the election.
Multimodal opinion mining from social mediaDiana Maynard
Presentation at the BCS SGAI 2013 conference in Cambridge, December 2013, describing the combination of opinion mining from text and multimedia from social media.
20 Years of Text Mining Applications with GATE: from Donald Trump to curing c...Diana Maynard
Talk given at the Data Pioneers 1st meetup in London, 27 July 2017.
Abstract:
The GATE open source NLP toolkit has now been in continuous development for 20 years at the University of Sheffield. Originally funded by a small EPSRC research grant, it now involves a team of 12 researchers working on it, and has been downloaded by hundreds of thousands of users all over the world. Its users range from solitary research students to multinational companies and government institutions. In this talk, I will give an overview of my work with GATE, giving examples of real-life case studies, ranging from analysing polarised opinions in online political debates (Brexit, the UK, French and US elections) through to finding a new cause of cancer by analysing information in the biomedical domain.
This presentation is about GATE which is a Natural Language Processing Platform That supports many Languages. It also mentions Mimir which is an Indexing server for GATE that enables its users to search in a corpus of documents
What do you really mean when you tweet? Challenges for opinion mining on soci...Diana Maynard
This talk, given at BRACIS 2013, introduces the topics of opinion mining and social media analytics, in particular looking at the challenges they impose for an NLP system. It investigates the impact of non-standard text in social media, use of sarcasm, swear words, non-words, short sentences, multiple languages and so on, which impede the success of current NLP tools to perform good analysis, and examines tools being developed in some current cutting-edge research projects, including not only text-based research but also multimedia analysis.
Social media & sentiment analysis splunk conf2012Michael Wilde
This presentation was delivered at Splunk's User Conference (conf2012). It covers info about social media data, how to index / use it with Splunk and a lot of content around Sentiment Analysis.
Using Data Science for Social Good: Fighting Human TraffickingAnidata
In this talk, Vincent Emanuele tells the story of how Anidata has applied data science techniques in collaboration with law enforcement to fight human trafficking.
Dark Data and Improving Human Rights in Fulton CountyAnidata
In this talk, Dr Baxley discusses the application of data science techniques to fight human trafficking. He gives a brief overview of Anidata, then dives into the technical details of the implementation of the graph-based entity resolution algorithm developed and implemented using Python, Luigi, and NetworkX.
Artificial Assistants: How can I help you? by Christopher CurrinChristopher Currin
Chatbots are not equal; with different forms permeating our lives more and more. Virtual assistants are increasingly relevant for businesses and our day-to-day lives. Chatbots have become ubiquitous for interactions, yet ‘reasonable’ intelligence remains elusive.
In this talk, we explore and explain their underlying architectures and capabilities to understand what makes them work, their weaknesses, and future improvements.
Design principles from a technology and human perspective will be disseminated with examples of current production systems and their impact. Furthermore, the audience will have the opportunity to advance these best practices.
Resources will be made available, so the technology is relevant, practical, and accessible.
Twitter Based Sentiment Analysis of Each Presidential Candidate Using Long Sh...CSCJournals
In the era of technology and internet, people use online social media services like Twitter, Instagram, Facebook, Reddit, etc. to express their emotions. The idea behind this paper is to understand people’s emotion on Twitter and their opinion towards Presidential Election 2020. We collected 1.2 million tweets in total with keyword like “RealDonaldTrump”, “JoeBiden”, “Election2020” and other election related keywords using Twitter API and then processed them with natural language processing toolkit. A Bidirectional Long Short-Term Memory (BiLSTM) model has been trained and we have achieved 93.45% accuracy on our test dataset. We then used our trained model to perform sentiment analysis on the rest of our dataset. With the sentiment analysis results and comparison with 2016 Presidential Election, we have made predictions on who could win the US Presidential Election in 2020 with pre-election twitter data. We have also analyzed the impact of COVID-19 on people’s sentiment about the election.
Are you a Twitter Star? Do you want to be? No matter what your level of interaction on social media, we'll have the hints and tips on how to make your CPSW events a success.
Hugo el abrazo es un regalo solidario para quienes tienes cerca que llega muy lejos. Colabora con un proyecto solidario comprando Hugos. El beneficio es 100% destinado a que la Fundación Vistare ayude a quienes más lo necesitan.
http://www.cuatro.com/hugo/
OpenStack, a view from sysadmin. Ver. 0.1Hazzim Anaya
Regularly when we hear talk about OpenStack is from the point of view of a developer, consultant or end user, but what about the technical side of the Sysadmin ?, In this talk we will try to give an idea of the necessary skills to be a sysadmin OpenStack.
Desde 1879 "Tonnellerie du Sud Ouest" realiza encargos especiales para los chateaux franceses. Mundialmente famosa por sus barricas de acacia para blancos, TSO produce además solamente barricas de roble francés de una calidad excepcional.
[ El camino al éxito, empieza con la mejor formación ] - @ide_cesem
IDE-CESEM, Instituto Directivo de Empresas, Madrid. Masters, MBA, Cursos y Formación para Empresas
- Dirección Estratégica De Las Finanzas: http://goo.gl/sio6oS
- Facefook: https://www.facebook.com/escuela.negocios.idecesem
- Visita nuestra oferta formativa en: http://goo.gl/sio6oS
- Más información en: www.ide-cesem.com
[ Programas Formativos ]
- Masters:
http://www.ide-cesem.com/Master/masters.asp
- Cursos:
http://www.ide-cesem.com/Cursos/cursos.asp
- Formación para Empresas:
http://www.ide-cesem.com/empresas.asp
Almost everyone agrees: Social media is important. But in today’s climate of budget constraints and overworked staffs, it can be overwhelming to add yet another task to overfull plates. UNCG, a mid-sized regional public university with 17,000 students, has built a thriving social media community of more than 25,000 followers with existing staff and virtually no budget. Our presentation will show how to create successful social media platforms by your bootstraps, using existing resources and personnel and powered by grassroots enthusiasm.
Focusing on some of the most popular social media platforms — Twitter, Facebook and YouTube — we’ll relate our experiences of getting buy-in from key administrators and launching branded sites. We’ll share free resources we’ve used that will help other communicators create, manage, maintain and promote their social offerings. Our presentation will offer strategies for divvying the workload among team members based on their strengths. And we’ll divulge our missteps along the way — like the need to reclaim our desired usernames from well-intended alumni and students before we could even begin.
Many conference social media presentations focus on big name, big budget success stories. Many conference attendees don’t have that. We’ll show that with little money and limited staff time, these vital communication avenues can be launched and grown.
Whether your office is responsible for university-wide communications or sharing the story of a single department, our listeners will take away step-by-step tools, tips and best practices to help strategize, launch and cultivate social media.
Presented at 2014 annual HighEdWeb Conference by L. Danielle Baldwin (@LDBaldwin)
Presented at 2014 CASE III Conference by L. Danielle Baldwin (@LDBaldwin), Lanita Withers Goins, Debbie Schallock
Newsroom culture change: Beating the blockers Alison Gow
Slides from my presentation to WAN IFRA's 13th International Newsroom Summit on the theme of changing the culture of a newsroom and overcoming blockers to change.
http://blog.wan-ifra.org/2014/10/13/how-do-you-overcome-newsroom-culture-blockers-answers-at-the-international-newsroom-summi
Webinar - Measuring What Matters with Google Analytics - 2015-12-3TechSoup
George Weiner, founder of Whole Whale, helps make sense of Google Analytics. He'll show you how to track social, content, and user behavior; analyze the numbers; and become an action-oriented, data-driven organization.
Drupal's Credit System and Evidence-Based Open-Source SustainabilityAll Things Open
Presented by: Matthew Tift & Tim Lehnen
Presented at the All Things Open 2021
Raleigh, NC, USA
Raleigh Convention Center
Abstract: Sustaining open-source projects is the decade's challenge. Understanding how contributions are made—by volunteers, sponsored by an organization, or both—can create incentives for ongoing support.
The Drupal project has measured contributions in Drupal.org issues since 2015 using a credit system that captures contributions—including code, documentation, speaking at events, security review, etc.—from both individuals and organizations. This insight is used to shape incentives in the Drupal ecosystem.
Attendees will learn how Drupal’s contribution credit system works and why we would like to bring it to GitLab and other platforms. We hope that other open-source projects and orgs who want to understand their return on investment in open source can model their approach on this credit system and benefit from the insights we’ll be sharing during this presentation.
This presentation was provided by Bill Trippe of Publishing Technology Partners, during the NISO event "Project Management for the Information Community: Managing and Communicating the Process, Session Six," held on Friday, March 29, 2019.
Debra Dixon-Anderson from Light of Gold PR Co-Facilitated with Pierre DeBois of Zimana, this powerful webinar for the Greater NY Chamber! We dropped gems and tips on the power of Social Media, Web, Video and Lives and Analytics! Check out what you missed!
Please share this presentation with all who may benefit from it!
March 19, 2015
--- Webinar Information ---
This webinar will outline social media strategies for non profit organizations. You will learn how to define your goals, improve your content, and make use of available tools in order to expand your audience and ensure your voice is heard online. You will also learn about some common practices that should be avoided.
--- Presenter Profile ---
Matthew Handy is CCSN's website and social media coordinator. He has nearly 2 years of experience working on social media in the non-profit world and has hosted multiple webinars on the topic. He recently graduated from Carleton University with a bachelor's degree in Communications.
Title: How Do You Know if Your Project Is Any Good?
Presented at All Things Open 2022
Presented by Avi Press & Emily Omier
Abstract: Are you, like many maintainers, struggling to get good data about who is actually using your project, how they are using it and why they downloaded it in the first place? Do you know how many users the project has, and whether those users even like it? Do you know what other technologies they use, what kinds of applications or workloads they use your project for or even what exactly they like (or dislike) about your project? In this talk, Avi Press will discuss ways to get quantitative data to get insights into who is using your project and what they are doing with it, and Emily Omier will talk about how to gather qualitative data on your project’s value and triggers that inspired adoption. Together, they’ll talk about how to use these two types of data to make better decisions about your outreach efforts, project roadmap and ultimate goals for the project.
Your Guide to Content Marketing for NonprofitsTechSoup
Social media, blogs, webinars, infographics — there are so many types of content, but which ones does your nonprofit need? Nonprofits are already strapped for time and money, so how can you create an efficient but effective content strategy? In this webinar, we will go over how you can set up a content plan to help your nonprofit increase visibility, expand impact, and engage with both community and donors. We will cover the following:
The different types of content your nonprofit should be thinking about and why — blog, social, webinar, podcasts, website
Prioritizing content for maximum reach
How to effectively create a content strategy
Tools and best practices to make your work easier and more effective
Measuring success and understanding what metrics matter most
Webinar - SEO for Beginners: Simple Steps for Nonprofits and Libraries - 2016...TechSoup
SEO – search engine optimization – is the practice of improving, and promoting a website in order to increase the number of visitors the site receives from search engines. The majority of traffic to your organization or library website may come from the three major search engines - Google, Yahoo, and Bing.
In this free webinar with Whole Whale, learn some basic SEO tips for beginners to help your organization's site and content rank higher and be found more consistently, helping you grow your reach and supporters.
There are over 200 factors that translate in the the Google Search algorithm that handles over 1 trillion searches each day. This session gives a simple history of how we got here and the basics of the algorithm. We cover the main topics and key terms you should know, as well as the guiding principles of the system. This overview will help your team start to decode the nice versus necessary elements of SEO your organization can use to increase organic traffic.
Takeaways:
-- Keyword research
-- Link-building basics to increase traffic
-- Understanding the on-page and off-page principles of the algorithm
G3 Partners, one of Asia’s top marketing agencies is providing tips about blockchain/ICO marketing. This deck will be useful for anyone who's running a blockchain startup, anyone who's planning a token sale, or anyone who's interested in blockchain marketing. Share with your friends.
This project has been realized during the 2015-2016 master “Business Intelligence and Big Data Analytics” at Università di Milano - Bicocca.
Authors: Marco Fusi @marco_fusi, Raffaele Lorusso @rlorusso76
Similar to Social media analytics as a service: tools from GATE (20)
Practical Opinion Mining for Social MediaDiana Maynard
This tutorial will introduce the concepts of sentiment analysis and opinion mining from unstructured text in social media, looking at why they are useful and what tools and techniques are available. It will cover both rule-based and machine learning techniques, provide some background information on the key underlying NLP processes required, and look in detail at some of the major problems and solutions, such as detection of sarcasm, use of informal language, spam opinion detection, trustworthiness of opinion holders, and so on. The techniques will be demonstrated with real applications developed in GATE, an open-source language processing toolkit. Links are provided to some hands-on material to try at home.
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesSanjeev Rampal
Talk presented at Kubernetes Community Day, New York, May 2024.
Technical summary of Multi-Cluster Kubernetes Networking architectures with focus on 4 key topics.
1) Key patterns for Multi-cluster architectures
2) Architectural comparison of several OSS/ CNCF projects to address these patterns
3) Evolution trends for the APIs of these projects
4) Some design recommendations & guidelines for adopting/ deploying these solutions.
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC
Ellisha Heppner, Grant Management Lead, presented an update on APNIC Foundation to the PNG DNS Forum held from 6 to 10 May, 2024 in Port Moresby, Papua New Guinea.
1.Wireless Communication System_Wireless communication is a broad term that i...JeyaPerumal1
Wireless communication involves the transmission of information over a distance without the help of wires, cables or any other forms of electrical conductors.
Wireless communication is a broad term that incorporates all procedures and forms of connecting and communicating between two or more devices using a wireless signal through wireless communication technologies and devices.
Features of Wireless Communication
The evolution of wireless technology has brought many advancements with its effective features.
The transmitted distance can be anywhere between a few meters (for example, a television's remote control) and thousands of kilometers (for example, radio communication).
Wireless communication can be used for cellular telephony, wireless access to the internet, wireless home networking, and so on.
This 7-second Brain Wave Ritual Attracts Money To You.!nirahealhty
Discover the power of a simple 7-second brain wave ritual that can attract wealth and abundance into your life. By tapping into specific brain frequencies, this technique helps you manifest financial success effortlessly. Ready to transform your financial future? Try this powerful ritual and start attracting money today!
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBrad Spiegel Macon GA
Brad Spiegel Macon GA’s journey exemplifies the profound impact that one individual can have on their community. Through his unwavering dedication to digital inclusion, he’s not only bridging the gap in Macon but also setting an example for others to follow.
# Internet Security: Safeguarding Your Digital World
In the contemporary digital age, the internet is a cornerstone of our daily lives. It connects us to vast amounts of information, provides platforms for communication, enables commerce, and offers endless entertainment. However, with these conveniences come significant security challenges. Internet security is essential to protect our digital identities, sensitive data, and overall online experience. This comprehensive guide explores the multifaceted world of internet security, providing insights into its importance, common threats, and effective strategies to safeguard your digital world.
## Understanding Internet Security
Internet security encompasses the measures and protocols used to protect information, devices, and networks from unauthorized access, attacks, and damage. It involves a wide range of practices designed to safeguard data confidentiality, integrity, and availability. Effective internet security is crucial for individuals, businesses, and governments alike, as cyber threats continue to evolve in complexity and scale.
### Key Components of Internet Security
1. **Confidentiality**: Ensuring that information is accessible only to those authorized to access it.
2. **Integrity**: Protecting information from being altered or tampered with by unauthorized parties.
3. **Availability**: Ensuring that authorized users have reliable access to information and resources when needed.
## Common Internet Security Threats
Cyber threats are numerous and constantly evolving. Understanding these threats is the first step in protecting against them. Some of the most common internet security threats include:
### Malware
Malware, or malicious software, is designed to harm, exploit, or otherwise compromise a device, network, or service. Common types of malware include:
- **Viruses**: Programs that attach themselves to legitimate software and replicate, spreading to other programs and files.
- **Worms**: Standalone malware that replicates itself to spread to other computers.
- **Trojan Horses**: Malicious software disguised as legitimate software.
- **Ransomware**: Malware that encrypts a user's files and demands a ransom for the decryption key.
- **Spyware**: Software that secretly monitors and collects user information.
### Phishing
Phishing is a social engineering attack that aims to steal sensitive information such as usernames, passwords, and credit card details. Attackers often masquerade as trusted entities in email or other communication channels, tricking victims into providing their information.
### Man-in-the-Middle (MitM) Attacks
MitM attacks occur when an attacker intercepts and potentially alters communication between two parties without their knowledge. This can lead to the unauthorized acquisition of sensitive information.
### Denial-of-Service (DoS) and Distributed Denial-of-Service (DDoS) Attacks
2. We are all connected to each other...
● Information,
thoughts and
opinions are
shared
prolifically on
the social web
these days
● 72% of online
adults use
social
networking sites
3. ● In Britain and the
US, approx 1 hour
a day on social media
● 90% of marketers use
social media channels
for business
4. Popularity of Social Networking Sites
Twitter
● 284 million monthly active users
● 100 million daily active users
● 80% of world leaders use Twitter
Facebook
● 1.35 billion monthly active users. 864 million
daily active, 10 billion messages a day
● 30% of Americans get their news from Facebook
● Facebook has more users than the whole of the
Internet did in 2005
● Google+: 300 million monthly active users
● LinkedIn: 332 million users
● MySpace: 36 million users
● 44% of “users” have
never sent a tweet
● 390 million users have
no followers
● Google+: only 7 minutes
per month
5. Your grandmother is three times as likely to
use a social networking site now as in 2009
6. Why analyse social media?
● Contrary to popular belief, Twitter isn't just full of tweets about
Justin Bieber.
● In an emergency, one in two people would use social media to
let people know they were safe or to find out more information
● Less than 24 hours after the recent Nepal trekking disaster hit,
Facebook and Twitter accounts had been set up to provide
information channels, missing persons register etc.
● For companies, sentiment analysis tools are critical to keep
track of the market pulse, customer feedback, etc.
● Fast-growing, highly dynamic and high volume source of data
● Reflects language and current views of today's society
● Analysing social media is far more efficient than e.g. youGov
polls
7. Opinion mining from social media
● Understanding customer reviews and so on is a huge business
● But also:
● Tracking political opinions: what events make people change
their minds?
● How does public mood influence the stock market, consumer
choices etc?
● How are opinions distributed in relation to demographics?
● Who are the opinion influencers?
● SMA tools are crucial in order to make sense of all the
information
8. Social media analysis for journalists
● Twitter is immensely valuable to news
professionals
● gauging opinion on breaking news
● discovering new stories
● first hand reports from disasters,
war zones, ...
● Issues of veracity: London Eye on Fire!
9. Analysing language in social media is hard
● Grundman:politics makes #climatechange scientific
issue,people don’t like knowitall rational voice tellin em wat 2do
● @adambation Try reading this article , it looks like it would be
really helpful and not obvious at all. http://t.co/mo3vODoX
● Want to solve the problem of #ClimateChange? Just #vote for a
#politician! Poof! Problem gone! #sarcasm #TVP #99%
● Human Caused #ClimateChange is a Monumental Scam!
http://www.youtube.com/watch?v=LiX792kNQeE … F**k yes!!
Lying to us like MOFO's Tax The Air We Breath! F**k Them!
10. We need tools for hashtag analysis
● Hashtags need unravelling:
● #gasprices
● And disambiguating:
● #therapist
● #nowthatcherisdead
14. NER on Tweets
● NER on Tweets much harder than on longer text
● Very short, so ambiguous terms hard to interpret
● Poor grammar and spelling, use of abbreviations, shorthands
● Twitter-specific features: hashtags, @mentions, etc.
● Tools designed for longer texts do very badly on Twitter
System P R F1 F0.5
OpenCalais 68.59 67.17 67.87 68.30
Lupedia 70.93 44.17 54.44 63.27
TextRazor 59.12 83.83 69.34 62.82
TwitIE 69.69 61.03 65.07 67.76
Zemanta 29.64 29.31 29.47 29.57
15. Tools for Sentiment Analysis
● There are lots of tools for sentiment analysis around
● Many of them don't work well at more than a very basic level
● They mainly use dictionary lookup for positive and negative words
● ML methods only works for text that's similar in style to the
training data, and it's hard to understand when it goes wrong
● Things like sarcasm tend not to get picked up
● They classify the tweets as positive or negative, but not with
respect to the keyword you're searching for
● keyword search just retrieves any tweet mentioning it, but not
necessarily about it as a topic
● no correlation between the keyword and the sentiment
16. Sentiment Analysis in GATE
● Knowledge-based linguistic approach based on entity
detection for opinion holders and targets
● Sentiment words have to be in a linguistic relation to the
opinion holder and target
● Use linguistic analysis to deal with scope issues
(negation, hashtags, sarcasm etc)
● Sentiment word scores are modified incrementally
● Easy understanding of errors and adaptation of the rules
● Twitter-specific pre-processing using TwitIE
17. This all sounds like it would be hard
to set up on my system!
18. GATE Cloud to the Rescue
● What?
● end-to-end text and web processing solutions from the
GATE family running on cloud computing infrastructures.
● Why?
● Solve any sort of text processing problem: web, text or
opinion mining; indexing and search (fulltext, boolean,
conceptual, structural); information extraction; semantic
annotation; sentiment analysis; ontology population; etc.
● Run large-scale jobs without investing in server hardware
or other fixed costs.
● Exploit a 15-year R&D programme, the expertise of the
GATE community and a defined and repeatable process.
19. Benefits of Gate Cloud
Text Analytics Consumer
Cloud Large scale, no CAPEX,
no system admin, no commitment
Open Source No vendor lock-in
TA Services Twitter, News, BioMed, Sentiment, etc.
low-level pre-processing support (POS tagging etc)
APIs Integrate
20. Application Types
● Low-level: stemmers, PoS taggers, phrase chunkers,
morphological analysers
● Coverage: tools for 18 languages including BG and RU
● General Purpose IE: named entities, numbers,
measurements, language ID
● Domain-specific IE: News, TwitIE, Biomed
● LOD-based semantic annotation: DBpedia, GeoNames,
Freebase
● Sentiment analysis
● Summarisation
● Includes many 3rd party tools also
20
22. It's just like online shopping
● Click through to the online shop, browse products and add them to
your shopping basket.
● Create an account and then buy credit vouchers
● Put the vouchers in your account, and go to checkout.
● We'll email you the login or job creation details for your cloud servers.
● Monitor and control your cloud machines on your dashboard.
● Use our existing applications:
● Just upload your documents and sit down with a cup of tea
● Create your own pipeline:
● Upload your own customised application along with your
documents, and sit down with a cup of tea
23. 23
Summary
● SMA tools are crucial, but hard to find what's good
● Solutions are readily available in GATE
● Easy to test different versions and configurations
● Open source and easily customisable
● Big data and installation problems are solved with GATE Cloud:
● PaaS for text analytics
● Low barrier to entry
● Just pay for what you use
● State-of-the-art pipelines for news and social media
● More pipelines constantly being added
24. Acknowledgements and more information
● GATE: http://gate.ac.uk
● GATE Cloud: http://gatecloud.net
● Annomarket: http://www.annomarket.eu
● Research partially supported by the European Union/EU under
the Information and Communication Technologies (ICT) theme
of the 7th Framework Programme for R&D (FP7) DecarboNet
(610829) and AnnoMarket (296322)
● Original GATE Cloud development supported by JISC/EPSRC,
reference number EP/I034092/1
This document does not represent the opinion of the European Community, and the
European Community is not responsible for any use that might be made of its content