Description of a configurable, real-time system for automatic record, analysis and visualization of information from user interactions in Twitter. The system is designed to provide public bodies (government agencies) with a powerful tool to rapidly and easily understand what the citizen behavior trends are, what their opinion about city services, events, etc. is, and also may be used as a primary alert system to improve the efficiency of emergency systems. The citizen is here observed as a proactive city sensor capable of generating huge amounts of very rich, high-level and valuable data through social media platforms, which, after properly processed, summarized and annotated, allows city officers to better understand citizen needs. The architecture and component blocks are described and some key details of the design, implementation and scenarios of application are discussed. Textalytics APIS are used for the semantic analysis of relevant tweets.
Presentation by DAEDALUS, UPM and UC3M at PEGOV 2014, 2nd International Workshop on Personalization in eGovernment Services and Applications, Aalborg, Denmark, in conjunction with the 22nd Conference on User Modeling, Adaptation and Personalization - UMAP 2014.
Demo or Die: Where advertising meets product designChristine Outram
This presentation explores the role of rapid prototyping in the age of digital advertising and how it is transforming a "traditional creative process" into a lean, interactive, and multidisciplinary endeavor. Advertising is evolving; the best ads are not always ads; demo or die.
An overview of traditional spatial analysis tools, an intro to hadoop and other tools for analyzing terabytes or more of data, and then a primer with examples on combining the two with data pulled from the Twitter streaming API. Given at the O'Reilly Where 2.0 conference in March 2010.
Demo or Die: Where advertising meets product designChristine Outram
This presentation explores the role of rapid prototyping in the age of digital advertising and how it is transforming a "traditional creative process" into a lean, interactive, and multidisciplinary endeavor. Advertising is evolving; the best ads are not always ads; demo or die.
An overview of traditional spatial analysis tools, an intro to hadoop and other tools for analyzing terabytes or more of data, and then a primer with examples on combining the two with data pulled from the Twitter streaming API. Given at the O'Reilly Where 2.0 conference in March 2010.
Social media data for Social science researchDavide Bennato
This is the talk I gave at the Lipari Summer School on Computational social science 2013. What are relationship between social science and big data? With a focus on Twitter and its social media mining tools
http://www.tecnoetica.it/2013/08/07/lipari-summer-school-computational-social-science-big-data-e-twitter/
With the tremendous growth of social networks, there has been a growth in the amount of new data that is being created every minute on these networking sites. The notion of community in this social networking world has caught lots of attention. Studying Twitter is useful for understanding how people use new communication technologies to form social connections and maintain existing ones. We analysed how geo-tagged tweets in Twitter can be used to identify useful user features and behavior as well as identify landmarks/places of interests. We also analysed several clustering algorithms and proposed different similarity measures to detect communities.
Twitter Text Mining with Web scraping, R, Shiny and Hadoop - Richard Sheng Richard Sheng
Based on an Analytics Week article of the Top 200 Influencers in Big Data and Analytics, I used R and Hadoop to analyze the Twitter Feeds of these leaders with Text Mining, Web Scraping and Visualization techniques.
This thesis proposes to help analyzing the characteristics of the heterogeneous social networks that emerge from the use of web-based social applications, with an original contribution that leverages Social Network Analysis with Semantic Web frameworks. Social Network Analysis (SNA) proposes graph algorithms to characterize the structure of a social network and its strategic positions. Semantic Web frameworks allow representing and exchanging knowledge across web applications with a rich typed graph model (RDF), a query language (SPARQL) and schema definition frameworks (RDFS and OWL). In this thesis, we merge both models in order to go beyond the mining of the flat link structure of social graphs by integrating a semantic processing of the network typing and the emerging knowledge of online activities. In particular we investigate how (1) to bring online social data to ontology-based representations, (2) to conduct a social network analysis that takes advantage of the rich semantics of such representations, and (3) to semantically detect and label communities of online social networks and social tagging activities.
Hadoop, Pig, and Twitter (NoSQL East 2009)Kevin Weil
A talk on the use of Hadoop and Pig inside Twitter, focusing on the flexibility and simplicity of Pig, and the benefits of that for solving real-world big data problems.
Make a query regarding a topic of interest and come to know the sentiment for the day in pie-chart or for the week in form of line-chart for the tweets gathered from twitter.com
Using gamification to generate citizen input for public transport planningMarius Rohde Johannessen
Presentation at the 2016 ePart conference in Guimaraes, Portugal. Research in progress presenting a case study of a smart cities app, and discussing how the data can be used for increased citizen participation.
Presentation by Miguel Alvarez-Rodriguez, DG DIGIT, European Commission, at seminar 2, held on 18 March 2021, which addresses digital government principles and building blocks. This 2nd event takes place in the framework of a series of three webinars organised by the SIGMA Programme, a joint initiative of the OECD and EU, principally financed by the EU, on the role of life events in end-to-end public service delivery.
Smart Citizen - Sense Making - Óscar González, Fablab Barcelona Alex Gluhak
Talk at Urban Data Talks event #3. Fab Labs Barcelona's journey from Smart Cities to Smart Citizens. Tools and methodologies to empower smarter citizens
Presentation provided at "Geo Gebruikersfestival", okt 31, 2018, Amersfoort, The Netherlands. Sketches how from several Sensor projects the Smart Emission Platform emerged and migrated to the PDOK Platform. How a next step could be be a Dutch national Sensor SDI asa a federated/distributed architecture. Special attention is given to APIs, in particular the SensorThings API.
Social media data for Social science researchDavide Bennato
This is the talk I gave at the Lipari Summer School on Computational social science 2013. What are relationship between social science and big data? With a focus on Twitter and its social media mining tools
http://www.tecnoetica.it/2013/08/07/lipari-summer-school-computational-social-science-big-data-e-twitter/
With the tremendous growth of social networks, there has been a growth in the amount of new data that is being created every minute on these networking sites. The notion of community in this social networking world has caught lots of attention. Studying Twitter is useful for understanding how people use new communication technologies to form social connections and maintain existing ones. We analysed how geo-tagged tweets in Twitter can be used to identify useful user features and behavior as well as identify landmarks/places of interests. We also analysed several clustering algorithms and proposed different similarity measures to detect communities.
Twitter Text Mining with Web scraping, R, Shiny and Hadoop - Richard Sheng Richard Sheng
Based on an Analytics Week article of the Top 200 Influencers in Big Data and Analytics, I used R and Hadoop to analyze the Twitter Feeds of these leaders with Text Mining, Web Scraping and Visualization techniques.
This thesis proposes to help analyzing the characteristics of the heterogeneous social networks that emerge from the use of web-based social applications, with an original contribution that leverages Social Network Analysis with Semantic Web frameworks. Social Network Analysis (SNA) proposes graph algorithms to characterize the structure of a social network and its strategic positions. Semantic Web frameworks allow representing and exchanging knowledge across web applications with a rich typed graph model (RDF), a query language (SPARQL) and schema definition frameworks (RDFS and OWL). In this thesis, we merge both models in order to go beyond the mining of the flat link structure of social graphs by integrating a semantic processing of the network typing and the emerging knowledge of online activities. In particular we investigate how (1) to bring online social data to ontology-based representations, (2) to conduct a social network analysis that takes advantage of the rich semantics of such representations, and (3) to semantically detect and label communities of online social networks and social tagging activities.
Hadoop, Pig, and Twitter (NoSQL East 2009)Kevin Weil
A talk on the use of Hadoop and Pig inside Twitter, focusing on the flexibility and simplicity of Pig, and the benefits of that for solving real-world big data problems.
Make a query regarding a topic of interest and come to know the sentiment for the day in pie-chart or for the week in form of line-chart for the tweets gathered from twitter.com
Using gamification to generate citizen input for public transport planningMarius Rohde Johannessen
Presentation at the 2016 ePart conference in Guimaraes, Portugal. Research in progress presenting a case study of a smart cities app, and discussing how the data can be used for increased citizen participation.
Presentation by Miguel Alvarez-Rodriguez, DG DIGIT, European Commission, at seminar 2, held on 18 March 2021, which addresses digital government principles and building blocks. This 2nd event takes place in the framework of a series of three webinars organised by the SIGMA Programme, a joint initiative of the OECD and EU, principally financed by the EU, on the role of life events in end-to-end public service delivery.
Smart Citizen - Sense Making - Óscar González, Fablab Barcelona Alex Gluhak
Talk at Urban Data Talks event #3. Fab Labs Barcelona's journey from Smart Cities to Smart Citizens. Tools and methodologies to empower smarter citizens
Presentation provided at "Geo Gebruikersfestival", okt 31, 2018, Amersfoort, The Netherlands. Sketches how from several Sensor projects the Smart Emission Platform emerged and migrated to the PDOK Platform. How a next step could be be a Dutch national Sensor SDI asa a federated/distributed architecture. Special attention is given to APIs, in particular the SensorThings API.
Citiviz Corporate Presentation | Smart mobility for Citizen's Quality of LifeNicolas Lachance-Bernard
Innovaud Connect - Big Data: opportunities & challenges
EPFL, Lausanne, Switzerland
June 17th 2014
"Innovaud Connect" meetings aim at the following goals:
- Give actors in high tech and high potential innovations the opportunity to discover each other ;
- Understand the needs and expectations of all actors included in the value chain ;
- Initiate collaborations or partnerships between the actors ;
- Highlight the creativity potential of a center of competitiveness;
www.citiviz.com | www.twitter.com/Citiviz
www.innovaud.ch | www.twitter.com/Innovaud
How Open Culture Data and Digital Cultural Heritage Content can contribute si...Martin Elshout
Paul Manwaring shows EU member states how Open Culture Data and Digital Cultural Heritage Content can contribute significantly to the European App Economy in Vilnius, Lithuania Oct. 2 for the EU Presidency Conference Informal Meeting on Culture: Ready For Tomorrow? Culture as an Agent for Social and Economic Transformation. In his presentation Paul looks at the massive investment in European Cultural Heritage Digitization and shows how this investment can be optimized by creating Public Private Partnerships to create Apps that engage and inspire the public.
Knowledge Technologies group at CefrielIrene Celino
Main research and innovation interests of the Knowledge technologies groups at Cefriel: Semantic Interoperability and Human Computation. Summary of our research lines,our approach, our offer and our experience in cooperative R&D projects.
Customer Analytics; qué se necesita y cómo conseguirlo by Josep CurtoSngular Meaning
Josep Curto nos acerca al mundo del Big Data y cómo el estudio de la evidencia de los datos nos ayuda a tomar mejores decisiones de negocio. Escuchar al cliente, analizar los datos que nos proporcionan nos permite, como compañía, mejorar nuestros procesos, añadir valor a nuestra relación con los clientes y obtener mejores resultados.
Customer Analytics: de text analytics a Voice of CustomerSngular Meaning
El uso de datos desestructurados nos va a permitir tener una visión 360 del cliente. Solo atendiendo sus necesidades y escuchando lo que dicen, podemos ofrecer un valor diferencial a nuestros clientes. El big data al servicio de la satisfacción
Stilus corrector ortografico gramatical de estilo en espanolSngular Meaning
Presentación de Stilus - Una herramienta en línea para la corrección ortográfica, gramatical y de estilo en español.
http://www.mystilus.com
Stilus es un producto de Daedalus
http://www.daedalus.es/
Webinar Herramientas semánticas para sector Salud - Daedalus 4 noviembre 2014Sngular Meaning
Webinar sobre Tecnologías semánticas que entienden el lenguaje de la Salud, ofrecido por la empresa Daedalus el 4 noviembre 2014 -
Las tecnologías semánticas le ayudan a entender la “Voz de los Pacientes” y a gestionar la documentación clínica
Daedalus desarrolla tecnología para extraer significado de contenidos no estructurados. En el sector de e-Salud (e-Sanidad), la tecnología semántica permite explotar automáticamente la información de la Historia Clínica Electrónica (HCE).
Esta presentación cubre la experiencia de Daedalus en:
• Monitorización de contenidos online sobre salud
• Enriquecimiento semántico (etiquetado) de historia clínica
• Anonimización de historias clínicas
• Búsqueda multimedia en historias clínicas
• Detección de interacciones entre medicamentos
• Analítica de texto y de datos en el sector de salud
Daedalus develops technology to extract the meaning and structure all types of multimedia content. In the field of Healthcare or e-Health, Daedalus' semantic technology allows to exploit automatically the information featured in the Electronic Health Record (EHR).
This presentation covers Daedalus experience in:
• Online health content monitoring
• Semantic enrichment (tagging) of medical records
• Anonymization of medical records
• Multimedia search in medical records
• Detection of interactions between drugs
• Text analytics and data analytics in the health sector
Tracking Buzz and Sentiment for Second Screens - Daedalus - ACM TVX 2014Sngular Meaning
Presentation about "Numbat - Tracking Buzz and Sentiment for Second Screens" delivered by Textalytics/Daedalus at the ACM TVX 2014 conference (Newcastle, UK)
Textalytics is now MeaningCloud http://www.meaningcloud.com/
Mineria de informacion util en medios sociales - Daedalus - Big Data Week 201...Sngular Meaning
Ponencia "Terremotos, señales de compra y… #WTF: Minería de información útil en medios sociales" presentada por Antonio Matarranz, de Daedalus en el Big Data Week 2014 en Madrid
Textalytics es ahora MeaningCloud http://www.meaningcloud.com/
Presentación de Stilus sobre "Lingüística de Corpus aplicada a la corrección automática y profesional" en Lenguando 2014 (Madrid)
Regístrate gratis en mystilus.com
Stilus es una marca de Daedalus, S. A.
Textalytics - Voice of the Customer - Sentiment Analysis Symposium 2014Sngular Meaning
Presentation about "Voice of the Customer in the Financial Servce Industry" delivered by Textalytics/Daedalus at the Sentiment Analysis Symposium 2014 (NYC)
Textalytics is now MeaningCloud http://www.meaningcloud.com/
An Introduction to Textalytics API - Redradix WeekendSngular Meaning
Introduction to NLP and the Core API in Textalytics. Core API functionalities include Language Identification, Text Classification, Parsing , Topics and Entity Extraction, Sentment Analysis, Text Proofreading and even Speech Recognition. The presentation introduces Natural Language Processing tasks and how they help to build a semantic representation of texts. Linked Open Data (LOD) is also introduced as Topics Extraction API includes link to the most popular LOD repoositories as well as Wikipedia.
Textalytics is now MeaningCloud http://www.meaningcloud.com/
Real time semantic search engine for social tv streamsSngular Meaning
Social TV, the use of social networks to comment on TV programs is a growing phenomena. TV channels and brands are turning into social networks to look for real time insights about their programs. Understanding the global conversation about a program is useful to acquire insights for broadcasters and brands. For broadcasters, acquiring insights while a program is aired enable them to produce new content formats that include social conversation. For brands, it helps to prevent reputation crisis and increase the reach of their marketing efforts. For viewers, which increasingly use second screen devices, should benefit from tools that help to understand opinions around main content and connect with peers during TV programs or live events.
Textalytics is now MeaningCloud http://www.meaningcloud.com/
We present a system that combines natural language processing (Textalytics API) and a scalable semi-structured database/search engine (senseiDB) to provide semantic and faceted search, real time analytics and support visualizations for this kind of applications.
In the first part, we will present some of the useful NLP methods that we can use to tame unstructured big data like Twitter or Facebook comments. We will include description for tasks like text categorization, sentiment analysis, named entity recognition. We would also see how this data could be related to external data like Linked Data points. While the description would be general, examples would be illustrated using Textalytics API.
Then we would present how this data could be ingested and made available for search in real time using a semi-structured database like SenseiDB. We would present key features of SenseiDB including high performance real time indexing and simultaneous querying, distribution and support for full-text and faceted search. We would also discuss how facets may be overused to provide real time analytics and enable semantic search. Finally we will discuss advantages, problems and current limitations of SenseiDB.
Takeaway Points.
- Analyzing and searching text in social streams
- Integrating text analytics services (Textalytics) and a semi-structured database (SenseiDB)
- Key features of SenseiDB
Webinar Textalytics Meaning as a Service - Daedalus 8 octubre 2013Sngular Meaning
Webinar sobre Textalytics (Meaning as a Service) ofrecido por la empresa Daedalus el 8 de octubre de 2013 - La manera más sencilla de incorporar procesamiento semántico a sus aplicaciones.
Textalytics es ahora MeaningCloud http://www.meaningcloud.com/
Presentación para las V Jornadas Empresa de la Rede Galega de Procesamento da Linguaxe e Recuperación de Información.
Textalytics es ahora MeaningCloud http://www.meaningcloud.com/
A Tale of Two (Semantic) APIs - Daedalus - API Days MediterraneaSngular Meaning
Presentation delivered by Daedalus at API Days Mediterranea (Madrid, 1 June 2013).
The Textalytics product has been rebranded to MeaningCloud http://www.meaningcloud.com/
Webinar Análisis Semántico de Medios Sociales - Daedalus 21 may 2013Sngular Meaning
Webinar sobre Análisis Semántico de Medios Sociales ofrecido por la empresa Daedalus el 21 de mayo de 2013 - Explote al máximo lo que se dice en medios sociales usando tecnologías semánticas
Top 7 Unique WhatsApp API Benefits | Saudi ArabiaYara Milbes
Discover the transformative power of the WhatsApp API in our latest SlideShare presentation, "Top 7 Unique WhatsApp API Benefits." In today's fast-paced digital era, effective communication is crucial for both personal and professional success. Whether you're a small business looking to enhance customer interactions or an individual seeking seamless communication with loved ones, the WhatsApp API offers robust capabilities that can significantly elevate your experience.
In this presentation, we delve into the top 7 distinctive benefits of the WhatsApp API, provided by the leading WhatsApp API service provider in Saudi Arabia. Learn how to streamline customer support, automate notifications, leverage rich media messaging, run scalable marketing campaigns, integrate secure payments, synchronize with CRM systems, and ensure enhanced security and privacy.
Large Language Models and the End of ProgrammingMatt Welsh
Talk by Matt Welsh at Craft Conference 2024 on the impact that Large Language Models will have on the future of software development. In this talk, I discuss the ways in which LLMs will impact the software industry, from replacing human software developers with AI, to replacing conventional software with models that perform reasoning, computation, and problem-solving.
Unleash Unlimited Potential with One-Time Purchase
BoxLang is more than just a language; it's a community. By choosing a Visionary License, you're not just investing in your success, you're actively contributing to the ongoing development and support of BoxLang.
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamtakuyayamamoto1800
In this slide, we show the simulation example and the way to compile this solver.
In this solver, the Helmholtz equation can be solved by helmholtzFoam. Also, the Helmholtz equation with uniformly dispersed bubbles can be simulated by helmholtzBubbleFoam.
We describe the deployment and use of Globus Compute for remote computation. This content is aimed at researchers who wish to compute on remote resources using a unified programming interface, as well as system administrators who will deploy and operate Globus Compute services on their research computing infrastructure.
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeAftab Hussain
Understanding variable roles in code has been found to be helpful by students
in learning programming -- could variable roles help deep neural models in
performing coding tasks? We do an exploratory study.
- These are slides of the talk given at InteNSE'23: The 1st International Workshop on Interpretability and Robustness in Neural Software Engineering, co-located with the 45th International Conference on Software Engineering, ICSE 2023, Melbourne Australia
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...Crescat
Crescat is industry-trusted event management software, built by event professionals for event professionals. Founded in 2017, we have three key products tailored for the live event industry.
Crescat Event for concert promoters and event agencies. Crescat Venue for music venues, conference centers, wedding venues, concert halls and more. And Crescat Festival for festivals, conferences and complex events.
With a wide range of popular features such as event scheduling, shift management, volunteer and crew coordination, artist booking and much more, Crescat is designed for customisation and ease-of-use.
Over 125,000 events have been planned in Crescat and with hundreds of customers of all shapes and sizes, from boutique event agencies through to international concert promoters, Crescat is rigged for success. What's more, we highly value feedback from our users and we are constantly improving our software with updates, new features and improvements.
If you plan events, run a venue or produce festivals and you're looking for ways to make your life easier, then we have a solution for you. Try our software for free or schedule a no-obligation demo with one of our product specialists today at crescat.io
Quarkus Hidden and Forbidden ExtensionsMax Andersen
Quarkus has a vast extension ecosystem and is known for its subsonic and subatomic feature set. Some of these features are not as well known, and some extensions are less talked about, but that does not make them less interesting - quite the opposite.
Come join this talk to see some tips and tricks for using Quarkus and some of the lesser known features, extensions and development techniques.
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Globus
The Earth System Grid Federation (ESGF) is a global network of data servers that archives and distributes the planet’s largest collection of Earth system model output for thousands of climate and environmental scientists worldwide. Many of these petabyte-scale data archives are located in proximity to large high-performance computing (HPC) or cloud computing resources, but the primary workflow for data users consists of transferring data, and applying computations on a different system. As a part of the ESGF 2.0 US project (funded by the United States Department of Energy Office of Science), we developed pre-defined data workflows, which can be run on-demand, capable of applying many data reduction and data analysis to the large ESGF data archives, transferring only the resultant analysis (ex. visualizations, smaller data files). In this talk, we will showcase a few of these workflows, highlighting how Globus Flows can be used for petabyte-scale climate analysis.
Code reviews are vital for ensuring good code quality. They serve as one of our last lines of defense against bugs and subpar code reaching production.
Yet, they often turn into annoying tasks riddled with frustration, hostility, unclear feedback and lack of standards. How can we improve this crucial process?
In this session we will cover:
- The Art of Effective Code Reviews
- Streamlining the Review Process
- Elevating Reviews with Automated Tools
By the end of this presentation, you'll have the knowledge on how to organize and improve your code review proces
Globus Connect Server Deep Dive - GlobusWorld 2024Globus
We explore the Globus Connect Server (GCS) architecture and experiment with advanced configuration options and use cases. This content is targeted at system administrators who are familiar with GCS and currently operate—or are planning to operate—broader deployments at their institution.
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Globus
Large Language Models (LLMs) are currently the center of attention in the tech world, particularly for their potential to advance research. In this presentation, we'll explore a straightforward and effective method for quickly initiating inference runs on supercomputers using the vLLM tool with Globus Compute, specifically on the Polaris system at ALCF. We'll begin by briefly discussing the popularity and applications of LLMs in various fields. Following this, we will introduce the vLLM tool, and explain how it integrates with Globus Compute to efficiently manage LLM operations on Polaris. Attendees will learn the practical aspects of setting up and remotely triggering LLMs from local machines, focusing on ease of use and efficiency. This talk is ideal for researchers and practitioners looking to leverage the power of LLMs in their work, offering a clear guide to harnessing supercomputing resources for quick and effective LLM inference.
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...Juraj Vysvader
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I didn't get rich from it but it did have 63K downloads (powered possible tens of thousands of websites).
Listen to the keynote address and hear about the latest developments from Rachana Ananthakrishnan and Ian Foster who review the updates to the Globus Platform and Service, and the relevance of Globus to the scientific community as an automation platform to accelerate scientific discovery.
May Marketo Masterclass, London MUG May 22 2024.pdfAdele Miller
Can't make Adobe Summit in Vegas? No sweat because the EMEA Marketo Engage Champions are coming to London to share their Summit sessions, insights and more!
This is a MUG with a twist you don't want to miss.
May Marketo Masterclass, London MUG May 22 2024.pdf
Tweet alert - semantic analysis in social networks for citizen opinion mining
1. PeGOV 2014 – 2nd Workshop on Personalization in eGovernment Services and Applications
11 July 2014, Aalborg, Denmark
TweetAlert:
Semantic Analytics in Social Networks
for Citizen Opinion Mining
in the City of the Future
Julio Villena-Román1,2,
Adrián Luna-Cobos1,3, José Carlos González-Cristóbal3,1
1 DAEDALUS - Data, Decisions and Language, S.A.
2 Universidad Carlos III de Madrid
3 Universidad Politécnica de Madrid
jvillena@daedalus.es, aluna@daedalus.es, josecarlos.gonzalez@upm.es
2. PeGOV-2014
11 July 2014, Aalborg, Denmark 2
Agenda
! Framework
! Citizen Sensor
! System
! Business cases
! Future work
3. PeGOV-2014
11 July 2014, Aalborg, Denmark 3
Framework
! Ciudad 2020 aims to achieve significant improvements in areas of
energetic efficiency, Internet of the Future, Internet of Things, human
behaviour, environmental sustainability and mobility and transport, in
order to design the City of the Future: sustainable, efficient, smart.
! Spanish R&D project, INNPRONTA Programme, Center for Industrial
Technological Development (CDTI), Ministry of Economy and
Competitiveness
! 2011-2014
! 16,3 M€ budget
! 5 multinational corporations, 4 SMEs, 8 PRIs
! Daedalus focuses on the automatic extraction of meaning from all types
of multimedia content, using NLP technologies and data/text analytics to
help our customers solve any challenge in these areas.
4. leisure and free time
surveys
PeGOV-2014
11 July 2014, Aalborg, Denmark 4
Citizen Sensor
mobility
professional activities
opinions in
social media
relationship with
public administration
collaborative
sensing
relationship with
other people
Citizen 2020 = another city sensor
5. PeGOV-2014
11 July 2014, Aalborg, Denmark 5
Citizen Sensor
! Innovative way to capture a very descriptive high-level
heterogeneous information, bringing high added value
especially when considering aggregations
! More complex and richer information than other sensors
! “smells awful”, “there is a fire”, “I’m going to the sales”…
! Individual actions may show citizen trends
! validate a bus ticket " route density
! Opinion/sentiments of the citizen about the city
! follow social networks to assess the impact of new policies
! Collaborative sensing
! using smartphones to get data (pollution, energy consumption) with low
cost and new possibilities
6. Our approach
What: Build a system able to capture, store and analyze user
PeGOV-2014
11 July 2014, Aalborg, Denmark 6
messages
Where: In Twitter
For whom: City administrators
What for: To help them rapidly and easily understand citizen
behaviour trends and know their opinion about city
services, events, etc.
Why: To enable them to better understand citizen necessities,
generate hypotheses over urban behaviour models, in
order to improve municipal management policies,
bringing them closer to the actual reality of the citizens
How: Using NLP technologies
When: In real-time
8. Information Repository
! Stores the high volume of data and provides advanced search
functionality to exploit the information
! Based on Elasticsearch
! open source, distributed, real-time search and analytics engine
! complex search capabilities
! scalable high-performance solution
PeGOV-2014
11 July 2014, Aalborg, Denmark 8
http://www.elasticsearch.org
9. PeGOV-2014
11 July 2014, Aalborg, Denmark 9
Gatherer
! Set of concurrent processes that query the Twitter APIs to collect
tweets
! Search or Streaming API
! Filter by a list of user identifiers, a list of keywords to track (terms,
hashtags) and/or a set of geographical bounding boxes
! Returns tweet text, author, location, embedded media
https://dev.twitter.com/docs/api/1.1
10. Text
Classification
API
http://textalytics.com
PeGOV-2014
11 July 2014, Aalborg, Denmark 10
Inquirer
! Set of concurrent processes that annotate tweets using our
Textalytics Core APIs
! Entities
! Concepts
Topic Extraction API
! Hashtags
! Thematic area of the message (transport, economy, daily life…)
! Citizen Sensor model
! Alert situations (road accidents, fires, street violence…)
! Specific location of the user (building, means of transport...)
! Events to which the text refers (cultural events, sports...)
! Sentiment polarity : P+, P, NEU, N, N+, NONE
! Irony and subjectivity
! User demographics: gender, age, type of tweet author
Sentiment Analysis API
User Demographics API
11. Entities, concepts, hashtags
Advanced NLP to obtain POS, syntactic tree and semantic analyses of the
text and use it to identify different types of significant elements
PeGOV-2014
11 July 2014, Aalborg, Denmark 11
12. Text classification
State-of-the-art hybrid text classification model using a statistical
classification combined with a rule-based filtering
PeGOV-2014
11 July 2014, Aalborg, Denmark 12
Social Media
Citizen Sensor
16. Sentiment analysis
State-of-the-art lexicon-based model for sentiment analysis, using POS
and syntactic tree for detecting negation and controlling the scope of
modifiers + subjectivity classification + irony detection
PeGOV-2014
11 July 2014, Aalborg, Denmark 16
17. User Demographics
Text classification based on n-grams model to guess user type, gender and
age from his/her login, name and profile description
PeGOV-2014
11 July 2014, Aalborg, Denmark 17
18. PeGOV-2014
11 July 2014, Aalborg, Denmark 18
Example
{
"text":"el viento ha roto una rama y hay un atascazo increible en toda la gran vía...",
"tag_list":[
{"type":"sensor", "value":"011002 Ubicación - Exteriores - Vías públicas"},
{"type":"sensor", "value":"070700 Alertas meteorológicas - Viento"},
{"type":"sensor", "value":"080100 Incidencia - Congestión de tráfico"},
{"type":"topic", "value":"06 medio ambiente, meteorología y energía"},
{"type":"entity", "value":"Gran Vía"},
{"type":"concept", "value":"viento"},
{"type":"sentiment", "value":"N"},
{"type":"subjectivity", "value":"OBJ"},
{"type":"irony", "value":"NONIRONIC"},
{"type":"user_type", "value":"PERSON"},
{"type":"user_gender", "value":"FEMALE"},
{"type":"user_age", "value":"25-35"}
]
}
21. PeGOV-2014
11 July 2014, Aalborg, Denmark 21
Ongoing business cases
! City console for a local administration to analyze in real-time the
behaviour and topics of interest of the citizens, with two
components:
! a private console, internal for the city services, for analytics
! a public dashboard to engage citizens with their city, displaying
attractive, summarized, non-confidential information at selected
public locations (town hall, libraries, museums) or a LED video wall in
a populous square in downtown
! Social alert detection system
! For 112 emergency services, providing early detection of security-related
issues
22. For short/mid term future
! Trending topics geolocation clustering
PeGOV-2014
11 July 2014, Aalborg, Denmark 22
! Analysis at neighbourhood level
health
traffic
jam
air pollution
jellyfish
pollen
23. For short/mid term future
PeGOV-2014
11 July 2014, Aalborg, Denmark 23
! Analysis of city pace of life
24. For short/mid term future
PeGOV-2014
11 July 2014, Aalborg, Denmark 24
! Mobility analysis
! How, when, why people move through the city
! Route identification (home"work"free time"home)
! Route changes (due to weather)
25. For short/mid term future
! City reputation and brand personality
! Automated satisfaction surveys
PeGOV-2014
11 July 2014, Aalborg, Denmark 25
26. This work has been supported by several Spanish R&D projects: Ciudad2020: Hacia un nuevo modelo de ciudad inteligente
sostenible (INNPRONTA IPT-20111006), MA2VICMR: Improving the access, analysis and visibility of the multilingual and
multimedia information in web for the Region of Madrid (S2009/TIC-1542) and MULTIMEDICA: Multilingual Information
Extraction in Health domain and application to scientific and informative documents (TIN2010-20644-C03-01). Authors
would like to thank all partners for their knowledge and support.
PeGOV-2014
11 July 2014, Aalborg, Denmark 26