Web 2.0 applications for social networking provide data about users’ mood and opinions in almost real time. Many applications are taking advantage of these data to derive business intelligence. However, the volume of data makes it hard and error-prone to classify sentiments and opinions manually. The combination of data mining techniques and a pipeline to process data from Web 2.0 applications, such as Twitter, Facebook, and Wordpress, makes it possible to apply natural language processing and machine learning techniques to automate partially this task. Therefore, the amount of manual classification is reduced, as the incoming data has already a classification tag that can be easily changed, feeding back the classifier. There is room for improvements and a Brazilian Portuguese Corpus was created to do the initial training of the classifier. The code used for this testing was based on open source libraries and is available as a test bed for different corpora and new algorithms.
Este documento discute os benefícios de usar bancos de dados NoSQL e apresenta vários estudos de caso de empresas que migraram com sucesso para sistemas NoSQL como MongoDB e Redis. Ele também aborda desafios comuns como importação e exportação de dados, recuperação em caso de falha e questões de escalabilidade.
Architectural Anti Patterns - Notes on Data Distribution and Handling FailuresGleicon Moraes
The document discusses several architectural anti-patterns related to data distribution and handling failures when using relational database management systems (RDBMS). It describes anti-patterns such as using tables as caches, queues, log files, or for dynamic schema creation. It also discusses abusing RDBMS features like stored procedures and triggers for application logic as well as using tables for distributed locking. The document is presented as a slide deck covering these anti-patterns to avoid when designing distributed systems that use RDBMS for data storage and access.
QCon SP 2015 - Advogados do diabo: como a arquitetura emergente de sua aplica...Gleicon Moraes
Este documento discute os desafios de entrega contínua em sistemas complexos reais. Ele destaca como as pressões por novas funcionalidades e mudanças frequentes podem levar a arquiteturas frágeis e mal testadas, dificultando a automação e a entrega contínua. Também enfatiza a importância de métricas, logs, testes abrangentes e padrões como circuit breaker para tornar os sistemas mais robustos e previsíveis.
This document discusses Infrastructure as a Service (IaaS) and Software Defined Networking (SDN).
IaaS allows consumers to provision computing resources like servers, storage, and networking and deploy their own operating systems and applications. The consumer does not manage the underlying cloud infrastructure. SDN abstracts traditional network equipment by separating the control and data planes, using a centralized controller and open standards like OpenFlow. This allows network configuration through software instead of dedicated hardware. The document then provides examples of how IaaS resources and SDN architecture could be implemented in a cloud computing environment.
This document provides an overview of Locaweb's Infrastructure as a Service (IaaS) engineering team and capabilities. It discusses the team size and responsibilities, defines IaaS, describes Locaweb's IaaS architecture including virtual/physical servers, open source software used, provisioning processes, and resource monitoring. It also lists several open source projects developed by Locaweb to support their IaaS operations.
This document summarizes a webinar on the invisible dynamics of collaboration in high performing groups and teams. It discusses using a tool called Menti to anonymously score statements about group dynamics and map individual results on a chart. The webinar explored research findings on how group dynamics relate to organizational learning and environment factors. It aimed to help participants reflect on dynamics within their own groups and areas for improvement.
The document provides steps for requesting writing assistance from HelpWriting.net. It outlines the 5-step process: 1) Create an account with a password and email. 2) Complete a 10-minute order form providing instructions, sources, and deadline. 3) Review bids from writers and choose one based on qualifications. 4) Review the completed paper and authorize payment or request revisions. 5) Request revisions to ensure satisfaction, with the option of a full refund for plagiarized work.
Este documento discute os benefícios de usar bancos de dados NoSQL e apresenta vários estudos de caso de empresas que migraram com sucesso para sistemas NoSQL como MongoDB e Redis. Ele também aborda desafios comuns como importação e exportação de dados, recuperação em caso de falha e questões de escalabilidade.
Architectural Anti Patterns - Notes on Data Distribution and Handling FailuresGleicon Moraes
The document discusses several architectural anti-patterns related to data distribution and handling failures when using relational database management systems (RDBMS). It describes anti-patterns such as using tables as caches, queues, log files, or for dynamic schema creation. It also discusses abusing RDBMS features like stored procedures and triggers for application logic as well as using tables for distributed locking. The document is presented as a slide deck covering these anti-patterns to avoid when designing distributed systems that use RDBMS for data storage and access.
QCon SP 2015 - Advogados do diabo: como a arquitetura emergente de sua aplica...Gleicon Moraes
Este documento discute os desafios de entrega contínua em sistemas complexos reais. Ele destaca como as pressões por novas funcionalidades e mudanças frequentes podem levar a arquiteturas frágeis e mal testadas, dificultando a automação e a entrega contínua. Também enfatiza a importância de métricas, logs, testes abrangentes e padrões como circuit breaker para tornar os sistemas mais robustos e previsíveis.
This document discusses Infrastructure as a Service (IaaS) and Software Defined Networking (SDN).
IaaS allows consumers to provision computing resources like servers, storage, and networking and deploy their own operating systems and applications. The consumer does not manage the underlying cloud infrastructure. SDN abstracts traditional network equipment by separating the control and data planes, using a centralized controller and open standards like OpenFlow. This allows network configuration through software instead of dedicated hardware. The document then provides examples of how IaaS resources and SDN architecture could be implemented in a cloud computing environment.
This document provides an overview of Locaweb's Infrastructure as a Service (IaaS) engineering team and capabilities. It discusses the team size and responsibilities, defines IaaS, describes Locaweb's IaaS architecture including virtual/physical servers, open source software used, provisioning processes, and resource monitoring. It also lists several open source projects developed by Locaweb to support their IaaS operations.
This document summarizes a webinar on the invisible dynamics of collaboration in high performing groups and teams. It discusses using a tool called Menti to anonymously score statements about group dynamics and map individual results on a chart. The webinar explored research findings on how group dynamics relate to organizational learning and environment factors. It aimed to help participants reflect on dynamics within their own groups and areas for improvement.
The document provides steps for requesting writing assistance from HelpWriting.net. It outlines the 5-step process: 1) Create an account with a password and email. 2) Complete a 10-minute order form providing instructions, sources, and deadline. 3) Review bids from writers and choose one based on qualifications. 4) Review the completed paper and authorize payment or request revisions. 5) Request revisions to ensure satisfaction, with the option of a full refund for plagiarized work.
Slides Joni Hoadley recently used in his discussion w/ mentees of The Product Mentor.
Synopsis: Every good product manager should listen to the voice of their customers. The key is using the right data to inform and influence your product decisions.
The Product Mentor is a program designed to pair Product Mentors and Mentees from around the World, across all industries, from start-up to enterprise, guided by the fundamental goals…Better Decisions. Better Products. Better Product People.
Throughout the program, each mentor leads a conversation in an area of their expertise that is live streamed and available to both mentee and the broader product community.
http://TheProductMentor.com
The document discusses online qualitative research tools and methods. It provides a history of the company Itracks and how online qualitative research has evolved since the 1990s with the growth of the internet. It then outlines several online qualitative research techniques like online focus groups, bulletin board focus groups, and additional services like iMarkIt for concept testing. It discusses the advantages of these online qualitative methods and how they can support quantitative research.
Netbase AMA Sentiment Analysis PresentationNetBase
Marketers can’t stop talking about social media, but how many understand how it can help them meet critical business objectives? Or what tools are available to analyze social media, how they compare, and which one is best suited for market research and brand managers?
This NetBase presentation will teach you:
How social media impacts your sales funnel.
Why understanding specific customer themes is important.
How to quantify conversations and get actionable insights that strengthen your brand.
How to use social analytics tools to efficiently get valuable competitive insights.
This document discusses social learning strategies and models. It begins with an introduction of the author, David Wilkins, who has over 15 years of experience in learning and is a national speaker. The rest of the document discusses the evolution of learning models from a top-down, formal structure to a more collaborative and emergent structure. It provides examples of how social learning has been implemented at companies like Cisco and Ace Hardware and discusses how learning interventions can incorporate different paradigms like formal, collaborative and emergent approaches. The document advocates that leaders think about whether their initiatives are more dependent on codified processes or emergent ideas and tailor their learning strategies accordingly.
Kellogg Video Essay Question List. Online assignment writing service.Ashley Opokuaa
Octavia Butler's short story "Speech Sounds" follows a woman named Rye in a dystopian future where a pandemic has severely limited human communication abilities. Through Rye's actions and dialogue, Butler challenges gender norms and expectations. Rye displays intelligence, bravery, and agency as she navigates dangerous situations, using violence if necessary to protect herself despite social expectations of female passiveness. Her initiation of a sexual relationship with another character also subverts expectations of female passivity. The story dismantles notions of gender stratification and female submission through Rye's defiance of societal roles.
Having Trouble Writing College Essay. How To Write An Exemplification ...Rebecca Bordes
The document provides instructions for how to request and complete an assignment writing request through the HelpWriting.net website. It outlines a 5-step process: 1) Create an account; 2) Complete an order form with instructions and deadline; 3) Review bids from writers and select one; 4) Receive the completed paper; 5) Request revisions if needed and authorize payment upon satisfaction. The process aims to match students with qualified writers to help complete assignments while allowing for revisions until the student is fully satisfied with the end product.
OAC is testing new product features using split testing and cohort analysis to better understand what members want. This revealed that members preferred solo study options over social tools, contrary to the team's assumptions. Regular reporting of experimental results in accessible formats helps the team learn and prioritize development based on what improves member outcomes. Relying only on vanity metrics like total users could mislead the team about the product's sustainability and effectiveness.
Using Open Badges to provide micro-credentials is a great way to track upskilling and career pathing for learners within organizations. This session will cover how micro-credentialing supports ongoing professional development and personalized learning pathways as it pertains to accomplishments and skills.
By attending this session, you will:
-Explore the open badge framework and how it can track progress to act as a workplace currency
-Analyze why most organizations that try badging don't succeed in their goals
-Learn how badges help solve the skills gap and how clients are using them to transform their learning culture.
-Discover strategies for badging effectively at your organization.
Logan Peterson provides information on three free online certifications: Google Analytics, HootSuite Social Media Management, and Temple Harrisburg International Nonprofit Training and Leadership. The certifications help expand knowledge, provide skills for the workplace, and validate online profiles. Google Analytics focuses on data analytics strategy, segmentation, and key performance indicators. HootSuite emphasizes using their dashboard. The Temple program includes six online nonprofit courses. Pursuing these certifications sets students up for success in technology, data analysis, social media, and nonprofit involvement.
Pam Morris is interviewed about her background in software measurement and process improvement. She discusses some of the root causes of high software project failure rates, including poor requirements specifications and inadequate scoping. She emphasizes that software process improvement is about rigor and measurement, not just "checking boxes". Measurement programs are most effective when they start small and focus on key metrics that support management's goals and decision-making. The role of management is also important to drive measurement from the top down.
This document discusses challenges with qualitative and quantitative data collection in school counseling and provides tips for effective use and presentation of data. It addresses:
1) Common types of qualitative data counselors collect and how to code it to find patterns and themes.
2) Potential pitfalls of quantitative data like non-response bias and issues with reliability/validity.
3) Strategies for presenting qualitative and quantitative data together using tools like Sign-Up Genius, Google Forms, Survey Monkey, and mandatory surveys. The goal is to use mixed methods and data to improve counseling programs and services.
Many companies seem to have fallen victim to the ominous skills gap – but only because they’ve found it’s easier to place blame somewhere than to look at what they can do to fix it. If you’re struggling to find qualified candidates, you might need to take a deeper look at your organization and ask some pointed questions. Are there areas of the recruiting process that are lending to skill gaps? Have you done a formal assessment to uncover skill gaps? Are you using modern learning methods to bridge gaps by upskilling your current workforce?
Join Katie Miller from BizLibrary as we re-evaluate potential root causes of your organization’s skill gaps and discuss what can be done to fill them efficiently.
In this webinar, you’ll learn:
The factors that could be contributing to your skill gaps, and how to find and address the root causes
How to re-evaluate what a “qualified candidate” is in order to create more relevant and realistic talent expectations
How to start uncovering your organization’s skill gaps with a formal assessment
How to use modern training methods to bridge skill gaps by upskilling your workforce
One of the most important part of talent and human capital management is engagement and one of the tool to measure engagement is by Gallup Q12 employee engagement survey.Hope this article will help you.
From sending 53 million emails at Conversio, here's five things I learned about how to make emails relevant to the recipients plus the quantitive results showing the performance improvements that relevancy gives you.
May 20, 2018: Colorado Coach Connection ICF Colorado
This document provides information on various coaching programs, workshops, and certifications. It includes the following opportunities:
- A monthly supervision program to help coaches reflect on their practice.
- A leadership coaching program to help participants add coaching to their leadership approach.
- A workshop exploring how women can overcome feeling silenced in professional settings.
- Various certifications in emotional intelligence assessment and developing social and emotional skills for individuals and teams.
- An event focusing on using purpose to grow a business.
The document discusses performance reviews and their flaws. It argues that performance reviews focus too much on evaluating past performance rather than developing future performance. Reviews are subjective and don't accurately capture an employee's full-year contributions. Instead, the document advocates for development programs that focus on coaching employees to achieve new goals and bring added value, with compensation increases tied to delivering more rather than just doing one's basic job. It also stresses the importance of ongoing feedback rather than annual reviews.
Gartner webinar social media analytics 23.10.2014Irene Ventayol
Virtually every modern marketer has a presence in social channels, and many use social listening tools to monitor what people say about their brands. Yet despite being a maturing discipline, social analytics remains stubbornly difficult and frustrating to apply. How much is a Facebook fan worth? Does it matter that your "net sentiment" is in the single digits? Your "share of voice" on Twitter is down this week – should you panic? This presentation focuses on the social analytics vendors, techniques, metrics and cases that can help you most.
Whether you are creating and leading strategy for your organization or have a role in driving adoption of a strategy there are three, easily avoidable mistakes that many organizations and leaders make: (1) not being clear on intent, (2) not understanding if all leaders are on-board with the strategy, and (3) not predicting and mitigating failure points.
In this webinar, Erica Tetuan, Change Management Practice Lead, will explore:
A formula to ensure the intent is clear and you know your role in bringing the strategy to life
A method for thinking about who needs to be engaged in leading the strategy and how to determine if their support of the strategy and aptitude for leading the change is where you need it to be and how to assess yourself
The power of being predictive to be preventative – how to see around corners to make better decisions and plan for things that might go awry
The document summarizes the key findings of a study conducted by iProspect and comScore on the branding implications of digital media like search engine marketing, search engine optimization, and online display advertising. Some of the main findings include:
- Exposure to combinations of digital media assets like paid search results, organic search results, and display ads significantly improved people's perceptions of brands and their likelihood to interact with brands, even if they didn't click on the ads.
- Paid search results had the greatest individual impact on brand lift and led to a 44% increase in purchase likelihood. The combination of paid and organic search results produced the strongest brand lift overall, including a 73% increase in purchase likelihood.
- Online
O documento discute as quebras de arquiteturas de dados, incluindo por design inadequado, falta de capacidade ou dono. Também aborda a gravidade dos dados e casos de arquiteturas que quebraram, como um cluster de banco de dados que cresceu muito rápido sem planejamento.
Slides Joni Hoadley recently used in his discussion w/ mentees of The Product Mentor.
Synopsis: Every good product manager should listen to the voice of their customers. The key is using the right data to inform and influence your product decisions.
The Product Mentor is a program designed to pair Product Mentors and Mentees from around the World, across all industries, from start-up to enterprise, guided by the fundamental goals…Better Decisions. Better Products. Better Product People.
Throughout the program, each mentor leads a conversation in an area of their expertise that is live streamed and available to both mentee and the broader product community.
http://TheProductMentor.com
The document discusses online qualitative research tools and methods. It provides a history of the company Itracks and how online qualitative research has evolved since the 1990s with the growth of the internet. It then outlines several online qualitative research techniques like online focus groups, bulletin board focus groups, and additional services like iMarkIt for concept testing. It discusses the advantages of these online qualitative methods and how they can support quantitative research.
Netbase AMA Sentiment Analysis PresentationNetBase
Marketers can’t stop talking about social media, but how many understand how it can help them meet critical business objectives? Or what tools are available to analyze social media, how they compare, and which one is best suited for market research and brand managers?
This NetBase presentation will teach you:
How social media impacts your sales funnel.
Why understanding specific customer themes is important.
How to quantify conversations and get actionable insights that strengthen your brand.
How to use social analytics tools to efficiently get valuable competitive insights.
This document discusses social learning strategies and models. It begins with an introduction of the author, David Wilkins, who has over 15 years of experience in learning and is a national speaker. The rest of the document discusses the evolution of learning models from a top-down, formal structure to a more collaborative and emergent structure. It provides examples of how social learning has been implemented at companies like Cisco and Ace Hardware and discusses how learning interventions can incorporate different paradigms like formal, collaborative and emergent approaches. The document advocates that leaders think about whether their initiatives are more dependent on codified processes or emergent ideas and tailor their learning strategies accordingly.
Kellogg Video Essay Question List. Online assignment writing service.Ashley Opokuaa
Octavia Butler's short story "Speech Sounds" follows a woman named Rye in a dystopian future where a pandemic has severely limited human communication abilities. Through Rye's actions and dialogue, Butler challenges gender norms and expectations. Rye displays intelligence, bravery, and agency as she navigates dangerous situations, using violence if necessary to protect herself despite social expectations of female passiveness. Her initiation of a sexual relationship with another character also subverts expectations of female passivity. The story dismantles notions of gender stratification and female submission through Rye's defiance of societal roles.
Having Trouble Writing College Essay. How To Write An Exemplification ...Rebecca Bordes
The document provides instructions for how to request and complete an assignment writing request through the HelpWriting.net website. It outlines a 5-step process: 1) Create an account; 2) Complete an order form with instructions and deadline; 3) Review bids from writers and select one; 4) Receive the completed paper; 5) Request revisions if needed and authorize payment upon satisfaction. The process aims to match students with qualified writers to help complete assignments while allowing for revisions until the student is fully satisfied with the end product.
OAC is testing new product features using split testing and cohort analysis to better understand what members want. This revealed that members preferred solo study options over social tools, contrary to the team's assumptions. Regular reporting of experimental results in accessible formats helps the team learn and prioritize development based on what improves member outcomes. Relying only on vanity metrics like total users could mislead the team about the product's sustainability and effectiveness.
Using Open Badges to provide micro-credentials is a great way to track upskilling and career pathing for learners within organizations. This session will cover how micro-credentialing supports ongoing professional development and personalized learning pathways as it pertains to accomplishments and skills.
By attending this session, you will:
-Explore the open badge framework and how it can track progress to act as a workplace currency
-Analyze why most organizations that try badging don't succeed in their goals
-Learn how badges help solve the skills gap and how clients are using them to transform their learning culture.
-Discover strategies for badging effectively at your organization.
Logan Peterson provides information on three free online certifications: Google Analytics, HootSuite Social Media Management, and Temple Harrisburg International Nonprofit Training and Leadership. The certifications help expand knowledge, provide skills for the workplace, and validate online profiles. Google Analytics focuses on data analytics strategy, segmentation, and key performance indicators. HootSuite emphasizes using their dashboard. The Temple program includes six online nonprofit courses. Pursuing these certifications sets students up for success in technology, data analysis, social media, and nonprofit involvement.
Pam Morris is interviewed about her background in software measurement and process improvement. She discusses some of the root causes of high software project failure rates, including poor requirements specifications and inadequate scoping. She emphasizes that software process improvement is about rigor and measurement, not just "checking boxes". Measurement programs are most effective when they start small and focus on key metrics that support management's goals and decision-making. The role of management is also important to drive measurement from the top down.
This document discusses challenges with qualitative and quantitative data collection in school counseling and provides tips for effective use and presentation of data. It addresses:
1) Common types of qualitative data counselors collect and how to code it to find patterns and themes.
2) Potential pitfalls of quantitative data like non-response bias and issues with reliability/validity.
3) Strategies for presenting qualitative and quantitative data together using tools like Sign-Up Genius, Google Forms, Survey Monkey, and mandatory surveys. The goal is to use mixed methods and data to improve counseling programs and services.
Many companies seem to have fallen victim to the ominous skills gap – but only because they’ve found it’s easier to place blame somewhere than to look at what they can do to fix it. If you’re struggling to find qualified candidates, you might need to take a deeper look at your organization and ask some pointed questions. Are there areas of the recruiting process that are lending to skill gaps? Have you done a formal assessment to uncover skill gaps? Are you using modern learning methods to bridge gaps by upskilling your current workforce?
Join Katie Miller from BizLibrary as we re-evaluate potential root causes of your organization’s skill gaps and discuss what can be done to fill them efficiently.
In this webinar, you’ll learn:
The factors that could be contributing to your skill gaps, and how to find and address the root causes
How to re-evaluate what a “qualified candidate” is in order to create more relevant and realistic talent expectations
How to start uncovering your organization’s skill gaps with a formal assessment
How to use modern training methods to bridge skill gaps by upskilling your workforce
One of the most important part of talent and human capital management is engagement and one of the tool to measure engagement is by Gallup Q12 employee engagement survey.Hope this article will help you.
From sending 53 million emails at Conversio, here's five things I learned about how to make emails relevant to the recipients plus the quantitive results showing the performance improvements that relevancy gives you.
May 20, 2018: Colorado Coach Connection ICF Colorado
This document provides information on various coaching programs, workshops, and certifications. It includes the following opportunities:
- A monthly supervision program to help coaches reflect on their practice.
- A leadership coaching program to help participants add coaching to their leadership approach.
- A workshop exploring how women can overcome feeling silenced in professional settings.
- Various certifications in emotional intelligence assessment and developing social and emotional skills for individuals and teams.
- An event focusing on using purpose to grow a business.
The document discusses performance reviews and their flaws. It argues that performance reviews focus too much on evaluating past performance rather than developing future performance. Reviews are subjective and don't accurately capture an employee's full-year contributions. Instead, the document advocates for development programs that focus on coaching employees to achieve new goals and bring added value, with compensation increases tied to delivering more rather than just doing one's basic job. It also stresses the importance of ongoing feedback rather than annual reviews.
Gartner webinar social media analytics 23.10.2014Irene Ventayol
Virtually every modern marketer has a presence in social channels, and many use social listening tools to monitor what people say about their brands. Yet despite being a maturing discipline, social analytics remains stubbornly difficult and frustrating to apply. How much is a Facebook fan worth? Does it matter that your "net sentiment" is in the single digits? Your "share of voice" on Twitter is down this week – should you panic? This presentation focuses on the social analytics vendors, techniques, metrics and cases that can help you most.
Whether you are creating and leading strategy for your organization or have a role in driving adoption of a strategy there are three, easily avoidable mistakes that many organizations and leaders make: (1) not being clear on intent, (2) not understanding if all leaders are on-board with the strategy, and (3) not predicting and mitigating failure points.
In this webinar, Erica Tetuan, Change Management Practice Lead, will explore:
A formula to ensure the intent is clear and you know your role in bringing the strategy to life
A method for thinking about who needs to be engaged in leading the strategy and how to determine if their support of the strategy and aptitude for leading the change is where you need it to be and how to assess yourself
The power of being predictive to be preventative – how to see around corners to make better decisions and plan for things that might go awry
The document summarizes the key findings of a study conducted by iProspect and comScore on the branding implications of digital media like search engine marketing, search engine optimization, and online display advertising. Some of the main findings include:
- Exposure to combinations of digital media assets like paid search results, organic search results, and display ads significantly improved people's perceptions of brands and their likelihood to interact with brands, even if they didn't click on the ads.
- Paid search results had the greatest individual impact on brand lift and led to a 44% increase in purchase likelihood. The combination of paid and organic search results produced the strongest brand lift overall, including a 73% increase in purchase likelihood.
- Online
O documento discute as quebras de arquiteturas de dados, incluindo por design inadequado, falta de capacidade ou dono. Também aborda a gravidade dos dados e casos de arquiteturas que quebraram, como um cluster de banco de dados que cresceu muito rápido sem planejamento.
This document provides an overview of API gateways. It discusses the API gateway pattern which includes separating client and server code, providing distinct API views from the same origin, and composing calls between APIs. It also includes an architecture diagram and discusses core features like uniform authentication, REST over HTTPS, horizontal scalability, payload rewrite, request composition, and backend as a service. Popular API gateway players like APIGEE, AWS API Gateway, and Kong are compared. Potential pitfalls around dependency, lock-in, scalability, and backend savings are also covered. The conclusion recommends using API gateways as accelerators but planning to reduce their footprint, using standard protocols, managing cache/data, and being able to migrate to an open
DNAD 2015 - Como a arquitetura emergente de sua aplicação pode jogar contra ...Gleicon Moraes
O documento discute os desafios da entrega contínua à medida que sistemas crescem em complexidade. Apresenta falácias comuns em sistemas distribuídos e como requisitos não funcionais são esquecidos, levando a problemas. Sugere focar em métricas, testes robustos, cache, banco de dados e introdução segura de novas features para evitar problemas.
Por trás da infraestrutura do Cloud - Campus Party 2014Gleicon Moraes
This document discusses infrastructure as a service (IaaS) cloud computing models. It defines IaaS according to NIST and Wikipedia as allowing consumers to provision processing, storage, networks and other computing resources to deploy and run arbitrary software. The consumer does not manage the underlying infrastructure directly but has control over operating systems and applications. Common IaaS resources include virtual machines, storage, networking and security services. The document also discusses common cloud computing building blocks like compute, storage, networking and management services.
This document discusses how architecture emerges even when not initially planned. It begins with an overview of databases, message queues, and caching as common architectural elements that emerge over time. The document then provides examples of how simple applications and data needs can evolve into more complex architectures with multiple servers, databases, caching, and services. It emphasizes that architectures tend to grow more elaborate with each new system as ideas are carried over from previous projects.
The document discusses various types of failures in organizations and projects, including failures due to trivial issues that receive disproportionate attention ("bikeshedding"), entrepreneurs who overpromise and underdeliver, and behavioral patterns among team members that can contribute to failure, such as passing blame and not taking responsibility for issues ("chain of irresponsibility"). Specific examples of failures are provided for projects, methodologies, and individuals to illustrate different kinds of failures and how they can be avoided or overcome through learning from mistakes.
Dlsecyx pgroammr (Dyslexic Programmer - cool stuff for scaling)Gleicon Moraes
This document discusses asynchronous and non-blocking I/O models in various programming languages like Ruby, Python, Node.js, and Erlang. It provides code examples for implementing URL fetching using blocking vs non-blocking I/O in Ruby and Python. It also summarizes concepts like generators, deferreds, callbacks and event loops in different asynchronous programming models.
Architectural anti-patterns for data handlingGleicon Moraes
This document discusses architectural anti-patterns related to data distribution and handling failures. It provides examples of anti-patterns when using SQL and NoSQL databases, including using tables as queues, logs, or caches instead of the proper tools. Alternatives are suggested such as using message queues, document databases, and key-value stores instead of forcing data models. The document advises to simplify data schemes, avoid over-engineering, and think about how to best structure data and applications.
Architectural anti patterns_for_data_handlingGleicon Moraes
This document discusses architectural anti-patterns related to data distribution and handling failures. It describes issues like using tables for queues, logs, or caches instead of the proper tools. Alternatives presented include document databases, key-value stores, message queues, and avoiding over-normalization. The document advocates simplifying data models and thinking about architecture and data flow rather than only databases.
RestMQ is a message queue system based on Redis that allows storing and retrieving messages through HTTP requests. It uses Redis' data structures like lists, sets, and hashes to maintain queues and messages. Messages can be added to and received from queues using RESTful endpoints. Additional features include status monitoring, queue control, and support for protocols like JSON, Comet, and WebSockets. The core functionality is language-agnostic but implementations exist in Python and Ruby.
This document provides examples of common SQL anti-patterns and related NoSQL alternatives. It discusses issues like using tables as trees, caches, queues, or logs. It also addresses dynamic schema/table creation, stored procedures, row padding, complex joins, and object-relational mismatches. The document recommends alternatives like document databases, key-value stores, message brokers, and denormalization. It includes examples of modeling book data in MongoDB and Redis.
Redis is a key-value store that provides common data structures like lists, sets, and ordered sets as building blocks. It offers features like asynchronous I/O, fast operations, active development, and different persistence strategies. RestMQ is a message queue built on Redis that uses HTTP, REST, and JSON. It provides COMET consumer functionality and integrates with an asynchronous Redis client and Twisted-based web framework called Cyclone. DOCDB is a prototype document database that indexes documents by tokenizing, removing stop words, stemming words, and grouping them into Redis sets to allow searching by keywords.
Unlock the Future of Search with MongoDB Atlas_ Vector Search Unleashed.pdfMalak Abu Hammad
Discover how MongoDB Atlas and vector search technology can revolutionize your application's search capabilities. This comprehensive presentation covers:
* What is Vector Search?
* Importance and benefits of vector search
* Practical use cases across various industries
* Step-by-step implementation guide
* Live demos with code snippets
* Enhancing LLM capabilities with vector search
* Best practices and optimization strategies
Perfect for developers, AI enthusiasts, and tech leaders. Learn how to leverage MongoDB Atlas to deliver highly relevant, context-aware search results, transforming your data retrieval process. Stay ahead in tech innovation and maximize the potential of your applications.
#MongoDB #VectorSearch #AI #SemanticSearch #TechInnovation #DataScience #LLM #MachineLearning #SearchTechnology
Cosa hanno in comune un mattoncino Lego e la backdoor XZ?Speck&Tech
ABSTRACT: A prima vista, un mattoncino Lego e la backdoor XZ potrebbero avere in comune il fatto di essere entrambi blocchi di costruzione, o dipendenze di progetti creativi e software. La realtà è che un mattoncino Lego e il caso della backdoor XZ hanno molto di più di tutto ciò in comune.
Partecipate alla presentazione per immergervi in una storia di interoperabilità, standard e formati aperti, per poi discutere del ruolo importante che i contributori hanno in una comunità open source sostenibile.
BIO: Sostenitrice del software libero e dei formati standard e aperti. È stata un membro attivo dei progetti Fedora e openSUSE e ha co-fondato l'Associazione LibreItalia dove è stata coinvolta in diversi eventi, migrazioni e formazione relativi a LibreOffice. In precedenza ha lavorato a migrazioni e corsi di formazione su LibreOffice per diverse amministrazioni pubbliche e privati. Da gennaio 2020 lavora in SUSE come Software Release Engineer per Uyuni e SUSE Manager e quando non segue la sua passione per i computer e per Geeko coltiva la sua curiosità per l'astronomia (da cui deriva il suo nickname deneb_alpha).
Introduction of Cybersecurity with OSS at Code Europe 2024Hiroshi SHIBATA
I develop the Ruby programming language, RubyGems, and Bundler, which are package managers for Ruby. Today, I will introduce how to enhance the security of your application using open-source software (OSS) examples from Ruby and RubyGems.
The first topic is CVE (Common Vulnerabilities and Exposures). I have published CVEs many times. But what exactly is a CVE? I'll provide a basic understanding of CVEs and explain how to detect and handle vulnerabilities in OSS.
Next, let's discuss package managers. Package managers play a critical role in the OSS ecosystem. I'll explain how to manage library dependencies in your application.
I'll share insights into how the Ruby and RubyGems core team works to keep our ecosystem safe. By the end of this talk, you'll have a better understanding of how to safeguard your code.
In the rapidly evolving landscape of technologies, XML continues to play a vital role in structuring, storing, and transporting data across diverse systems. The recent advancements in artificial intelligence (AI) present new methodologies for enhancing XML development workflows, introducing efficiency, automation, and intelligent capabilities. This presentation will outline the scope and perspective of utilizing AI in XML development. The potential benefits and the possible pitfalls will be highlighted, providing a balanced view of the subject.
We will explore the capabilities of AI in understanding XML markup languages and autonomously creating structured XML content. Additionally, we will examine the capacity of AI to enrich plain text with appropriate XML markup. Practical examples and methodological guidelines will be provided to elucidate how AI can be effectively prompted to interpret and generate accurate XML markup.
Further emphasis will be placed on the role of AI in developing XSLT, or schemas such as XSD and Schematron. We will address the techniques and strategies adopted to create prompts for generating code, explaining code, or refactoring the code, and the results achieved.
The discussion will extend to how AI can be used to transform XML content. In particular, the focus will be on the use of AI XPath extension functions in XSLT, Schematron, Schematron Quick Fixes, or for XML content refactoring.
The presentation aims to deliver a comprehensive overview of AI usage in XML development, providing attendees with the necessary knowledge to make informed decisions. Whether you’re at the early stages of adopting AI or considering integrating it in advanced XML development, this presentation will cover all levels of expertise.
By highlighting the potential advantages and challenges of integrating AI with XML development tools and languages, the presentation seeks to inspire thoughtful conversation around the future of XML development. We’ll not only delve into the technical aspects of AI-powered XML development but also discuss practical implications and possible future directions.
Taking AI to the Next Level in Manufacturing.pdfssuserfac0301
Read Taking AI to the Next Level in Manufacturing to gain insights on AI adoption in the manufacturing industry, such as:
1. How quickly AI is being implemented in manufacturing.
2. Which barriers stand in the way of AI adoption.
3. How data quality and governance form the backbone of AI.
4. Organizational processes and structures that may inhibit effective AI adoption.
6. Ideas and approaches to help build your organization's AI strategy.
Monitoring and Managing Anomaly Detection on OpenShift.pdfTosin Akinosho
Monitoring and Managing Anomaly Detection on OpenShift
Overview
Dive into the world of anomaly detection on edge devices with our comprehensive hands-on tutorial. This SlideShare presentation will guide you through the entire process, from data collection and model training to edge deployment and real-time monitoring. Perfect for those looking to implement robust anomaly detection systems on resource-constrained IoT/edge devices.
Key Topics Covered
1. Introduction to Anomaly Detection
- Understand the fundamentals of anomaly detection and its importance in identifying unusual behavior or failures in systems.
2. Understanding Edge (IoT)
- Learn about edge computing and IoT, and how they enable real-time data processing and decision-making at the source.
3. What is ArgoCD?
- Discover ArgoCD, a declarative, GitOps continuous delivery tool for Kubernetes, and its role in deploying applications on edge devices.
4. Deployment Using ArgoCD for Edge Devices
- Step-by-step guide on deploying anomaly detection models on edge devices using ArgoCD.
5. Introduction to Apache Kafka and S3
- Explore Apache Kafka for real-time data streaming and Amazon S3 for scalable storage solutions.
6. Viewing Kafka Messages in the Data Lake
- Learn how to view and analyze Kafka messages stored in a data lake for better insights.
7. What is Prometheus?
- Get to know Prometheus, an open-source monitoring and alerting toolkit, and its application in monitoring edge devices.
8. Monitoring Application Metrics with Prometheus
- Detailed instructions on setting up Prometheus to monitor the performance and health of your anomaly detection system.
9. What is Camel K?
- Introduction to Camel K, a lightweight integration framework built on Apache Camel, designed for Kubernetes.
10. Configuring Camel K Integrations for Data Pipelines
- Learn how to configure Camel K for seamless data pipeline integrations in your anomaly detection workflow.
11. What is a Jupyter Notebook?
- Overview of Jupyter Notebooks, an open-source web application for creating and sharing documents with live code, equations, visualizations, and narrative text.
12. Jupyter Notebooks with Code Examples
- Hands-on examples and code snippets in Jupyter Notebooks to help you implement and test anomaly detection models.
Skybuffer SAM4U tool for SAP license adoptionTatiana Kojar
Manage and optimize your license adoption and consumption with SAM4U, an SAP free customer software asset management tool.
SAM4U, an SAP complimentary software asset management tool for customers, delivers a detailed and well-structured overview of license inventory and usage with a user-friendly interface. We offer a hosted, cost-effective, and performance-optimized SAM4U setup in the Skybuffer Cloud environment. You retain ownership of the system and data, while we manage the ABAP 7.58 infrastructure, ensuring fixed Total Cost of Ownership (TCO) and exceptional services through the SAP Fiori interface.
Best 20 SEO Techniques To Improve Website Visibility In SERPPixlogix Infotech
Boost your website's visibility with proven SEO techniques! Our latest blog dives into essential strategies to enhance your online presence, increase traffic, and rank higher on search engines. From keyword optimization to quality content creation, learn how to make your site stand out in the crowded digital landscape. Discover actionable tips and expert insights to elevate your SEO game.
Webinar: Designing a schema for a Data WarehouseFederico Razzoli
Are you new to data warehouses (DWH)? Do you need to check whether your data warehouse follows the best practices for a good design? In both cases, this webinar is for you.
A data warehouse is a central relational database that contains all measurements about a business or an organisation. This data comes from a variety of heterogeneous data sources, which includes databases of any type that back the applications used by the company, data files exported by some applications, or APIs provided by internal or external services.
But designing a data warehouse correctly is a hard task, which requires gathering information about the business processes that need to be analysed in the first place. These processes must be translated into so-called star schemas, which means, denormalised databases where each table represents a dimension or facts.
We will discuss these topics:
- How to gather information about a business;
- Understanding dictionaries and how to identify business entities;
- Dimensions and facts;
- Setting a table granularity;
- Types of facts;
- Types of dimensions;
- Snowflakes and how to avoid them;
- Expanding existing dimensions and facts.
Generating privacy-protected synthetic data using Secludy and MilvusZilliz
During this demo, the founders of Secludy will demonstrate how their system utilizes Milvus to store and manipulate embeddings for generating privacy-protected synthetic data. Their approach not only maintains the confidentiality of the original data but also enhances the utility and scalability of LLMs under privacy constraints. Attendees, including machine learning engineers, data scientists, and data managers, will witness first-hand how Secludy's integration with Milvus empowers organizations to harness the power of LLMs securely and efficiently.
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUpanagenda
Webinar Recording: https://www.panagenda.com/webinars/hcl-notes-und-domino-lizenzkostenreduzierung-in-der-welt-von-dlau/
DLAU und die Lizenzen nach dem CCB- und CCX-Modell sind für viele in der HCL-Community seit letztem Jahr ein heißes Thema. Als Notes- oder Domino-Kunde haben Sie vielleicht mit unerwartet hohen Benutzerzahlen und Lizenzgebühren zu kämpfen. Sie fragen sich vielleicht, wie diese neue Art der Lizenzierung funktioniert und welchen Nutzen sie Ihnen bringt. Vor allem wollen Sie sicherlich Ihr Budget einhalten und Kosten sparen, wo immer möglich. Das verstehen wir und wir möchten Ihnen dabei helfen!
Wir erklären Ihnen, wie Sie häufige Konfigurationsprobleme lösen können, die dazu führen können, dass mehr Benutzer gezählt werden als nötig, und wie Sie überflüssige oder ungenutzte Konten identifizieren und entfernen können, um Geld zu sparen. Es gibt auch einige Ansätze, die zu unnötigen Ausgaben führen können, z. B. wenn ein Personendokument anstelle eines Mail-Ins für geteilte Mailboxen verwendet wird. Wir zeigen Ihnen solche Fälle und deren Lösungen. Und natürlich erklären wir Ihnen das neue Lizenzmodell.
Nehmen Sie an diesem Webinar teil, bei dem HCL-Ambassador Marc Thomas und Gastredner Franz Walder Ihnen diese neue Welt näherbringen. Es vermittelt Ihnen die Tools und das Know-how, um den Überblick zu bewahren. Sie werden in der Lage sein, Ihre Kosten durch eine optimierte Domino-Konfiguration zu reduzieren und auch in Zukunft gering zu halten.
Diese Themen werden behandelt
- Reduzierung der Lizenzkosten durch Auffinden und Beheben von Fehlkonfigurationen und überflüssigen Konten
- Wie funktionieren CCB- und CCX-Lizenzen wirklich?
- Verstehen des DLAU-Tools und wie man es am besten nutzt
- Tipps für häufige Problembereiche, wie z. B. Team-Postfächer, Funktions-/Testbenutzer usw.
- Praxisbeispiele und Best Practices zum sofortigen Umsetzen
Your One-Stop Shop for Python Success: Top 10 US Python Development Providersakankshawande
Simplify your search for a reliable Python development partner! This list presents the top 10 trusted US providers offering comprehensive Python development services, ensuring your project's success from conception to completion.
1. Semi Automatic Sentiment Analysis
Results from a case study in Brazilian Portuguese web 2.0 sites
Gleicon Moraes, Marco Aurélio Gerosa
gleicon@gmail.com, gerosa@ime.usp.br
2. Introduction
• Popular Web 2.0 applications are based on social
networking: Facebook, Twitter, Orkut, Flickr, LinkedIn
• Status messages, user information, wall posts, like/unline
votes, scraps, recommendations are created and exchanged
between users.
• Symetric and Assymetric relationships broadcast these
messages between friends (direct connections) and friends
of friends.
• Sentiment and opinions might be objective (up/down votes,
recommendations) or subjective (free text)
Gleicon Moraes, Marco Aurélio Gerosa 2/20
3. Sentiment Classification
• Find out what users in a social network think about
product, tendency or brand.
• Compute or help to compute the Return of Investment of a
marketing campaign
• Create or compose product and services recommendations
to other users
• To measure user satisfaction and experience about a
service.
Gleicon Moraes, Marco Aurélio Gerosa 3/20
4. Goals
• Opinion mining / subjectivity and sentiment analysis review
[1]
• Automate opinion classification (tweet, scrap, message, wall
post) using Machine Learning and Information Retrieval
techniques.
• To apply a Bayesian filter (and also try a SVM classifier) to
identify Positive and Negative sentiment on brazilian
Portuguese texts.
• To build a corpus to train and test the classifiers
• To find out how to measure the filter efficiency.
[1] Pang e Lee - Opinion Mining and Sentiment Analysis
Gleicon Moraes, Marco Aurélio Gerosa 4/20
5. Related work
• Thumbs Up? Sentiment Classification using Machine Learning
Techniques: Bayesian filter, Maximum Entropy filter and SVM filter. Training
was made with Movielens dataset, splitting between 70% corpus to training and
30% to test. This corpus is already marked as positive and negative. Conclusion
was that sarcasm on opinions made it difficult to classify the sentiments. There
was no smaller text classification (e.g. A tweet/140 chars) and feedback with
outside text to the classifiers. [1]
[1] Pang, B. Lee L., Cornell University, Vaithyanathan S, IBM: Thumbs Up? Sentiment Classification using Machine Learning
Techniques
Gleicon Moraes, Marco Aurélio Gerosa 5/20
6. Related work
• Content-based book Recommendation Using Learning for Text
Categorization and information extracted from the internet to train a classifier,
with a database per user. The combination between collaborative filtering and
content filtering complete each other and help improve the results. [1][2]
[1] Mooney R. J., Roy L., “Content-Based Book Recommendation Using Learning for Text Categorization” (Proceedings of
ACM Conference on Digital Libraries, 2000)
[2] Dˇzeroski S., Zenko B. “Is Combining Classifiers Better than Selecting the Best One?”
Gleicon Moraes, Marco Aurélio Gerosa 6/20
7. Semi-Automatic Sentiment Classification
• Trained Bayesian Filter on two categories: “positive” and “negative”
• Feedback feature so false positives and false negatives could be
trained back to improve the filter
• Problem: There is not a brazilian portuguese data matching text to
sentiment to do the initial classificator training.
• Problem: Text composition varies between social networks and
groups within these networks. Feeding back data to keep the
classificator database updated is fundamental
Gleicon Moraes, Marco Aurélio Gerosa 7/20
8. Semi-Automatic Sentiment Classification
• English language training corpus uses movie reviews in
most papers, associated with ratings to tell what that text
block express [1]
• An initial training corpus was made using consumer review
data from Brazilian websites like iVox, ReclameAqui,
opiniões do MercadoLivre
• After scrapping each opinion and its rating (stars, rating, or
positive/negative indication), stored it on folders ranging
from 0.0 to 5.0, each opinion a file inside the proper folder
[1] MovieLens dataset: http://www.grouplens.org/node/73
Gleicon Moraes, Marco Aurélio Gerosa 8/20
9. Training composition findings
• Number of words in negative opinions is bigger than on
positive opinions: 67.575 words in 712 positive opinions
versus 81.747 words in 507 negative opinions.
• Distribution of reviews between minimum and maximum
ratings: more opinions on the extremes (0.0 to 0.5 and 4.5
to 5.0).
Gleicon Moraes, Marco Aurélio Gerosa 9/20
10. Composição da base de treinamento - iVox
Gleicon Moraes, Marco Aurélio Gerosa 10/22
11. Domain
• Language domain varies between communities/sites
Gleicon Moraes, Marco Aurélio Gerosa 11/20
12. Opinion Sample (Mercadolivre)
positivo (rating 5):
"Este alto-falante faz o baile tremer... comprei para montar uma mini-
saveiro”
negativo (rating 1):
"Apesar de custar muito barato recomendo economizar e comprar
falantes de marcas conhecidas. Bravox, Selenium.
O produto parece recondicionado, e não tem 90Wats nem na china,
meu triaxial Pionner de 60Wats aquenta muito mais grave que esse
Unlike.
Não faça besteira economize mais R$60,00 e compre um Kit 2 vias
Selenium ou até Sony ou Bomber que custa quase o mesmo aqui no
Mercado Livre"
Gleicon Moraes, Marco Aurélio Gerosa 12/20
13. Opinion Sample (iVox)
positivo (rating 5.0):
"Economica não tem Adquiri uma web.evo Sundown,à moto é bonita,gostei tanto
da Sundown que adquiri mais uma moto Sundow a hunter 90cc. estou com 2
motos e estou muito satisfeito. Quanto ao pessoal da grappa, todos sem exceção
sempre bem atenciósos comigo; só tenho a agradecer. "
negativo (rating 1):
"Contra Todas Não sei o motivo de sua defesa a esta empresa, pois fui
enganado a pouco tempo e o engraçado é que liguei para reclamar,
bem na hora que o vendedor estava enganando outra pessoa, por um
deslize do mesmo o cliente verificou o numero e me ligou dizendo que
também havia sido enganado. Entramos com denúncia conjunta na
DECON do DF. Razoável Muito Ruim Razoável Muito Ruim"
Gleicon Moraes, Marco Aurélio Gerosa 13/20
14. Opinion Sample (Reclame Aqui)
positivo:
"Olá, estou passando apenas para parabenizar ao ótimo e sério trabalho da
equipe do site reclameaqui.com.br, pois já fui atendido em duas ocasiões
reclamadas no site e foi algo bem melhor e mais rápido do que partir para outras
atitudes. Parabéns e que cada vez mais possamos ter meios iguais para
podermos agilizar o processo de negociação.
Obrigado,"
negativo:
"Fiz 2 reclamações contra a MOTOROLA DO BRASIL por propaganda
enganosa em seu site www.motorola.com.br sobre o aparelho V3m que
no site diz ACOPMPANHA cartão enquanto no meu aparelho nao veio
NADA !!! Eles me ligaram e tiram o deles da reta dizendo que a culpa é
da VIVO ! MAis perai quem faz o aparelho nao é eles ??? A VIVO so
revende !!!! Ah MOTOROLA POR FAVOR NE !!!!! QUERO MEU
CARTAO !!!" Gleicon Moraes, Marco Aurélio Gerosa 14/20
15. Domain
• Language Domain [1]: "go read a book” has different meaning related
to each social network. In a book related network might be a positive
meaning. In others might mean a negative sentiment.
• Feeding back data also helps to keep the database updated with new
slangs and combinations that also might cover sarcasm expressions.
• Events like world cup and television shows might introduce new words
and expressions.
[1] Pang e Lee - Opinion Mining and Sentiment Analysis
Gleicon Moraes, Marco Aurélio Gerosa 15/20
16. Training
• Split the database between negative (rating: 0.0) and positive (rating
5.0). Later steps added ratings 4.5, 4.0 to positive while negative rating
kept the same.
• Training/Classifying applied on raw data and on data processed a
pipeline of taking out stop words and extracting the stem of remaining
words
• Raw data biased towards negative sentiment, processed data biased
towards positive sentiments.
Gleicon Moraes, Marco Aurélio Gerosa 16/20
17. Results – raw data
iVox ReclameAqui False results
Ratings Negative/Positive Negative Positive Negative Positive
No training No messages 1635 268 0 0
0.0 e 5.0 506/720 1634 6 262 1
0.0 e 4.5 + 5.0 506/873 1587 99 169 48
0.0 e 4.0 + 4.5 + 5.0 506/973 1365 165 105 270
Gleicon Moraes, Marco Aurélio Gerosa 17/20
18. Results – filtered data
iVox ReclameAqui False results
Ratings Negative/Positive Negative Positive Negative Positive
No training No messages 1635 268 0 0
0.0 and 5.0 506/720 1635 0 268 0
0.0 and 4.5 + 5.0 506/873 0 261 0 1627
0.0 and 4.0 + 4.5 + 5.0 506/973 0 268 0 1635
Gleicon Moraes, Marco Aurélio Gerosa 18/20
19. Measuring efficiency
• Metrics: Accuracy, Precision Recall
• Token extraction: words (bag of words) and bigrams.
• Test between languages and domain: trained and tested the same
classifiers and extractors with the Movielens dataset [1]
[1] The MovieLens dataset: http://www.grouplens.org/node/73
Gleicon Moraes, Marco Aurélio Gerosa 19/20
20. Efficiency
Movie Review (en)
Feature Extractor Accuracy Positive Negative Positive Negative
Precision Precision Recall Recall
Bag of Words 0.7280 0.6516 0.9597 0.9800 0.4760
Bigrams 0.8240 0.7613 0.9263 0.9440 0.7040
Consumer Opinion (pt_br)
Feature Extractor Accuracy Positive Negative Positive Negative
Precision Precision Recall Recall
Bag of Words 0.5984 1.0000 0.5100 0.3099 1.000
Bigrams 0.7049 1.0000 0.5862 0.4930 1.000
Gleicon Moraes, Marco Aurélio Gerosa 20/20
21. Conclusion
• Consumer review database helped on initial training.
• O keep the messages as is helps makes the database richer with
different forms of the same expression
• Token extraction influences the end result
• Feeding back helps to keep the database up to date
• To combine classifiers helps the end results and the precision
• Contribution: Brazilian portuguese database and scripts used to extract
data and to reproduce the experiment at: https://github.com/gleicon/
sentiment_analysis
Gleicon Moraes, Marco Aurélio Gerosa 21/20