The document provides an overview of Stephen Dann's methodology for analyzing Twitter data and conversations. It discusses acquiring Twitter data through personal timelines, hashtag captures, or timeline captures. The data is then processed by extracting tweets into Excel for manual coding into categories. Additional analysis includes LIWC for word counts and Leximancer for concept mapping. Metrics like tweet count, character density, network density, and average/unique word counts are calculated and normalized for comparison across categories. The analysis aims to provide insights into research questions about changes in tweeting patterns, hashtag conversations over time, or account engagement.
The Data Science of Social Good Fellows (dssg.io) collaborated with Ushahidi (ushahidi.com) for the Summer 2013.
Presented August 20, 2013
Video - https://www.youtube.com/watch?v=4eK8HjVG2m0
Tool - http://dssg.ushahididev.com/
In recent times, research activities in the areas of Opinion and Sentiment analysis in natural language texts and other media are gaining ground under the umbrella of subjectivity analysis. The reason may be the huge amount of available text data in the Social Web in the forms of news, reviews, blogs, chats and even twitter. Though Sentiment analysis from natural lan-guage text is a multifaceted and multidisciplinary problem, in general, the term “sentiment” is used in reference to the automatic analysis of evaluative text.
The Data Science of Social Good Fellows (dssg.io) collaborated with Ushahidi (ushahidi.com) for the Summer 2013.
Presented August 20, 2013
Video - https://www.youtube.com/watch?v=4eK8HjVG2m0
Tool - http://dssg.ushahididev.com/
In recent times, research activities in the areas of Opinion and Sentiment analysis in natural language texts and other media are gaining ground under the umbrella of subjectivity analysis. The reason may be the huge amount of available text data in the Social Web in the forms of news, reviews, blogs, chats and even twitter. Though Sentiment analysis from natural lan-guage text is a multifaceted and multidisciplinary problem, in general, the term “sentiment” is used in reference to the automatic analysis of evaluative text.
Twitter analysis - Data as factor for designing the right communication star...Pere Claver Llimona
Short presentation of Twitter analysis interactive scorecard. An on-line application that downloads and analyses tweets related to an Organisation to monitor Twitter activity, in order to help defining a suitable Twitter communication strategy (working demo).
Data Science Popup Austin: Using lda and Structural Topic Modeling to Explore...Domino Data Lab
Watch talk ➟ http://bit.ly/1SGlQqj
Interested in what your customers (or their customers) are talking about, and how that’s changing over time? The most popular 'topic trending' analyses depend on the investigation of how keyword (or term) usage change over time. Using keyword trends to index topic trends is highly suitable for short-form text, such as search terms, hashtags, or tweets. However, exploring trends in topic prevalence across longer, more free-form texts, such as call center telephone transcripts, is better served by grouping together topically-related words that co-occur. In this talk, we'll show an example of using Latent Dirichlet Allocation (LDA; Blei, Ng, & Jordan, 2003) in R to group words in call center transcripts into multiword topics over which we explore trends. We’ll also demonstrate the how the use of Structural Topic Modeling (Roberts, Stewart, Tingley, & Airoldi, 2013) can aid in further investigation of how document-level covariates (in this case, additional call- or caller-based characteristics) can affect topic prevalence and topic trends.
A User Modeling Oriented Analysis of Cultural Backgrounds in MicrobloggingElena Daehnhardt
This is my article on culture oriented user modeling
presented at the ASE International Conference on Social Informatics in Washington D.C. US on 14 December 2012. This work received the Best Paper Award.
This slide presents an application for the automatic identification of the important moments that might occur during students’ collaborative chats. The moments are detected based on the input received from the user, who may choose to perform an analysis on the topics that interest him/her. More-over, the application offers various types of suggestive and intuitive graphics that aid the user in identification of such moments. There are two main aspects that are considered when identifying important moments: the concepts' frequency and distribution throughout the conversation and the chat tempo, which is analyzed for identifying intensively debated concepts. By the tempo of the chat we understand the rate at which the ideas are input by the chat participants, expressed by the utterances' timestamps
Laboratorio Master BI&BDA (Modulo Web Data Analytics) : Reddit fashion insightsCarla Marini
Laoratorio svolto al Master in Business Intelligence & Big Dat Analytic, nel modulo Web Data Analytics
Analisi degli argomenti che trattano temi relativi alla moda in Reddit. Data Scraping, Data Cleaning, Data Clustering, Text Mining and Sentiment Analysis.
Sentiment analysis involves the process of automatically detecting the polarity of a text and extracting the author's reviews on the subject, and finally, classifying the text. In many research approaches, the textual data classification is done using deep learning models. This is due to the ability of deep learning models to classify a text with a high accuracy and the ability to model the sequence of textual data with word dependencies throughout the sentence. One of these deep learning models is RNN (Recurrent Neural Network). In order to use these models, the textual data and words must be converted into numerical vectors, for which various algorithms and methods have been proposed [10]. Today's pretrained word embedding libraries such as FastText have a high accuracy and quality in vector representations for words. Accordingly, in most current systems and research approaches, these libraries are used to convert the textual data to numerical vectors
Researchers have long known that the words of a text have always contained more information than on the surface. As such, texts have been studied for subtexts and other latent or hidden information. One approach has involved the machine-enabled analysis of human sentiment, usually mapped out on a positive-negative polarity. NVivo 11 Plus (a qualitative research tool released in late 2015) enables the automated sentiment analysis of texts (coded research, formal articles, text corpora, Tweetstream datasets, Facebook wall posts, websites, and other sources) based on four categories: very positive, moderately positive, moderately negative, and very negative. The tool feature compares the target text set against a sentiment dictionary and enables coding at different units of analysis: sentence, paragraph, or cell. Further, the sentiment capability extracts the coded text into respective text sets which may be further analyzed using text frequency counts, text searches, automated theme and sub-theme extractions (topic modeling), and data visualizations.
Slides from the ccis-ce 2018 cocreation event. A discussion of how the ideas of co-creation from marketing can inform and support the deployment of Lego Serious Play, and areas in which people need to engage to help us further understand the process in the LSP setting
More Related Content
Similar to Insights into the Twitterverse: Benchmarking and analysis twitter content
Twitter analysis - Data as factor for designing the right communication star...Pere Claver Llimona
Short presentation of Twitter analysis interactive scorecard. An on-line application that downloads and analyses tweets related to an Organisation to monitor Twitter activity, in order to help defining a suitable Twitter communication strategy (working demo).
Data Science Popup Austin: Using lda and Structural Topic Modeling to Explore...Domino Data Lab
Watch talk ➟ http://bit.ly/1SGlQqj
Interested in what your customers (or their customers) are talking about, and how that’s changing over time? The most popular 'topic trending' analyses depend on the investigation of how keyword (or term) usage change over time. Using keyword trends to index topic trends is highly suitable for short-form text, such as search terms, hashtags, or tweets. However, exploring trends in topic prevalence across longer, more free-form texts, such as call center telephone transcripts, is better served by grouping together topically-related words that co-occur. In this talk, we'll show an example of using Latent Dirichlet Allocation (LDA; Blei, Ng, & Jordan, 2003) in R to group words in call center transcripts into multiword topics over which we explore trends. We’ll also demonstrate the how the use of Structural Topic Modeling (Roberts, Stewart, Tingley, & Airoldi, 2013) can aid in further investigation of how document-level covariates (in this case, additional call- or caller-based characteristics) can affect topic prevalence and topic trends.
A User Modeling Oriented Analysis of Cultural Backgrounds in MicrobloggingElena Daehnhardt
This is my article on culture oriented user modeling
presented at the ASE International Conference on Social Informatics in Washington D.C. US on 14 December 2012. This work received the Best Paper Award.
This slide presents an application for the automatic identification of the important moments that might occur during students’ collaborative chats. The moments are detected based on the input received from the user, who may choose to perform an analysis on the topics that interest him/her. More-over, the application offers various types of suggestive and intuitive graphics that aid the user in identification of such moments. There are two main aspects that are considered when identifying important moments: the concepts' frequency and distribution throughout the conversation and the chat tempo, which is analyzed for identifying intensively debated concepts. By the tempo of the chat we understand the rate at which the ideas are input by the chat participants, expressed by the utterances' timestamps
Laboratorio Master BI&BDA (Modulo Web Data Analytics) : Reddit fashion insightsCarla Marini
Laoratorio svolto al Master in Business Intelligence & Big Dat Analytic, nel modulo Web Data Analytics
Analisi degli argomenti che trattano temi relativi alla moda in Reddit. Data Scraping, Data Cleaning, Data Clustering, Text Mining and Sentiment Analysis.
Sentiment analysis involves the process of automatically detecting the polarity of a text and extracting the author's reviews on the subject, and finally, classifying the text. In many research approaches, the textual data classification is done using deep learning models. This is due to the ability of deep learning models to classify a text with a high accuracy and the ability to model the sequence of textual data with word dependencies throughout the sentence. One of these deep learning models is RNN (Recurrent Neural Network). In order to use these models, the textual data and words must be converted into numerical vectors, for which various algorithms and methods have been proposed [10]. Today's pretrained word embedding libraries such as FastText have a high accuracy and quality in vector representations for words. Accordingly, in most current systems and research approaches, these libraries are used to convert the textual data to numerical vectors
Researchers have long known that the words of a text have always contained more information than on the surface. As such, texts have been studied for subtexts and other latent or hidden information. One approach has involved the machine-enabled analysis of human sentiment, usually mapped out on a positive-negative polarity. NVivo 11 Plus (a qualitative research tool released in late 2015) enables the automated sentiment analysis of texts (coded research, formal articles, text corpora, Tweetstream datasets, Facebook wall posts, websites, and other sources) based on four categories: very positive, moderately positive, moderately negative, and very negative. The tool feature compares the target text set against a sentiment dictionary and enables coding at different units of analysis: sentence, paragraph, or cell. Further, the sentiment capability extracts the coded text into respective text sets which may be further analyzed using text frequency counts, text searches, automated theme and sub-theme extractions (topic modeling), and data visualizations.
Slides from the ccis-ce 2018 cocreation event. A discussion of how the ideas of co-creation from marketing can inform and support the deployment of Lego Serious Play, and areas in which people need to engage to help us further understand the process in the LSP setting
Questioning the lego cocreation talismanStephen Dann
What happens when one of the fastest moving innovators in cocreation in business gets timecapsuled into a 2004 paper citation? This presentation discusses three research agenda items as a "Where to next?" for marketing researchers interested in taking a critical eye to how Lego is used as a talismanic invocation of cocreation, and what more there is to learn from the complexity of the Lego consumption experience.
AMSRS instagram Presentation (words on a screen mix)Stephen Dann
Instagram is a straightforward image, video and story sharing platform that provides a sufficiently robust framework to create a diverse range of outcomes for individual, group and corporation alike. In this presentation, we outline a process to examine how a single timeline (preferably the one you run or control) can be extracted, examined, and measured against a consistent set of five image measures. Whilst the method can inform competitor analysis, environment scans and a range of other techniques, it is presented here as an internal marketing metric system – a measurement of the “telling” as the corporate Instagram attempts to support and enable the selling of a product.
A more wordy version of the Instagram analysis framework paper.
Measuring the “Telling” in the Selling of the Story presentation versionStephen Dann
Measuring the “Telling” in the Selling of the Story. Classifying Instagram Imagery for Internal Benchmarks
Instagram is a straightforward image, video and story sharing platform that provides a sufficiently robust framework to create a diverse range of outcomes for individual, group and corporation alike. In this presentation, we outline a process to examine how a single timeline (preferably the one you run or control) can be extracted, examined, and measured against a consistent set of five image measures. Whilst the method can inform competitor analysis, environment scans and a range of other techniques, it is presented here as an internal marketing metric system – a measurement of the “telling” as the corporate Instagram attempts to support and enable the selling of a product.
The Legoland™ branded translation of the real world. Operation legoland goes to Unconference Canberra to tell the story of encountering the Legolish translation of the real world.
Learning environment optimisation: Doing less with more for better outcomesStephen Dann
A maxi-edition of my guide to Learning Management Environment optimisation, whereby we hack the workload model to our favour, make marking easier (and more fun) for the lecturer, and put the students second in the pursuit of "How can we use this teaching technology to deliver something superior to our own work days?". Because quite often, the better systems for improving academic workplace effectiveness never get sold on the basis of "Want to do less work with more resources for better personal outcomes?"
Keeping corners and cutting time in assessment, marking and learning management by using Turnitin to give better feedback to students, quicker, faster and with more fun for you as the lecturer.
Brand on the Run: Political affiliations and Twitter social media presenceStephen Dann
What happened in the QLD Election to frighten off the branded political twitter accounts? This presentation examines just how few and far between branded activity was on the Twitter accounts of sitting political party members defending their office at the election.
Contrasting the Branded Experience of Legoland from social media and social proximity.
This is what happens when you let a marketing academic have free reign to examine the Legoland social media presence, official and user created, versus the parkland experience (official and self-generated). #operationlegoland goes legit.
Audio: https://soundcloud.com/drstephendann/social-marketing-research-agenda
Research agenda papers have been a staple of marketing discipline areas approaching intellectual turbulence – for example, stakeholder theory during the Service Dominant Logic era (Hult et al. 2011). This paper contributes to the process of research as an intersection of opportunity and demand – topics are acquired through interest, and agendas driven through identified need. The focus of this work is the assessment and cataloguing research demands – areas and agendas as yet unresolved, and as such, open as an invitation to scholars to pursue. Previously, Dann et al. (2007) outlined a series of research agendas for social, political, non-profit and sustainable marketing. The Social Marketing Agenda identified four items – the need for adapting contemporary marketing; visible boundaries to the social marketing discipline; social marketing specific theory; and, consumer-centric research on the understanding of benefits and value. As part of the contemporary study, the question of whether prior agenda items, have retained currency, proven intractable or met with relative resolution.
Presented to the #colconf10 College Conference in Canberra at the University of Canberra on 29/1/2015. An overview of Twitter, how it can be analysed, and things you can do in terms of looking at how political figures use their twitter accounts to engage (or not) with the public and other stakeholders.
Australian Political Parties and social media: uses and attitudesStephen Dann
Australian Political Parties and social media talks about how the Twitter accounts of political candidates from the W.A. senate re-election fared under analysis from the Twitter Content Classification framework (Plus a brief overview of Day 1 of the #cmpm2014 conference)
RMD24 | Debunking the non-endemic revenue myth Marvin Vacquier Droop | First ...BBPMedia1
Marvin neemt je in deze presentatie mee in de voordelen van non-endemic advertising op retail media netwerken. Hij brengt ook de uitdagingen in beeld die de markt op dit moment heeft op het gebied van retail media voor niet-leveranciers.
Retail media wordt gezien als het nieuwe advertising-medium en ook mediabureaus richten massaal retail media-afdelingen op. Merken die niet in de betreffende winkel liggen staan ook nog niet in de rij om op de retail media netwerken te adverteren. Marvin belicht de uitdagingen die er zijn om echt aansluiting te vinden op die markt van non-endemic advertising.
Memorandum Of Association Constitution of Company.pptseri bangash
www.seribangash.com
A Memorandum of Association (MOA) is a legal document that outlines the fundamental principles and objectives upon which a company operates. It serves as the company's charter or constitution and defines the scope of its activities. Here's a detailed note on the MOA:
Contents of Memorandum of Association:
Name Clause: This clause states the name of the company, which should end with words like "Limited" or "Ltd." for a public limited company and "Private Limited" or "Pvt. Ltd." for a private limited company.
https://seribangash.com/article-of-association-is-legal-doc-of-company/
Registered Office Clause: It specifies the location where the company's registered office is situated. This office is where all official communications and notices are sent.
Objective Clause: This clause delineates the main objectives for which the company is formed. It's important to define these objectives clearly, as the company cannot undertake activities beyond those mentioned in this clause.
www.seribangash.com
Liability Clause: It outlines the extent of liability of the company's members. In the case of companies limited by shares, the liability of members is limited to the amount unpaid on their shares. For companies limited by guarantee, members' liability is limited to the amount they undertake to contribute if the company is wound up.
https://seribangash.com/promotors-is-person-conceived-formation-company/
Capital Clause: This clause specifies the authorized capital of the company, i.e., the maximum amount of share capital the company is authorized to issue. It also mentions the division of this capital into shares and their respective nominal value.
Association Clause: It simply states that the subscribers wish to form a company and agree to become members of it, in accordance with the terms of the MOA.
Importance of Memorandum of Association:
Legal Requirement: The MOA is a legal requirement for the formation of a company. It must be filed with the Registrar of Companies during the incorporation process.
Constitutional Document: It serves as the company's constitutional document, defining its scope, powers, and limitations.
Protection of Members: It protects the interests of the company's members by clearly defining the objectives and limiting their liability.
External Communication: It provides clarity to external parties, such as investors, creditors, and regulatory authorities, regarding the company's objectives and powers.
https://seribangash.com/difference-public-and-private-company-law/
Binding Authority: The company and its members are bound by the provisions of the MOA. Any action taken beyond its scope may be considered ultra vires (beyond the powers) of the company and therefore void.
Amendment of MOA:
While the MOA lays down the company's fundamental principles, it is not entirely immutable. It can be amended, but only under specific circumstances and in compliance with legal procedures. Amendments typically require shareholder
Unveiling the Secrets How Does Generative AI Work.pdfSam H
At its core, generative artificial intelligence relies on the concept of generative models, which serve as engines that churn out entirely new data resembling their training data. It is like a sculptor who has studied so many forms found in nature and then uses this knowledge to create sculptures from his imagination that have never been seen before anywhere else. If taken to cyberspace, gans work almost the same way.
Business Valuation Principles for EntrepreneursBen Wann
This insightful presentation is designed to equip entrepreneurs with the essential knowledge and tools needed to accurately value their businesses. Understanding business valuation is crucial for making informed decisions, whether you're seeking investment, planning to sell, or simply want to gauge your company's worth.
Explore our most comprehensive guide on lookback analysis at SafePaaS, covering access governance and how it can transform modern ERP audits. Browse now!
Falcon stands out as a top-tier P2P Invoice Discounting platform in India, bridging esteemed blue-chip companies and eager investors. Our goal is to transform the investment landscape in India by establishing a comprehensive destination for borrowers and investors with diverse profiles and needs, all while minimizing risk. What sets Falcon apart is the elimination of intermediaries such as commercial banks and depository institutions, allowing investors to enjoy higher yields.
Affordable Stationery Printing Services in Jaipur | Navpack n PrintNavpack & Print
Looking for professional printing services in Jaipur? Navpack n Print offers high-quality and affordable stationery printing for all your business needs. Stand out with custom stationery designs and fast turnaround times. Contact us today for a quote!
3.0 Project 2_ Developing My Brand Identity Kit.pptxtanyjahb
A personal brand exploration presentation summarizes an individual's unique qualities and goals, covering strengths, values, passions, and target audience. It helps individuals understand what makes them stand out, their desired image, and how they aim to achieve it.
RMD24 | Retail media: hoe zet je dit in als je geen AH of Unilever bent? Heid...BBPMedia1
Grote partijen zijn al een tijdje onderweg met retail media. Ondertussen worden in dit domein ook de kansen zichtbaar voor andere spelers in de markt. Maar met die kansen ontstaan ook vragen: Zelf retail media worden of erop adverteren? In welke fase van de funnel past het en hoe integreer je het in een mediaplan? Wat is nu precies het verschil met marketplaces en Programmatic ads? In dit half uur beslechten we de dilemma's en krijg je antwoorden op wanneer het voor jou tijd is om de volgende stap te zetten.
Taurus Zodiac Sign_ Personality Traits and Sign Dates.pptxmy Pandit
Explore the world of the Taurus zodiac sign. Learn about their stability, determination, and appreciation for beauty. Discover how Taureans' grounded nature and hardworking mindset define their unique personality.
Enterprise Excellence is Inclusive Excellence.pdfKaiNexus
Enterprise excellence and inclusive excellence are closely linked, and real-world challenges have shown that both are essential to the success of any organization. To achieve enterprise excellence, organizations must focus on improving their operations and processes while creating an inclusive environment that engages everyone. In this interactive session, the facilitator will highlight commonly established business practices and how they limit our ability to engage everyone every day. More importantly, though, participants will likely gain increased awareness of what we can do differently to maximize enterprise excellence through deliberate inclusion.
What is Enterprise Excellence?
Enterprise Excellence is a holistic approach that's aimed at achieving world-class performance across all aspects of the organization.
What might I learn?
A way to engage all in creating Inclusive Excellence. Lessons from the US military and their parallels to the story of Harry Potter. How belt systems and CI teams can destroy inclusive practices. How leadership language invites people to the party. There are three things leaders can do to engage everyone every day: maximizing psychological safety to create environments where folks learn, contribute, and challenge the status quo.
Who might benefit? Anyone and everyone leading folks from the shop floor to top floor.
Dr. William Harvey is a seasoned Operations Leader with extensive experience in chemical processing, manufacturing, and operations management. At Michelman, he currently oversees multiple sites, leading teams in strategic planning and coaching/practicing continuous improvement. William is set to start his eighth year of teaching at the University of Cincinnati where he teaches marketing, finance, and management. William holds various certifications in change management, quality, leadership, operational excellence, team building, and DiSC, among others.
Discover the innovative and creative projects that highlight my journey throu...dylandmeas
Discover the innovative and creative projects that highlight my journey through Full Sail University. Below, you’ll find a collection of my work showcasing my skills and expertise in digital marketing, event planning, and media production.
3. A little context
The Past
• Dann (2010)
– Six top level twitter
categories
– 23 sub domains
• Dann (2011)
– Six top level
– 28 sub domain
The Present
• Dann (Today)
– Six Top Level Categories
• No sub domain analysis
– Secondary Processing
• Leximancer
• Linguistic Inquiry Word
Count
5. Acquire Research Question
• Does Event X change the tweeting patterns of Account @Y?
• Do responses to the #hashtag event change over time?
– #EventTags in Time Period A will have more Status than in Time Period D
– Time Period D will have more Pass Along than Status
• What were they thinking?
– Dominant Categories of tweets over time within a selected account
• Do comments change by platform for account @X?
– mobile versus web versus desktop
• Does @BrandX engage with the community?
– Conversational over all other types over capture time period
6. Acquire your data
• Personal timelines
– Download from Twitter
• #Hashtag captures
– Hootsuite
• Time line captures
– Choose your own adventure
– Getting worse, harder and
Twitter’s API is less available.
• Try to avoid big data
7. Big Data
• If you are Axel Bruns, fine, continue
– http://mappingonlinepublics.net/
• For everyone else, what are you looking for?
– What sample suits your research question?
8. Process your data
• Stand by for ugliness and manual coding*
– Extract data into Excel
• Excel allows for additional data inputs as you progress the
analysis
– Keep tweet visible
• Only keep a column visible if it fits your research question
– Eg date, time, @user, platform
– Add column for Tweet ID, category, cat_n
• Sub category, sub_cat_n for the detailed version
*Automated coding? People are working on it. It’s a terrible idea that’ll happen anyway
9. Manual Coding
• Use the Dann (2010) or Dann (2011) top level
domains
– Dann (201X) is under development
• I broke something important earlier this year
• Manual coding is superior
– Nuance and interpretation counts.
10. Pick a box
1 Conversational Uses an @statement to address another user
2 News Events Identifiable news content
3 Pass along Tweets of endorsement of content
4 Phatic Content independent connected presence
5 Status
Tweets which address the statement "What are you doing?"
and "What's happening?" in terms of an account holder's
experiences
6 Spam Unsolicited content
11. Keep it on manual
Conversational Uses an @statement to address another user
1.1 Action
Activities involving other Twitter users, or tweets which
describe the presence of other Twitter users.
1.2 Query
Any statement style tweet that ends with a question mark, as it
represents an active attempt to engage responses from the
community
1.3 Referral
An @response which contains URLs or recommendation of
other Twitter users. (Excludes RT @user)
1.4 Response
Classification for tweets which commence with another user’s
name and which do not meet the requirements of the referral
category
1.5 Rhetoric Question
Asked and answered within the same tweet (distinct from
Conversational - Query) which may not require (but may elicit)
audience response
12. Upgrades
Pass along Tweets of endorsement of content
3.1 Automated
Endorsement Status announcements triggered by third party applications which publish URLs
3.2
Endorsement Links to web content not created by the sender
3.3 Retweet Any statement reproducing another Twitter status using the via @ or RT protocol
3.4 Secondary
Social Media Links to Facebook (fb.me) or similar social media platform
3.5 User
generated
content Links to own content created by the user
3.6 Quote
Comment marked with “ “ to represent a direct quote, paraphrase of a statement
without a source URL, including reference to offline speaker or overheard (OH)
3.7 Cite
Any tweet which contains a reference in a recognised Harvard, Oxford or similar
format
3.8 Modified
ReTweet Acknowledgement of the use of MT protocol to allow for an edited RT.
18. Tweet Math Dude
• Tweet Count
– N per category
• Calculate the Tweet Ratio
– Tweet ratio is a normalized rank order of the highest
volume of tweets, where the most common category is
scored as 1
• Calculating the Tweet Ratio
– Highest number of tweets in a single category = TTMax
– Tweets per category = TCat
– Ratio is Tcat / TTMax
I’m only mildly mocking statistical analysis here
19. Maximum Character Density
• Max Density = 140 x TCat [number of tweets in
each category]
• Theoretical range for a tweet is between 1 and 140
characters
• Maximum tweet is 140 characters
• More characters used, more information density
• Calculate Character Density
– (Actual Character / Max Density)
• Divide each CharDensity score by the highest Char density
• Normalise CharDensity score to rank order
23. LIWC
• http://www.liwc.net/
– text analysis software
– calculates the degree to which people use
different categories of words in texts
• 70 other language dimensions.
– positive or negative emotions,
– self-references,
– causal words,
24. A giant bucket of data
• 70 variables
– So have a hypothesis and a purpose for the
analysis
• Differences in tweet construction
– Word Counts
– Unique Words
25. Results
Average Word Count (AWC) Unique Word Count (UWC)
Category AWC AWC_Ratio
Conversational
12.82 0.78
News 13.56 0.82
Pass Along
16.35 1
Phatic 15.42 0.94
Status 12.94 0.79
Category UWC UWC_Ratio
Conversational
93 0.97
News 93 0.97
Pass Along
92 0.96
Phatic 93 0.97
Status 96 1
26. Results
Word Count Unique Word
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
Conversational
News
Pass AlongPhatic
Status
0.93
0.94
0.95
0.96
0.97
0.98
0.99
1
Conversational
News
Pass AlongPhatic
Status
Chart Title
28. Leximancer
• Import into Leximancer as an individual
analysis (individual project)
– Edit Pre processing options: Sentence per block 1
– Run to Generate Outputs
– Generate Concept Map
30. Four sample maps
Entirely because quadrants fit on screens better than hexes. No other reason
conversational
news
pass along
phatic
31. Tweet Network Density
• Calculate Network Density
– Count Nodes (n)
– Count Actual Connections (e) Edges (paths
between nodes)
– Calculate Network density based on 2e / n(n-1)
• Network Density Notes
– Calculate potential connections
33. Network Density Results
Category Nodes Edges
Network
Density
Conversational 13 12 0.15
News 18 17 0.11
Pass Along 15 15 0.14
Phatic 3 2 0.67
Status 4 3 0.50
n 19 17 0.10
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Conversational
News
Pass AlongPhatic
Status
34. One Bucket of Data
• This is why a research question is important
– You can map a range of information
– None of it is useful without the RQ / hypothesis
– It’s pretty, but not valuable
Category Tweet Density Network Ave.WC
Unique
Words
Conversatio
nal 0.081081 0.819075 0.814598 0.830959 0.96875
News 0.085239 0.83315 0.828595 0.878952 0.96875
Pass Along 1 1.005496 1 1.059722 0.958333
Phatic 0.043659 0.938173 0.933044 1 0.96875
Status 0.037422 0.775065 0.770829 0.838992 1
0
0.2
0.4
0.6
0.8
1
1.2
Tweet Density Network Ave.WC Unique Words
Chart Title
Conversational News Pass Along Phatic Status