Facebook uses extensive data mining techniques to gather and analyze user data:
1. Facebook tracks users across the web through cookies and can identify users through facial recognition in photos to learn more about users' interests, habits, and relationships.
2. Analyzing patterns in users' likes, posts, and comments allows Facebook to accurately predict personal attributes such as sexual orientation, political views, and life satisfaction.
3. Facebook suggests tags for photos using image processing and uses "flashbacks" and "friendiversary" videos created from users' past posts and likes to demonstrate its ability to analyze large datasets through techniques like Hadoop clusters.
AN INTEGRATED RANKING ALGORITHM FOR EFFICIENT INFORMATION COMPUTING IN SOCIAL...ijwscjournal
Social networks have ensured the expanding disproportion between the face of WWW stored traditionally in search engine repositories and the actual ever changing face of Web. Exponential growth of web users and the ease with which they can upload contents on web highlights the need of content controls on material published on the web. As definition of search is changing, socially-enhanced interactive search methodologies are the need of the hour. Ranking is pivotal for efficient web search as the search performance mainly depends upon the ranking results. In this paper new integrated ranking model based on fused rank of web object based on popularity factor earned over only valid interlinks from multiple social forums is proposed. This model identifies relationships between web objects in separate social networks based on the object inheritance graph. Experimental study indicates the effectiveness of proposed Fusion based ranking algorithm in terms of better search results.
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...inventionjournals
The problem of web search time complexity and accuracy has been visited in many research papers, and the authors discussed many approaches to improve the search performance. Still the approaches does not produce any noticeable improvement and struggles with more time complexity as well. To overcome the issues identified, an efficient multi mode conceptual clustering algorithm has been discussed in this paper, which identifies the similar interested user groups by clustering their search context according to different conceptual queries. Identified user groups are shared with the related conceptual queries and their results to reduce the time complexity. The multi mode conceptual clustering, performs grouping of search queries and users according to number of users and their search pattern. The concept of search is identified by using Natural language processing methods and the web logs produced by the default web search engines. The author designed a dedicated web interface to collect the web log about the user search and the same data has been used to cluster the social groups according to number of conceptual queries. The search results has been shared between the users of identified social groups which reduces the search time complexity and improves the efficiency of web search in better manner
A glimpse into what social media is all about and how the researchers in the world are using social media. Social media is not a mere hype and not a platform to leverage word-of-the-mouth practices as is the common perception of it in Pakistan: it is much more than that and this is what this talk presented.
AN INTEGRATED RANKING ALGORITHM FOR EFFICIENT INFORMATION COMPUTING IN SOCIAL...ijwscjournal
Social networks have ensured the expanding disproportion between the face of WWW stored traditionally in search engine repositories and the actual ever changing face of Web. Exponential growth of web users and the ease with which they can upload contents on web highlights the need of content controls on material published on the web. As definition of search is changing, socially-enhanced interactive search methodologies are the need of the hour. Ranking is pivotal for efficient web search as the search performance mainly depends upon the ranking results. In this paper new integrated ranking model based on fused rank of web object based on popularity factor earned over only valid interlinks from multiple social forums is proposed. This model identifies relationships between web objects in separate social networks based on the object inheritance graph. Experimental study indicates the effectiveness of proposed Fusion based ranking algorithm in terms of better search results.
Multi-Mode Conceptual Clustering Algorithm Based Social Group Identification ...inventionjournals
The problem of web search time complexity and accuracy has been visited in many research papers, and the authors discussed many approaches to improve the search performance. Still the approaches does not produce any noticeable improvement and struggles with more time complexity as well. To overcome the issues identified, an efficient multi mode conceptual clustering algorithm has been discussed in this paper, which identifies the similar interested user groups by clustering their search context according to different conceptual queries. Identified user groups are shared with the related conceptual queries and their results to reduce the time complexity. The multi mode conceptual clustering, performs grouping of search queries and users according to number of users and their search pattern. The concept of search is identified by using Natural language processing methods and the web logs produced by the default web search engines. The author designed a dedicated web interface to collect the web log about the user search and the same data has been used to cluster the social groups according to number of conceptual queries. The search results has been shared between the users of identified social groups which reduces the search time complexity and improves the efficiency of web search in better manner
A glimpse into what social media is all about and how the researchers in the world are using social media. Social media is not a mere hype and not a platform to leverage word-of-the-mouth practices as is the common perception of it in Pakistan: it is much more than that and this is what this talk presented.
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
Evolving social data mining and affective analysis Athena Vakali
Evolving social data mining and affective analysis methodologies, framework and applications - Web 2.0 facts and social data
Social associations and all kinds of graphs
Evolving social data mining
Emotion-aware social data analysis
Frameworks and Applications
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
International Journal of Engineering Research and DevelopmentIJERD Editor
Electrical, Electronics and Computer Engineering,
Information Engineering and Technology,
Mechanical, Industrial and Manufacturing Engineering,
Automation and Mechatronics Engineering,
Material and Chemical Engineering,
Civil and Architecture Engineering,
Biotechnology and Bio Engineering,
Environmental Engineering,
Petroleum and Mining Engineering,
Marine and Agriculture engineering,
Aerospace Engineering.
Sbs facebook data privacy dilemma case studysmumbahelp
Dear students get fully solved assignments
Send your semester & Specialization name to our mail id :
help.mbaassignments@gmail.com
or
call us at : 08263069601
Social Media Mining - Chapter 10 (Behavior Analytics)SocialMediaMining
R. Zafarani, M. A. Abbasi, and H. Liu, Social Media Mining: An Introduction, Cambridge University Press, 2014.
Free book and slides at http://socialmediamining.info/
RUNNING HEAD: BIG DATA IN SOCIAL MEDIA 1
BIG DATA IN SOCIAL MEDIA 3
Big Data in Social Media
By definition, Big Data can simply be termed as voluminous data. In more specific definitions, it can be termed as that which is large, complex and fast and a s a result, is not in a position to be processed using the typical traditional methods of data processing. The volume, variety, velocity, variability and veracity are used in the categorizing of data as big data. With the development in technology, and the continued incorporation of these technological sources into our day to day lives, the collected data through the Internet of Things among other information systems has resulted in big data (Ivanov, 2018). One such areas where Big data is found is in the social media platforms. As opposed to the olden days, currently, more and more people and companies are using social media daily to achieve their specific objectives and goals, it is estimated that social media platforms like Facebook produce data as big as 500+ terabytes in a s ingle day!
Most of these data in the social media are as a result of the videos, photos, messages and comments being shared across the media platforms. Not only do individuals use social media to keep in touch, but companies also use it in a concept called social media marketing. Through the media, and using big data analytics, companies are able to map out consumer behavior through what they like and what they share (Nicora, 2019). They use these platforms to reach their target audiences and at the same time use them to get feedback from their clients. As a result, the amount of data from social media platforms is not only voluminous, it is also heterogenous in the sense that it contains both nominal and numeral values from different places, it is variable in that it has unpredictable flow, it is fast because it is collected in real time. This qualifies the data to be Big Data and requires big data analytics to process.
References
Ivanov I. (2018) What is Big Data Analytics on Social Media? Iocowise. Retrieved from https://locowise.com/blog/what-is-big-data-analytics-on-social-media
Nicora R. (2019) How is big data impacting social media? Medium. Retrieved from https://medium.com/dative-io/how-is-big-data-impacting-social-media-df31aa3f66f6
1
4
Title
Student Name
Ashford University
GEN103: Information Literacy
Instructor Name
Month Day, Year
Title
Research Question:
Replace these instructions with your research question. Incorporate any feedback your instructor provided on your week 1 assignment. To learn how to view the comments on your papers watch the Waypoint: Accessing Assignment Feedback video.
Thesis Statement:
Replace these instructions with your thesis statement. Refer to the Writing Center’.
IJERA (International journal of Engineering Research and Applications) is International online, ... peer reviewed journal. For more detail or submit your article, please visit www.ijera.com
Evolving social data mining and affective analysis Athena Vakali
Evolving social data mining and affective analysis methodologies, framework and applications - Web 2.0 facts and social data
Social associations and all kinds of graphs
Evolving social data mining
Emotion-aware social data analysis
Frameworks and Applications
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
International Journal of Engineering Research and DevelopmentIJERD Editor
Electrical, Electronics and Computer Engineering,
Information Engineering and Technology,
Mechanical, Industrial and Manufacturing Engineering,
Automation and Mechatronics Engineering,
Material and Chemical Engineering,
Civil and Architecture Engineering,
Biotechnology and Bio Engineering,
Environmental Engineering,
Petroleum and Mining Engineering,
Marine and Agriculture engineering,
Aerospace Engineering.
Sbs facebook data privacy dilemma case studysmumbahelp
Dear students get fully solved assignments
Send your semester & Specialization name to our mail id :
help.mbaassignments@gmail.com
or
call us at : 08263069601
Social Media Mining - Chapter 10 (Behavior Analytics)SocialMediaMining
R. Zafarani, M. A. Abbasi, and H. Liu, Social Media Mining: An Introduction, Cambridge University Press, 2014.
Free book and slides at http://socialmediamining.info/
RUNNING HEAD: BIG DATA IN SOCIAL MEDIA 1
BIG DATA IN SOCIAL MEDIA 3
Big Data in Social Media
By definition, Big Data can simply be termed as voluminous data. In more specific definitions, it can be termed as that which is large, complex and fast and a s a result, is not in a position to be processed using the typical traditional methods of data processing. The volume, variety, velocity, variability and veracity are used in the categorizing of data as big data. With the development in technology, and the continued incorporation of these technological sources into our day to day lives, the collected data through the Internet of Things among other information systems has resulted in big data (Ivanov, 2018). One such areas where Big data is found is in the social media platforms. As opposed to the olden days, currently, more and more people and companies are using social media daily to achieve their specific objectives and goals, it is estimated that social media platforms like Facebook produce data as big as 500+ terabytes in a s ingle day!
Most of these data in the social media are as a result of the videos, photos, messages and comments being shared across the media platforms. Not only do individuals use social media to keep in touch, but companies also use it in a concept called social media marketing. Through the media, and using big data analytics, companies are able to map out consumer behavior through what they like and what they share (Nicora, 2019). They use these platforms to reach their target audiences and at the same time use them to get feedback from their clients. As a result, the amount of data from social media platforms is not only voluminous, it is also heterogenous in the sense that it contains both nominal and numeral values from different places, it is variable in that it has unpredictable flow, it is fast because it is collected in real time. This qualifies the data to be Big Data and requires big data analytics to process.
References
Ivanov I. (2018) What is Big Data Analytics on Social Media? Iocowise. Retrieved from https://locowise.com/blog/what-is-big-data-analytics-on-social-media
Nicora R. (2019) How is big data impacting social media? Medium. Retrieved from https://medium.com/dative-io/how-is-big-data-impacting-social-media-df31aa3f66f6
1
4
Title
Student Name
Ashford University
GEN103: Information Literacy
Instructor Name
Month Day, Year
Title
Research Question:
Replace these instructions with your research question. Incorporate any feedback your instructor provided on your week 1 assignment. To learn how to view the comments on your papers watch the Waypoint: Accessing Assignment Feedback video.
Thesis Statement:
Replace these instructions with your thesis statement. Refer to the Writing Center’.
NOVEL MACHINE LEARNING ALGORITHMS FOR CENTRALITY AND CLIQUES DETECTION IN YOU...ijaia
The goal of this research project is to analyze the dynamics of social networks using machine learning techniques to locate maximal cliques and to find clusters for the purpose of identifying a target demographic. Unsupervised machine learning techniques are designed and implemented in this project to analyze a dataset from YouTube to discover communities in the social network and find central nodes. Different clustering algorithms are implemented and applied to the YouTube dataset. The well-known Bron-Kerbosch algorithm is used effectively in this research to find maximal cliques. The results obtained from this research could be used for advertising purposes and for building smart recommendation systems. All algorithms were implemented using Python programming language. The experimental results show that we were able to successfully find central nodes through clique-centrality and degree centrality. By utilizing clique detection algorithms, the research shown how machine learning algorithms can detect close knit groups within a larger network.
NOVEL MACHINE LEARNING ALGORITHMS FOR CENTRALITY AND CLIQUES DETECTION IN YOU...gerogepatton
The goal of this research project is to analyze the dynamics of social networks using machine learning techniques to locate maximal cliques and to find clusters for the purpose of identifying a target demographic. Unsupervised machine learning techniques are designed and implemented in this project to analyze a dataset from YouTube to discover communities in the social network and find central nodes. Different clustering algorithms are implemented and applied to the YouTube dataset. The well-known Bron-Kerbosch algorithm is used effectively in this research to find maximal cliques. The results obtained from this research could be used for advertising purposes and for building smart recommendation systems. All algorithms were implemented using Python programming language. The experimental results show that we were able to successfully find central nodes through clique-centrality and degree centrality. By utilizing clique detection algorithms, the research shown how machine learning algorithms can detect close knit groups within a larger network.
Novel Machine Learning Algorithms for Centrality and Cliques Detection in You...gerogepatton
The goal of this research project is to analyze the dynamics of social networks using machine learning techniques to locate maximal cliques and to find clusters for the purpose of dentifying a target
demographic. Unsupervised machine learning techniques are designed and implemented in this project to analyze a dataset from YouTube to discover communities in the social network and find central nodes. Different clustering algorithms are implemented and applied to the YouTube dataset. The well-known Bron-Kerbosch algorithm is used effectively in this research to find maximal cliques. The results obtained
from this research could be used for advertising purposes and for building smart recommendation systems.
All algorithms were implemented using Python programming language. The experimental results show that
we were able to successfully find central nodes through clique-centrality and degree centrality. By utilizing
clique detection algorithms, the research shown how machine learning algorithms can detect close knit
groups within a larger network.
Researching Social Media – Big Data and Social Media AnalysisFarida Vis
Researching Social Media – Big Data and Social Media Analysis, presentation for the Social Media for Researchers: A Sheffield Universities Social Media Symposium, 23 September 2014
The impact of sentiment analysis from user on Facebook to enhanced the servic...IJECEIAES
Facebook's influence on the modern social media platform is undoubtedly enormous. While it has gotten a backlash for its inability to control its influence over important affairs, there are still many questions regarding people's perception of Facebook and their sentiment over Facebook. This paper's role in this ongoing debate is to give a glimpse of people's sentiment and perception of Facebook in recent times. By collecting samples data from Facebook's Top Page, this paper hopes to represent a significant amount of people's aspirations towards this company. By processing the data with a processing tool to construct and model out the data and a sentiment analyzer tool helps determine the sentiment, this paper can deduce a 600-comment worth of processed data. The results from the 600 sampled comments concluded that the sentiments towards Facebook are 41.50% negative comments, 22.83% neutral comments, and 35.67% positive comments.
Presentation to WARC and ESOMAR digital conference on Web2.0. Processes for research innovation, technology and collective intelligence.
Dr Mariann Hardey, Lecturer in Social Media Marketing, Durham Business School, University of Durham.
Social networking sites are a significant source of information to know the behavior of users and to know
what is occupying society of all ages and accordingly helpful information can be provided to specialists
and decision-makers. According to official sources, 98.43% of Saudi youth use social networking sites. The
study and analysis of social media data are done to provide the necessary information to increase
investment opportunities within the Kingdom of Saudi Arabia, by studying and analyzing what people
occupy on the communication sites through their tweets about the labor market and investment. Given the
huge volume of data and also its randomness, a survey of the data will be done and collected from through
keywords, the priority of arranging the data, and recording it as (positive - negative - mixed). The study
analysis and conclusion will be based on data-mining and its techniques of analysis and deduction
.
INCREASING THE INVESTMENT’S OPPORTUNITIES IN KINGDOM OF SAUDI ARABIA BY STUDY...ijcsit
Social networking sites are a significant source of information to know the behavior of users and to know
what is occupying society of all ages and accordingly helpful information can be provided to specialists
and decision-makers. According to official sources, 98.43% of Saudi youth use social networking sites. The
study and analysis of social media data are done to provide the necessary information to increase
investment opportunities within the Kingdom of Saudi Arabia, by studying and analyzing what people
occupy on the communication sites through their tweets about the labor market and investment. Given the
huge volume of data and also its randomness, a survey of the data will be done and collected from through
keywords, the priority of arranging the data, and recording it as (positive - negative - mixed). The study
analysis and conclusion will be based on data-mining and its techniques of analysis and deduction.
Macroeconomics- Movie Location
This will be used as part of your Personal Professional Portfolio once graded.
Objective:
Prepare a presentation or a paper using research, basic comparative analysis, data organization and application of economic information. You will make an informed assessment of an economic climate outside of the United States to accomplish an entertainment industry objective.
Model Attribute Check Company Auto PropertyCeline George
In Odoo, the multi-company feature allows you to manage multiple companies within a single Odoo database instance. Each company can have its own configurations while still sharing common resources such as products, customers, and suppliers.
Honest Reviews of Tim Han LMA Course Program.pptxtimhan337
Personal development courses are widely available today, with each one promising life-changing outcomes. Tim Han’s Life Mastery Achievers (LMA) Course has drawn a lot of interest. In addition to offering my frank assessment of Success Insider’s LMA Course, this piece examines the course’s effects via a variety of Tim Han LMA course reviews and Success Insider comments.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
This slides describes the basic concepts of ICT, basics of Email, Emerging Technology and Digital Initiatives in Education. This presentations aligns with the UGC Paper I syllabus.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdfTechSoup
In this webinar you will learn how your organization can access TechSoup's wide variety of product discount and donation programs. From hardware to software, we'll give you a tour of the tools available to help your nonprofit with productivity, collaboration, financial management, donor tracking, security, and more.
How to Make a Field invisible in Odoo 17Celine George
It is possible to hide or invisible some fields in odoo. Commonly using “invisible” attribute in the field definition to invisible the fields. This slide will show how to make a field invisible in odoo 17.
BÀI TẬP BỔ TRỢ TIẾNG ANH GLOBAL SUCCESS LỚP 3 - CẢ NĂM (CÓ FILE NGHE VÀ ĐÁP Á...
Research on how_facebook_industry_is_using_data_mining_by_shafiu_umar_abubakar_nov-2019[1]
1. RESEARCH ON HOW FACEBOOK INDUSTRY IS USING DATA MINING
BY
SHAFIU UMAR ABUBAKAR
shafiuumarabubakar@gmail.com
2. 1
INTRODUCTION
DATA MINING
Data mining is the process of discovering patterns in large data sets involving
methods at the intersection of machine learning, statistics, and database systems. Data
mining is an interdisciplinary subfield of computer science and statistics with an
overall goal to extract information (with intelligent methods) from a data set and
transform the information into a comprehensible structure for further use. Data mining
is the analysis step of the "knowledge discovery in databases" process or KDD. Aside
from the raw analysis step, it also involves database and data management aspects,
data pre-processing, model and inference considerations, interestingness metrics,
complexity considerations, post-processing of discovered structures, visualization, and
online updating.
SOME OF THE TECHNIQUES OF DATA MINING
1. CHARACTERIZATION: is used to generalize, summarize and possibly different
data characteristics.
2. CLASSIFICATION: Data classification is a process in which the given data is
classified into different classes according to a classification model.
3. REGRESSION: This process is similar to classification the major difference is that
the object to be predicted is continuous rather than discrete.
4. ASSOCIATION: In this process the association between the objects is found. It
discovers the association between various data bases and the association between
the attributes of single database.
5. CLUSTERING: Involves grouping of data into several new classes such that it
describes the data. It breaks large data set into smaller groups to make the
designing and implementation process to be simple. The task of clustering is to
maximize the similarity between the objects of classes and to reduce the similarity
between the classes.
6. CHANGE DETECTION: This method identifies the significant changes in the data
from the previously measured values.
3. 2
7. DEVIATION DETECTION: Deviation detection focuses on the major deviations
between the actual values of the objects and its expected values. This method finds
out the deviation according to the time as well the deviation among different
subsets of data.
8. LINK ANALYSIS: It traces the connections between the objects to develop models
based on the patterns in the relationships by applying graph theory techniques.
9. SEQUENTIAL PATTERN MINING: This method involves the discovery of the
frequently occurring patterns in the data. Social networks are important sources of
online interactions and contents sharing ,subjectivity , assessments , approaches,
evaluation , influences , observations, feelings, opinions and sentiments
expressions borne out in text, reviews, blogs, discussions, news, remarks,
reactions, or some other documents .
4. 3
FACEBOOK
Facebook, Inc. Is an American online social media and social networking service
company based in Menlo Park, California. It was founded by Mark Zuckerberg, then a
Harvard computer science student on February 4, 2004, in Cambridge, Massachusetts
along with fellow Harvard College students and roommates Eduardo Saverin, Andrew
McCollum, Dustin Moskovitz and Chris Hughes. It is considered one of the Big Four
technology companies along with Amazon, Apple, and Google.
The founders initially limited the website's membership to Harvard students and
subsequently Columbia, Stanford, and Yale students. Membership was eventually
expanded to the remaining Ivy League schools, MIT, and higher education institutions
in the Boston area, then various other universities, and lastly high school students.
Since 2006, anyone who claims to be at least 13 years old has been allowed to become
a registered user of Facebook, though this may vary depending on local laws. The
name comes from the face book directories often given to American university
students. Facebook held its initial public offering (IPO) in February 2012, valuing the
company at $104 billion, the largest valuation to date for a newly listed public
company. Facebook makes most of its revenue from advertisements that appear
onscreen and in users' News Feeds.
The Facebook service can be accessed from devices with Internet connectivity, such
as personal computers, tablets and smartphones. After registering, users can create
customized profile revealing information about themselves. They can post text, photos
and multimedia which is shared with any other users that have agreed to be their
"friend", or, with a different privacy setting, with any reader. Users can also use
various embedded apps, join common-interest groups, and receive notifications of
their friends' activities. Facebook claimed that had more than 2.3 billion monthly
active users as of December 2018. However, it faces a big problem of fake accounts. It
caught 3 billion fake accounts in the last quarter of 2018 and the first quarter of 2019.
Many critics questioned whether Facebook knows how many actual users it has.
Facebook is now one of the world's most valuable companies.
6. 5
HOW FACEBOOK IS USING DATA MINING
PREAMBLE: Data mining techniques play a fundamental role in extracting
correlation patterns between personality and variety of user‟s data captured from
multiple sources. Generally, two approaches were adopted for studying personality
traits of social network users. The first approach uses a variety of machine learning
algorithms to build models based on social network activities only. The second one
extends the personality-related features with linguistic cues (Mairesse et al. 2007;
Oberlander and Nowson 2006). Several classification and regression techniques were
used to build predictive personality models along the five personality dimensions
using the linguistic features of a dataset comprised of few thousand essays solicited
from introductory psychology students (Mairesse et al. 2007).
Social media accelerates innovation, drives cost savings, and strengthens brands
through mass collaboration. Across every industry, companies are using social media
platforms to market and hype up their services and products, along with monitoring
what the audience is saying about their brand.
Possibly Facebook is the world‟s most popular social media network with more
than two billion monthly active users worldwide, Facebook stores enormous amounts
of user data, making it a massive data wonderland. It‟s estimated that there are more
than 169 million Facebook users in the United States alone by 2018. Facebook is the
fifth most valuable public company in the world, with a market value of
approximately $321 billion.
Every day, we feed Facebook‟s data beast with mounds of information. Every 60
seconds, 136,000 photos are uploaded, 510,000 comments are posted, and 293,000
status updates are posted. That is a LOT of data. At first, this information may not
seem to mean very much. But with data like this, Facebook knows who our friends
are, what we look like, where we are, what we are doing, our likes, our dislikes, and
so much more. Some researchers even say Facebook has enough data to know us
better than our therapists!
Apart from Google, Facebook is probably the only company that possesses this
high level of detailed customer information. The more users who use Facebook, the
more information they amass. Heavily investing in their ability to collect, store, and
analyze data, Facebook does not stop there. Apart from analyzing user data, Facebook
has other ways of determining user behavior.
7. 6
SUMMARY OF HOW FACEBOOK IS USING DATA MINING
TRACKING COOKIES: Facebook tracks its users across the web by using tracking
cookies. If a user is logged into Facebook and simultaneously browses other
websites, Facebook can track the sites they are visiting. This allows Facebook to
know more about the user‟s likes and dislikes.
FACIAL RECOGNITION: One of Facebook‟s latest investments has been in facial
recognition and image processing capabilities. Facebook can track its users across
the internet and other Facebook profiles with image data provided through user
sharing.
ANALYZING THE LIKES: A recent study conducted showed that is practicable to
predict data accurately on a range of personal attributes that are highly sensitive
just by analyzing a user‟s Facebook Likes. Work conducted by researchers at
Cambridge University and Microsoft Research show how the patterns of Facebook
Likes can very accurately predict your sexual orientation, satisfaction with life,
intelligence, emotional stability, religion, alcohol use and drug use, relationship
status, age, gender, race, and political views among many others.
TAG SUGGESTIONS: Facebook suggests who to tag in user photos through image
processing and facial recognition. And this is typical use of data mining.
USING BIG DATA: Videos on Facebook that shows a “flashback” of posts, likes,
or images, like the ones you might see on your birthday, or on the Friendversary
that is anniversary of becoming friends with someone is example of big data uses
by Facebook
Facebook Inc. analytics Chief Ken Rudin says, “Big Data is crucial to the
company‟s very being.” He goes on to say that, “Facebook relies on a massive
installation of Hadoop, a highly scalable open-source framework that uses clusters
of low-cost servers to solve problems. Facebook even designs its own hardware for
this purpose. Hadoop is just one of many Big Data technologies employed at
Facebook.”
THE FLASHBACK: Honoring its 10th anniversary, Facebook offered its users the
option of viewing and sharing a video that traces the course of their social network
activity from the date of registration till the present. Called the “Flashback,”
8. 7
This video is a collection of photos and posts that received the most comments and
likes and set to nostalgic background music. Other videos have been created since
then, including those you can view and share to celebrate a “Friendversary,” the
anniversary of two people becoming friends on Facebook. You‟ll also be able to
see a special video on your birthday.
TOPIC DATA: Is a Facebook technology that displays to marketers the responses
of the audience with regard to brands, events, activities, and subjects, in a way that
keeps their personal information private. Marketers use the information from topic
data to selectively change the way they market on the platform as well as other
channels.
SELLING DATA TO MARKETERS: Facebook is a free social media; data is the
currency of free social media. Facebook sell the behavior that a user shares or
expresses to marketers. In 2016 around 98 data points are used to be sold to
interested companies such as user„s gender, age, political position, relationship
status and many more (Glum, 2018). Facebook also uses the user„s profile and
internet activity for advertisement purposes and it has generated profits at a rate of
about US$20.21 per user in 2017 (Glum, 2018).
MARKETING AND ADVATISEMENT: Facebook provides users with a free social
networking site. In return, users can provide basic demographic information
through their likes, interests, and activities shared in their accounts. All of this
information is very valuable to marketers who want to pay Facebook to place their
ads on the target consumer„s accounts. Many companies/organizations even
individuals specifically advertisers uses Facebook services for their advantages.
Due to this massive gold mine of data, advertisers wait like hungry vultures. In
fact, the 2015 Social Media Marketing Industry Report stated that Facebook is the
No one 1 social platform for marketers.
COLLECTING DATA: Everything a user does in Facebook is collected, processed
and shared to friends and user„s network. It is also shared to a certain level to
Facebook third-party partners, sometimes are taken advantage by them. This is
characteristic using of data mining. The reason why Facebook‟s marketing
solutions are so profitable is because of the size of the reach Facebook has on
personal data. Not only does Facebook collect data from their own social media
platform, but also across Instagram, Messenger, and WhatsApp, which are some of
the biggest social media platforms that are around today . Along with the data that
these platforms provide, Facebook also collects data from developers using the
9. 8
Facebook Pixel (which is a code website. It collect data that help you track
conversation from Facebook ads, build targeted audiences for future ads, remarket
to people who have already taken some kind of action on your website). You place
on your on their website and the developers using the Facebook SDK in their app.
These two products provide Facebook with data that was previously unattainable
with their social media platforms. The Facebook SDK allows any app developer to
utilize the Facebook login system and ad delivery system, and in exchange,
Facebook gets to expand their scope on people‟s personal data. To put the power of
the Facebook Pixel and SDK in perspective, let‟s say a person was browsing an
online shopping website that was running the Facebook Pixel. The person winds up
looking at a specific product but does not end up buying the product. The
Facebook Pixel keeps track of this person‟s potential interest in buying the product
for later ad delivery. The person then goes on their phone to play a game that is
running the Facebook SDK. While playing their game, it is very likely that an ad
will appear showing the same product that this person was contemplating on
purchasing previously.
11. 10
LITERATURE REVIEW
The society has benefited so much from the advances in information and
communications technologies. In the past, people need to visit libraries in best
universities or monasteries where precious manuscripts were hoarded (Hean, 2000).
The development of digital computers opens its way to networked communications
which has revolutionized information sharing. In 1969, the first computer-to-computer
link was established on ARPANET or the Advanced Research Projects Agency
Network.
Social media refers to websites and applications that are designed to allow people to
share content quickly, efficiently, and in real-time. Many people define social media
as apps on their smartphone or tablet, but the truth is, this communication tool started
with computers. Social media serves as a great location for data mining because there
is an immense quantity of data, a wide variety of structured and unstructured data, and
great speed at which new data is added. In an article in Yale Law, the writer describes
social media mining as a company or organization collecting data from a large amount
of social media users in order to draw conclusions about the populations of users
In this report, we describe and evaluate Facebook‟s revolutionary data mining
algorithms that allow advertisers to target consumers through location, demographics,
interests, behavior, and connections. In addition, we discuss how their use of data
mining gives them the power to sway and change people‟s opinions on companies and
political persons.
Facebook has become more powerful now more than ever. As it expands its
network, more and more information becomes available for the company to use for
various purposes for an improved experience of a Facebook user and benefit of the
company. The social capital to which people have invested in the social media
platform made it more difficult for people to quit. The review of the literature revealed
how Facebook is using data mining and making reports from that. It is however very
evident that even if people who are aware of the risks involved still patronize
Facebook. Some may have tinkered with their privacy setting to improve privacy
control but their actions/activities on the net make that extra precaution useless. As the
internet becomes more of a necessity than choice, more and more information is
readily available for data mining through Facebook or other platforms capable of
phishing valuable information such as passwords.
12. 11
Facebook and Google are just two of many websites that keeps information about
the users. However, most of the user is not aware of the volume and value of
information they are providing. Ideally, software companies must share responsibility
about the products and services they offer in a manner that both helps protect
customers from potential harm (Kutterer, nd.).
PERSONALITY DATA: The study of personality reflected in user‟s Facebook
activities includes a wide range of features. Some of the most intuitively predicted
indices are the statistical data for user‟s activities (e.g., number of likes, statuses,
groups, tags, events). Demographic characteristics such as: age and gender, were
accounted for since their effect is known to manifest in the context under investigation
(Golbeck, Robles, and Turner 2011). Self-centered network parameters representing
the number of friends, and measures such as density, brokerage, and betweenness,
provide additional insight into user‟s social behavior instrumental in assessing
personality along several dimensions (Celli et al. 2013).
BASIC LINGUISTIC FEATURES: Several studies have pointed out to the significant
correlation between personality and the spoken or written linguistic cues (Pennebaker
and King 1999; Mairesse et al. 2007). A selected set of categories, which correspond
to the LIWC linguistic process variables (e.g., word count, words per sentence/status,
sentences per status, punctuation count and lexical diversity) have been accounted for.
In addition, a list of words related to online jargon, chat acronyms emoticons,
profanities, and slang words were included to account for the specific language use
exhibited by social network users.
13. 12
CONCLUSION AND RECOMMENDATION
Social media data is clearly the largest, richest and most dynamic evidence base
of human behavior, bringing new opportunities to understand individuals,
groups and society. Innovative scientists and industry professionals are
increasingly finding novel ways of automatically collecting, combining and
analyzing this wealth of data.
Possibly Facebook is the world‟s most popular social media network with more
than two billion monthly active users worldwide, Facebook stores enormous
amounts of user data, making it a massive data wonderland. And Facebook is
using data mining to extract useful data for advertisement, marketers, and ads.
It is important that researchers have access to computational environments and
especially „big‟ social media data for experimentation. Otherwise,
computational social science could become the exclusive domain of major
companies, government agencies and a privileged set of academic researchers
presiding over private data from which they produce papers that cannot be
critiqued or replicated. Because, the biggest concern is that companies are
increasingly restricting access to their data to monetize their content.
Facebook should have more data centers and at least one in each continents in
the world, so that they will improve the usage of data mining.