Presenting a tool for identifying political popularity over Twitter. AthPPA (which stands for Athena Political Popularity Analysis) is a tool for identifying how popular a political leader is over Twitter. For the purposes of this dissertation the Twitter accounts of the most prominent Greek political leaders have been identified. Structured data such as likes, re-tweets, text-length per tweet as well as the number of subscribers per account have been visualized. Furthermore, sentiment analysis is calculated and visualized using spaCy module and a sentiment lexicon which contains a set of emotion based labeled words.
The document provides a detailed history and timeline of the development of the Internet from 1836 to 1997. Some of the key events and developments include:
- The origins of packet switching networks developed in the 1960s for the U.S. military which helped enable the future development of the Internet.
- The birth of the Internet in 1969 with the creation of ARPANET and the connection of four computer nodes at universities in California and Utah.
- The development of electronic mail in 1971 which allowed people to communicate over the network and is still a primary form of communication today.
- The creation of the World Wide Web in 1991 by Tim Berners-Lee which revolutionized access to information and communications.
1) Joseph Licklider at DARPA developed the idea of a universal network and wanted different computer systems used by the military to be able to communicate.
2) Researchers at BBN and MIT created the first Interface Message Processors (IMPs) and connected the first two nodes, helping establish the foundations of the internet.
3) Key internet innovations included TCP/IP, packet switching, routers, fiber optics, Ethernet, HTML, and multimedia standards which were developed in the 1970s and 1980s.
1. Belgian information expert Paul Otlet imagined a "Radiated Library" in the 1930s that used technology of the time like telephone and radio to create something similar to the internet.
2. In the 1960s, researchers at universities and technology companies in the US began experimenting with and developing early versions of packet switching and inter-connected computer networks that would later become known as the ARPANET and lay the foundation for the internet.
3. By the late 1970s, TCP/IP emerged as the main protocol for ARPANET and helped connect different networks, establishing the internet. This allowed for greater connectivity and use of the internet throughout the 1980s.
1) Joseph Licklider at DARPA developed the idea of a universal network and wanted different computer systems used by the military to be able to communicate.
2) Researchers at BBN and MIT created the first Interface Message Processors (IMPs) and connected the first two nodes, helping establish the foundations of the internet.
3) Key internet innovations included TCP/IP, packet switching, routers, fiber optics, Ethernet, HTML, and multimedia standards which were developed in the 1970s and 1980s.
CATComputer Applied Training...the history and basic concepts provides an overview of the history of the internet from 1969 to 1997. It discusses key milestones such as the development of ARPANET, TCP/IP, email, the world wide web, and commercialization. The document also covers pros and cons of internet use and training as well as basics of netiquette and social networking.
The Links that Became a Web: The 45-year Old Internet and the 25-Year Old WebJohan Koren
The document provides a history of the development of the Internet and World Wide Web over several decades. It describes how ARPANET, the earliest precursor to the Internet, was developed in the 1960s as a military network and then expanded. Key developments included the creation of email in the early 1970s, and connections between different networks in 1975 which marked the beginning of the Internet. In the 1980s, uses expanded beyond research and the military with the rise of bulletin board systems and search tools like Gopher, Veronica, and Archie. The World Wide Web was invented in 1989-1991, using HTML and HTTP. The first graphical web browser, Mosaic, helped popularize the Web in the early 1990s.
The document provides a timeline of key events in the development of the Internet from 1957-1990. It describes the launch of Sputnik which led to the creation of ARPA and early computer networking research. Important developments include the ARPANET being built in the late 1960s, the transmission of the first message between networked computers in 1969, and the publication of the TCP/IP protocol in 1974. The timeline also covers the expansion of networking internationally and the creation of technologies like email, DNS, and the World Wide Web.
The document provides a detailed timeline and history of the development of the Internet from the late 1950s to present day. It traces the early research on packet switching networks funded by ARPA/DARPA and the development of TCP/IP in the 1970s. Key events include the creation of the ARPANET in the 1960s, the World Wide Web in the early 1990s, and the commercialization of the Internet in the mid-1990s. The document also describes common Internet protocols like HTTP, DNS, and how IP addresses and domain names work.
The document provides a detailed history and timeline of the development of the Internet from 1836 to 1997. Some of the key events and developments include:
- The origins of packet switching networks developed in the 1960s for the U.S. military which helped enable the future development of the Internet.
- The birth of the Internet in 1969 with the creation of ARPANET and the connection of four computer nodes at universities in California and Utah.
- The development of electronic mail in 1971 which allowed people to communicate over the network and is still a primary form of communication today.
- The creation of the World Wide Web in 1991 by Tim Berners-Lee which revolutionized access to information and communications.
1) Joseph Licklider at DARPA developed the idea of a universal network and wanted different computer systems used by the military to be able to communicate.
2) Researchers at BBN and MIT created the first Interface Message Processors (IMPs) and connected the first two nodes, helping establish the foundations of the internet.
3) Key internet innovations included TCP/IP, packet switching, routers, fiber optics, Ethernet, HTML, and multimedia standards which were developed in the 1970s and 1980s.
1. Belgian information expert Paul Otlet imagined a "Radiated Library" in the 1930s that used technology of the time like telephone and radio to create something similar to the internet.
2. In the 1960s, researchers at universities and technology companies in the US began experimenting with and developing early versions of packet switching and inter-connected computer networks that would later become known as the ARPANET and lay the foundation for the internet.
3. By the late 1970s, TCP/IP emerged as the main protocol for ARPANET and helped connect different networks, establishing the internet. This allowed for greater connectivity and use of the internet throughout the 1980s.
1) Joseph Licklider at DARPA developed the idea of a universal network and wanted different computer systems used by the military to be able to communicate.
2) Researchers at BBN and MIT created the first Interface Message Processors (IMPs) and connected the first two nodes, helping establish the foundations of the internet.
3) Key internet innovations included TCP/IP, packet switching, routers, fiber optics, Ethernet, HTML, and multimedia standards which were developed in the 1970s and 1980s.
CATComputer Applied Training...the history and basic concepts provides an overview of the history of the internet from 1969 to 1997. It discusses key milestones such as the development of ARPANET, TCP/IP, email, the world wide web, and commercialization. The document also covers pros and cons of internet use and training as well as basics of netiquette and social networking.
The Links that Became a Web: The 45-year Old Internet and the 25-Year Old WebJohan Koren
The document provides a history of the development of the Internet and World Wide Web over several decades. It describes how ARPANET, the earliest precursor to the Internet, was developed in the 1960s as a military network and then expanded. Key developments included the creation of email in the early 1970s, and connections between different networks in 1975 which marked the beginning of the Internet. In the 1980s, uses expanded beyond research and the military with the rise of bulletin board systems and search tools like Gopher, Veronica, and Archie. The World Wide Web was invented in 1989-1991, using HTML and HTTP. The first graphical web browser, Mosaic, helped popularize the Web in the early 1990s.
The document provides a timeline of key events in the development of the Internet from 1957-1990. It describes the launch of Sputnik which led to the creation of ARPA and early computer networking research. Important developments include the ARPANET being built in the late 1960s, the transmission of the first message between networked computers in 1969, and the publication of the TCP/IP protocol in 1974. The timeline also covers the expansion of networking internationally and the creation of technologies like email, DNS, and the World Wide Web.
The document provides a detailed timeline and history of the development of the Internet from the late 1950s to present day. It traces the early research on packet switching networks funded by ARPA/DARPA and the development of TCP/IP in the 1970s. Key events include the creation of the ARPANET in the 1960s, the World Wide Web in the early 1990s, and the commercialization of the Internet in the mid-1990s. The document also describes common Internet protocols like HTTP, DNS, and how IP addresses and domain names work.
The document provides a history of the development of the Internet from the 1960s to present day. It begins with the conception of early computer networks by the US Department of Defense and military researchers. These early networks grew throughout the 1960s and 1970s, eventually connecting many universities. Standards and protocols like TCP/IP were established in the 1970s and 1980s, allowing for the Internet as we know it today. Commercial use expanded in the 1990s with technologies like the World Wide Web, and today the Internet connects billions of people and sites globally.
The document provides a history of the development of the Internet from its origins in the 1960s to the present day. It describes how the Internet began as a project of the US military called ARPANET to create a decentralized network that could withstand attacks. Key developments included the creation of email in 1972, the introduction of TCP/IP in the 1970s which allowed different networks to connect, the launch of the World Wide Web in the early 1990s, and the rise of popular services and technologies like search engines, browsers, social media, and ecommerce in the late 1990s. The document also discusses some of the ongoing issues surrounding the Internet like privacy, security, and the digital divide.
The Internet is a global system of interconnected computer networks that use TCP/IP protocols. It originated in the 1960s from US military research networks and has since grown exponentially to include networks run by universities, businesses, governments, and individuals worldwide. The Internet allows for communication via applications like the World Wide Web, email, file transfers, online gaming, and voice/video calls. It is estimated that a quarter of the world's population now uses the Internet.
Short presentation made at attempting to demonstrate the fast growth of the Internet.
Includes pictures of early ArpaNet diagrams, reproduced without permission but found openly on the Internet.
Most of the other material (host files as well as screen captures of early browser activity) is mine.
The document provides a brief history of the development of the Internet from 1969 to 2001. It describes key events and inventions such as ARPANET in 1969, the development of Ethernet in 1976, the creation of TCP/IP and other internet protocols in 1983, the release of the World Wide Web in 1992, the commercialization of the internet in the mid-1990s, and the internet boom of 1999-2001. The summary also mentions some of the major players involved including Al Gore, Steve Jobs, Bill Gates, and companies like AOL.
The history of the Internet began in 1957 with the creation of ARPANET by the US Department of Defense. ARPANET connected four university sites in 1969 and was the precursor to the Internet. By 1980, 200 host computers were connecting over 20,000 users across university, military, and government networks. Major developments in the 1980s included the domain name system and the first widespread virus. Growth continued exponentially, and by the mid-2000s the Internet included over 90 million websites and popular services like Google and YouTube.
Activity 10 - Timeline History of InternetCrystal Gozon
The document outlines major events in the history of the internet from 1957 to 2010, including the launch of Sputnik which led to the creation of ARPA and funding for early computer networking research, the development of packet switching and protocols like TCP/IP which enabled interconnected computer networks, the creation of technologies like email, the World Wide Web, and browsers, and key figures who helped develop and expand access to the internet globally.
The document provides information on the history and development of the Internet. It discusses how early research networks like ARPANET were developed in the 1960s and 1970s and the development of key protocols like TCP/IP. The document then summarizes the role of several pioneers in the development of early computer networks and the Internet, including Paul Baran who developed the concept of packet switching, Vint Cerf and Robert Kahn who created TCP/IP, and Tim Berners-Lee who invented the World Wide Web.
The document provides an overview of the history and evolution of the Internet from its earliest origins in the 19th century to modern times. It discusses pioneering figures like Vannevar Bush who established the US military/university research partnership that developed ARPANET, and Claude Shannon who established information theory and the concept of bits. It also summarizes the work of Paul Baran who developed the concept of packet switching networks, and J.C.R. Licklider who developed the idea of a universal network and inspired the creation of ARPANET.
1) The US government established ARPA (Advanced Research Projects Agency) in response to the Soviet Union launching the first satellite, and ARPA developed ARPANET, the precursor to the Internet, to enable easier communication between researchers.
2) In the 1980s, networks started connecting internationally through NSFNET, allowing over 250 networks to connect outside of the US by 1990.
3) Tim Berners-Lee invented the World Wide Web at CERN in 1989 to allow scientists to easily share research over the newly developing Internet. He created HTML, URLs, and the first web browser.
The document provides an overview of the history and development of the Internet. It discusses that the ARPANET, developed by ARPA (Advanced Research Projects Agency), was the first network to use packet switching and laid the foundation for the Internet. The ARPANET connected 4 computers at UCLA, SRI, UCSB, and the University of Utah in 1969. It then expanded to connect 18 sites by 1971 and nearly 100 sites by 1980. The ARPANET was shut down in 1990 but commercial backbones were established, becoming the current Internet.
- The creation of ARPA in 1958 by the United States in response to the USSR's launch of Sputnik led to funding for early computer network research.
- ARPA's Information Processing Technology Office (IPTO) funded the development of the ARPANET, connecting four computers at UCLA and SRI in 1969, which demonstrated that packet switching networks could work.
- Standardization of TCP/IP protocols in the 1970s allowed different networks to interconnect, and by the 1980s commercial internet providers emerged, connecting networks like Usenet and BITNET and allowing widespread growth of the internet.
The document provides a history of the internet and its evolution from early concepts in the 1940s-1950s to widespread adoption in the 1990s. It discusses pioneering concepts such as packet switching, TCP/IP, and the ARPANET. Key individuals who contributed ideas and research that led to the development of the internet are also profiled, including Vannevar Bush, Claude Shannon, J.C.R. Licklider, Paul Baran, Vinton Cerf, and Tim Berners-Lee.
The history of the Internet began in the 1960s with the creation of ARPANET by the Department of Defense. Key developments included the adoption of TCP/IP protocols in the 1980s and the launch of the NSFNET in the mid-1980s. In the 1990s, the introduction of the World Wide Web and graphical web browsers like Mosaic and Netscape made the Internet easily accessible to the general public and sparked its rapid growth.
The document provides an overview of Epix, an Internet service provider established in 1994. It summarizes Epix's services such as dial-up and high-speed Internet access, email accounts, web hosting, and technical support hours. The document also outlines Epix's various Internet access plans and premium features included with each plan.
The document discusses the origins and development of the Internet. It describes how ARPANET, developed by DARPA, was the first wide area network and used packet switching, laying the foundation for the Internet. It then explains how NSFNET, created by the National Science Foundation, helped spread network usage by connecting to ARPANET and regional networks after ARPANET was discontinued. Finally, it discusses how the World Wide Web, invented by Tim Berners-Lee in 1989, made the Internet useful for human communication through technologies like HTML, URIs, and HTTP.
Leonard Kleinrock established the Network Measurement Center at UCLA which connected to the first Interface Message Processor, becoming the first node on the ARPANET and Internet. Lawrence Roberts led the overall ARPANET system design as the program manager and held a design session in 1967 that outlined standards for user identification, character transmission, and error checking which helped launch the ARPANET project. Paul Baran developed the concept of packet switching networks while at RAND which influenced Roberts and Kleinrock to adopt that technology for the ARPANET.
The document provides a timeline history of the internet from 1957 to 2012. It describes several key events and innovations such as the creation of ARPA which led to the development of packet switching and the ARPANET, the first email program, the introduction of TCP/IP, the creation of the World Wide Web by Tim Berners-Lee, and the launch of popular websites and technologies like Google, Facebook, YouTube and Wikipedia. The timeline traces the evolution of the internet from its origins as a US military network to the ubiquitous technology and tool it is today.
The internet began in the 1950s as a US government network called ARPANET to enable communication in case of nuclear war. In the 1960s, scientists developed packet switching and email, allowing computers to communicate. In the 1970s, protocols like TCP/IP were developed to connect different networks, and the term "internet" was coined. The 1980s saw widespread adoption among researchers, and domains and browsers launched in the 1990s, making the internet publicly accessible. The web, e-commerce, social media, and mobile use transformed the internet in the 2000s into a global communication platform.
This document provides an overview of the history and evolution of the Internet. It discusses how the Internet originated from early computer networks developed by ARPA and DARPA in the 1960s-1970s to connect government and university research computers. It then covers the creation of TCP/IP in the 1970s which established the fundamental communication protocols of the Internet. The document also summarizes the commercial opening of the Internet in the 1990s and the creation of the World Wide Web in 1989-1991 by Tim Berners-Lee, which allowed for easy access to hyperlinked documents and multimedia over the Internet.
The Internet began in the 1960s as a research project funded by the U.S. Defense Department to connect computers at universities and research labs. Early pioneers like Leonard Kleinrock, Vint Cerf, and Robert Kahn developed the basic networking and communication protocols that allowed different networks to interconnect. In the 1980s and 1990s, the Internet grew exponentially as it became commercially available and the World Wide Web brought hypertext and multimedia to online content. Key challenges going forward include expanding high-speed access, transitioning to a new Internet protocol, and enhancing security against cyber threats.
The document provides a history of the development of the Internet from the 1960s to present day. It begins with the conception of early computer networks by the US Department of Defense and military researchers. These early networks grew throughout the 1960s and 1970s, eventually connecting many universities. Standards and protocols like TCP/IP were established in the 1970s and 1980s, allowing for the Internet as we know it today. Commercial use expanded in the 1990s with technologies like the World Wide Web, and today the Internet connects billions of people and sites globally.
The document provides a history of the development of the Internet from its origins in the 1960s to the present day. It describes how the Internet began as a project of the US military called ARPANET to create a decentralized network that could withstand attacks. Key developments included the creation of email in 1972, the introduction of TCP/IP in the 1970s which allowed different networks to connect, the launch of the World Wide Web in the early 1990s, and the rise of popular services and technologies like search engines, browsers, social media, and ecommerce in the late 1990s. The document also discusses some of the ongoing issues surrounding the Internet like privacy, security, and the digital divide.
The Internet is a global system of interconnected computer networks that use TCP/IP protocols. It originated in the 1960s from US military research networks and has since grown exponentially to include networks run by universities, businesses, governments, and individuals worldwide. The Internet allows for communication via applications like the World Wide Web, email, file transfers, online gaming, and voice/video calls. It is estimated that a quarter of the world's population now uses the Internet.
Short presentation made at attempting to demonstrate the fast growth of the Internet.
Includes pictures of early ArpaNet diagrams, reproduced without permission but found openly on the Internet.
Most of the other material (host files as well as screen captures of early browser activity) is mine.
The document provides a brief history of the development of the Internet from 1969 to 2001. It describes key events and inventions such as ARPANET in 1969, the development of Ethernet in 1976, the creation of TCP/IP and other internet protocols in 1983, the release of the World Wide Web in 1992, the commercialization of the internet in the mid-1990s, and the internet boom of 1999-2001. The summary also mentions some of the major players involved including Al Gore, Steve Jobs, Bill Gates, and companies like AOL.
The history of the Internet began in 1957 with the creation of ARPANET by the US Department of Defense. ARPANET connected four university sites in 1969 and was the precursor to the Internet. By 1980, 200 host computers were connecting over 20,000 users across university, military, and government networks. Major developments in the 1980s included the domain name system and the first widespread virus. Growth continued exponentially, and by the mid-2000s the Internet included over 90 million websites and popular services like Google and YouTube.
Activity 10 - Timeline History of InternetCrystal Gozon
The document outlines major events in the history of the internet from 1957 to 2010, including the launch of Sputnik which led to the creation of ARPA and funding for early computer networking research, the development of packet switching and protocols like TCP/IP which enabled interconnected computer networks, the creation of technologies like email, the World Wide Web, and browsers, and key figures who helped develop and expand access to the internet globally.
The document provides information on the history and development of the Internet. It discusses how early research networks like ARPANET were developed in the 1960s and 1970s and the development of key protocols like TCP/IP. The document then summarizes the role of several pioneers in the development of early computer networks and the Internet, including Paul Baran who developed the concept of packet switching, Vint Cerf and Robert Kahn who created TCP/IP, and Tim Berners-Lee who invented the World Wide Web.
The document provides an overview of the history and evolution of the Internet from its earliest origins in the 19th century to modern times. It discusses pioneering figures like Vannevar Bush who established the US military/university research partnership that developed ARPANET, and Claude Shannon who established information theory and the concept of bits. It also summarizes the work of Paul Baran who developed the concept of packet switching networks, and J.C.R. Licklider who developed the idea of a universal network and inspired the creation of ARPANET.
1) The US government established ARPA (Advanced Research Projects Agency) in response to the Soviet Union launching the first satellite, and ARPA developed ARPANET, the precursor to the Internet, to enable easier communication between researchers.
2) In the 1980s, networks started connecting internationally through NSFNET, allowing over 250 networks to connect outside of the US by 1990.
3) Tim Berners-Lee invented the World Wide Web at CERN in 1989 to allow scientists to easily share research over the newly developing Internet. He created HTML, URLs, and the first web browser.
The document provides an overview of the history and development of the Internet. It discusses that the ARPANET, developed by ARPA (Advanced Research Projects Agency), was the first network to use packet switching and laid the foundation for the Internet. The ARPANET connected 4 computers at UCLA, SRI, UCSB, and the University of Utah in 1969. It then expanded to connect 18 sites by 1971 and nearly 100 sites by 1980. The ARPANET was shut down in 1990 but commercial backbones were established, becoming the current Internet.
- The creation of ARPA in 1958 by the United States in response to the USSR's launch of Sputnik led to funding for early computer network research.
- ARPA's Information Processing Technology Office (IPTO) funded the development of the ARPANET, connecting four computers at UCLA and SRI in 1969, which demonstrated that packet switching networks could work.
- Standardization of TCP/IP protocols in the 1970s allowed different networks to interconnect, and by the 1980s commercial internet providers emerged, connecting networks like Usenet and BITNET and allowing widespread growth of the internet.
The document provides a history of the internet and its evolution from early concepts in the 1940s-1950s to widespread adoption in the 1990s. It discusses pioneering concepts such as packet switching, TCP/IP, and the ARPANET. Key individuals who contributed ideas and research that led to the development of the internet are also profiled, including Vannevar Bush, Claude Shannon, J.C.R. Licklider, Paul Baran, Vinton Cerf, and Tim Berners-Lee.
The history of the Internet began in the 1960s with the creation of ARPANET by the Department of Defense. Key developments included the adoption of TCP/IP protocols in the 1980s and the launch of the NSFNET in the mid-1980s. In the 1990s, the introduction of the World Wide Web and graphical web browsers like Mosaic and Netscape made the Internet easily accessible to the general public and sparked its rapid growth.
The document provides an overview of Epix, an Internet service provider established in 1994. It summarizes Epix's services such as dial-up and high-speed Internet access, email accounts, web hosting, and technical support hours. The document also outlines Epix's various Internet access plans and premium features included with each plan.
The document discusses the origins and development of the Internet. It describes how ARPANET, developed by DARPA, was the first wide area network and used packet switching, laying the foundation for the Internet. It then explains how NSFNET, created by the National Science Foundation, helped spread network usage by connecting to ARPANET and regional networks after ARPANET was discontinued. Finally, it discusses how the World Wide Web, invented by Tim Berners-Lee in 1989, made the Internet useful for human communication through technologies like HTML, URIs, and HTTP.
Leonard Kleinrock established the Network Measurement Center at UCLA which connected to the first Interface Message Processor, becoming the first node on the ARPANET and Internet. Lawrence Roberts led the overall ARPANET system design as the program manager and held a design session in 1967 that outlined standards for user identification, character transmission, and error checking which helped launch the ARPANET project. Paul Baran developed the concept of packet switching networks while at RAND which influenced Roberts and Kleinrock to adopt that technology for the ARPANET.
The document provides a timeline history of the internet from 1957 to 2012. It describes several key events and innovations such as the creation of ARPA which led to the development of packet switching and the ARPANET, the first email program, the introduction of TCP/IP, the creation of the World Wide Web by Tim Berners-Lee, and the launch of popular websites and technologies like Google, Facebook, YouTube and Wikipedia. The timeline traces the evolution of the internet from its origins as a US military network to the ubiquitous technology and tool it is today.
Similar to Design and development of a web-based data visualization software for political tendency identification of Twitter’s users using Python Dash Framework
The internet began in the 1950s as a US government network called ARPANET to enable communication in case of nuclear war. In the 1960s, scientists developed packet switching and email, allowing computers to communicate. In the 1970s, protocols like TCP/IP were developed to connect different networks, and the term "internet" was coined. The 1980s saw widespread adoption among researchers, and domains and browsers launched in the 1990s, making the internet publicly accessible. The web, e-commerce, social media, and mobile use transformed the internet in the 2000s into a global communication platform.
This document provides an overview of the history and evolution of the Internet. It discusses how the Internet originated from early computer networks developed by ARPA and DARPA in the 1960s-1970s to connect government and university research computers. It then covers the creation of TCP/IP in the 1970s which established the fundamental communication protocols of the Internet. The document also summarizes the commercial opening of the Internet in the 1990s and the creation of the World Wide Web in 1989-1991 by Tim Berners-Lee, which allowed for easy access to hyperlinked documents and multimedia over the Internet.
The Internet began in the 1960s as a research project funded by the U.S. Defense Department to connect computers at universities and research labs. Early pioneers like Leonard Kleinrock, Vint Cerf, and Robert Kahn developed the basic networking and communication protocols that allowed different networks to interconnect. In the 1980s and 1990s, the Internet grew exponentially as it became commercially available and the World Wide Web brought hypertext and multimedia to online content. Key challenges going forward include expanding high-speed access, transitioning to a new Internet protocol, and enhancing security against cyber threats.
The document provides a brief history of the development of the Internet from the 1950s to the present. It describes how early computer networks developed in the US, Britain, and France led to the creation of ARPANET in 1969. Key events included the introduction of email in 1972, the development of TCP/IP allowing network interconnection in the 1970s, and the creation of the World Wide Web in 1991 using hypertext transfer protocol. The commercialization of the Internet in the 1990s led to widespread adoption and access.
The Internet began in the 1960s as a research project funded by the U.S. Defense Department to explore connecting computers together using packet switching networks. Key figures in the development of the Internet include Leonard Kleinrock, who did early work in packet switching, Vint Cerf and Robert Kahn who defined the Internet Protocol (IP), and Tim Berners-Lee who developed the World Wide Web. Through the 1970s and 1980s, protocols like TCP/IP were developed to connect different networks together and by the 1990s the Internet had exploded in popularity with the widespread adoption of web browsers and commercial internet providers.
The Internet began in the 1960s as a research project funded by the U.S. Department of Defense to connect computers at universities and research labs. Early pioneers like Leonard Kleinrock, Vint Cerf, and Robert Kahn developed the basic networking and communication protocols that allowed different networks to interconnect. In the 1980s and 1990s, the Internet grew exponentially as it transitioned from a research project to a publicly used network, fueled by the development of the World Wide Web in the early 1990s. Today, challenges remain around increasing access speeds, transitioning to a new Internet protocol, and improving security against cyber threats.
This document provides an overview of the history and development of computer networks and the internet. It discusses the early development of packet switching in the 1960s by researchers at MIT, RAND, and the UK. It also describes the creation of ARPANET in the late 1960s and early 1970s and its growth. Subsequent sections discuss the proliferation of networks in the 1980s and 1990s driven by NSFNET and the development of the World Wide Web. The document concludes by outlining some of the key hardware components of networks and benefits and disadvantages of computer networks.
The document provides an overview of the history and development of the Internet. It discusses how the Internet started as a small network called ARPANET connecting only a few computers and has grown exponentially over time. Key developments include the creation of TCP/IP in the 1970s allowing different networks to connect, the naming of the interconnected networks as "the Internet" in the 1980s, and the creation of the World Wide Web in the 1990s. The document pays tribute to many of the pioneers and technologies that made the modern Internet possible. Statistics show the Internet has grown from around 100 hosts in the late 1970s to over 200 million hosts by 2002.
In this presentation, we will identify and pay tribute to several of the people who .... been having grand ideas but has never seen them through to completed projects. ... But possess the technical ... the first node on the ARPANET, and the first computer ever on the Internet.
The document provides information about the history and development of the Internet. It discusses how the Internet originated from the ARPANET network developed by the US Department of Defense in the 1960s for research purposes. It then explains how ARPANET evolved into the Internet over time as networking technology advanced and more computers and networks became connected. The document also outlines some of the key events and innovations that led to the widespread adoption of the Internet, such as the introduction of TCP/IP protocols and the creation of the World Wide Web.
Internet its history by pankaj singh chandelPankaj Chandel
The Internet began in 1957 with the creation of ARPANET by the U.S. Department of Defense to compete against the USSR in technology development. In 1969, the first connection was made between computers at UCLA and SRI, marking the beginning of ARPANET which became the first operational packet switching network and the core of what would become the global Internet. Major developments included the introduction of TCP/IP in 1977 allowing for seamless transmission, the creation of the World Wide Web in 1990 at CERN, and the launch of popular websites and apps in the 1990s-2000s which drove widespread commercial and public use of the Internet.
This document provides a timeline of key events in the development of the Internet from 1972 to 2010. Some of the major milestones include: Ray Tomlinson sending the first email in 1972; the establishment of domain naming standards like .com and .edu in 1983; the creation of the World Wide Web in 1990 by Tim Berners-Lee; the launch of popular websites and technologies like Google in 1998, Facebook in 2004, and Firefox in 2004; and the Internet reaching 400 million Facebook users in 2010. The timeline traces the evolution of the Internet from its origins in research networks to the ubiquitous technology it is today.
[GE207] Session01: Introduction to Digital TechnologySukanya Ben
The document provides an introduction and history of digital technology and the internet. It defines digital technology and traces the evolution of computers from early machines to personal computers and mobile devices. It also outlines the history of the internet from its origins as ARPANET to the development of technologies like TCP/IP and the world wide web. Key events and innovations in digital storage, processors, and influential technologies are highlighted in timeline formats.
The document discusses the history and development of the Internet. It began as a US military network called ARPANET in the 1960s and expanded to connect universities. The first email program and domain name system were introduced in the 1970s. Tim Berners-Lee invented the World Wide Web in 1991 using HTTP and browsers. The number of Internet users exploded from the mid-1990s onward, reaching billions by the 2010s. Web 2.0 democratized the Internet through user-generated content and social media. The document outlines future challenges like cloud computing, big data, and the Internet of Things.
The document provides a history of the internet, beginning with its origins in the 1950s with the development of computers. It describes how the US Department of Defense funded research in the 1960s to create a network (ARPANET) that would continue functioning even if parts were damaged, laying the foundation for today's internet. Key developments include the creation of TCP/IP in the 1970s and 1980s to standardize network communication and the introduction of the World Wide Web in the 1990s, which led internet usage to grow tremendously, reaching over 3 billion users globally by 2014.
Each month we select a topic then trawl the trove of Slideshare to find the best presentations and explanation of the topics we advocate for. Sometimes the material will be dry, heavy and other times the presentation might be a pretty dam fine bit of art. This month we have concentrated on Privacy.
COMPED9 Module 2 The Internet and the WebJeremy Flores
The document summarizes the history and development of the Internet and World Wide Web. It describes how JCR Licklider first proposed a global network of computers in the 1960s which led to the development of ARPANET and packet switching. The ARPANET connected four universities in 1969 which is seen as the beginning of the Internet. Key developments included the introduction of email, FTP, and TCP/IP. Tim Berners-Lee later introduced the World Wide Web in 1989 as a way to share information over the Internet using hyperlinks. The graphical web browser Mosaic popularized the Web in the early 1990s. The document distinguishes that the Internet is the global network itself while the Web is an interface that runs on
Chapter 4 data communication fundamentalN. A. Sutisna
This document provides a history of data communication and the development of the Internet. It discusses key events and innovations such as the development of packet switching networks in the 1960s, the creation of ARPANET in 1969, the specification of TCP and IP in 1982 which defined the Internet, and the commercialization of the Internet in the late 1980s. It outlines the rapid growth of the Internet from just a few nodes in 1969 to over 100,000 hosts by 1989.
The document provides an overview of the history and development of the Internet. It discusses how the Internet originated from the ARPANET network created by the U.S. Department of Defense in the 1960s to enable communication between researchers even if parts of the network failed. It describes how standards like TCP/IP were developed in the 1970s and 1980s, allowing the Internet to grow rapidly from around 1,000 hosts in 1984 to over 200 million hosts by 2002. The document also summarizes how the Internet works, including topics like browsers, URLs, domain names, and different ways users can access the Internet through connections like LAN servers, dial-up, or online services.
This presentation, designed for a high school communications skills class, outlines the development of the internet.
Similar to Design and development of a web-based data visualization software for political tendency identification of Twitter’s users using Python Dash Framework (20)
Learn SQL from basic queries to Advance queriesmanishkhaire30
Dive into the world of data analysis with our comprehensive guide on mastering SQL! This presentation offers a practical approach to learning SQL, focusing on real-world applications and hands-on practice. Whether you're a beginner or looking to sharpen your skills, this guide provides the tools you need to extract, analyze, and interpret data effectively.
Key Highlights:
Foundations of SQL: Understand the basics of SQL, including data retrieval, filtering, and aggregation.
Advanced Queries: Learn to craft complex queries to uncover deep insights from your data.
Data Trends and Patterns: Discover how to identify and interpret trends and patterns in your datasets.
Practical Examples: Follow step-by-step examples to apply SQL techniques in real-world scenarios.
Actionable Insights: Gain the skills to derive actionable insights that drive informed decision-making.
Join us on this journey to enhance your data analysis capabilities and unlock the full potential of SQL. Perfect for data enthusiasts, analysts, and anyone eager to harness the power of data!
#DataAnalysis #SQL #LearningSQL #DataInsights #DataScience #Analytics
Global Situational Awareness of A.I. and where its headedvikram sood
You can see the future first in San Francisco.
Over the past year, the talk of the town has shifted from $10 billion compute clusters to $100 billion clusters to trillion-dollar clusters. Every six months another zero is added to the boardroom plans. Behind the scenes, there’s a fierce scramble to secure every power contract still available for the rest of the decade, every voltage transformer that can possibly be procured. American big business is gearing up to pour trillions of dollars into a long-unseen mobilization of American industrial might. By the end of the decade, American electricity production will have grown tens of percent; from the shale fields of Pennsylvania to the solar farms of Nevada, hundreds of millions of GPUs will hum.
The AGI race has begun. We are building machines that can think and reason. By 2025/26, these machines will outpace college graduates. By the end of the decade, they will be smarter than you or I; we will have superintelligence, in the true sense of the word. Along the way, national security forces not seen in half a century will be un-leashed, and before long, The Project will be on. If we’re lucky, we’ll be in an all-out race with the CCP; if we’re unlucky, an all-out war.
Everyone is now talking about AI, but few have the faintest glimmer of what is about to hit them. Nvidia analysts still think 2024 might be close to the peak. Mainstream pundits are stuck on the wilful blindness of “it’s just predicting the next word”. They see only hype and business-as-usual; at most they entertain another internet-scale technological change.
Before long, the world will wake up. But right now, there are perhaps a few hundred people, most of them in San Francisco and the AI labs, that have situational awareness. Through whatever peculiar forces of fate, I have found myself amongst them. A few years ago, these people were derided as crazy—but they trusted the trendlines, which allowed them to correctly predict the AI advances of the past few years. Whether these people are also right about the next few years remains to be seen. But these are very smart people—the smartest people I have ever met—and they are the ones building this technology. Perhaps they will be an odd footnote in history, or perhaps they will go down in history like Szilard and Oppenheimer and Teller. If they are seeing the future even close to correctly, we are in for a wild ride.
Let me tell you what we see.
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataKiwi Creative
Harness the power of AI-backed reports, benchmarking and data analysis to predict trends and detect anomalies in your marketing efforts.
Peter Caputa, CEO at Databox, reveals how you can discover the strategies and tools to increase your growth rate (and margins!).
From metrics to track to data habits to pick up, enhance your reporting for powerful insights to improve your B2B tech company's marketing.
- - -
This is the webinar recording from the June 2024 HubSpot User Group (HUG) for B2B Technology USA.
Watch the video recording at https://youtu.be/5vjwGfPN9lw
Sign up for future HUG events at https://events.hubspot.com/b2b-technology-usa/
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data LakeWalaa Eldin Moustafa
Dynamic policy enforcement is becoming an increasingly important topic in today’s world where data privacy and compliance is a top priority for companies, individuals, and regulators alike. In these slides, we discuss how LinkedIn implements a powerful dynamic policy enforcement engine, called ViewShift, and integrates it within its data lake. We show the query engine architecture and how catalog implementations can automatically route table resolutions to compliance-enforcing SQL views. Such views have a set of very interesting properties: (1) They are auto-generated from declarative data annotations. (2) They respect user-level consent and preferences (3) They are context-aware, encoding a different set of transformations for different use cases (4) They are portable; while the SQL logic is only implemented in one SQL dialect, it is accessible in all engines.
#SQL #Views #Privacy #Compliance #DataLake
ViewShift: Hassle-free Dynamic Policy Enforcement for Every Data Lake
Design and development of a web-based data visualization software for political tendency identification of Twitter’s users using Python Dash Framework
1. Hellenic Mediterranean University
School of Engineering
Department of Electrical and Computer Engineering
“Design and development of a web-based data visualization software for political
tendency identification of Twitter’s users using Python Dash Framework”
M.Sc. Candidate: Alexandros Britzolakis | Student’s ID: mtp186
Committee approvals:
Supervisor: Dr. Nikolaos Papadakis | Examiner A: Dr. Charidimos Kondylakis | Examiner B: Dr. Stylianos Sfakianakis
Postgraduate course “Informatics Engineering”
A thesis presentation for the partial fulfillment of the requirements for the degree of Master of Science
Thesis Subject:
Estavromenos University Campus, 714 10 Heraklion – Crete, Greece
2. Acknowledgements
• To Dr. Haridimos Kondilakis and Dr. Stelios Sfakianakis senior researchers at Computational BioMedicine
Laboratory (C.BM.L.) of Foundation for Research & Technology – Hellas (FO.R.T.H.)
• To the entire team of the C.BM.L for giving me the chance to work with them and be a part of their team.
• To my thesis supervisor Dr. Nikolaos Papadakis, associate professor at the Hellenic Mediterranean University, for
his constant guidance throughout my studies.
• And to my family for their constant affection and support.
3. A brief Introduction
Section Slides
1. A milestone From Abacus to the first Computer 5 – 13
2. Computer Networks: From ARPANET to the Internet 14 – 18
3. The evolution of the Web until nowadays 19 – 30
4. Text mining methodologies over Twitter 31 – 35
5. Lexicon-Based Sentiment Analysis 36 – 45
6. Identifying the issues of Sentiment Lexicons 46 – 48
7. Sentiment Analysis with Machine Learning 49 – 51
8. AthPPA a tool for analyzing political popularity over Twitter 52 – 68
9. AthPPA graph results 69 – 98
10. Conclusion 99 – 101
11. Useful Links 102
12. References 103 - 105
4. A brief Introduction
“We must look beneath every
stone, lest it conceal some
politician ready to sting us”
Aristophanes, 445-386 BC,
Ancient Greek comic playwright
Thesmophoriazusae
5. A milestone: From Abacus to the first computer
• Before we proceed in analyzing programming stuff lets make a small overview on how computing systems
emerged. Humans always strive for progress and this can be seen even from the ancient times were the idea of
computation was on the table.
• Although the history of modern computers begins in mid 40s, there is an older recorded historical background of
computing machines.
• The first recorded computing machine begins with the discovery of Abacus by Sumerians (2500 BCE), the Scytale
(700 BCE) and Antikythera mechanism (150 BCE) as well as Heron's Programmable Robot (10 AD-85 AD),
discovered by the Ancient Greeks.
The Sumerian abacus system, (b) the Scytale (c) and the Antikythera mechanism
Slide: 5 to 105
6. A milestone: From Abacus to the first computer
Emerged computational systems from Bronze age until the end of 19th century
Slide: 6 to 105
7. A milestone: From Abacus to the first computer
Emerged computing systems at the beginning of World War I and at the end of World War II
Slide: 7 to 105
8. A milestone: From Abacus to the first computer
Emerged computing systems from 1947 until 1973 (Post–World War II era)
Slide: 8 to 105
9. A milestone: From Abacus to the first computer
Emerged computing systems from 1974 until 1991 (End of the cold war)
Slide: 9 to 105
10. A milestone: From Abacus to the first computer
(a) the bell 101 modem, (b) FORTRAN documentation, (c) the IBM System/360, (d) C programming language logo, (e)
first personal computer, (f) first laptop, (g) IBM 5100 one of the first IBM PCs
Slide: 10 to 105
11. A milestone: From Abacus to the first computer
Technological advancements in computer science and engineering from 1992 until present
Slide: 11 to 105
12. A milestone: From Abacus to the first computer
(a) Logo of Boston Dynamics, (b) Apple newton, (c) first DVD player, (d) Universal Serial Bus, (e) Diamond Rio MP3
player (f) Napster logo, (g)Wikipedia logo, (h) Trek ThumbDrive one of the first USBs, (i) Google logo in 1998
Slide: 12 to 105
13. A milestone: From Abacus to the first computer
Social media platforms and their additional logos organized by the date of their foundation
Slide: 13 to 105
14. Computer Networks: From ARPANET to the Internet
• 1965: At MIT Lincoln Lab, two computers successfully communicate with each other using packet-switching
technology.
• 1968: Beranek and Newman, Inc. (BBN) unveils the final version of the Interface Message Processor (IMP)
specifications. BBN wins ARPANET contract.
• 1969: On Oct. 29, UCLA’s Network Measurement Center, Stanford Research Institute (SRI), University of California-
Santa Barbara and University of Utah install nodes. The first message is "LO," which was an attempt by student
Charles Kline to "LOGIN" to the SRI computer from the university. However, the message was unable to be
completed because the SRI system crashed.
• 1972: BBN’s Ray Tomlinson introduces network email. The Internetworking Working Group (INWG) forms to
address need for establishing standard protocols.
• 1973: Global networking becomes a reality as the University College of London (England) and Royal Radar
Establishment (Norway) connect to ARPANET.
Slide: 14 to 105
15. Computer Networks: From ARPANET to the Internet
An Interface Message Processor – IMP and its control panel
Slide: 15 to 105
16. Computer Networks: From ARPANET to the Internet
The expansion of ARPANET across the United States in 1970
Slide: 16 to 105
17. Computer Networks: From ARPANET to the Internet
• 1974: A commercial version of ARPANET was introduced namely as “Telenet” which can be considered as a the
first effort for public commercial Internet Service Provider – ISP.
• 1974: Peter T. Kirstein, Vint Cerf and Bob Kahn publish "A Protocol for Packet Network Interconnection," which
details the design of TCP.
• 1979: USENET forms to host news and discussion groups.
• 1981: The National Science Foundation (NSF) provided a grant to establish the Computer Science Network
(CSNET) to provide networking services to university computer scientists.
• 1982: TCP/IP was introduced after TCP (Transmission Control Protocol) and IP (Internet Protocol (IP) were joined
together in order to form a protocol suite for the ARPANET. This improvement led to the definition of what is
known nowadays as Internet as it still remains the active communication protocol suite.
Slide: 17 to 105
18. Computer Networks: From ARPANET to the Internet
• 1983: The Domain Name System (DNS) establishes the familiar .edu, .gov, .com, .mil, .org, .net, and .int system for
naming websites. This is easier to remember than the previous designation for websites, such as 123.456.789.10.
• 1985: Symbolics.com, the website for Symbolics Computer Corp. in Massachusetts, becomes the first registered
domain.
• 1986: The National Science Foundation’s NSFNET goes online to connected supercomputer centers at 56,000 bits
per second — the speed of a typical dial-up computer modem. Over time the network speeds up and regional
research and education networks, supported in part by NSF, are connected to the NSFNET backbone — effectively
expanding the Internet throughout the United States. The NSFNET was essentially a network of networks that
connected academic users along with the ARPANET.
• 1987: More than 20.000 hosts around the globe have joined the NSFNET. Meanwhile Cisco introduces its first
router.
• 1989: World.std.com becomes the first commercial provider of dial-up access to the NSFNET.
Slide: 18 to 105
19. The evolution of the Web until nowadays
• 1990: Tim Berners-Lee, a scientist at CERN, the European Organization for Nuclear Research, develops HyperText
Markup Language (HTML). This technology continues to have a large impact on how we navigate and view the
Internet nowadays.
• 1991: CERN introduces the World Wide Web to the public.
• 1992: The first audio and video are distributed over the Internet. The phrase "surfing the Internet" is popularized.
• 1993: The number of websites reaches 600 and the White House and United Nations go online. Marc Andreesen
develops the Mosaic Web browser at the University of Illinois, Champaign-Urbana. The number of computers
connected to NSFNET grows from 2,000 in 1985 to more than 2 million in 1993. The National Science Foundation
leads an effort to outline a new Internet architecture that would support the burgeoning commercial use of the
network.
• 1994: Netscape Communications is born. Microsoft creates a Web browser for Windows 95.
Slide: 19 to 105
20. The evolution of the Web until nowadays
The “Vision” of Tim Berners-Lee for the Semantic Web
Slide: 20 to 105
21. The evolution of the Web until nowadays
• 1998: Google search engine was introduced which shaped a new easy way of searching things around the
Internet.
• 1998: The Internet Protocol version 6 introduced, to allow for future growth of Internet Addresses. The current
most widely used protocol is version 4. IPv4 uses 32-bit addresses allowing for 4.3 billion unique addresses; IPv6,
with 128-bit addresses, will allow 3.4 x 1038 unique addresses, or 340 trillion trillion trillion.
• 1999: AOL (a US web portal and online service provider based in New York City) buys Netscape. Peer-to-peer file
sharing becomes a reality as Napster arrives on the Internet, much to the displeasure of the music industry.
• 2000: The dot-com bubble bursts. Web sites such as Yahoo! and eBay are hit by a large-scale denial of service
attack, highlighting the vulnerability of the Internet. AOL merges with Time Warner
• 2001: A federal judge shuts down Napster, ruling that it must find a way to stop users from sharing copyrighted
material before it can go back online.
Slide: 21 to 105
22. The evolution of the Web until nowadays
• 2003: The SQL Slammer worm spread worldwide in just 10 minutes. Myspace, Skype and the Safari Web browser
debut. The blog publishing platform WordPress is launched.
• 2004: Facebook goes online and the era of social networking begins. Mozilla unveils the Mozilla Firefox browser.
• 2005: YouTube a video blogging platform as well as Reddit social news site were launched.
• 2006: AOL (a US web portal and online service provider based in New York City) changes its growth strategy, by
offering many of its services free of charge, focusing on ads in order to generated revenue and for the very first
time the Internet Governance Forum meets up. Meanwhile, Twitter micro-blogging platform was launched by
Jack Dorsey CEO of the company who sends out the first Tweet: “just setting up my twttr.“ .
• 2009: The Internet marks its 40th anniversary.
Slide: 22 to 105
23. The evolution of the Web until nowadays
Web 1.0 and Web 2.0 architectures
Slide: 23 to 105
24. The evolution of the Web until nowadays
First version of Blogger platformFirst version of Twitter platform
Slide: 24 to 105
25. The evolution of the Web until nowadays
First version of YouTube platformFirst version of Facebook platform
Slide: 25 to 105
26. The evolution of the Web until nowadays
Number of users using social media platforms from 2004 until present
(source: Statista and The Next Web and ourworldindata.org)
Slide: 26 to 105
27. The evolution of the Web until nowadays
• 2010: Facebook reaches 400 million active users.
• 2010: The social media sites Pinterest and Instagram are launched.
• 2011: Social networking platforms such as Twitter and Facebook play a prominent role on the “Arab Spring”.
• 2012: The Obama administration begins an effort to ban online piracy by issuing an appropriate act with its
purpose to protect the intellectual property and to stop illegal activities that were against the federal laws of the
U.S.A. (copyrights violations etc). This was an important act as many companies such as Google or social media
platforms such as YouTube depend on the “fair use” of the Internet.
• 2013: Edward Snowden, a former CIA employee and contractor with the National Security Agency (NSA), reports
that NSA has a surveillance system in operation that can hack into the interactions of thousands of individuals,
including U.S. civilians.
Slide: 27 to 105
28. The evolution of the Web until nowadays
• 2013: According to a Pew Research Center poll, 51% of U.S. people disclose e-banking.
• 2015: Instagram, the picture media platform, crosses 400 million subscribers, outpacing Facebook, which by the
middle of the same year will move on to hit 316 million people.
• 2016: Google introduces Google Assistant, a voice-activated personal assistant system, which signals the Internet
giant's foray into the automated "smart" assistant industry. Google joins Amazon's Alexa, Apple's Siri and
Microsoft's Cortana.
Slide: 28 to 105
29. The evolution of the Web until nowadays
Use of social media platforms by age group in the US in 2019
(Source: Pew Research Center 2019)
Slide: 29 to 105
30. The evolution of the Web until nowadays
Social networking
platforms
Structure Description
Facebook Friends, Groups, Pages
A chat-based platform where users can create their friend list by adding other
users. Chatting can be publicly through a post/comment or privately through
message.
Twitter Followers
A micro-blogging platform where its users can follow other users and can be
informed about the latest news using hashtag trends or post a tweet which
can be public or private (depending on user settings). It is a platform which is
used frequently by politicians.
LinkedIn Connections
A job seeking platform where its users can apply for jobs add other users. It is
used mostly by unemployed individuals and recruiters. A user can add a post
which is shared publicly or send a message to one or many of his/her
connections.
Hi5 Friends
A chat-based platform where its users can create friend lists. It is similar with
Facebook service.
Slide: 30 to 105
31. Text mining methodologies over Twitter
• Twitter is a micro-blogging platform and social networking service where its users can submit their opinion
publicly in a form of a limited text or video known as "tweets".
• Its registered users can interact to the tweets of other users with likes and retweets.
Beyond likes and re-tweets there are also other forms of engagement which Twitter's
users can use such as replies, mentions and favorites.
• This form of interaction makes Twitter an interesting platform for online marketers as
this form of engagement can "tell" if a user would likely buy their commercial products
due the reason that it incorporates one-to-one conversations as well as promotion to
their circles or influence.
• The table of the next slide describes the engagements that can be used by Twitter's users
Twitter’s official logo
Slide: 31 to 105
32. Text mining methodologies over Twitter
Engagement
Icon
Engagement
type
Description
Reply
Users can reply to posted tweets by other users and say their opinion about the context of the tweet. It is a feature
which is used frequently. Online marketers rely a lot to this feature as from the replies we can extract the opinion of a
user for a commercial product.
Retweet
Users can share a posted Tweet by re-sharing it to their circle of followers in their accounts. Companies rely a lot in
this feature as retweets enable the free promotion of their commercial product as it reaches to other users.
Mentions
Users can mention the account name of another user. In politics, voters who use twitter tend to post a tweet and
mention the politician that relates with it. For example, “@RealDonaldTrump Nice campaign in back in 2016” or
“@RealDonaldTrump Is the wall ready?”.
Likes
Users can use this feature when they agree or like the context of the tweet. It is a useful as from it we can detect if
the users like a politician or not. The same thing goes for commercial products as well.
Hashtags
It is a feature that can be used by users that marks the tweet as a public one. Users can use a hashtag regarding to
the context that the tweet contains. For example, “Donald Trump has successfully built the wall in order to protect
our country #Trump2016 #OurPresident” or “Donald Trump lied about the wall #TrumpDictator #NotMyPresident”.
Hashtags may indicate negative or positive opinion for a political leader and it is used by data analysts who analyze
the political landscape of a country.
Slide: 32 to 105
33. An abstract concept of how Twitter’s API data exchange procedure occurs
Text mining methodologies over Twitter
Slide: 33 to 105
34. Text mining methodologies over Twitter
Key Type Description
Consumer Key
It is the API key which associates with the Twitter micro-blogging platform and identifies uniquely
the client. With the term client we refer to a user which uses a script (Python or Java) or a web
service in order to access Twitter's resources.
Consumer Secret It is the unique ID of the client which is used for the authentication process of the authorized
Twitter’s server.
Access Token
It is issued once the client is successfully authenticated (using the consumer key and secret) with
the Twitter's server. Typically, it defines the set of privileges that the client has and the amount of
data that can be collected from Twitter’s data center.
Access Token Secret It is the ID of the issued access token.
Slide: 34 to 105
36. • With the term sentiment analysis (or opinion mining) we refer to the automatic extraction process of subjective
information from a text (unstructured data) using natural language processing techniques.
• The extracted information might be the opinions as well as the sentiment of a user regarding a subject through
text analysis. Sentiment analysis has been an active research area on the field of natural language processing
since the early 2000s [33]. Pozzi et al. [32] mention that there has been unsurprising confusion among
researchers of the field by debating whether if the field should be called sentiment analysis or opinion mining and
the difference between those two terms.
• Merriam-Webster's Collegiate Dictionary defines the word sentiment as a thought, attitude or judgment
motivated by the feelings of a crowd while the word opinion is defined as the judgement, view or appraisal that
someone has over a subject.
• The latter word is more egocentric as it defines the thoughts of one individual while the other one defines the
thoughts of a group of individuals or a crowd. Let’s compare to phrases the phrase is “The current political
situation concerns me” and the phrase “I think that politics is not going well”, the first phrase expresses a
sentiment while the second one expresses an opinion.
Lexicon-Based Sentiment Analysis
Slide: 36 to 105
37. Lexicon-Based Sentiment Analysis
• When analyzing a text to extract sentiment analysis, we
need to make a distinction between subjective and
objective sentences. If we have a sentence that has been
classified as objective then there are no other required
fundamental tasks.
• Ιf a sentence has been classified as subjective then we
have to calculate its polarity meaning if it implies positive,
negative or neutral sentiment. Figure at right depicts a
visualized example of this distinction based on Pozzi et al.
[32] example.
Distinction between subjectivity
and polarity classifications
Slide: 37 to 105
38. Lexicon-Based Sentiment Analysis
• Subjectivity classification is the process that distinguishes sentences that have objective information (objective
sentences) with sentences which express subjective views and opinions (subjective sentences).
• To understand this let's give an example with two sentences the first one is “Nokia 7 Plus is a smartphone” and the
second is “Nokia 7 Plus is awesome”.
• The first sentence indicates a fact or something which exists while the second sentence indicates something
positive, in the case of our second sentence indicates something positive for a particular smartphone. The first
sentence can be classified as an objective one while the second sentence can be classified as a subjective one.
• As it is mentioned above, the purpose of sentiment analysis is “the definition of automatic tools which can be
capable of extracting subjective information from texts in natural language”.
• When we want to apply sentiment analysis our first task is to define what textual information will be analyzed.
Sentiment analysis which is applied in social networks can be separated into three layers as figure of the next slide
depicts.
Slide: 38 to 105
40. Lexicon-Based Sentiment Analysis
Level Description
Message or
document
The purpose at this level is the polarity classification of the entire message that indicates an opinion. For
example, let's suppose that we have a tweet text from an individual, the system will determine if the entire
text of the tweet expresses a positive or a negative opinion about something e.g. a politician, an event, a
commercial product etc. Therefore, the whole message indicates one opinion over a single entity. Liu et al.
[32] suggest that this level is not applicable for documents which evaluate or compare multiple entities.
Sentence
At this level it is determined the polarity of each sentence that a text message contains. Therefore, each
sentence depicts a message which is classified accordingly on the entire text of the document.
Entity and
aspect
A fine-grained analysis is performed comparing to the previous levels. At this level the opinion is consisted by
a sentiment and a target (of option). For example, the sentence “Samsung phones are good but do not have
efficient battery” indicates to opinions one that Samsung phones are good which is positive and a negative
which “do not have efficient battery”.
Slide: 40 to 105
41. Lexicon-Based Sentiment Analysis
• Opinions are classified as positive, neutral or negative but there are cases where an opinion has different meaning
from what it seems to have in the first place. There are two category groups of sentences the explicit and the
implicit ones.
• An explicit opinion can be separated into two different types the direct opinion which refers directly into a single
entity and the indirect opinion which refers indirectly onto a single entity based on its impact on other entities. For
example the sentence:
“After I switched to iPhone, I had problems with battery performance”
• The sentence implies that there is battery inefficiency over a product that the user switched with.
• The explicit comparative opinion expresses a relation of similarities/differences between two or more entities
and/or a preference of the opinion holder is based on some shared aspects of the entities [38]. For example, the
sentences bellow are examples of two comparative opinions:
“Linux is better OS than Windows” and “Linux is the best OS”
Slide: 41 to 105
42. Lexicon-Based Sentiment Analysis
• The second category which is the implicit opinion defines an expressed opinion over a desirable or undesirable fact.
Lets see two examples of these sentences:
Sentence 2: “Better Call Saul new season series starts tomorrow at 20:00 on Netflix I can’t wait until it starts”
• The 2nd sentence implies a good expectation for "Better Call Saul" TV series but it is not explicated with words while
the 1st sentence doesn’t appear to have a defined sentiment indicator. Some people might find violence as a good
characteristic for the entire plot of a series while others might not.
• It is obvious that from the two given types of opinions the implicit ones are easier in order to detect the underlying
sentiment of an entire sentence.
Sentence 1: “Breaking bad series was more violent than Better call Saul.”
Slide: 42 to 105
43. Lexicon-Based Sentiment Analysis
• To extract user expressions through social media it is important to analyze the semantics of the language. Therefore,
the content of the textual expression is evident to achieve successful sentiment analysis and thus discover the
underlying sentiment of the text.
• If we analyze ourselves a sentence, we might lead to an assumption that it is positive or a negative one based on our
own perspective whereas from the perspective of sentiment analysis this might be the completely opposite. Let's
see two examples of sentences:
I just watched the most terrifying horror movie. It was really spooky! GEETTT PANNIIICCCC"
• Initially would be interpreted as a negative sentence but taking into consideration the sense in which these views
are articulated (i.e. community of horror film fans) as well as other lexical views characteristic of the vocabulary of
the social network, we can draw a (actual) optimistic assessment.
• Lexica, corpora and ontology need to be correctly developed and then used to grasp the meaning of natural
language in social networks.
Slide: 43 to 105
44. Lexicon-Based Sentiment Analysis
• Dealing with figures of speech is also an important aspect in order to perform accurate sentiment analysis. For the
term figure of speech, we refer to an artful etymology of the normal style of expression or writing.
• According to Aristotle's tradition, figures of speech can be separated into two parts the schemes and the tropes.
Schemes and tropes have the role of doing a sort of transference; schemes are defined by a transition in time,
whereas tropes are defined by a transference of context.
• For starters, irony and sarcasm, which are grouped under the tropes section, are the most controversial figures of
speech in natural language processing.
• Although irony is sometimes used to illustrate events that deviate from the intended, such as twists of nature,
sarcasm is often used to express subtle critique with a particular entity/person as its goal [32].
• To get a better understanding of this let’s see some examples of ironic and sarcastic sentences.
Slide: 44 to 105
45. Lexicon-Based Sentiment Analysis
Sarcasm: (Note: Nick dislikes Kate's set of squirrel dolls)
- Nick: What a huge collection of squirrel dolls. Remind me how old are you yet?
- Kate: Ha! Who cares?
Irony: (Note: Nick and Kate watch a movie although it wasn't as good as they expected)
- Nick: Thank God it has finished! What an investment of our times.
- Kate: Couldn't agree more on that.
• On the irony example there was no sarcasm due to the reason that Nick doesn't had intensions to hurt Kate with his
sayings. Nick was ironic to express his disappointment for the movie and for the time that he spends watching it. In
the example of sarcasm Nick used sarcasm to show to Kate that he disliked the set of squirrel dolls that she has [32].
Slide: 45 to 105
46. Identifying the issues of sentiment lexicons
• On the previous section we referred some basic aspects of sentiment analysis, the types of opinions and the issues
that exists for the creation of automated sentiment analysis mechanisms [42]. Although there are other crucial
aspects that automated sentiment analysis mechanisms use in order to identify the polarity of each word.
• Perhaps one of the most usual forms of research on the field of sentiment analysis is the creation of resource
dictionaries which include terms, expressed opinions, words with polarity (marked as positive, negative or neutral)
emotion expression [43].
• The last method is identical for political sentiment analysis due to the reason that an automated sentiment analysis
mechanism will advise a resource dictionary in order to identify if the words contained on the text are positive,
negative or neutral based on the indication [33].
• Each word contained on a lexicon has a rating which indicates how positive or negative is. The negative type can be
indicated from -2 (very negative) to 2 (very positive).
• Based on that there are a variety of different types of sentiment analysis as it is depicted on the table of the next
slide.
Slide: 46 to 105
47. Identifying the issues of sentiment lexicons
Type of
sentiment analysis
Description
Standard This type is used regularly as it classifies the emotional tone of the overall expression
as positive (1), negative (0), or neutral (-1).
Fine-grained This type uses a scale from very positive (-3) to very negative (3) and thus it
provides a more fine-grained polarity of the overall sentiment.
Emotion detection This type identifies specific emotions such as happiness, anger, frustration, sadness
etc.
Aspect-based
This type of sentiment analysis classifies the text according to its related topic. It
identifies if the content of a text and guesses its expressed topic. For example, the
sentence “I like Olympic Games and specifically Pentathlon” it is related to the sports
topic.
Intent detection This type attempts to identify the actions of the person who wrote a text. It is best
to be used for real-time sentiment analysis.
Slide: 47 to 105
48. Sentiment Analysis with Machine Learning
• The term Machine Learning reflects the capability of a system to acquire and integrate knowledge by using an
automatic way. To classify the emotion underlying from a text several techniques can be applied. There are three
machine learning categories the supervised, unsupervised and semi-supervised.
• Supervised learning is the process of learning a function that maps an input to an output based on the example of
input-output pairs [44].
• A machine learning system which uses supervised techniques when it begins the learning process it will require to
have an input namely as training data in order for the classifier to recognize and obtain knowledge for more
representative differences between texts belonging to different categories [42].
• Basically, in supervised machine learning we have an input variable X then this input is transferred to an algorithm in
order to obtain knowledge (learn the mapping function from the input) and thus produce an output variable Y. This
can be expressed by the following equation.
( )y f x=
Slide: 48 to 105
49. Sentiment Analysis with Machine Learning
• Supervised learning has two types of problems the regression and classification. The classification problem is when
an output variable is a category e.g. “green” and “yellow” or “decision” and “no decision” while the regression
problem is when the output has a real value e.g. “euros” or “weight”.
• Several specific categories of issues developed on top of classification and regression require estimation of the
recommendation and time series, respectively.
• Some profound examples of supervised machine learning algorithms are the Linear Regression, the Random Forest,
Support Vector Machines – SVMs, Naïve Bayes – NB as well as Maximum Entropy [45].
• Unsupervised learning systems are driven only by arriving inputs (data) in order to uncover potential secret
structures among them with main purpose to classify them into groups that reflect a certain similarity. In more
simple terms, in unsupervised learning we have input data x but non any produced output.
• The concept idea in unsupervised learning is to get the data and model their underlying structure in order to learn
as much as possible information’s about them [42].
Slide: 49 to 105
50. Sentiment Analysis with Machine Learning
• As its name implies (“unsupervised”) these systems do not provide any guarantee for correct answers and there are
none training models. Algorithms decide themselves to present the interesting structure within the data.
• An unsupervised system is in the position to know the existence of numbers and properties of groups and the need
for a training set for extraction of feature vectors is not a prerequisite.
• In addition to that, unsupervised systems use pre-built emotion dictionaries for various and thus the various terms
contained in a text are labeled by the overall polarity arises [42]. There are two types of unsupervised learning
problems [45].
• The first type is namely as clustering occurs when we want to discover the inherent groupings in the data such as
grouping customers by purchasing behavior and the second type namely as association rule occurs when we want
to discover rules that describe a large amount of our data, for example when people are buying x will likely have the
tendency to buy y item as well.
• Some profound examples of unsupervised algorithms are k-means used in clustering problems and Apriori
algorithms for association rule learning problems. In semi-supervised learning the system tries to learn with limited
text data which are already labeled with their corresponding emotion content.
Slide: 50 to 105
51. Sentiment Analysis with Machine Learning
• Spatiotis et al. [42] mention that this method is time consuming as it needs the creation of a dictionary but it
considered as the most secure [45].
• A variety of machine learning problems exist on this area as it is time consuming to label data as it requires domain
experts as well as time consuming or expensive.
• Generally semi-supervised leaning algorithms are characterized by a large amount of data X and a small portion Y
labeled data. The table below summarizes the above-mentioned types of machine learning algorithm categories.
Types of
machine learning
Description
Supervised Every data is labeled and the algorithm tries to learn in order to predict
an output from the given data
Unsupervised Every data is unlabeled and the algorithm tries to learn to inherent
structure from the input data
Semi-Supervised A small portion of data is labeled while most of them are unlabeled and
a combination of supervised and unsupervised techniques can be used
Slide: 51 to 105
52. AthPPA a tool for analyzing political popularity over Twitter
• AthPPA (which stands for Athena Political Popularity Analysis) is a tool for identifying the popularity among different
political leaders as well as of the latest trends that occur in social media and especially in Twitter.
• The scope of AthPPA is the design of a web application capable of presenting Twitter statistics for the top three well
known political leaders in Greece. With that said we will choose structured and not structured data as both of them
indicate the sentiment of the user.
• For the purposes of our analysis three Twitter accounts have been identified which are @PrimeMinisterGR,
@kmitsotakis, @atsipras and @FofiGennimata as well as the respective official political party accounts in Twitter
which these politicians represent.
• The tweet sample is by default 200 and we analyze the number of likes, re-tweets and characters per posted tweet
as well as the total number of subscribers from the above-mentioned Twitter accounts.
Slide: 52 to 105
53. • It also identifies negative hashtags for these political parties which are currently trends on Twitter by its users. We
compare those structured data and present them into graphs using Python Dash framework.
• Furthermore, spaCy module is used for identifying Greek language and the open source sentiment analyzer of NLP
Buddy. Sentiment analyzer of NLP buddy reaches to 90% accuracy for Greek language and does text subjectivity
analysis.
• On the next section we will analyze the methodology that was followed for the creation of this application as well as
its overall structure presenting its file structure, and source code.
• There is also a GitHub repository with the source code as well as a website version of this web application. Both
links are provided on the Links section at the final slides of this presentation.
AthPPA a tool for analyzing political popularity over Twitter
Slide: 53 to 105
54. • For the creation of this application Python 3.8 was used in accordance with Tweepy a Python module which allows
the application to communicate with Twitter and fetch data from the platform. Furthermore, Python Dash is used
which is a Python web framework with many capabilities and features. Figure 3.1 depicts the file structure of AthPPA
web application.
Political sentiment lexicon in
Greek and spaCy module
File structure of AthPPA
AthPPA a tool for analyzing political popularity over Twitter
Slide: 54 to 105
55. Python Class Description
TwitterCredentials.py It contains the four-key set of the created Twitter’s API
TwitterAuthenticator.py It authenticates the application using the four key set and to do this it imports the twitter
credentials.
TwitterClient.py
It defines what the application will “ask” from Twitter’s Servers and it imports
TwitterAuthenticator in order to authenticate the application against the platform
TwitterListener.py
It is a basic interface which “listens” towards the communication socket. If data are
streamed properly, if data error occurred during the streaming or if TCP handshake error
occurred and thus the overall connection failed.
TweetAnalyzer.py It adds all structured data into a data frame in order to be used by the input class
TwitterStreamer.py It does basic streaming of tweets using TwitterAuthenticator and TwitterListener.
Inputs.py It contains the basic data which graphs use in main class of the program
Sentiment.py
This class searches for tweets of a twitter account where it calculates and prints the
overall sentiment per posted tweet. To achieve this, it uses a Greek sentiment lexicon
designed especially for political sentiment analysis as well as spacy-nightly module and its
additional el_core_news_md corpus.
Main.py
This class contains accepts the data from input class and contains the basic structure of
the webpage. Typically, it is the place where the graphs are called and visualized.
AthPPA a tool for analyzing political popularity over Twitter
Slide: 55 to 105
56. • The Python classes TwitterAuthenticator.py, TwitterClient.py, TwitterCredentials.py, TwitterListener,py and
TwitterStreamer.py have the responsibility of guarantying the data exchange between AthPPA web application and
Twitter’s Servers.
• The Python classes Inputs,py and TweetAnalyzer.py have the responsibility of gathering the tweets and process
them in order to extract useful structured information. The structured data taken from Twitter are the total number
of likes, retweets and text length per posted tweet as well as the total number of subscribers per account for a set
sample of 200 tweets.
• Furthermore, negative hashtag counter is also a functionality which these classes perform. We have identified three
negative hashtags for New Democracy and SYRIZA parties as well as one for the KINAL party. The Sentiment.py
Python class it gathers a set sample of 200 tweets from the Twitter accounts of the most well-known Greek political
party leaders and from these ones it extracts only the textual information of each tweet.
• Speaking of textual information, we refer to the entire text of each tweet which means that this tweet might contain
special characters such as emojis and other unnecessary elements such as exclamation marks. For this reason,
AthPPA implements a text format parsing of each obtained tweet using regex where it removes all these unwanted
characters.
AthPPA a tool for analyzing political popularity over Twitter
Slide: 56 to 105
57. • The reason for doing this is that emojis and special characters such exclamation marks and punctuations do not
indicate a value for sentiment analysis and SpaCy module will simply do not process the entire text of the obtained
tweet properly in order to extract its overall sentiment.
• When the tweet is processed and formatted properly it is stored on a data frame in order to be analyzed by SpaCy
module and the implemented sentiment analyzer. Note that the implementation for sentiment analyzer was
inspired by an additional open source project namely as NLPBuddy which uses Greek SpaCy module.
• A few changes have been made through the source code of this sentiment analyzer in order to analyze the set of
Tweets stored on the data array.
• A loop checks all the elements of the data frame and each sentence is processed by the sentiment analyzer in order
to extract a proper value.
• The Table of the next slide depicts the sentiment values that the sentiment analyzer assigns on tweets based on
their mood expression.
AthPPA a tool for analyzing political popularity over Twitter
Slide: 57 to 105
58. Sentiment Value Emotion Score Type
3 Happiness
2 Surprise
1 Sadness
0 Neutral
-1 Fear
-2 Disgust
-3 Anger
Sentiment values used for the labelling process by
the sentiment analyzer used in AthPPA
AthPPA a tool for analyzing political popularity over Twitter
Slide: 58 to 105
59. • The sentiment analyzer analyzes the sentence and assigns on it the proper emotion (as presented on the table 3.2)
based on the indicative expression of the sentence.
• Neutral value has been added to the sentiment analyzer and as neutral it counts every sentence which has not any
indicative expressed opinion. The following example presents a sentence in Greek language with the produced
output from the sentiment analyzer.
Sentence:
Έχω μείνει έκπληκτος! Πώς γίνεται αυτό; Η έκπληξη είναι τόσο μεγάλη! Α, τώρα εξηγούνται όλα.
Produced Output:
Subjectivity: 16.6%
Main emotion: Surprise.
Emotion score: 33.3%
AthPPA a tool for analyzing political popularity over Twitter
Slide: 59 to 105
60. • To achieve this, it advises a lexicon which contains a set of labelled words. SpaCy is used for processing the text
using a variety of NLP techniques.
• The following figure presents the structure by which SpaCy module works where a text is given as input and it
produces a doc object. This doc object can be used in order to apply several Natural Language Processing
techniques that SpaCy provides.
How SpaCy module produces a Natural Language Processing linguistic object
AthPPA a tool for analyzing political popularity over Twitter
Slide: 60 to 105
61. • Specifically, when a text is given to spaCy it tokenizes the text in order to produce a Natural Language Processing
linguistic object namely as doc.
• Afterwards, the doc can be processed with several steps which are called as pipeline processing.
• This pipeline processing includes a tagger, parser and an entity recognizer.
• Each pipeline component returns the processed result from the doc which can also be forwarded onto the next
components.
AthPPA a tool for analyzing political popularity over Twitter
Slide: 61 to 105
62. AthPPA a tool for analyzing political popularity over Twitter
• SpaCy is comprised by two main data structures which are the Doc and the Vocab. The Doc object is constructed by
a tokenizer. It can be modified by a pipeline and owns tokens and annotations.
• The vocab object owns lookup tables for information common to all documents. With this way we have centralized
strings, lexical attributes and word vectors.
• This eliminates the need for storing multiple copies of this data which saves memory and ensures that there is a
single source of truth.
• Text annotations have been designed in order to allow a single source of truth where the Doc object owns the data
and Span and Token are views that point into it.
• The doc object is designed by the tokenizer and can be modified with the use of the pipeline components.
• The Language object defines the coordination of these components where a raw text is given as input through the
pipeline and the produced output is an annotated document. It also sets the procedures of training and serialization
[53].
Slide: 62 to 105
63. Architecture of spaCy module
AthPPA a tool for analyzing political popularity over Twitter
Slide: 63 to 105
64. • The following code examples present the basic Natural Language Processing capabilities of spaCy module. Beginning
with the Linguistic annotations which provide insights into a text and present its grammatical structure. This process
includes the identification of parts of speech and related words within the text.
import spacy
nlp = spacy.load("en_core_web_sm")
doc = nlp("Apple is looking at buying U.K. startup for $1 billion")
for token in doc:
print(token.text, token.pos_, token.dep_)
Apple PROPN nsubj
is AUX aux
looking VERB ROOT
at ADP prep
buying VERB pcomp
U.K. PROPN compound
startup NOUN dobj
for ADP prep
$ SYM quantmod
1 NUM compound
billion NUM pobj
Example
Result
Identifying linguistic annotations with spaCy module
AthPPA a tool for analyzing political popularity over Twitter
Slide: 64 to 105
65. • The first step when a text is being processed by spaCy is to tokenize it. With the term tokenization we refer to the
process word segmentation from the sentence. In more simple terms words are identified from the text and are
separated independently.
Tokenizing a text into words using spaCy module
import spacy
nlp = spacy.load("en_core_web_sm")
doc = nlp("Apple is looking at buying U.K. startup for $1 billion")
for token in doc:
print(token.text, token.pos_, token.dep_)
Apple
is
looking
at
buying
U.K.
startup
for
$
1
billion
Example
Result
AthPPA a tool for analyzing political popularity over Twitter
Slide: 65 to 105
66. • The text processing begins from left to the right and on every substring, it performs two checks. The first check is if
the substring matches to the exception rule of the tokenizer. Let’s imagine that we have the following sentence and
we want to provide as input to the Doc.
Sentence: “I don’t want to move on the U.K.”
• Highlight two words “don’t” and “U.K.”. On the first highlight word there is not any whitespace although it is not one
word and can be separated into two words the “do” and “n’t”.
• On the second highlight word there is no need to be separated into two words and should remain one single word.
These words are expressed as tokens.
• The second check is the removal od prefixes and suffixes for e.g. commas, hyphens, quotes or periods. There are
other functionalities that spaCy provides such as named entity detection but for the sake of this thesis we will skip
them.
AthPPA a tool for analyzing political popularity over Twitter
Slide: 66 to 105
67. AthPPA a tool for analyzing political popularity over Twitter
• In AthPPA’s sentiment script identifier, regex expressions have been deployed in order to remove punctuations,
hashtags, emoticons or any other non-processing character from the obtained tweets.
• Once the text is processed the analyzer advises a Greek lexicon designed specifically for political sentiment analysis
in order to locate words within the text [54].
• The overall sentiment is extracted from the average of sentiment words. For example, if there is a sentence with 5
words where the 3 of them indicate Anger then the overall sentiment of the sentence will be anger by a 70%.
Emotion lexicons and word matching are a well-known form of sources for creating knowledge when it comes for
sentiment analysis over social networking.
• While their efficiency has been analyzed and proven on the field of sentiment analysis, and a plethora of them has
been created for the English language there is not availability for the Greek language.
• In addition to that, Spatiotis et al. [42] suggest that the most common way of research approach for sentiment
analysis is by implementing resource dictionaries which include a set of terms as well as expressed opinions which
indicate positive or negative sentiment commonly known as opinion words.
Slide: 67 to 105
68. AthPPA a tool for analyzing political popularity over Twitter
Slide: 68 to 105
• Tsakalidis et al. [52] in their study are trying to resolve this issue by implementing a vast collection of such tools,
spanning from manually annotated lexicon to semi-supervised word matching vectors and annotated data sets for
various activities.
• It is worth to mention that all of them can be accessed publicly. Tsakalidis et al. [54] mention that experiments have
been performed using a variety of algorithms and parameters on their implemented resources to present promising
results over standard baselines.
• Specifically, Tsakalidis et al. [54] achieved a 24.9% relative improvement in F-score on the cross-domain sentiment
analysis task when training the same algorithms with their resources, compared to training them on more
traditional feature sources, such as n-grams.
• Furthermore, Tsakalidis et al. [54] mention that their resources were built with the primary focus on the cross-
domain sentiment analysis task but they also show promising results in related tasks, such as emotion analysis and
sarcasm detection.
69. AthPPA graphs results
• The last legislative elections in Greece was held on 7th of July 2019 where the New Democracy Centre-right
conservative party of Kyriakos Mitsotakis won with 158 from the overall 300 seats of the Greek parliament leading
to an outright majority. Figure 3.4 depicts the election results from the last Greek legislative elections
Results of the last Greek legislative election, showing the vote strength
of the party winning a plurality in each electoral district.
Slide: 69 to 105
70. • The figure of the previous slide presents the Greek parliament seat ratio among the different political parties.
SYRIZA (Coalition of the Radical Left) won 86 seats making it the official opposition party in Greece, Movement for
Change (or KINAL) which is a Centre-democratic socialist party won 22 seats and afterwards follow the rest political
parties which have a small share of seats in the Greek parliament (34 seats in total).
• For the purposes of this research the three most prominent political parties and their representing leaders have
been included.
• These ones are the current Greek Prime Minister Kyriakos Mitsotakis which is leading the New Democracy party,
afterwards is Alexis Tsipras which is leading the Coalition for the Radical Left making it the official opposition party
and finally Fofi Gennimata which is leading the Movement of Change party.
• The table of the next slide depicts the identified Twitter accounts which the data are obtained.
AthPPA graphs results
Slide: 70 to 105
71. Twitter Account Type Person/Entity Representation
@PrimeministerGR Politician Kyriakos Mitsotakis ND (Majority)
@kmitsotakis Politician Kyriakos Mitsotakis ND (Majority)
@neademokratia Political Party New Democracy ND (Majority)
@atsipras Politician Alexis Tsipras SYRIZA (2nd Opposition)
@syriza_gr Political Party SYRIZA SYRIZA (2nd Opposition)
@FofiGennimata Politician Fofi Gennimata KINAL (3rd Opposition)
@kinimallagis Political Party KINAL KINAL (3rd Opposition)
Person: Entity:
AthPPA graphs results
Slide: 71 to 105
72. • From those accounts we obtain for each one of them a dynamic sample of 200 posted tweets using Tweepy Python
library. The data have been visualized are the number of likes, re-tweets and text character count for each tweet of
the obtained sample of 200 tweets. We have also identified negative hashtags for the two prominent political
parties that Twitter’s users tend to use in their posted tweets. The following table depicts those identified hashtags.
Identified hashtag Hashatag relation Tweet sample Data visualized
#ΝΔ_θελατε Negative for ND 200 Date posted frequency
#ΝΔ_ξεφτίλες Negative for ND 200 Date posted frequency
#ΝΔ_ρομπες Negative for ND 200 Date posted frequency
#ΣΥΡΙΖΑ_ξεφτίλες Negative for SYRIZA 200 Date posted frequency
#συριζωα Negative for SYRIZA 200 Date posted frequency
#Συριζα_απατεώνες Negative for SYRIZA 200 Date posted frequency
Identified negative hashtags per political party
AthPPA graphs results
Slide: 72 to 105
73. Users likes and retweets per posted tweet mined from @kmitsotakis account (200 tweet sample)
AthPPA graphs results
Slide: 73 to 105
74. Users likes and retweets per posted tweet mined from @PrimeministerGR account (200 tweet sample)
AthPPA graphs results
Slide: 74 to 105
75. Users likes and retweets per posted tweet mined from @atsipras account (200 tweet sample)
AthPPA graphs results
Slide: 75 to 105
76. Users likes and retweets per posted tweet mined from @FofiGennimata account (200 tweet sample)
AthPPA graphs results
Slide: 76 to 105
77. Text length per posted tweet mined from @kmitsotakis account (200 tweet sample)
AthPPA graphs results
Slide: 77 to 105
78. Text length per posted tweet mined from @atsipras account (200 tweet sample
AthPPA graphs results
Slide: 78 to 105
79. Text length per posted tweet mined from @FofiGennimata account (200 tweet sample)
AthPPA graphs results
Slide: 79 to 105
80. Users likes and retweets per posted tweet mined from @neademokratia account (200 tweet sample)
AthPPA graphs results
Slide: 80 to 105
81. Users likes and retweets per posted tweet mined from @syriza_gr account (200 tweet sample)
AthPPA graphs results
Slide: 81 to 105
82. Mined tweets which include negative hashtag (#ΝΔ_Θελατε) for New Democracy party
AthPPA graphs results
Slide: 82 to 105
83. Mined tweets which include negative hashtag (#ΝΔ_ξεφτίλες) for New Democracy party
AthPPA graphs results
Slide: 83 to 105
84. Mined tweets which include negative hashtag (#ΝΔ_ρομπες) for New Democracy party
AthPPA graphs results
Slide: 84 to 105
85. Mined tweets which include negative hashtag (#ΣΥΡΙΖΑ_ξεφτιλες) for SYRIZA party
AthPPA graphs results
Slide: 85 to 105
86. Mined tweets which include negative hashtag (#συριζωα) for SYRIZA party
AthPPA graphs results
Slide: 86 to 105
87. Mined tweets which include negative hashtag (#Συριζα_απατεωνες) for SYRIZA party
AthPPA graphs results
Slide: 87 to 105
88. Number of registered subscribers per twitter account for the top three Greek political leaders (actual numbers)
AthPPA graphs results
Slide: 88 to 105
89. AthPPA graphs results
Number of registered subscribers per twitter account
for the top three Greek political leaders (in percentage)
Slide: 89 to 105
90. Comparison of users likes per posted tweet for the top three Greek political leaders (600 tweet sample)
AthPPA graphs results
Slide: 90 to 105
91. AthPPA graphs results
Comparison of re-tweets per posted tweet for the top three Greek political leaders (600 tweet sample)
Slide: 91 to 105
92. Total comparison of Sentiment tweet for the top three Greek political leaders (600 tweet sample)
AthPPA graphs results
Slide: 92 to 105
94. Comparison of identified positive tweets (600 tweet sample)
AthPPA graphs results
Slide: 94 to 105
95. Comparison of identified neutral tweets (600 tweet sample)
AthPPA graphs results
Slide: 95 to 105
96. Comparison of identified negative tweets (600 tweet sample)
AthPPA graphs results
Slide: 96 to 105
97. Comparison of identified negative tweets (600 tweet sample)
AthPPA graphs results
Slide: 97 to 105
98. Comparison positive, negative and neutral tweets per political leader (600 tweet sample
AthPPA graphs results
Slide: 98 to 105
99. Conclusion (1/3)
• We presented a web-based data visualization application for political tendency identification of Twitter’s users.
Twitter is a useful tool to extract the sentiment of users and to predict a political result.
• We have identified crucial structured data such as the number of likes, re-tweets, text length, number of subscribers
per account as well as the frequency of negative hashtags that users include in their posted tweets.
• The number of likes and retweets allows us to observe how popular is a political leader and how many active users a
politician has on Twitter. This can help us of identifying the fan base group of a politician which is a strong indicator
of political popularity.
• The same occurs for political parties that have their official accounts in Twitter where likes and re-tweets per posted
tweet are an indicator of a fan base pool, voters or supporters. In addition to that, political parties that have official
accounts in Twitter are the first source of advertising a political leader which represents it.
• For example, on the Greek political standards if a political leader let’s say leader A posts something on his/her
account it is likely that this post will be re-tweeted by the official Twitter account of the political party that leader A
represents.
Slide: 99 to 105
100. • Also, the number of subscribers that political leaders have on their accounts is another crucial indicator of political
popularity as it indicates the intertest of users to stay tuned to the posts and news that a political leader tweeted.
• On the other hand, some of these users might be fake accounts but that would certainly be a small number,
although AthPPA presents unstructured data as well such as sentiment analysis.
• This includes also the negative hashtags as users might use them as trend rather than a true source of opinion
indicator. For this reason, we are focusing mostly on how frequently a tweet with negative hashtag about a political
party is posted on the Twitter rather than the emotion that these tweets imply.
• Finalizing with the structured data that we've obtained from Twitter, the text length of a posted tweet is an indicator
to what extend a politician uses Twitter for example small texts are an indicator for expressing an announcement or
something very crucial that wants to reach onto many people and that will ensure that many will read it while long
texts indicate an effort to express an opinion or to affect the thoughts of a certain group of people e.g. invocation of
emotion etc. although they have high risk of being unnoticed by users as they might be bored of reading it.
• That’s why the character limitation that Twitter provides is a crucial restriction in order to make sure that the
message of the tweet will reach to everyone (or at least everyone will read it).
Conclusion (2/3)
Slide: 100 to 105
101. • As we've mentioned previously, structured data are a fairly good source of obtaining the popularity of a politician or
a political party but there are also fake accounts that might lead us onto false results.
• That's why we are also analyzing the text of the tweet that a politician posts on Twitter in order to identify its
expressed emotion or its impact that this tweet might have on users.
• SpaCy is an efficient Natural Language Processing tool that helps us to deploy Natural Language Processing
techniques on a text and it is also an efficient commercial tool with strong community with a good documentation.
• Also, the lexicon made by Tsakalidis et al. [54] is a good option on or case as we want to extract the emotion that a
text implies rather than simply identifying how negative or positive that text is based on a simple scale of positive or
negative words.
• Emotion based lexicons are efficient when it comes on political sentiment analysis as the emotions are the main
indicators that affect the opinion of a voter (e.g. anger about a tweet that a politician tweeted etc.).
• Furthermore, Python as programming language is an outstanding tool when it comes on data visualization as Dash
framework provides many capabilities and features for creating graphs and charts on a web-based environment.
Conclusion (3/3)
Slide: 101 to 105
102. Useful Links
• Website for AthPPA: https://athppa.cs.hmu.gr/
• Github link for AthPPA: https://github.com/CodeBrakes/AthPPA
• Github link spaCy (Greek Version) : https://github.com/eellak/gsoc2018-spacy
• Github link Greek sentiment lexicon: https://github.com/MKLab-ITI/greek-sentiment-lexicon
Slide: 102 to 105
103. 1. Garfinkel, Simson L., and Rachel H. Grunspan. The Computer Book: From the Abacus to Artificial Intelligence, 250 Milestones in the History of Computer Science. Sterling Swift
Pub Co, 2018.
2. Ellison, Carl. "Cryptography Timeline." (2001) [http://world.std.com/~cme/html/timeline.html].
3. Kelly, Thomas. "The myth of the skytale." Cryptologia 22.3 (1998): 244-260.
4. Sharkey, N., and A. Sharkey. "Electro-mechanical robots before the computer." Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical
Engineering Science 223.1 (2009): 235-241.
5. Al-Kadit, Ibrahim A. "Origins of cryptology: The Arab contributions." Cryptologia 16.2 (1992): 97-126.
6. Kahn, David. The Codebreakers: The comprehensive history of secret communication from ancient times to the internet. Simon and Schuster, 1996.
7. Gray, Jonathan. "“Let us Calculate!”: Leibniz, Llull, and the Computational Imagination." (2016).
8. ulstad, Mark, and Laurence Carlin. “Leibniz’s Philosophy of Mind.” The Stanford Encyclopedia of Philosophy. Last modified November 11, 20.
9. illiams, John B. The electronics revolution: inventing the future. Springer, 2017.
10. Campbell-Kelly, Martin. "Data communications at the national physical laboratory (1965-1975)." Annals of the History of Computing 9.3/4 (1987): 221-247.
11. Pelkey, James. "A History of Computer Communications 1968–1988." (2014).
12. Katie Hafner, and Matthew Lyon. Where wizards stay up late: The origins of the Internet. Simon and Schuster, 1998.
13. David Roessner, et al. "The Role of NSF Support of Engineering in Enabling Technological Innovation." Final Report to the National Science Foundation. SRI International,
Washington, DC (1997).
14. Davies, Donald W. "An historical study of the beginnings of packet switching." The Computer Journal 44.3 (2001): 152-162.
15. Hjorth, Larissa, and Sam Hinton. Understanding social media. SAGE Publications Limited, 2019.
16. Wall, Aaron. "History of search engines: From 1945 to Google today." Search engine history (2015): 2006-2017.
17. Anthony JG Hey and Gyuri Pápay. The computing universe: a journey through a revolution. Cambridge University Press, 2014.
18. Darcy DiNucci, "Fragmented future." (1999): 32-33.
19. José Van Dijck. The culture of connectivity: A critical history of social media. Oxford University Press, 2013.
20. José Van Dijck, Thomas Poell, and Martijn De Waal. The platform society: Public values in a connective world. Oxford University Press, 2018.
21. Sponder, Marshall, and Gohar F. Khan. Digital analytics for marketing. Routledge, 2017.
22. Berners-Lee, Tim, and Mark Fischetti. "Weaving the Web. HarperSanFrancisco. chapter 12." (1999).
References (1/3)
Slide: 103 to 105
104. 23. Berners-Lee, Tim, and Mark Fischetti. "Weaving the Web. HarperSanFrancisco. chapter 12." (1999).
24. Ravi Gupta and Hugh Brooks. Using social media for global security. John Wiley & Sons, 2013.
25. The Rise of Social Media, Esteban Ortiz-Ospina - https://ourworldindata.org/rise-of-social-media
26. Chatterjee, Siddhartha, and Michal Krystyanczuk. Python Social Media Analytics. Packt Publishing Ltd, 2017.
27. Matthew Ganis and Avinash Kohirkar. Social media analytics: Techniques and insights for extracting business value out of social media. IBM Press, 2015.
28. Stikeleather, Jim. "The Three Elements of Successful Data Visualizations." Harvard Business Review 19 (2013).
29. Measured, Simply. "The complete guide to Twitter Analytics: How to analyze the metrics that matter." (2014).
30. Shah, Chirag. A Hands-On Introduction to Data Science. Cambridge University Press, 2020.
31. Kumar, Shamanth, Fred Morstatter, and Huan Liu. Twitter data analytics. New York: Springer, 2014.
32. Roesslein, Joshua. "tweepy Documentation." Online] http://tweepy. readthedocs. io/en/v3 5 (2009).
33. Pozzi, Federico Alberto, et al. Sentiment analysis in social networks. Morgan Kaufmann, 2016.
34. Liu, Bing. "Sentiment analysis and opinion mining." Synthesis lectures on human language technologies 5.1 (2012): 1-167.
35. Liu, Yang, et al. "ARSA: a sentiment-aware model for predicting sales performance using blogs." Proceedings of the 30th annual international ACM SIGIR conference on
Research and development in information retrieval. 2007.
36. O'Connor, Brendan, et al. "From tweets to polls: Linking text sentiment to public opinion time series." Fourth international AAAI conference on weblogs and social media. 2010.
37. Tumasjan, Andranik, et al. "Predicting elections with twitter: What 140 characters reveal about political sentiment." Fourth international AAAI conference on weblogs and
social media. 2010.
38. Amolik, Akshay, et al. "Twitter sentiment analysis of movie reviews using machine learning techniques." international Journal of Engineering and Technology 7.6 (2016): 1-7.
39. Jindal, Nitin, and Bing Liu. "Identifying comparative sentences in text documents." Proceedings of the 29th annual international ACM SIGIR conference on Research and
development in information retrieval. 2006.
40. Lazarsfeld, Paul F., and Robert K. Merton. "Friendship as a social process: A substantive and methodological analysis." Freedom and control in modern society 18.1 (1954): 18-
66.
41. Ferguson, Niall. "The False Prophecy of Hyperconnection: How to Survive the Networked Age." Foreign Aff. 96 (2017): 68.
42. Edward, P. J., and J. C. Robert. "Classical rhetoric for the modern student." (1971): 40.
43. Spatiotis, Nikolaos, et al. "Sentiment Analysis for the Greek Language." Proceedings of the 20th Pan-Hellenic Conference on Informatics. 2016.
44. Qazi, Atika, et al. "A systematic literature review on opinion types and sentiment analysis techniques." Internet Research (2017).
References (2/3)
Slide: 104 to 105
105. 45. Stuart, Russell, and Norvig Peter. "Artificial intelligence: a modern approach." (2003).
46. Müller, Andreas C., and Sarah Guido. Introduction to machine learning with Python: a guide for data scientists. " O'Reilly Media, Inc.", 2016.
47. Deshwal, Ajay, and Sudhir Kumar Sharma. "Twitter sentiment analysis using various classification algorithms." 2016 5th International Conference on Reliability, Infocom
Technologies and Optimization (Trends and Future Directions)(ICRITO). IEEE, 2016.
48. Witten, Ian H., Eibe Frank, and Mark A. Hall. "Practical machine learning tools and techniques." Morgan Kaufmann (2005): 578.
49. Desai, Mitali, and Mayuri A. Mehta. "Techniques for sentiment analysis of Twitter data: A comprehensive survey." 2016 International Conference on Computing,
Communication and Automation (ICCCA). IEEE, 2016.
50. O'Reilly, Tim, and Sarah Milstein. The twitter book. " O'Reilly Media, Inc.", 2011.
51. Gokulakrishnan, Balakrishnan, et al. "Opinion mining and sentiment analysis on a twitter data stream." International Conference on Advances in ICT for Emerging Regions
(ICTer2012). IEEE, 2012.
52. "Eellak/Nlpbuddy". Github, 2020, https://github.com/eellak/nlpbuddy.
53. SpaCy 101: Everything You Need to Know · SpaCy Usage Documentation.” SpaCy 101: Everything You Need to Know, spacy.io/usage/spacy-101
54. Architecture · SpaCy API Documentation.” Architecture, spacy.io/api.
55. Tsakalidis, Adam, et al. "Building and evaluating resources for sentiment analysis in the Greek language." Language resources and evaluation 52.4 (2018): 1021-1044.
56. Ribeiro, Filipe N., et al. "Sentibench-a benchmark comparison of state-of-the-practice sentiment analysis methods." EPJ Data Science 5.1 (2016): 1-29.
References (3/3)
Slide: 105 to 105