Introduction

•Download as DOCX, PDF•

0 likes•23 views

Kinnudj Amee

data minin with big data, project

Engineering

INTRODUCTION
Along with the above example, the era of Big Data has arrived. Every day, 2.5 quintillion bytes
of data are created and 90 percent of the data in the world today were produced within the past
two years. Our capability for data generation has never been so powerful and enormous ever
since the invention of the information technology in the early 19th century. As another example,
on 4 October 2012, the first presidential debate between President Barack Obama and Governor
Mitt Romney triggered more than 10 million tweets within 2 hours. Among all these tweets, the
specific moments that generated the most discussions actually revealed the public interests, such
as the discussions about medicare and vouchers. Such online discussions provide a new means to
sense the public interests and generate feedback in realtime, and are mostly appealing compared
to generic media, such as radio or TV broadcasting. Another example is Flickr, a public picture
sharing site, which received 1.8 million photos per day, on average, from February to March
2012. Assuming the size of each photo is 2 megabytes (MB), this requires 3.6 terabytes (TB)
storage every single day. Indeed, as an old saying states: “a picture is worth a thousand words,”
the billions of pictures on Flicker are a treasure tank for us to explore the human society, social
events, public affairs, disasters, and so on, only if we have the power to harness the enormous
amount of data. The above examples demonstrate the rise of Big Data applications where data
collection has grown tremendously and is beyond the ability of commonly used software tools to
capture, manage, and process within a “tolerable elapsed time.” The most fundamental challenge
for Big Data applications is to explore the large volumes of data and extract useful information
or knowledge for future actions. In many situations, the knowledge extraction process has to be
very efficient and close to real time because storing all observed data is nearly infeasible. For
example, the square kilometer array (SKA) in radio astronomy consists of 1,000 to 1,500 15-
meter dishes in a central 5-km area. It provides 100 times more sensitive vision than any existing
radio telescopes, answering fundamental questions about the Universe. However, with a 40
gigabytes (GB)/second data volume, the data generated from the SKA are exceptionally large.
Although researchers have confirmed that interesting patterns, such as transient radio anomalies
can be discovered from the SKA data, existing methods can only work in an offline fashion and
are incapable of handling this Big Data scenario in real time. As a result, the unprecedented data
volumes require an effective data analysis and prediction platform to achieve fast response and
real-time classification for such Big Data.

Viewers also liked

BibilographyKinnudj Amee

Software environmentKinnudj Amee

Data Mining with big data total ieee project and entire files.Kinnudj Amee

Input design and output designKinnudj Amee

System testingKinnudj Amee

System studyKinnudj Amee

System analysisKinnudj Amee

AbstractKinnudj Amee

Big data miningThadsanamoorthy Kajavathanan

Big Data v Data MiningUniversity of Hertfordshire

Viewers also liked (10)

Bibilography

Software environment

Data Mining with big data total ieee project and entire files.

Input design and output design

System testing

System study

System analysis

Abstract

Big data mining

Big Data v Data Mining

Similar to Introduction

A Big Data TimelineBig Cloud

Research issues in the big data and its ChallengesKathirvel Ayyaswamy

Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...AnthonyOtuonye

DataEd Online: Demystifying Big DataDATAVERSITY

Data-Ed: Demystifying Big DataData Blueprint

WORLD CAT AS BIG DATADr. Anjaiah Mothukuri

Big data and InternetSanoj Kumar

Big dataKnoldus Inc.

Sensory transformationKarlos Svoboda

SoderstromNASAPMC

Big Data Story - From An Engineer's PerspectiveHien Luu

Information OverloadMiro Pusnik

Keynote on 2015 Yale Day of Data Robert Grossman

Data Science Innovations : Democratisation of Data and Data Science suresh sood

wireless sensor networkparry prabhu

Big data surveyEzhilarasan Elumalai

top 10 Data Mining AlgorithmsNagasuri Bala Venkateswarlu

International Journal of Engineering Research and Development (IJERD)IJERD Editor

Data mining on big dataSwapnil Chaudhari

Big Data - Big Deal? - Edison's Academic Paper in SMUEdison Lim Jun Hao

Similar to Introduction (20)

A Big Data Timeline

Research issues in the big data and its Challenges

Overcomming Big Data Mining Challenges for Revolutionary Breakthroughs in Com...

DataEd Online: Demystifying Big Data

Data-Ed: Demystifying Big Data

WORLD CAT AS BIG DATA

Big data and Internet

Big data

Sensory transformation

Soderstrom

Big Data Story - From An Engineer's Perspective

Information Overload

Keynote on 2015 Yale Day of Data

Data Science Innovations : Democratisation of Data and Data Science

wireless sensor network

Big data survey

top 10 Data Mining Algorithms

International Journal of Engineering Research and Development (IJERD)

Data mining on big data

Big Data - Big Deal? - Edison's Academic Paper in SMU

Recently uploaded

AKTU Computer Networks notes --- Unit 3.pdfankushspencer015

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile

VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698

MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTINGSIVASHANKAR N

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLSSIVASHANKAR N

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

Roadmap to Membership of RICS - Pathways and RoutesM Maged Hegazy, LLM, MBA, CCP, P3O

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat

UNIT-II FMM-Flow Through Circular Conduitsrknatarajan

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani

(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7Call Girls in Nagpur High Profile Call Girls

College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...Call Girls in Nagpur High Profile

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

Recently uploaded (20)

AKTU Computer Networks notes --- Unit 3.pdf

Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...

VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking

MANUFACTURING PROCESS-II UNIT-1 THEORY OF METAL CUTTING

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts

Russian Call Girls in Nagpur Grishma Call 7001035870 Meet With Nagpur Escorts

The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...

Introduction to IEEE STANDARDS and its different types.pptx

MANUFACTURING PROCESS-II UNIT-5 NC MACHINE TOOLS

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE

Roadmap to Membership of RICS - Pathways and Routes

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...

UNIT-II FMM-Flow Through Circular Conduits

CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record

(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik

Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

Introduction

1. INTRODUCTION Along with the above example, the era of Big Data has arrived. Every day, 2.5 quintillion bytes of data are created and 90 percent of the data in the world today were produced within the past two years. Our capability for data generation has never been so powerful and enormous ever since the invention of the information technology in the early 19th century. As another example, on 4 October 2012, the first presidential debate between President Barack Obama and Governor Mitt Romney triggered more than 10 million tweets within 2 hours. Among all these tweets, the specific moments that generated the most discussions actually revealed the public interests, such as the discussions about medicare and vouchers. Such online discussions provide a new means to sense the public interests and generate feedback in realtime, and are mostly appealing compared to generic media, such as radio or TV broadcasting. Another example is Flickr, a public picture sharing site, which received 1.8 million photos per day, on average, from February to March 2012. Assuming the size of each photo is 2 megabytes (MB), this requires 3.6 terabytes (TB) storage every single day. Indeed, as an old saying states: “a picture is worth a thousand words,” the billions of pictures on Flicker are a treasure tank for us to explore the human society, social events, public affairs, disasters, and so on, only if we have the power to harness the enormous amount of data. The above examples demonstrate the rise of Big Data applications where data collection has grown tremendously and is beyond the ability of commonly used software tools to capture, manage, and process within a “tolerable elapsed time.” The most fundamental challenge for Big Data applications is to explore the large volumes of data and extract useful information or knowledge for future actions. In many situations, the knowledge extraction process has to be very efficient and close to real time because storing all observed data is nearly infeasible. For example, the square kilometer array (SKA) in radio astronomy consists of 1,000 to 1,500 15- meter dishes in a central 5-km area. It provides 100 times more sensitive vision than any existing radio telescopes, answering fundamental questions about the Universe. However, with a 40 gigabytes (GB)/second data volume, the data generated from the SKA are exceptionally large. Although researchers have confirmed that interesting patterns, such as transient radio anomalies can be discovered from the SKA data, existing methods can only work in an offline fashion and are incapable of handling this Big Data scenario in real time. As a result, the unprecedented data volumes require an effective data analysis and prediction platform to achieve fast response and real-time classification for such Big Data.

Introduction

Recommended

Recommended

More Related Content

Viewers also liked

Viewers also liked (10)

Similar to Introduction

Similar to Introduction (20)

Recently uploaded

Recently uploaded (20)

Introduction