SlideShare a Scribd company logo
1 of 11
Download to read offline
A HISTORY OF BIG DATA
What is Big Data?
In essence, Big Data is a term for data sets that are so large or
complex that traditional data processing applications are
inadequate. It usually includes data sets with sizes beyond the
ability of commonly used software tools to capture, curate, manage
and process data within a tolerable elapsed time1
. The “size” of Big
Data is a constantly moving target, which doesn’t remain stable at
any given point of time. As per a recent report, its size ranges from
a few dozen terabytes to many petabytes of data.
The story of how data became big starts many years before the
current buzz around big data. About seventy years ago we
encountered the first attempts to quantify the growth rate in the
volume of data or what has popularly been known as the
Information Explosion (a term first used in 1941). The history of
Big Data as a term may be brief – but many of the foundations it is
built on were laid many years ago2
. Long before computers (as we
know today) were commonplace, the idea that we were creating an
ever-expanding body of knowledge ripe for analysis was popular
in academia.
Now, let’s look at a detailed account of the major milestones in the
history of sizing data volumes in the evolution of the idea of “big
data” and observations pertaining to data or information explosion:
1932 Skipping the important milestone of the population boom
would not do justice to the history of Big Data. Information
1
Source: Wikipedia
2
Link: https://www.linkedin.com/pulse/brief-history-big-data-everyone-should-read-bernard-marr
overload continued with the boom in the US population, the
issuing of social security numbers, and the general growth of
knowledge (research) which demanded more thorough and
organized record-keeping.
1941 Scholars began referring to this incredible expansion of
information as the “Information Explosion”. First referenced by
the Lawton Constitution (newspaper) in 1941, the term was
expanded upon in a New Statesman article in March 1964, which
referred to the difficulty of managing the amount of information
available.
1944 The first flag of warning on the growth of knowledge
storage and the retrieval problem came in 1944, when Fremont
Rider, a Wesleyan University Librarian estimated that American
university libraries were doubling in size every sixteen years. At
this growth rate, Rider speculated that the Yale Library in 2040
would have “approximately 200,000,000 volumes, which will
occupy over 6,000 miles of shelves… [requiring] a cataloging staff
of over six thousand persons.”
Schematic showing a general communication system3
.
3
Link: http://www.winshuttle.com/big-data-timeline/
1948 Claude Shannon published “Shannon’s Information
Theory” which established a framework for determining the
minimal data requirements to transmit information over a noisy
(imperfect) channel. This was a landmark work that enabled much
of today’s infrastructure. Without this understanding, data would
be “bigger” than it is today.
1956 The concept of virtual memory was developed by German
physicist Fritz-Rudolf Guntsch as an idea that treated finite storage
as infinite. Storage, managed by integrated hardware and software
to hide the details from the user, permitted us to process data
without the hardware memory constraints that previously forced
the problem to be partitioned.
Information Overload4
4
Image source: Google images
1961 Information Scientist, Derek Price, generalized Rider’s
findings to include almost the entire range of scientific knowledge.
The scientific revolution, as he called it, was responsible for the
rapid communication of new ideas as scientific information. This
rapid growth was in the form of new journals doubling every 15
years.
1963 In the early 1960’s, Price observed that the vast amount of
scientific research was too much for humans to keep abreast of.
Abstract journals, which were created in the late 1800’s as a way
to manage the increasing knowledge-base, were also growing at
the same trajectory and had already reached a “critical magnitude”.
They were no longer a storage or organization solution for
information.
1966 At around this time, the Centralized Computing Systems
entered the scene. Not only was information booming in the
science sector, it was booming in the business sector as well. Due
to the information influx in the 1960’s, most organizations began
to design, develop and implement centralized computing systems
that allowed them to automate their inventory systems.
1970 Edgar F. Codd, an Oxford-educated mathematician
working at the IBM Research Lab, published a paper showing how
information stored in large databases could be accessed without
knowing how the information was structures or where it resided on
the database. Until then, retrieving information required relatively
sophisticated computer knowledge, or even the services of
specialists —a time-consuming and expensive task. Today, most
routine data transactions—accessing bank accounts, using credit
cards, trading stocks, making travel reservations, buying things
online—all use structures based on relational database theory.
A relational database system5
1976 In the mid-1970’s, Materials Requirements Planning
(MRP) systems were designed as a tool to help manufacturing
firms to organize and schedule their information. Around the same
time, PC’s were gaining huge popularity gradually which marked a
shift in focus toward business processes and accounting
capabilities. Companies like Oracle and SAP were founded around
the same time.
5
Image source: IBM.com
1983 As advancements in technology continued further, every
industry began to benefit from new ways to organize, store and
produce data.
Information Explosion6
1996 Digital storage became more cost-effective for storing
data than paper. Also, the boom in data brought more challenges to
ERP vendors. The need to redesign ERP products, including
breaking the barrier of proprietorship and customization, forced
vendors to embrace the collaborative business over the internet in a
seamless manner.
1997 The term “Big Data” was used for the first time in an
article by NASA researchers Michael Cox and David Ellsworth.
6
Image source: IBM.com
The pair claimed that the rise of data was becoming an issue for
current computer systems. This was also known as the “problem of
big data”.
The 4 V’s of Big Data7
.
1998 By the end of 90’s, many businesses began to believe that
their data mining systems were not up to the mark and still needed
improvements. Business workers were unable to get access to or
answer the data they needed from searches. Also, IT resources
were not so easily available at their disposal. So, whenever the
employees needed access, they had to call the IT department due to
lack of easily accessible information.
2001 The acronym SaaS (Software as a Service) first appeared
around this time. It basically means an “on-demand software”
7
Image source: IBM.com
delivery model which is licensed on a subscription basis and is
centrally hosted.
Software as a Service8
2005 SaaS companies began appearing on the scene to offer an
alternative to Oracle and SAP that was more focused on the
usability of the end user. Adding to this was the creation of a new
programming language named Hadoop. Free to download, use,
enhance and improve, Hadoop is 100% open source way pf storing
and processing data that enables distributed parallel procession of
huge amounts of data across inexpensive, industry-standard servers
that both store and process the data with extreme scalability.
2009 Business Intelligence became a top priority for Chief
Information Officers in 2009. Tim Berners, director of the World
Wide Web Consortium (W3C) was the first to use the term “linked
8
Image source: Google images
data” during a presentation on the subject at the TED 2009
conference. A set of best practices for using the Web to create
links between structured data is known as Linked Data.
2011 By this time, nearly all sectors in the US economy had at
least an average of 200 terabytes of stored data per company with
more than 1000 employees. The writers also estimated the
securities and investment industries led in terms of stored data per
organization. The scientists calculated that 7.4 exabytes of original
data were saved by enterprises and 6.8 exabytes by consumers in
2010 alone.
2012 After the launch of IPv6, identification and location
system for computers on the networks and traffic routes across the
internet became much faster. Technologically advanced features
such as ability to generate reports from in-memory databases
which provide faster and more predictable performance were also
on the rise. Businesses began to implement new in-memory
technology such as SAP HANA to analyze and optimize mass
quantities of data. Companies became ever more reliant on
utilizing data as a business asset to gain a competitive advantage,
with big data leading the charge as arguably the most important
new technology to understand and make use of in day-to-day
business.
How does Hexanika make use of Big Data?
Hexanika is a FinTech big data software company which has
developed an end-to-end solution for financial institutions to
address data sourcing and reporting challenges for regulatory
compliance. Hexanika’s innovative solution improves data quality,
keeps regulatory reporting in harmony with the dynamic regulatory
requirements and keeps pace with the new developments and latest
regulatory updates.
Hexanika’s unique Big Data deployment approach by experienced
professionals will simplify, optimize and reduce costs of
deployment. It strives to achieve this by following the process as
shown below:
Hexanika addresses Big Data using its unique product and
solutions. To know more about us,
see: http://hexanika.com/company-profile/
Feel free to get in touch with our experts to know more
at: http://hexanika.com/contact-us-big-data-company/
CONTACT US
USA
249 East 48 Street,
New York, NY 10017
Tel: +1 646.733.6636
INDIA
Krupa Bungalow 1187/10,
Shivaji Nagar, Pune 411005
Tel: +91 9850686861
Email: info@hexanika.com
Follow Us

More Related Content

What's hot

Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its ChallengesKathirvel Ayyaswamy
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysisPoonam Kshirsagar
 
23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunishaShivlal Mewada
 
The Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big DataThe Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big DataReadWrite
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop IntroductionJayant Mukherjee
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabatinabati
 
Data: A Timeline - How Data Came To Rule The World
Data: A Timeline - How Data Came To Rule The WorldData: A Timeline - How Data Came To Rule The World
Data: A Timeline - How Data Came To Rule The WorldRibbonfish
 
Big data 2017 final
Big data 2017   finalBig data 2017   final
Big data 2017 finalAmjid Ali
 
Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019Edwin S. Garcia
 

What's hot (20)

The big story (BIG DATA)
The big story (BIG DATA)The big story (BIG DATA)
The big story (BIG DATA)
 
A Brief History Of Data
A Brief History Of DataA Brief History Of Data
A Brief History Of Data
 
Research issues in the big data and its Challenges
Research issues in the big data and its ChallengesResearch issues in the big data and its Challenges
Research issues in the big data and its Challenges
 
Big data
Big dataBig data
Big data
 
Data minig with Big data analysis
Data minig with Big data analysisData minig with Big data analysis
Data minig with Big data analysis
 
Big data
Big dataBig data
Big data
 
Chapter 1 big data
Chapter 1 big dataChapter 1 big data
Chapter 1 big data
 
Data and science
Data and scienceData and science
Data and science
 
23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha23 ijcse-01238-1indhunisha
23 ijcse-01238-1indhunisha
 
The Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big DataThe Age of Exabytes: Tools & Approaches for Managing Big Data
The Age of Exabytes: Tools & Approaches for Managing Big Data
 
Big Data & Hadoop Introduction
Big Data & Hadoop IntroductionBig Data & Hadoop Introduction
Big Data & Hadoop Introduction
 
Big data analytics, survey r.nabati
Big data analytics, survey r.nabatiBig data analytics, survey r.nabati
Big data analytics, survey r.nabati
 
BigData
BigDataBigData
BigData
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
Data: A Timeline - How Data Came To Rule The World
Data: A Timeline - How Data Came To Rule The WorldData: A Timeline - How Data Came To Rule The World
Data: A Timeline - How Data Came To Rule The World
 
Research paper on big data and hadoop
Research paper on big data and hadoopResearch paper on big data and hadoop
Research paper on big data and hadoop
 
Big data 2017 final
Big data 2017   finalBig data 2017   final
Big data 2017 final
 
Big data survey
Big data surveyBig data survey
Big data survey
 
ANALYTICS OF DATA USING HADOOP-A REVIEW
ANALYTICS OF DATA USING HADOOP-A REVIEWANALYTICS OF DATA USING HADOOP-A REVIEW
ANALYTICS OF DATA USING HADOOP-A REVIEW
 
Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019Data Mining @ BSU Malolos 2019
Data Mining @ BSU Malolos 2019
 

Viewers also liked

Serrano artwork sample 2
Serrano artwork sample 2Serrano artwork sample 2
Serrano artwork sample 2Desiree Serrano
 
15207870 pss7-ans
15207870 pss7-ans15207870 pss7-ans
15207870 pss7-ansmichaelkw
 
Serrano artwork sample 4
Serrano artwork sample 4Serrano artwork sample 4
Serrano artwork sample 4Desiree Serrano
 
Huzefa_UpdatedR (1)
Huzefa_UpdatedR (1)Huzefa_UpdatedR (1)
Huzefa_UpdatedR (1)Hozefa Jamil
 
Servers are like snowmen - and virtualization is cool!
Servers are like snowmen - and virtualization is cool!Servers are like snowmen - and virtualization is cool!
Servers are like snowmen - and virtualization is cool!Jason Samuels
 
Mixx2016_Krasulya_Dmirty
Mixx2016_Krasulya_DmirtyMixx2016_Krasulya_Dmirty
Mixx2016_Krasulya_Dmirtyiabrussiaprez
 
SK1 / U.2 - Movies & Entertainment
SK1 / U.2 - Movies & EntertainmentSK1 / U.2 - Movies & Entertainment
SK1 / U.2 - Movies & EntertainmentLee Gonz
 
Infogix Automated Information Controls
Infogix Automated Information ControlsInfogix Automated Information Controls
Infogix Automated Information Controlskoleksy
 
Infogix BCBS 239 Implementation Challenges
Infogix BCBS 239 Implementation ChallengesInfogix BCBS 239 Implementation Challenges
Infogix BCBS 239 Implementation ChallengesMichelle Genser
 
Back arrow.pngs campaign recomendations
Back arrow.pngs campaign recomendationsBack arrow.pngs campaign recomendations
Back arrow.pngs campaign recomendationsJoshua Baca
 
How To Make Display Ads That Work
How To Make Display Ads That WorkHow To Make Display Ads That Work
How To Make Display Ads That WorkPraveen Rajaretnam
 

Viewers also liked (12)

Serrano artwork sample 2
Serrano artwork sample 2Serrano artwork sample 2
Serrano artwork sample 2
 
15207870 pss7-ans
15207870 pss7-ans15207870 pss7-ans
15207870 pss7-ans
 
Mission Ref
Mission RefMission Ref
Mission Ref
 
Serrano artwork sample 4
Serrano artwork sample 4Serrano artwork sample 4
Serrano artwork sample 4
 
Huzefa_UpdatedR (1)
Huzefa_UpdatedR (1)Huzefa_UpdatedR (1)
Huzefa_UpdatedR (1)
 
Servers are like snowmen - and virtualization is cool!
Servers are like snowmen - and virtualization is cool!Servers are like snowmen - and virtualization is cool!
Servers are like snowmen - and virtualization is cool!
 
Mixx2016_Krasulya_Dmirty
Mixx2016_Krasulya_DmirtyMixx2016_Krasulya_Dmirty
Mixx2016_Krasulya_Dmirty
 
SK1 / U.2 - Movies & Entertainment
SK1 / U.2 - Movies & EntertainmentSK1 / U.2 - Movies & Entertainment
SK1 / U.2 - Movies & Entertainment
 
Infogix Automated Information Controls
Infogix Automated Information ControlsInfogix Automated Information Controls
Infogix Automated Information Controls
 
Infogix BCBS 239 Implementation Challenges
Infogix BCBS 239 Implementation ChallengesInfogix BCBS 239 Implementation Challenges
Infogix BCBS 239 Implementation Challenges
 
Back arrow.pngs campaign recomendations
Back arrow.pngs campaign recomendationsBack arrow.pngs campaign recomendations
Back arrow.pngs campaign recomendations
 
How To Make Display Ads That Work
How To Make Display Ads That WorkHow To Make Display Ads That Work
How To Make Display Ads That Work
 

Similar to History of Big Data

Review of big data analytics (bda) architecture trends and analysis
Review of big data analytics (bda) architecture   trends and analysis Review of big data analytics (bda) architecture   trends and analysis
Review of big data analytics (bda) architecture trends and analysis Conference Papers
 
Moving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and PerspectivesMoving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and PerspectivesIJRESJOURNAL
 
Big data a rescue plan
Big data a rescue planBig data a rescue plan
Big data a rescue planTapasya123
 
SWOT of Bigdata Security Using Machine Learning Techniques
SWOT of Bigdata Security Using Machine Learning TechniquesSWOT of Bigdata Security Using Machine Learning Techniques
SWOT of Bigdata Security Using Machine Learning Techniquesijistjournal
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October IssueJIMS Rohini Sector 5
 
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsHappiest Minds Technologies
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessAjay Ohri
 
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilindEMC
 
Big data introduction by quontra solutions
Big data introduction by quontra solutionsBig data introduction by quontra solutions
Big data introduction by quontra solutionsQUONTRASOLUTIONS
 

Similar to History of Big Data (20)

Review of big data analytics (bda) architecture trends and analysis
Review of big data analytics (bda) architecture   trends and analysis Review of big data analytics (bda) architecture   trends and analysis
Review of big data analytics (bda) architecture trends and analysis
 
Moving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and PerspectivesMoving Toward Big Data: Challenges, Trends and Perspectives
Moving Toward Big Data: Challenges, Trends and Perspectives
 
lecture-1-1487765601.pptx
lecture-1-1487765601.pptxlecture-1-1487765601.pptx
lecture-1-1487765601.pptx
 
What is big data.pdf
What is big data.pdfWhat is big data.pdf
What is big data.pdf
 
Big data a rescue plan
Big data a rescue planBig data a rescue plan
Big data a rescue plan
 
Big Data: A Rescue Plan
Big Data: A Rescue PlanBig Data: A Rescue Plan
Big Data: A Rescue Plan
 
Database Essay
Database EssayDatabase Essay
Database Essay
 
SWOT of Bigdata Security Using Machine Learning Techniques
SWOT of Bigdata Security Using Machine Learning TechniquesSWOT of Bigdata Security Using Machine Learning Techniques
SWOT of Bigdata Security Using Machine Learning Techniques
 
Data mining with big data
Data mining with big dataData mining with big data
Data mining with big data
 
LITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUD
LITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUDLITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUD
LITERATURE SURVEY ON BIG DATA AND PRESERVING PRIVACY FOR THE BIG DATA IN CLOUD
 
Intro dm
Intro dmIntro dm
Intro dm
 
The-Information-Age.pptx
The-Information-Age.pptxThe-Information-Age.pptx
The-Information-Age.pptx
 
From Big Data to Fast Data
From Big Data to Fast DataFrom Big Data to Fast Data
From Big Data to Fast Data
 
JIMS Rohini IT Flash Monthly Newsletter - October Issue
JIMS Rohini IT Flash Monthly Newsletter  - October IssueJIMS Rohini IT Flash Monthly Newsletter  - October Issue
JIMS Rohini IT Flash Monthly Newsletter - October Issue
 
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest MindsWhitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
 
How Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help businessHow Big Data ,Cloud Computing ,Data Science can help business
How Big Data ,Cloud Computing ,Data Science can help business
 
Big data
Big dataBig data
Big data
 
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
Big Data Systems: Past, Present &  (Possibly) Future with @techmilindBig Data Systems: Past, Present &  (Possibly) Future with @techmilind
Big Data Systems: Past, Present & (Possibly) Future with @techmilind
 
Big data introduction by quontra solutions
Big data introduction by quontra solutionsBig data introduction by quontra solutions
Big data introduction by quontra solutions
 
Big data
Big dataBig data
Big data
 

More from HEXANIKA

Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?HEXANIKA
 
Scope of Data Integration
Scope of Data IntegrationScope of Data Integration
Scope of Data IntegrationHEXANIKA
 
How Big Data helps banks know their customers better
How Big Data helps banks know their customers betterHow Big Data helps banks know their customers better
How Big Data helps banks know their customers betterHEXANIKA
 
Sandbox in Financial Services
Sandbox in Financial ServicesSandbox in Financial Services
Sandbox in Financial ServicesHEXANIKA
 
High regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banksHigh regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banksHEXANIKA
 
Automation in Banking
Automation in BankingAutomation in Banking
Automation in BankingHEXANIKA
 
Regulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized BanksRegulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized BanksHEXANIKA
 
Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)HEXANIKA
 
Why shift from ETL to ELT?
Why shift from ETL to ELT?Why shift from ETL to ELT?
Why shift from ETL to ELT?HEXANIKA
 
FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?HEXANIKA
 
The Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and AftereffectsThe Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and AftereffectsHEXANIKA
 
A summary of Solvency II Directives
A summary of Solvency II DirectivesA summary of Solvency II Directives
A summary of Solvency II DirectivesHEXANIKA
 
A Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliantA Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliantHEXANIKA
 
Dodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory ReportingDodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory ReportingHEXANIKA
 
Regulatory impact on small and midsize banks
Regulatory impact on small and midsize banksRegulatory impact on small and midsize banks
Regulatory impact on small and midsize banksHEXANIKA
 

More from HEXANIKA (15)

Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?Why is Regulatory Reporting tough?
Why is Regulatory Reporting tough?
 
Scope of Data Integration
Scope of Data IntegrationScope of Data Integration
Scope of Data Integration
 
How Big Data helps banks know their customers better
How Big Data helps banks know their customers betterHow Big Data helps banks know their customers better
How Big Data helps banks know their customers better
 
Sandbox in Financial Services
Sandbox in Financial ServicesSandbox in Financial Services
Sandbox in Financial Services
 
High regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banksHigh regulatory costs for small and mid sized banks
High regulatory costs for small and mid sized banks
 
Automation in Banking
Automation in BankingAutomation in Banking
Automation in Banking
 
Regulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized BanksRegulatory Pain Points For Small And Medium Sized Banks
Regulatory Pain Points For Small And Medium Sized Banks
 
Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)Understanding SAR (Suspicious Activity Reporting)
Understanding SAR (Suspicious Activity Reporting)
 
Why shift from ETL to ELT?
Why shift from ETL to ELT?Why shift from ETL to ELT?
Why shift from ETL to ELT?
 
FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?FATCA: why is it so difficult even after so many years?
FATCA: why is it so difficult even after so many years?
 
The Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and AftereffectsThe Volcker Rule: Its Implications and Aftereffects
The Volcker Rule: Its Implications and Aftereffects
 
A summary of Solvency II Directives
A summary of Solvency II DirectivesA summary of Solvency II Directives
A summary of Solvency II Directives
 
A Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliantA Review of BCBS 239: Helping banks stay compliant
A Review of BCBS 239: Helping banks stay compliant
 
Dodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory ReportingDodd-Frank's Impact on Regulatory Reporting
Dodd-Frank's Impact on Regulatory Reporting
 
Regulatory impact on small and midsize banks
Regulatory impact on small and midsize banksRegulatory impact on small and midsize banks
Regulatory impact on small and midsize banks
 

Recently uploaded

Interimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance CompanyInterimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance CompanyTyöeläkeyhtiö Elo
 
Call Girls Near Me WhatsApp:+91-9833363713
Call Girls Near Me WhatsApp:+91-9833363713Call Girls Near Me WhatsApp:+91-9833363713
Call Girls Near Me WhatsApp:+91-9833363713Sonam Pathan
 
Ch 4 investment Intermediate financial Accounting
Ch 4 investment Intermediate financial AccountingCh 4 investment Intermediate financial Accounting
Ch 4 investment Intermediate financial AccountingAbdi118682
 
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证rjrjkk
 
Lundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfLundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfAdnet Communications
 
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...Henry Tapper
 
Tenets of Physiocracy History of Economic
Tenets of Physiocracy History of EconomicTenets of Physiocracy History of Economic
Tenets of Physiocracy History of Economiccinemoviesu
 
(中央兰开夏大学毕业证学位证成绩单-案例)
(中央兰开夏大学毕业证学位证成绩单-案例)(中央兰开夏大学毕业证学位证成绩单-案例)
(中央兰开夏大学毕业证学位证成绩单-案例)twfkn8xj
 
Economics, Commerce and Trade Management: An International Journal (ECTIJ)
Economics, Commerce and Trade Management: An International Journal (ECTIJ)Economics, Commerce and Trade Management: An International Journal (ECTIJ)
Economics, Commerce and Trade Management: An International Journal (ECTIJ)ECTIJ
 
Stock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdfStock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdfMichael Silva
 
SBP-Market-Operations and market managment
SBP-Market-Operations and market managmentSBP-Market-Operations and market managment
SBP-Market-Operations and market managmentfactical
 
《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》
《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》
《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》rnrncn29
 
House of Commons ; CDC schemes overview document
House of Commons ; CDC schemes overview documentHouse of Commons ; CDC schemes overview document
House of Commons ; CDC schemes overview documentHenry Tapper
 
Current Economic situation of Pakistan .pptx
Current Economic situation of Pakistan .pptxCurrent Economic situation of Pakistan .pptx
Current Economic situation of Pakistan .pptxuzma244191
 
NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...
NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...
NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...Amil Baba Dawood bangali
 
Bladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results PresentationBladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results PresentationBladex
 
Stock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdfStock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdfMichael Silva
 
原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证
原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证
原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证jdkhjh
 
Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111Sapana Sha
 

Recently uploaded (20)

Interimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance CompanyInterimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
Interimreport1 January–31 March2024 Elo Mutual Pension Insurance Company
 
Call Girls Near Me WhatsApp:+91-9833363713
Call Girls Near Me WhatsApp:+91-9833363713Call Girls Near Me WhatsApp:+91-9833363713
Call Girls Near Me WhatsApp:+91-9833363713
 
Ch 4 investment Intermediate financial Accounting
Ch 4 investment Intermediate financial AccountingCh 4 investment Intermediate financial Accounting
Ch 4 investment Intermediate financial Accounting
 
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
原版1:1复刻温哥华岛大学毕业证Vancouver毕业证留信学历认证
 
Lundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdfLundin Gold April 2024 Corporate Presentation v4.pdf
Lundin Gold April 2024 Corporate Presentation v4.pdf
 
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
letter-from-the-chair-to-the-fca-relating-to-british-steel-pensions-scheme-15...
 
Tenets of Physiocracy History of Economic
Tenets of Physiocracy History of EconomicTenets of Physiocracy History of Economic
Tenets of Physiocracy History of Economic
 
(中央兰开夏大学毕业证学位证成绩单-案例)
(中央兰开夏大学毕业证学位证成绩单-案例)(中央兰开夏大学毕业证学位证成绩单-案例)
(中央兰开夏大学毕业证学位证成绩单-案例)
 
Economics, Commerce and Trade Management: An International Journal (ECTIJ)
Economics, Commerce and Trade Management: An International Journal (ECTIJ)Economics, Commerce and Trade Management: An International Journal (ECTIJ)
Economics, Commerce and Trade Management: An International Journal (ECTIJ)
 
Stock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdfStock Market Brief Deck for 4/24/24 .pdf
Stock Market Brief Deck for 4/24/24 .pdf
 
SBP-Market-Operations and market managment
SBP-Market-Operations and market managmentSBP-Market-Operations and market managment
SBP-Market-Operations and market managment
 
《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》
《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》
《加拿大本地办假证-寻找办理Dalhousie毕业证和达尔豪斯大学毕业证书的中介代理》
 
House of Commons ; CDC schemes overview document
House of Commons ; CDC schemes overview documentHouse of Commons ; CDC schemes overview document
House of Commons ; CDC schemes overview document
 
Current Economic situation of Pakistan .pptx
Current Economic situation of Pakistan .pptxCurrent Economic situation of Pakistan .pptx
Current Economic situation of Pakistan .pptx
 
NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...
NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...
NO1 Certified Ilam kala Jadu Specialist Expert In Bahawalpur, Sargodha, Sialk...
 
Bladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results PresentationBladex 1Q24 Earning Results Presentation
Bladex 1Q24 Earning Results Presentation
 
Stock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdfStock Market Brief Deck for "this does not happen often".pdf
Stock Market Brief Deck for "this does not happen often".pdf
 
🔝+919953056974 🔝young Delhi Escort service Pusa Road
🔝+919953056974 🔝young Delhi Escort service Pusa Road🔝+919953056974 🔝young Delhi Escort service Pusa Road
🔝+919953056974 🔝young Delhi Escort service Pusa Road
 
原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证
原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证
原版1:1复刻堪萨斯大学毕业证KU毕业证留信学历认证
 
Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111Call Girls In Yusuf Sarai Women Seeking Men 9654467111
Call Girls In Yusuf Sarai Women Seeking Men 9654467111
 

History of Big Data

  • 1. A HISTORY OF BIG DATA What is Big Data? In essence, Big Data is a term for data sets that are so large or complex that traditional data processing applications are inadequate. It usually includes data sets with sizes beyond the ability of commonly used software tools to capture, curate, manage and process data within a tolerable elapsed time1 . The “size” of Big Data is a constantly moving target, which doesn’t remain stable at any given point of time. As per a recent report, its size ranges from a few dozen terabytes to many petabytes of data. The story of how data became big starts many years before the current buzz around big data. About seventy years ago we encountered the first attempts to quantify the growth rate in the volume of data or what has popularly been known as the Information Explosion (a term first used in 1941). The history of Big Data as a term may be brief – but many of the foundations it is built on were laid many years ago2 . Long before computers (as we know today) were commonplace, the idea that we were creating an ever-expanding body of knowledge ripe for analysis was popular in academia. Now, let’s look at a detailed account of the major milestones in the history of sizing data volumes in the evolution of the idea of “big data” and observations pertaining to data or information explosion: 1932 Skipping the important milestone of the population boom would not do justice to the history of Big Data. Information 1 Source: Wikipedia 2 Link: https://www.linkedin.com/pulse/brief-history-big-data-everyone-should-read-bernard-marr
  • 2. overload continued with the boom in the US population, the issuing of social security numbers, and the general growth of knowledge (research) which demanded more thorough and organized record-keeping. 1941 Scholars began referring to this incredible expansion of information as the “Information Explosion”. First referenced by the Lawton Constitution (newspaper) in 1941, the term was expanded upon in a New Statesman article in March 1964, which referred to the difficulty of managing the amount of information available. 1944 The first flag of warning on the growth of knowledge storage and the retrieval problem came in 1944, when Fremont Rider, a Wesleyan University Librarian estimated that American university libraries were doubling in size every sixteen years. At this growth rate, Rider speculated that the Yale Library in 2040 would have “approximately 200,000,000 volumes, which will occupy over 6,000 miles of shelves… [requiring] a cataloging staff of over six thousand persons.” Schematic showing a general communication system3 . 3 Link: http://www.winshuttle.com/big-data-timeline/
  • 3. 1948 Claude Shannon published “Shannon’s Information Theory” which established a framework for determining the minimal data requirements to transmit information over a noisy (imperfect) channel. This was a landmark work that enabled much of today’s infrastructure. Without this understanding, data would be “bigger” than it is today. 1956 The concept of virtual memory was developed by German physicist Fritz-Rudolf Guntsch as an idea that treated finite storage as infinite. Storage, managed by integrated hardware and software to hide the details from the user, permitted us to process data without the hardware memory constraints that previously forced the problem to be partitioned. Information Overload4 4 Image source: Google images
  • 4. 1961 Information Scientist, Derek Price, generalized Rider’s findings to include almost the entire range of scientific knowledge. The scientific revolution, as he called it, was responsible for the rapid communication of new ideas as scientific information. This rapid growth was in the form of new journals doubling every 15 years. 1963 In the early 1960’s, Price observed that the vast amount of scientific research was too much for humans to keep abreast of. Abstract journals, which were created in the late 1800’s as a way to manage the increasing knowledge-base, were also growing at the same trajectory and had already reached a “critical magnitude”. They were no longer a storage or organization solution for information. 1966 At around this time, the Centralized Computing Systems entered the scene. Not only was information booming in the science sector, it was booming in the business sector as well. Due to the information influx in the 1960’s, most organizations began to design, develop and implement centralized computing systems that allowed them to automate their inventory systems. 1970 Edgar F. Codd, an Oxford-educated mathematician working at the IBM Research Lab, published a paper showing how information stored in large databases could be accessed without knowing how the information was structures or where it resided on the database. Until then, retrieving information required relatively sophisticated computer knowledge, or even the services of specialists —a time-consuming and expensive task. Today, most routine data transactions—accessing bank accounts, using credit
  • 5. cards, trading stocks, making travel reservations, buying things online—all use structures based on relational database theory. A relational database system5 1976 In the mid-1970’s, Materials Requirements Planning (MRP) systems were designed as a tool to help manufacturing firms to organize and schedule their information. Around the same time, PC’s were gaining huge popularity gradually which marked a shift in focus toward business processes and accounting capabilities. Companies like Oracle and SAP were founded around the same time. 5 Image source: IBM.com
  • 6. 1983 As advancements in technology continued further, every industry began to benefit from new ways to organize, store and produce data. Information Explosion6 1996 Digital storage became more cost-effective for storing data than paper. Also, the boom in data brought more challenges to ERP vendors. The need to redesign ERP products, including breaking the barrier of proprietorship and customization, forced vendors to embrace the collaborative business over the internet in a seamless manner. 1997 The term “Big Data” was used for the first time in an article by NASA researchers Michael Cox and David Ellsworth. 6 Image source: IBM.com
  • 7. The pair claimed that the rise of data was becoming an issue for current computer systems. This was also known as the “problem of big data”. The 4 V’s of Big Data7 . 1998 By the end of 90’s, many businesses began to believe that their data mining systems were not up to the mark and still needed improvements. Business workers were unable to get access to or answer the data they needed from searches. Also, IT resources were not so easily available at their disposal. So, whenever the employees needed access, they had to call the IT department due to lack of easily accessible information. 2001 The acronym SaaS (Software as a Service) first appeared around this time. It basically means an “on-demand software” 7 Image source: IBM.com
  • 8. delivery model which is licensed on a subscription basis and is centrally hosted. Software as a Service8 2005 SaaS companies began appearing on the scene to offer an alternative to Oracle and SAP that was more focused on the usability of the end user. Adding to this was the creation of a new programming language named Hadoop. Free to download, use, enhance and improve, Hadoop is 100% open source way pf storing and processing data that enables distributed parallel procession of huge amounts of data across inexpensive, industry-standard servers that both store and process the data with extreme scalability. 2009 Business Intelligence became a top priority for Chief Information Officers in 2009. Tim Berners, director of the World Wide Web Consortium (W3C) was the first to use the term “linked 8 Image source: Google images
  • 9. data” during a presentation on the subject at the TED 2009 conference. A set of best practices for using the Web to create links between structured data is known as Linked Data. 2011 By this time, nearly all sectors in the US economy had at least an average of 200 terabytes of stored data per company with more than 1000 employees. The writers also estimated the securities and investment industries led in terms of stored data per organization. The scientists calculated that 7.4 exabytes of original data were saved by enterprises and 6.8 exabytes by consumers in 2010 alone. 2012 After the launch of IPv6, identification and location system for computers on the networks and traffic routes across the internet became much faster. Technologically advanced features such as ability to generate reports from in-memory databases which provide faster and more predictable performance were also on the rise. Businesses began to implement new in-memory technology such as SAP HANA to analyze and optimize mass quantities of data. Companies became ever more reliant on utilizing data as a business asset to gain a competitive advantage, with big data leading the charge as arguably the most important new technology to understand and make use of in day-to-day business. How does Hexanika make use of Big Data? Hexanika is a FinTech big data software company which has developed an end-to-end solution for financial institutions to address data sourcing and reporting challenges for regulatory compliance. Hexanika’s innovative solution improves data quality, keeps regulatory reporting in harmony with the dynamic regulatory
  • 10. requirements and keeps pace with the new developments and latest regulatory updates. Hexanika’s unique Big Data deployment approach by experienced professionals will simplify, optimize and reduce costs of deployment. It strives to achieve this by following the process as shown below: Hexanika addresses Big Data using its unique product and solutions. To know more about us, see: http://hexanika.com/company-profile/ Feel free to get in touch with our experts to know more at: http://hexanika.com/contact-us-big-data-company/
  • 11. CONTACT US USA 249 East 48 Street, New York, NY 10017 Tel: +1 646.733.6636 INDIA Krupa Bungalow 1187/10, Shivaji Nagar, Pune 411005 Tel: +91 9850686861 Email: info@hexanika.com Follow Us