SlideShare a Scribd company logo
1 of 4
Download to read offline
International Journal of Trend in Scientific Research and Development (IJTSRD)
Volume 4 Issue 4, June 2020 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470
@ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 26
Extract the Analyzed Information from Dark Data
(Data which Lies Beneath the Surface of an Iceberg)
Rahul P1, Ganeshan M2
1Master of Computer Application,
2Department of Computer Science and Information Technology,
1,2Jain University, JGI, Bengaluru, Karnataka, India
ABSTRACT
The world is surrounded by data and data, the data may be structured,
unstructured, or semi-structured; every organization generates enormous
data daily, only the tip of data is analyzed, and the larger the data is ignored
from the utilizable analysis. This paper focuses on a particularly unstructured
and bothersome class of data, termed Dark data. Dark data is not attentively
analyzed, indexed, and stored, so it becomes nearly imperceptibletopotential
users and therefore is more likely to last neutralized and eventually lost. This
paper discusses how the concepts of long-term specificallyuseofanalyzedfor
all intents and purposes dark data can be used to generally understand the
very possible solutions for better curation of dark data in a major way. This
paper describes why this class of data is so critical to scientific progress,some
of the properties of this dark data, as well as the technical difficulties to useful
management of this class of data. Many probable useful institutional and
technical solutions are under development which will show in this paper in
the last section, but these solutions are mainly conceptual and require
additional research during lack of resources.
KEYWORDS: Dark Data; Big Data Analytics, StructuredData, UnstructuredData,
and Semi-Structure Data
How to cite this paper: Rahul P |
Ganeshan M "Extract the Analyzed
Information from Dark Data"Publishedin
International Journal
of Trend in Scientific
Research and
Development
(ijtsrd), ISSN: 2456-
6470, Volume-4 |
Issue-4, June 2020,
pp.26-29, URL:
www.ijtsrd.com/papers/ijtsrd30842.pdf
Copyright © 2020 by author(s) and
International Journal ofTrendinScientific
Research and Development Journal. This
is an Open Access article distributed
under the terms of
the Creative
CommonsAttribution
License (CC BY 4.0)
(http://creativecommons.org/licenses/by
/4.0)
I. INTRODUCTION
Dark data is the digital information that is not being utilized.
Consulting and market research company Gartner Inc.
describes dark data as "information assets that an
organization amasses, processes and stores in the course of
its conventional business activity,butgenerallyfailstoutilize
for other purposes."
Many times,an organizationmay for all intentsandpurposes
leave data generally dark for all intents and purposes
practical reasons in a definitely big way. The data may be
dirty and by the time it can for all intents and purposes be
scrubbed, the information may be too old to be subsidiary in
a subtle way. In such a scenario, records may, for the most
part, contain incomplete or specifically pass data, really be
parsed incorrectly or essentially be stored in file formats or
on contrivances that generally have mostly becomeobsolete,
which definitely is fairly significant.
Increasingly, the term actuallydark data particularlyisbeing
associated with immensely colossal data and operational
data.
Examples kind of include server log files that could definitely
provide clues to website visitor deportment, the customer
definitely call detail records that incorporate unstructured
consumer sentiment data, and mobile geolocation data that
could definitely reveal traffic patterns that would avail with
business orchestrating, or so they for the most part though.
Potentially, this type of generally dark data can actually be
habituated to drive incipient revenue sources, eliminate
waste, and reduce costs. As a result, actually many
organizations that store pretty dark data for regulatory
compliance purposes generally are utilizing Hadoop to
basically identify subsidiary for all intentsandpurposesdark
bits and map them to possible business essentially uses in a
big way.
By pretty many estimates, very dark data mostly makes up
around 80% of the basic total contentinanyorganizationina
subtle way. It can generally be defined as any data which
really has been engendered, but now really lies dormant and
dormant. As a result of standard day-to-day processes,
particularly dark data is all the content that for the most part
is left behind, enubilated in systems and servers, and
underused or forgotten about, which is quite significant.
The data that lies beneath what you know about your data is
only the tip of the iceberg. Approximately 1/8ofaniceberg's
total massisvisibleabovewater.Theremaining7/8stretches
into the deep ocean blue, hidden from view. The iceberg
principle holds true for digital data. Today, organizations
know they have a big data problem, but they don't knowjust
how big it is. Organizations collect data fromvarioussources
and use it to understand their customers and grow their
business. This type of data is called structured data. E.g. -
IJTSRD30842
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 27
When a cashier swipes a consumer's credit card at the point
of sale, a slew of data is generated and stored neatly in a
database for later analysis and Every time a user clicks on a
link on a website [1], data is generated and later analyzed to
determine browsing behavior and buying patterns.
Companies currently only analyze 10% of the data they
collect. According to the international data corporation,10%
of digital data is structured—leaving 90% unused,
unstructured, and hidden.[2]
Fig- 1 – Graphical view of dark data under big data and
traditional enterprise data.[2]
II. HOW DOES DATA GO DARK
A. Post-purchase, a consumer speaks his or her mindabout
a website's lack of mobile-friendliness by filling out the
optional "how was your shopping experience “field and
clicks submit. Reasons
B. Although structured data is typically filed neatly into
tables in databases, unstructured data doesn't always
have a defined destination or format. In this case, the
company's website is not equipped with the technology
to responsibly storetheconsumer'scomments.Whynot?
C. The consumer's rant gets stored in a variety of locations,
where it's nearly impossible to analyze.
D. Why Data Goes Dark:
Too much data, not enough analysis only 39 % of data is
analyzed in organizations.
25% of data can only access structured data sets by
organizations.,
13 % data is there but tools can't make sense of it.[1].
7.5 Sextillion gigabytes of data are generated worldwide
every single day If the iceberg principle holds true, that
means 6.75 Septillion megabytes of data will go dark.[1]
III. RISK RELATED TO DARK DATA
Gartner refers to this data as "dark data"—the information
that organizations collect, process, and store during regular
business activities, but generally fail to use for other
purposes.Saving
Regularly audit and prune the database in a kind ofbig
way. This kind of means that you should literally be
structuring or assigning categories totheolddatasothat
you particularly know what kind of data generally is
stored and where which essentially is fairly significant.
You definitely do not essentially have to dump that data
subtly. With storage becoming inexpensive, there is no
need to definitely dump data, which specifically is quite
significant. Later, you may suddenly need the data and
since youdark data that’s outlived its shelf life and utility
to your business is sloppy at best and dangerous at
worst. [3]
Security –
The kind of more dark data you generally retain particularly
meansyouhavekindofmoretoprotect—anddefinitelymore
at risk if a breach occurs, contrary to popular belief. Old files
that may not particularly seem very important to you might
be extremely interesting and valuable to a company insider
or an external attacker who basically is looking for
information to leverage for personal, political, or monetary
gain, which generally is fairly significant.
Compliance Issues –
If sort of your generally dark data holds, for example, a Word
document with employee PII or an Excel really file with
customer payment information, actually your organization
may mostly be violating regulations pretty such as GDPR,
HIPAA, SOX, PCI-DSS and others in a sort of big way.
Unfortunately, kind of many companies doesn’t essentially
know this data mostly is evenon their network andreallyfail
to secure it in a kind of big way. In the event of a breach,
attackers will zero-in on this content, and regulators will
demand answers.
Hybrid Storage Concerns –
Most enterprises store data both on-premises and in the
cloud, which can actually make it harder to literally secure
fairly sensitive information on a need-to-know basis. Cloud
storage is convenient but often lacks the security controls
organizations, for the most part, have grown to really expect
in their - premises data stores, basically contrary to popular
belief. At the same time, don’t literally forget generally your
on-premises data in a subtle way. Many companies taking a
cloud-first approach will basically continue to store
information on generally physical servers. Security controls
and measures typically mostlyprotect cloud or on-premises
data storage, but not both – you need to for all intents and
purposes understand the limitations and capabilities of both
environments, for the most part, lockdown actually your
security and definitely monitor both environments for
threats, which actually is quite significant.
IV. RELATED WORK
Unused data may particularly render some of it redundant
over time, or so they particularly thought. Also, itisdefinitely
unlikely that all of the actual dark data will for all intents and
purposes be valuable in a for all intents and purposes major
way. So, you should neither actually toss out all of the actual
dark data nor definitely consider all of it a goldmine. Here
really are some ways to get the very much the best out of
generally dark data in a subtle way.
Apply for all intents and purposes of strong encryption
standards on the data in a major way. This should be
applicable both for data sitting in the pretty in-house
servers and the cloud storage in a, particularly big way.
Encryption can generally prevent a lot of securityissues
with that, which is quite significant.
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 28
Have data retention and very safe disposal policies in
place, fairly contrary to popular belief. The policies
should essentially be aligned with the prescriptions of
the Department of Defense. Carefully formulate policies
identifying data for erasure or destruction, pretty
contrary to popular belief. Good retention policies will
actually help you retain valuabledataforlateruse,which
definitely is quite significant.
V. BENEFITS OF EXTRACTING DARK DATA
Officialdoms pull out dark data incur an expense and spend
considerable engineering effort, but there are many benefits
to doing this
Dark data is valuable:
Dark data is valuable because it often holds data that is not
available in any other presentation. Thus, organizations
continue to pay the cost of gathering and storing dark data
forcompliance purposes and hopes of exploiting the data in
the future.
Because of this value, organizations sometimes resort to
human resources to extract physically extract and interpret
the data, and then enter it into a relational database, even
though this process is expensive,slow,anderror-prone.Deep
learningtechnologiesperformdarkdataextractionfasterand
with much better accuracy than humanbeings.itwillbecome
less expensive and uses less engineering effort when using
these techniques and tools.
Better-quality analytics
With access to better sources and more data, the quality of
analyticsimprovesvividly.Notonlyisthe examinationbased
on a larger pool of high-quality data, but the data is available
for analysis on time. The result is faster and better
information- driven decision making, which in turn leads to
business and functioning success.
Reduced cost and vulnerability
Extractingdark dataleavesofficialdomslessexposedtorisks
and liability in securing sensitivedata.Organizationscanalso
securely purge unnecessary data, thereby reducing the
repeated Storage and curation costs regulatory compliance
also becomes easier.
VI. PROPOSED WORK
There specifically are three actually key steps to getting the
most from dark data: finding, reviewing, and determining
value, pretty contrary to popular belief.Findingitspecifically
is arguably the most difficult part,andbusinesseswillneedto
mostly employ multiple methods and resources, particularly
such as in a really big way.
Getting administrative access to everything, including all
servers, hard drives, and any very other storagefacilitiesuse
subtly.
Searching for all actual file types and folder, for all
intents.
Categorizing by type and identifying ownership in a
particular way.
Reporting on both usage and purpose
A. Institutional Solutions –
One solution for all intents and purposes is to create science
data centers around sort of individual disciplines. Many
initiatives within federal agencies fairly such as NSF, NASA,
NOAA, and by individual fairly principal investigators
generally are already addressing these issues, basically
contrarytopopularbelief.For example, at the organizational
level, NSF funded the creation of the National Center for
Ecological Analysis and Synthesis (NCEAS)
(http://www.nceas.ucsb.edu/)inparttoaddressdataissues,
which actually is quite significant. The mission of NCEAS is
threefold,whichspecificallyisfairlysignificant.First,advance
the state of ecological knowledge through the search for
definitely general patterns and principles in existing data,
which basically is fairly significant. Second, essentially
organize and synthesize ecological information in a manner
useful to researchers, resource managers, and policymakers
addressing important environmental issues.
B. Promising Approaches –
As with generally many of the barriers to optimal data use,
actually many institutions need to beinvolvedinestablishing
a basically professional reward structure for scientists to
mostly participate.[5] Sharing and long-term preservationof
data should kind of lead to very professional success in a
subtle way. Scientists currently definitely get credit for the
citation of their published papers, which generally is quite
significant. Similar credit for data use will mostly require a
change in the sociology ofscience where data citationmostly
is given scholarly value, or so they essentially thought. The
publishing industry including, for example, Nature and
Science is already beginning to, for the most part, provide a
solution by allowing data to really be connected with
publications subtly. However, space limits, format control,
and indexing of data remain a generally major problem in an
actual major way. Institutional and disciplinary repositories
need to definitelyprovidefacilitiessothatcitationscanreturn
thesamedata setthatwasused inthecitationwithoutadding
or deleting records, orsotheygenerallythought.Theservices
and organizations listed above and ones like them are
working on solutions to the barriers to really effective data
use enumerated above, or so they thought. This section lists
someofthesolutionsandtheorganizationsworking on them
(see Table 1[6]).
Barrier Potential Solution
Lack of Professional
Reward Structure
Funding Body Requirements Data Citation, Requirements, Data Citation Index, Replace or
Educate the Old, Guard
Lack of Financial Reward
Structure
Funding Initiatives: NSF DataNet, INTEROP IMLS Data Curation Initiative
Undervaluation / Lack of
Investment
Public and Private Foundation Initiatives Sociology of Science Research
Lack of Education in Data
Curation
Formal Education Program
Intellectual Property
Rights (IPR)
Formal Education Programs Science Commons
International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470
@ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 29
Lack of Metadata
Standards and
Metadata Working Groups, Metacat Creation Tools
Lack of Sustainable
Technology
DataNet
Cost of Infrastructure
Creation
Data Repositories Cyberinfrastructure Development (OCI, eScience) Metadata Tool
Development Research Initiative e.g. DataNet Publishers, Data Federation Technology
Cost of Infrastructure
Maintenance
Long Term Collaborations and Institutionalization and Economies of Scale (Babble) PDF,
Excel, MS Word, Open Formats, translation tools, migration tools (e.g. Fedora) ArcView,
Floppy Disks
Table 1- Potential Solutions for Problems
CONCLUSION
Data specifically is the underpinningofthescientificmethod.
Without data to back up theory science becomes
ungrounded conjecture, which mostly is quite significant.
While the majority of data from large scientific enterprises
particularly is well-curated, there is little scientific
infrastructure in place to support the storage and reuse of
data created by kind of smaller projects. To maximize our
return on investment in scientific research we need to
develop this science infrastructure through existing
institutions very such as libraries and museums that kind of
have traditionally been the guardians of scholarly
productivity. It’s important to act when the benefits of use
actually outweigh the costs of accessing and analyzing very
dark data, contrary to popular belief. Locate, generally
organize, and literally understand data to generally unlock
its relevance and usefulness. There may kind of be
information you can monetize,theremightnot,butyouwon’t
know until you, for the most part, assess it subtly.that can
essentially provide knowledge,whichisparticularlyturncan
basically be used to generate profit subtly. By utilizing new
technologies around business intelligence and IT tools,
companies can particularly joinstructuredandunstructured
data setstogethertospecifically provide high- value results,
which kind of is quite significant. When done correctly, the
benefits will easily mostly outweigh the costs involved with
mining very dark data, which particularlyisfairlysignificant.
REFERENCES
[1] http://lucidworks.com/darkdata/
[2] https://www.sciencedirect.com/scie
nce/article/pii/S0960077916301515
[3] https://www.datacenterknowledge.com/industry-
perspectives/dark-data -putting-your-orga nization-risk
[4] https://developer.ibm.com/technologies/analytics/articl
es/ba- data-becomes-knowledge-3/
[5] Text Big Data Analytics: exploring API opportunity
Internet as Global storage – how to get the situation
awareness from Dark Data Olga Kolesnichenko
Security Analysis Bulletin Moscow, Russia
oykolesnichenko@list.ruDariya Yakovleva Vladivostok
State University of Economics and Service
Vladivostok,Russia darya.yakovleva15@vvsu.ru Oleg
Zhurenkov Altai Academy of Economics and Law
Barnaul, Russia zhur@pie -aael.ru
[6] Dark Data: Are We SolvingtheRightProblems?Michael
Cafarella1, Ihab F. Ilyas2, Marcel Kornacker3, Tim
Kraska4 and Christopher Ré51University of Michigan,
USA 2University of Waterloo, USA 3Cloudera, USA
4Brown University, USA 5Stanford University, USA
michjc@umich.edu, ilyas@uwaterloo.ca,
marcel@cloudera.com, tim kraska@brown.edu,
chrismre@cs.stanford.edu
[7] Shedding Light on the Dark Data in the Long Tail of
Science, P. Bryan Heidorn, LibraryTrends,Volume57,
Number 2, Fall 2008, pp. 280-299 (Article),Published
by The Johns Hopkins University Press DOI:
10.1353/lib.0.0036

More Related Content

What's hot

Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Happiest Minds Technologies
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsDatameer
 
Introduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIntroduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIMC Institute
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Edureka!
 
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...Usama Fayyad
 
IABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspectiveIABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspectiveMateusz Maj
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveKun Le
 
Connectivity to business outcomes
Connectivity to business outcomesConnectivity to business outcomes
Connectivity to business outcomesAndrey Karpov
 
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014pietvz
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data FundamentalsSmarak Das
 
Big data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightBig data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightJyrki Määttä
 
Big Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesBig Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesGregg Barrett
 
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...Findwise
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A ReviewIRJET Journal
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementTony Bain
 

What's hot (20)

Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
Whitepaper: Big Data 101 - Creating Real Value from the Data Lifecycle - Happ...
 
Fight Fraud with Big Data Analytics
Fight Fraud with Big Data AnalyticsFight Fraud with Big Data Analytics
Fight Fraud with Big Data Analytics
 
Introduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data ScienceIntroduction to Data Mining, Business Intelligence and Data Science
Introduction to Data Mining, Business Intelligence and Data Science
 
Taming the data beast
Taming the data beastTaming the data beast
Taming the data beast
 
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
Big Data Tutorial For Beginners | What Is Big Data | Big Data Tutorial | Hado...
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
Hadoop Overview
Hadoop OverviewHadoop Overview
Hadoop Overview
 
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...
Keynote talk at Financial Times Forum - BigData and Advanced Analytics at SIB...
 
IABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspectiveIABE Big Data information paper - An actuarial perspective
IABE Big Data information paper - An actuarial perspective
 
IBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep diveIBM-Infoworld Big Data deep dive
IBM-Infoworld Big Data deep dive
 
Connectivity to business outcomes
Connectivity to business outcomesConnectivity to business outcomes
Connectivity to business outcomes
 
IDOL presentation
IDOL presentationIDOL presentation
IDOL presentation
 
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
Take the Big Data Challenge - Take Advantage of ALL of Your Data 16 Sept 2014
 
Analytics3.0 e book
Analytics3.0 e bookAnalytics3.0 e book
Analytics3.0 e book
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
Big data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sightBig data-comes-of-age ema-9sight
Big data-comes-of-age ema-9sight
 
Big Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and ChallengesBig Data: Opportunities, Strategy and Challenges
Big Data: Opportunities, Strategy and Challenges
 
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
Enterprise Search White Paper: Beyond the Enterprise Data Warehouse - The Eme...
 
Big data – A Review
Big data – A ReviewBig data – A Review
Big data – A Review
 
Big Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data ManagementBig Data, NoSQL, NewSQL & The Future of Data Management
Big Data, NoSQL, NewSQL & The Future of Data Management
 

Similar to Extract the Analyzed Information from Dark Data

Dark Data Revelation and its Potential Benefits
Dark Data Revelation and its Potential BenefitsDark Data Revelation and its Potential Benefits
Dark Data Revelation and its Potential BenefitsPromptCloud
 
Big Data A Review
Big Data A ReviewBig Data A Review
Big Data A Reviewijtsrd
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big DataIRJET Journal
 
Solving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data SuiteSolving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data SuiteLindaWatson19
 
Using a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance businessUsing a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance businessDataWorks Summit/Hadoop Summit
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A StudyIRJET Journal
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big dataDigimark
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataAkshata Humbe
 
veritas-strike-global-report_a4-sdc2
veritas-strike-global-report_a4-sdc2veritas-strike-global-report_a4-sdc2
veritas-strike-global-report_a4-sdc2Marius Ghinea
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online caniceconsulting
 
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...IRJET Journal
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)Shahbaz Anjam
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data scienceVipul Kalamkar
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET Journal
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1Parviz Vakili
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellenceMudit Mangal
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...IT Support Engineer
 

Similar to Extract the Analyzed Information from Dark Data (20)

Dark Data Revelation and its Potential Benefits
Dark Data Revelation and its Potential BenefitsDark Data Revelation and its Potential Benefits
Dark Data Revelation and its Potential Benefits
 
Big Data Analytics
Big Data AnalyticsBig Data Analytics
Big Data Analytics
 
Big Data A Review
Big Data A ReviewBig Data A Review
Big Data A Review
 
Analysis of Big Data
Analysis of Big DataAnalysis of Big Data
Analysis of Big Data
 
Solving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data SuiteSolving The Data Growth Crisis: Solix Big Data Suite
Solving The Data Growth Crisis: Solix Big Data Suite
 
Using a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance businessUsing a Data Lake at the core of a Life Assurance business
Using a Data Lake at the core of a Life Assurance business
 
unit-4-notes.pdf
unit-4-notes.pdfunit-4-notes.pdf
unit-4-notes.pdf
 
IRJET- Big Data: A Study
IRJET-  	  Big Data: A StudyIRJET-  	  Big Data: A Study
IRJET- Big Data: A Study
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Big data assignment
Big data assignmentBig data assignment
Big data assignment
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
veritas-strike-global-report_a4-sdc2
veritas-strike-global-report_a4-sdc2veritas-strike-global-report_a4-sdc2
veritas-strike-global-report_a4-sdc2
 
Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online Module 6 The Future of Big and Smart Data- Online
Module 6 The Future of Big and Smart Data- Online
 
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...Big data analytics in Business Management and Businesss Intelligence: A Lietr...
Big data analytics in Business Management and Businesss Intelligence: A Lietr...
 
Big data (word file)
Big data  (word file)Big data  (word file)
Big data (word file)
 
Embracing data science
Embracing data scienceEmbracing data science
Embracing data science
 
IRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its ChallengesIRJET - Big Data Analysis its Challenges
IRJET - Big Data Analysis its Challenges
 
Intro to big data and applications - day 1
Intro to big data and applications - day 1Intro to big data and applications - day 1
Intro to big data and applications - day 1
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellence
 
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...Nuestar "Big Data Cloud" Major Data Center Technology  nuestarmobilemarketing...
Nuestar "Big Data Cloud" Major Data Center Technology nuestarmobilemarketing...
 

More from ijtsrd

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementationijtsrd
 
Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...ijtsrd
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospectsijtsrd
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...ijtsrd
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...ijtsrd
 
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...ijtsrd
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Studyijtsrd
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...ijtsrd
 
The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...ijtsrd
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...ijtsrd
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...ijtsrd
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...ijtsrd
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadikuijtsrd
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...ijtsrd
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...ijtsrd
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Mapijtsrd
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Societyijtsrd
 
Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...ijtsrd
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...ijtsrd
 
Streamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine LearningStreamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine Learningijtsrd
 

More from ijtsrd (20)

‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation‘Six Sigma Technique’ A Journey Through its Implementation
‘Six Sigma Technique’ A Journey Through its Implementation
 
Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...Edge Computing in Space Enhancing Data Processing and Communication for Space...
Edge Computing in Space Enhancing Data Processing and Communication for Space...
 
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and ProspectsDynamics of Communal Politics in 21st Century India Challenges and Prospects
Dynamics of Communal Politics in 21st Century India Challenges and Prospects
 
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
Assess Perspective and Knowledge of Healthcare Providers Towards Elehealth in...
 
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...The Impact of Digital Media on the Decentralization of Power and the Erosion ...
The Impact of Digital Media on the Decentralization of Power and the Erosion ...
 
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
Online Voices, Offline Impact Ambedkars Ideals and Socio Political Inclusion ...
 
Problems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A StudyProblems and Challenges of Agro Entreprenurship A Study
Problems and Challenges of Agro Entreprenurship A Study
 
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
Comparative Analysis of Total Corporate Disclosure of Selected IT Companies o...
 
The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...The Impact of Educational Background and Professional Training on Human Right...
The Impact of Educational Background and Professional Training on Human Right...
 
A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...A Study on the Effective Teaching Learning Process in English Curriculum at t...
A Study on the Effective Teaching Learning Process in English Curriculum at t...
 
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
The Role of Mentoring and Its Influence on the Effectiveness of the Teaching ...
 
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
Design Simulation and Hardware Construction of an Arduino Microcontroller Bas...
 
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. SadikuSustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
Sustainable Energy by Paul A. Adekunte | Matthew N. O. Sadiku | Janet O. Sadiku
 
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
Concepts for Sudan Survey Act Implementations Executive Regulations and Stand...
 
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
Towards the Implementation of the Sudan Interpolated Geoid Model Khartoum Sta...
 
Activating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment MapActivating Geospatial Information for Sudans Sustainable Investment Map
Activating Geospatial Information for Sudans Sustainable Investment Map
 
Educational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger SocietyEducational Unity Embracing Diversity for a Stronger Society
Educational Unity Embracing Diversity for a Stronger Society
 
Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...Integration of Indian Indigenous Knowledge System in Management Prospects and...
Integration of Indian Indigenous Knowledge System in Management Prospects and...
 
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
DeepMask Transforming Face Mask Identification for Better Pandemic Control in...
 
Streamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine LearningStreamlining Data Collection eCRF Design and Machine Learning
Streamlining Data Collection eCRF Design and Machine Learning
 

Recently uploaded

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfchloefrazer622
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 

Recently uploaded (20)

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 

Extract the Analyzed Information from Dark Data

  • 1. International Journal of Trend in Scientific Research and Development (IJTSRD) Volume 4 Issue 4, June 2020 Available Online: www.ijtsrd.com e-ISSN: 2456 – 6470 @ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 26 Extract the Analyzed Information from Dark Data (Data which Lies Beneath the Surface of an Iceberg) Rahul P1, Ganeshan M2 1Master of Computer Application, 2Department of Computer Science and Information Technology, 1,2Jain University, JGI, Bengaluru, Karnataka, India ABSTRACT The world is surrounded by data and data, the data may be structured, unstructured, or semi-structured; every organization generates enormous data daily, only the tip of data is analyzed, and the larger the data is ignored from the utilizable analysis. This paper focuses on a particularly unstructured and bothersome class of data, termed Dark data. Dark data is not attentively analyzed, indexed, and stored, so it becomes nearly imperceptibletopotential users and therefore is more likely to last neutralized and eventually lost. This paper discusses how the concepts of long-term specificallyuseofanalyzedfor all intents and purposes dark data can be used to generally understand the very possible solutions for better curation of dark data in a major way. This paper describes why this class of data is so critical to scientific progress,some of the properties of this dark data, as well as the technical difficulties to useful management of this class of data. Many probable useful institutional and technical solutions are under development which will show in this paper in the last section, but these solutions are mainly conceptual and require additional research during lack of resources. KEYWORDS: Dark Data; Big Data Analytics, StructuredData, UnstructuredData, and Semi-Structure Data How to cite this paper: Rahul P | Ganeshan M "Extract the Analyzed Information from Dark Data"Publishedin International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456- 6470, Volume-4 | Issue-4, June 2020, pp.26-29, URL: www.ijtsrd.com/papers/ijtsrd30842.pdf Copyright © 2020 by author(s) and International Journal ofTrendinScientific Research and Development Journal. This is an Open Access article distributed under the terms of the Creative CommonsAttribution License (CC BY 4.0) (http://creativecommons.org/licenses/by /4.0) I. INTRODUCTION Dark data is the digital information that is not being utilized. Consulting and market research company Gartner Inc. describes dark data as "information assets that an organization amasses, processes and stores in the course of its conventional business activity,butgenerallyfailstoutilize for other purposes." Many times,an organizationmay for all intentsandpurposes leave data generally dark for all intents and purposes practical reasons in a definitely big way. The data may be dirty and by the time it can for all intents and purposes be scrubbed, the information may be too old to be subsidiary in a subtle way. In such a scenario, records may, for the most part, contain incomplete or specifically pass data, really be parsed incorrectly or essentially be stored in file formats or on contrivances that generally have mostly becomeobsolete, which definitely is fairly significant. Increasingly, the term actuallydark data particularlyisbeing associated with immensely colossal data and operational data. Examples kind of include server log files that could definitely provide clues to website visitor deportment, the customer definitely call detail records that incorporate unstructured consumer sentiment data, and mobile geolocation data that could definitely reveal traffic patterns that would avail with business orchestrating, or so they for the most part though. Potentially, this type of generally dark data can actually be habituated to drive incipient revenue sources, eliminate waste, and reduce costs. As a result, actually many organizations that store pretty dark data for regulatory compliance purposes generally are utilizing Hadoop to basically identify subsidiary for all intentsandpurposesdark bits and map them to possible business essentially uses in a big way. By pretty many estimates, very dark data mostly makes up around 80% of the basic total contentinanyorganizationina subtle way. It can generally be defined as any data which really has been engendered, but now really lies dormant and dormant. As a result of standard day-to-day processes, particularly dark data is all the content that for the most part is left behind, enubilated in systems and servers, and underused or forgotten about, which is quite significant. The data that lies beneath what you know about your data is only the tip of the iceberg. Approximately 1/8ofaniceberg's total massisvisibleabovewater.Theremaining7/8stretches into the deep ocean blue, hidden from view. The iceberg principle holds true for digital data. Today, organizations know they have a big data problem, but they don't knowjust how big it is. Organizations collect data fromvarioussources and use it to understand their customers and grow their business. This type of data is called structured data. E.g. - IJTSRD30842
  • 2. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 27 When a cashier swipes a consumer's credit card at the point of sale, a slew of data is generated and stored neatly in a database for later analysis and Every time a user clicks on a link on a website [1], data is generated and later analyzed to determine browsing behavior and buying patterns. Companies currently only analyze 10% of the data they collect. According to the international data corporation,10% of digital data is structured—leaving 90% unused, unstructured, and hidden.[2] Fig- 1 – Graphical view of dark data under big data and traditional enterprise data.[2] II. HOW DOES DATA GO DARK A. Post-purchase, a consumer speaks his or her mindabout a website's lack of mobile-friendliness by filling out the optional "how was your shopping experience “field and clicks submit. Reasons B. Although structured data is typically filed neatly into tables in databases, unstructured data doesn't always have a defined destination or format. In this case, the company's website is not equipped with the technology to responsibly storetheconsumer'scomments.Whynot? C. The consumer's rant gets stored in a variety of locations, where it's nearly impossible to analyze. D. Why Data Goes Dark: Too much data, not enough analysis only 39 % of data is analyzed in organizations. 25% of data can only access structured data sets by organizations., 13 % data is there but tools can't make sense of it.[1]. 7.5 Sextillion gigabytes of data are generated worldwide every single day If the iceberg principle holds true, that means 6.75 Septillion megabytes of data will go dark.[1] III. RISK RELATED TO DARK DATA Gartner refers to this data as "dark data"—the information that organizations collect, process, and store during regular business activities, but generally fail to use for other purposes.Saving Regularly audit and prune the database in a kind ofbig way. This kind of means that you should literally be structuring or assigning categories totheolddatasothat you particularly know what kind of data generally is stored and where which essentially is fairly significant. You definitely do not essentially have to dump that data subtly. With storage becoming inexpensive, there is no need to definitely dump data, which specifically is quite significant. Later, you may suddenly need the data and since youdark data that’s outlived its shelf life and utility to your business is sloppy at best and dangerous at worst. [3] Security – The kind of more dark data you generally retain particularly meansyouhavekindofmoretoprotect—anddefinitelymore at risk if a breach occurs, contrary to popular belief. Old files that may not particularly seem very important to you might be extremely interesting and valuable to a company insider or an external attacker who basically is looking for information to leverage for personal, political, or monetary gain, which generally is fairly significant. Compliance Issues – If sort of your generally dark data holds, for example, a Word document with employee PII or an Excel really file with customer payment information, actually your organization may mostly be violating regulations pretty such as GDPR, HIPAA, SOX, PCI-DSS and others in a sort of big way. Unfortunately, kind of many companies doesn’t essentially know this data mostly is evenon their network andreallyfail to secure it in a kind of big way. In the event of a breach, attackers will zero-in on this content, and regulators will demand answers. Hybrid Storage Concerns – Most enterprises store data both on-premises and in the cloud, which can actually make it harder to literally secure fairly sensitive information on a need-to-know basis. Cloud storage is convenient but often lacks the security controls organizations, for the most part, have grown to really expect in their - premises data stores, basically contrary to popular belief. At the same time, don’t literally forget generally your on-premises data in a subtle way. Many companies taking a cloud-first approach will basically continue to store information on generally physical servers. Security controls and measures typically mostlyprotect cloud or on-premises data storage, but not both – you need to for all intents and purposes understand the limitations and capabilities of both environments, for the most part, lockdown actually your security and definitely monitor both environments for threats, which actually is quite significant. IV. RELATED WORK Unused data may particularly render some of it redundant over time, or so they particularly thought. Also, itisdefinitely unlikely that all of the actual dark data will for all intents and purposes be valuable in a for all intents and purposes major way. So, you should neither actually toss out all of the actual dark data nor definitely consider all of it a goldmine. Here really are some ways to get the very much the best out of generally dark data in a subtle way. Apply for all intents and purposes of strong encryption standards on the data in a major way. This should be applicable both for data sitting in the pretty in-house servers and the cloud storage in a, particularly big way. Encryption can generally prevent a lot of securityissues with that, which is quite significant.
  • 3. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 28 Have data retention and very safe disposal policies in place, fairly contrary to popular belief. The policies should essentially be aligned with the prescriptions of the Department of Defense. Carefully formulate policies identifying data for erasure or destruction, pretty contrary to popular belief. Good retention policies will actually help you retain valuabledataforlateruse,which definitely is quite significant. V. BENEFITS OF EXTRACTING DARK DATA Officialdoms pull out dark data incur an expense and spend considerable engineering effort, but there are many benefits to doing this Dark data is valuable: Dark data is valuable because it often holds data that is not available in any other presentation. Thus, organizations continue to pay the cost of gathering and storing dark data forcompliance purposes and hopes of exploiting the data in the future. Because of this value, organizations sometimes resort to human resources to extract physically extract and interpret the data, and then enter it into a relational database, even though this process is expensive,slow,anderror-prone.Deep learningtechnologiesperformdarkdataextractionfasterand with much better accuracy than humanbeings.itwillbecome less expensive and uses less engineering effort when using these techniques and tools. Better-quality analytics With access to better sources and more data, the quality of analyticsimprovesvividly.Notonlyisthe examinationbased on a larger pool of high-quality data, but the data is available for analysis on time. The result is faster and better information- driven decision making, which in turn leads to business and functioning success. Reduced cost and vulnerability Extractingdark dataleavesofficialdomslessexposedtorisks and liability in securing sensitivedata.Organizationscanalso securely purge unnecessary data, thereby reducing the repeated Storage and curation costs regulatory compliance also becomes easier. VI. PROPOSED WORK There specifically are three actually key steps to getting the most from dark data: finding, reviewing, and determining value, pretty contrary to popular belief.Findingitspecifically is arguably the most difficult part,andbusinesseswillneedto mostly employ multiple methods and resources, particularly such as in a really big way. Getting administrative access to everything, including all servers, hard drives, and any very other storagefacilitiesuse subtly. Searching for all actual file types and folder, for all intents. Categorizing by type and identifying ownership in a particular way. Reporting on both usage and purpose A. Institutional Solutions – One solution for all intents and purposes is to create science data centers around sort of individual disciplines. Many initiatives within federal agencies fairly such as NSF, NASA, NOAA, and by individual fairly principal investigators generally are already addressing these issues, basically contrarytopopularbelief.For example, at the organizational level, NSF funded the creation of the National Center for Ecological Analysis and Synthesis (NCEAS) (http://www.nceas.ucsb.edu/)inparttoaddressdataissues, which actually is quite significant. The mission of NCEAS is threefold,whichspecificallyisfairlysignificant.First,advance the state of ecological knowledge through the search for definitely general patterns and principles in existing data, which basically is fairly significant. Second, essentially organize and synthesize ecological information in a manner useful to researchers, resource managers, and policymakers addressing important environmental issues. B. Promising Approaches – As with generally many of the barriers to optimal data use, actually many institutions need to beinvolvedinestablishing a basically professional reward structure for scientists to mostly participate.[5] Sharing and long-term preservationof data should kind of lead to very professional success in a subtle way. Scientists currently definitely get credit for the citation of their published papers, which generally is quite significant. Similar credit for data use will mostly require a change in the sociology ofscience where data citationmostly is given scholarly value, or so they essentially thought. The publishing industry including, for example, Nature and Science is already beginning to, for the most part, provide a solution by allowing data to really be connected with publications subtly. However, space limits, format control, and indexing of data remain a generally major problem in an actual major way. Institutional and disciplinary repositories need to definitelyprovidefacilitiessothatcitationscanreturn thesamedata setthatwasused inthecitationwithoutadding or deleting records, orsotheygenerallythought.Theservices and organizations listed above and ones like them are working on solutions to the barriers to really effective data use enumerated above, or so they thought. This section lists someofthesolutionsandtheorganizationsworking on them (see Table 1[6]). Barrier Potential Solution Lack of Professional Reward Structure Funding Body Requirements Data Citation, Requirements, Data Citation Index, Replace or Educate the Old, Guard Lack of Financial Reward Structure Funding Initiatives: NSF DataNet, INTEROP IMLS Data Curation Initiative Undervaluation / Lack of Investment Public and Private Foundation Initiatives Sociology of Science Research Lack of Education in Data Curation Formal Education Program Intellectual Property Rights (IPR) Formal Education Programs Science Commons
  • 4. International Journal of Trend in Scientific Research and Development (IJTSRD) @ www.ijtsrd.com eISSN: 2456-6470 @ IJTSRD | Unique Paper ID – IJTSRD30842 | Volume – 4 | Issue – 4 | May-June 2020 Page 29 Lack of Metadata Standards and Metadata Working Groups, Metacat Creation Tools Lack of Sustainable Technology DataNet Cost of Infrastructure Creation Data Repositories Cyberinfrastructure Development (OCI, eScience) Metadata Tool Development Research Initiative e.g. DataNet Publishers, Data Federation Technology Cost of Infrastructure Maintenance Long Term Collaborations and Institutionalization and Economies of Scale (Babble) PDF, Excel, MS Word, Open Formats, translation tools, migration tools (e.g. Fedora) ArcView, Floppy Disks Table 1- Potential Solutions for Problems CONCLUSION Data specifically is the underpinningofthescientificmethod. Without data to back up theory science becomes ungrounded conjecture, which mostly is quite significant. While the majority of data from large scientific enterprises particularly is well-curated, there is little scientific infrastructure in place to support the storage and reuse of data created by kind of smaller projects. To maximize our return on investment in scientific research we need to develop this science infrastructure through existing institutions very such as libraries and museums that kind of have traditionally been the guardians of scholarly productivity. It’s important to act when the benefits of use actually outweigh the costs of accessing and analyzing very dark data, contrary to popular belief. Locate, generally organize, and literally understand data to generally unlock its relevance and usefulness. There may kind of be information you can monetize,theremightnot,butyouwon’t know until you, for the most part, assess it subtly.that can essentially provide knowledge,whichisparticularlyturncan basically be used to generate profit subtly. By utilizing new technologies around business intelligence and IT tools, companies can particularly joinstructuredandunstructured data setstogethertospecifically provide high- value results, which kind of is quite significant. When done correctly, the benefits will easily mostly outweigh the costs involved with mining very dark data, which particularlyisfairlysignificant. REFERENCES [1] http://lucidworks.com/darkdata/ [2] https://www.sciencedirect.com/scie nce/article/pii/S0960077916301515 [3] https://www.datacenterknowledge.com/industry- perspectives/dark-data -putting-your-orga nization-risk [4] https://developer.ibm.com/technologies/analytics/articl es/ba- data-becomes-knowledge-3/ [5] Text Big Data Analytics: exploring API opportunity Internet as Global storage – how to get the situation awareness from Dark Data Olga Kolesnichenko Security Analysis Bulletin Moscow, Russia oykolesnichenko@list.ruDariya Yakovleva Vladivostok State University of Economics and Service Vladivostok,Russia darya.yakovleva15@vvsu.ru Oleg Zhurenkov Altai Academy of Economics and Law Barnaul, Russia zhur@pie -aael.ru [6] Dark Data: Are We SolvingtheRightProblems?Michael Cafarella1, Ihab F. Ilyas2, Marcel Kornacker3, Tim Kraska4 and Christopher Ré51University of Michigan, USA 2University of Waterloo, USA 3Cloudera, USA 4Brown University, USA 5Stanford University, USA michjc@umich.edu, ilyas@uwaterloo.ca, marcel@cloudera.com, tim kraska@brown.edu, chrismre@cs.stanford.edu [7] Shedding Light on the Dark Data in the Long Tail of Science, P. Bryan Heidorn, LibraryTrends,Volume57, Number 2, Fall 2008, pp. 280-299 (Article),Published by The Johns Hopkins University Press DOI: 10.1353/lib.0.0036