The macro trends in healthcare and the associated career
Big_Data_In_Health
1. BIG DATA IN HEALTH
Divya Juneja (Author)
CSE, CTIEMT
Jalandhar
juneja.divya21@gmail.com
Amita Sharma(Author)
CSE, CTIEMT
Jalandhar
amitasharma706@gmail.com
Navya Beri (Author)
CSE, CTIEMT
Jalandhar
sweet.sour18@gmail.com
Abstract—One of the biggest issue in India is healthcare.
Those people who are living in big cities or metro cities have
obtained the opportunity to use the benefits of efficient health
care as compared to those living in rural parts of the country
and are facing more difficulties of insufficient provisions in
healthcare industry. Although some parts of the country have
more access to the sufficient facilities but still, we are very slow to
delve into this advancement as compared to other countries .
According to some experts and policy makers, there is a huge
chink in skills and knowledge about t he innovations in public
and private health financing and delivery. The incompetent and
inequities in the health care access in India have pushed ahead
the necessity for productive and innovative solutions to
strengthen the same. The troubles prevailing in the industry of
health care yields an important and apparent calls for the need to
alterthe existing structure of the present health care services by
applying big data analytics. This paper identifies the huge
shortage of sufficient health care facilities and addresses how to
use big data in providing greater access to primary health care
services in India . Further, it also addresses the critical
computing and analytical ability of Big Data in processing huge
volumes of transactional data in real time situations . The intent
of this paper is to provide a definition of big data and to present
the transforms in the health care sector and boosts the
discussions on how government can hitch up the innovations in
the big data analytics to upgrade the health care system.
Keywords— Big Data Analytics, HealthCare, EHRs, Tele
Medicine,Electronic Medical Report.
I. INTRODUCTION
India is an innovative nation having more than billion
population, one of the fastest growing country , economically
as well as in the terms of number of population .This increase
in the population saddle the health care industry of India.
Health care in our country is capitalized by government. But,
for those living in rural areas accessing primary health care is
still a challenge. For the developing countries like India, one
of the most important issue to emphasis is the healthcare to
provide better health care access to the priceless human
resources, which in turn can make the India healthier. The
exponential rise of data over the last ten years has brought a
new domain in the era of information technology and data
science called Big Data. The term Big data is often used to
define a large amount of data (both structured and
unstructured) that is massive in volume and very hard to
process using traditional database management techniques .
As the health care production is engulfed with massive
volumes of data which needs authentication and investigation,
Big Data Analytics can be applied. Big data has the ability to
execute critical computing and analytical ability towards the
processing of the massive volumes of transactional data.
Data is rising rapidly than healthcare organizations can digest
it, approximately 85% of health data is unstructured and
clinically relevant. This data is located in multiple places like
individual EMR's, lab ,genomics and imaging data, claims,
CRM systems, PRO's and finance. In addition, the conversion
from paper medical records to EHR systems has led to an
exponential growth of data. The health industry is on the peak
of its most exciting period to date and entering into the new
era where technology is starting to tackle big data , fetching
about unlimited potential for statistical extension. Big Data
analytics are helping to realize the goal of diagnosing ,treating,
helping and healing all patients in need of healthcare, with the
terminating goal of this domain being improved HealthCare
Output(HCO), or the quality of care that healthcare can
provide to end users(i.e. patients).However, the scope of this
study will be a research that uses data mining in order to
answer questions throughout the various levels of health. In
addition to aggregate the huge quantity of data, there’s the
challenge of maintaining the records of the patient's privacy.
Healthcare organizations are leveraging big data technology to
capture all of the information about a patient to get a more
complete view for insight into care coordination and
outcomes-based reimbursement models, population health
management, and patient engagement and outreach.
Successfully harnessing big data unleashes the potential to
2. achieve the three critical objectives for healthcare
transformation:
A. Build sustainable healthcare systems
B. Collaborate to improve care and outcomes
C. Increase access to healthcare
With the adoption of big data in healthcare ,security and
primary concerns of the patients is increasing significantly the
detail information of the patients is recommended to be
stored in data centre’s with different levels of security.
According to the health insurance acts, EHR security must be
taken to the highest priority to ensure safety of the patients.
This paper is structured as follows: in section I, we explain
the existing health care in India, section II discusses about big
data and its characteristics , section III focuses on the
problems in the existing healthcare system, section IV
provides the innovative ideas to provide better health care to
people of India, section V presents the big data analytics in
health care systemand section VI concludes the work.
I. EXISTING HEALTHCARE IN INDIA
Healthcare is the right of each and every person but absence
of standard infrastructure, lack of qualified medical
functionalities, and non- access to basic medical aid and
facilities thwarts its reach to 65% of population in India.
Those parts of the country where the condition of medical
facilities is disgraceful, a shocking number of 700 million
people reside there. Although a lot of strategies and programs
are being controlled by the administration but the prosperity
and productiveness of these programs is controversial due to
vent in the implementation. The main problem in the Indian
health care system is inequity in Healthcare. To overcome this
, the Government of India has introduced the National Rural
Health Mission (NHRM) in 2005. The objective of this
mission is to provide productive healthcare to population of
India. The significance of this mission is to establish an
effective community which is decentralized by the health
delivery system with inter-sectoral convergence at all levels,
to ensure simultaneous action on a wide range of determinants
of health such as water, hygiene, education, nourishment,
social and gender equality. Institutional integration within the
fragmented health sector was expected to provide a focus on
outcomes, measured against Indian Public Health Standards
for all health facilities. Though it is hard to believe, the fact is,
compared to government health sectorthe private health sector
has highly skilled doctors and the facilities are world-class
using the latest laboratory equipment’s and innovative
technology. Though, the aim of National Rural Health Mission
(NRHM) is to provide effective health care to rural
population, the hospitals located in remote villages are not
equipped with well qualified doctors and sophisticated
equipment due to several unknown reasons like the doctors
may not be interested to reside in village to serve the rural
population, lack of funds to procure and supply required
infrastructure to all the remote hospitals. To access the high-
end private sector facilities, people are spending large amounts
of money on treatment which in-turn affects their livelihoods
and slowly poor people are becoming poorer. In such cases,
the concept of Telemedicine along with big data analytics is
the better alternative to treat the patients in rural areas. The
Rural public health care system in India has three different
levels of health care access. At the lowest level, we have Sub
Centre (SC) and on top of that there will be a Primary Health
Centre (PHC). The Health workers at the sub centre first take
note of the vitals and symptoms of the patient, record them
using the Electronic Health Records (EHR’s) in the proposed
system are sorted first by criticality and then by timestamp and
also the proposed system should not allow two doctors at the
same level to handle the case simultaneously.
II. WHAT IS BIG DATA?
Definition: Big Data is a collection of extremely large data
which is hard to process using common database management
tools or traditional database. Big data is arriving from multiple
sources at an alarming velocity, volume and variety.
Big data is changing the way people work together within
the companies. It is a policy of importing, storing and
analyzing data to uncover information that is not known
before. This eruption of the data, changing the way people
think . The huge Indian health care industry needs to harness
healthcare’s “big data” and analyze a complex set of data,
including electronic medical records and sensor data. This
permits clinicians to access and examine healthcare big data
to ascertain quality, resolute best practice, assess aid strategies
and discover patients at risk .Earlier, the type of information
available was confined. A well-defined set of technology was
there to tackle the managing information. But nowadays , the
quantity of data in our world has been expanding . It has
increased from terabytes and petabytes. Big data is distinct
from large existing data stored in various relational databases.
It refers to a collection of large data sets which are very
complex.
The objective of this paper is to impart high quality health
care without any insularity on the foundation of gender, case,
social status and economic status and also points to ensure
better health care to rural people and defeat the problems in
the health care like expired drugs not administered to patients,
deceptive management in the health care. The suggested
concept allows doctors, patients and staff to have role-based
access to information on electronic health records.
A. CHARACTERISTICS OF BIG DATA
Big data is not only carried by the exponential
development of data but also by editing user conduct and
globalization. Globalization creates competitiveness among
the participants in the market. To increase the competitive
advantage, organizations are regularly gazing for
opportunities.
The typical characteristics of the Big data are:
3. Volume: There is an exponential increase in data volume
irrespective of the organization’s size. Vast amount of
data is generated by organizations or individuals in the
form of terabytes, exabytes and zetabytes. Some small
sized organizations may have gigabytes or terabytes of
data storage. Nowadays, the datasets of many of these
companies are within the range of terabytes but, soon
they could reach petabytes or even exabytes. Data which
is generated by machines is massive in volume than the
traditional data.
Velocity: . Velocity is defined as the rate of change in
the data and how quickly it must be used to create real
value . It is the speed at which the data is being generated
like streamed data from different smart gadgets into social
media and also image streamed data which stores the data
in motion from huge number of closed circuit cameras.
Big data expands quickly, which generates unprecedented
quantities that need to be stored, transferred, and
processed quickly .
Variety: Different variety of data are captured. It may be
structured,semi structured or unstructured. Data collection
can be from different sources and forms , it can be sourced
out from e-mails audio and videos. In big data, the variety
of data has risen up, boosted by the use of cloud
computing. Variety makes big data really big. Big data
comes from a great variety of sources and generally it
consists of three types: structured, semi structured and
unstructured. Structured data interpolates data warehouse
that is already tagged and easily sorted but unstructured
data is random and hard to handle. Semi structured data
does not obey to fixed era but includes tags to isolate data
elements .
Veracity: Veracity refers to the biases, noise and
abnormality in data. Is the data that is being stored, and
mined meaningful to the problem being examined. It is the
great challenge in analytics of big data when compared to
other attributes like volume and velocity It can create a
problem , if the data coming in large volume is not
correct, and is of no use. So, it should be correct.
III. PROBLEMS IN EXISTING HEALTHCARE SYSTEM
The health system of India includes public and private
hospitals as well as specialized Ayurvedic hospitals. Health
insurance only covers hospitalization and emergency costs.
India is one of the most populouated country in the world, the
healthcare infrastructure that is over-burdened with this ever
increasing population, a set of the following challenges that
are unique to India arise.
Economic lack in the bigger segment of population
outcomes in poor access to health care and poor
educational status in India, leads to lack of utilization
of primary health services and results in the growth
of risk factors.
The twin epidemic of transpiring infectious diseases
as well as chronic degenerative diseases is faced by
India.
.
Deficiency of education, gender discrimination and
explosive rise in population contributes to rising
burden of diseases resulting in the challenge to the
health sector.
The most important from the government side is,
Expenditure on health by the Government continues
to be low. It is not viewed as an investment but rather
as a dead loss.
. The various types of data can be anticipated from the health
care system from different health science data sources include
data from drug research, social media, patient records, gene
sequencing, test results, claims, home monitoring mobile apps
etc.,
Clinical data including unstructured documents,
prescriptions and images.
Day to day research publications and medical
references
Huge amounts of genomic data for analyzing the
behavior of various .
Web and social networking data on healthcare issues
Streamed data from home monitoring, Telemedicine,
hand held and sensor based wireless device data.
Thus the need for BDA (Big Data Analytics) arises, which
provides clinical decision support through large amounts of
data, personalized care by early detection and diagnosis before
a patient develops disease symptoms, clinical operations with
great accuracy, fraud management in the health sector.
IV. INNOVATIVE IDEAS TO FIX HEALTHCARE IN
INDIA
Now ,we are going to suggest some ideas to fix rural health
care in India and bridge the gap between quality and
affordability in government hospitals. These ideas will enable
4. us to access the services on par with the private super specialty
hospitals. Further, the implementation of these ideas will
provide cheaper, better and easier health care facilities to the
citizens of India.
1. e-Health File: The creation of a e-Health care file for
each patient, where all health care providers and
patients themselves were able to submit information
(with the consent of the patient). To overcome the
information overload from the massive amounts of
data, Big Data Analytics could be employed for the
processing of the data and obtain the desired results
with great accuracy in reasonable time.
2. Creating awareness with chronic diseases: The
system must identify and create awareness among the
people with the common chronic diseases at
particular areas, through which we can prevent
diseases. These chronic diseases are responsible for
the 75% of health care spending due to lack of
awareness and prior care.
3. e-Prescribe: Paper based prescriptions are archaic
and lead to several miseries each year due to errors in
But if every doctor is provided with prescription. an
electronic prescription system, it would improve
safety by making prescriptions easier to read and
providing instant checks on drug interactions,
dosages, and a patient’s medication history.
4. Stop Unnecessary Treatments: Doctors should
avoid trial and error type of medication. The problem
must be examined thoroughly by performing the
required diagnostic tests during the preliminary days
of disease. The right treatment should be suggested at
the first visit only which avoids the disease to
become more critical. Most of the issues are arising
with the misdiagnosis and wrong treatment during the
early stages.
5. Electronic Medical Records: Medical Experts
agree that electronic medical records (EMRs) are a
must for the better health care in India.
V. APPLYING BIG DATA ANALYTICS IN
HEALTH CARE
We reside in the age of big data. The quantity of data
generated in the world upto and including 2005 is now
generated every two days. Big data is a platform for
importing, storing and analyzing data to uncover data not
which is known earlier. This outburst of the data alter the
thought process of the people about everything. However, the
health care has not kept pace with big data. Big Data
Healthcare is the force to capitalize on growing patient and
health system data availability to give rise to healthcare
innovation. By forging smart use of the ever-increasing
quantity of data available, we discover new awareness by
reviewing the data or merging it with other information. In
healthcare this means not just mining patient records, medical
imagery diagnostic reports etc., for insights, diagnoses and
resolution support machine, but also continuous analysis of the
data streams generated for and by every patient in a hospital,
at home and even while on the move via mobile devices .
Even today , most of the health care analytics is executing by
doing monthly data refreshes in relational databases that
generate pre-processed reports. A fair gap is often absent lab
test is often 45 days old, as the data flow proceed from
batched data sector to real time sectors from transactional
systems and streaming data from analytical modeling devices.
This old model of analytics will not succeed. Analysis will
require to be done on that spot moment not in the pre-
processed structure. Data revives need to be done in real-time
not once in a month. The data analysis tools of nowadays are
likely yellow pages phone book in the era of Internet Search
Engine. They are becoming more outdated with each passing
day. The traditional health care analytic devices are built on
devices enhanced by IBM in 1970, more than 40 years ago. If
all the three parties (payer, provider, pharmaceutical company)
work collectively and distribute data/insight, disease
management programs will become cost-effective and bring
improved patient results at a scale that will furthermore
optimize overall health-care cost structure. The term “e-
health” defined by WHO: “ a new term used to describe the
integrated use of electronic communication and information
technology in the health field”. e-health is the main driver for
three significant changes within the health care environment:
1. Patients to become better informed
2. Patients to become more active and empowered in their
health care
3. Healthcare to become more efficient.
Big data solutions attempt to cost effectively solve the
challenges of large and fast-growing data volumes realize its
potential analytical value. For instance, trend analytics permits
you to find out what happened,while root cause and predictive
analytics enable understanding of why it occurred and what it
is likely to occur next. All healthcare constituents – patients,
payers, providers, groups, researchers, governments etc. – will
be impacted by big data, which can predict how these payers
are likely to perform, encourage desirable performance. These
applications of big data can be tested, refined and examined
rapidly and inexpensively and will radically alter healthcare
delivery and research. The healthcare domain has not been a
difficult aim for people who pursue easy money by using
fraud means . Healthcare fraud is expected to proceed to rise
as people live longer. A big data platform has potential to
filter through a large quantity of historical records in
relatively lesser extent of time, so that the production
transactions can use fraud discovery on real time. Though, the
big data analytics in healthcare plays a crucial role to provide
better health care services, generate analysis on the historical
data to expose hidden data, the big data analytics has the
challenges like Heterogeneity and Incompleteness of data,
scale, timeliness, confidentiality and Human Collaboration.
The later research is all about to defeat the challenges and use
big data analytics in healthcare to expose the understanding
from the raw unstructured data.
5. VII. CONCLUSIONS
Big data analytics in healthcare is yielding into a promising
sector for providing awareness from very large data sets and
upgrading results for minimizing costs. In this paper, an
outline of the problems encountered by the rural community
living in remote areas of the country in obtaining the primary
health care is described specifically. It is also explored how
the big data analytics are valuable to change rural healthcare
by gathering awareness from their clinical and other data
repositories and make informed conclusions. This paper
discusses about the big data, its characteristics, methods
challenges and suggests howto get rid of the underlying issues
being encountered by the health care industry. It also
introduce the big ideas to rectify the healthcare system in
India. The implementation part of this paper can be done using
HDFS (Hadoop File System) for the storage of large data and
Hadoop Map Reduce with Amazon Web Services. The usage
of big data analytics over the healthcare organization and
healthcare industry will mine the doctor’s lab transcript’s
using text mining and co-relation to patient results and
location aware application analytics for increasing customer
experience. Attaining better results at inferior costs has
become very essential for health care which can be achieved
through the implementation of this paper using Hadoop HDFS
and Map Reduce to uncover the data lying in big health data
sets.
REFERENCES
[1] D. Peter Augustine,”Leveraging Big Data Analytics and
Hadoop in Developing India’s Healthcare Services”,
International Journal of Computer Applications, vol. 89-No
16, pp. 44-50, Match 2014.
[2]http://www.who.int/gho/publications/world_health_statistic
s/2014/e n/
[3] Prof. Arnab Mukherji, “ Indian Healthcare system:
Challenges and Opportunities”, available online at
http://tejas.iimb.ac.in/interviews/41.php
[4] National Rural Health Mission (NRHM) website,
http://nrhm.gov.in/
[5] K. Priyanka and Kulennavar Nagarathna, “ A Survey on
Big Data Analytics in Health Care”, International Journal of
Computer Science and Information Technologies, Vol.
5(4),2014, pp. 5685- 5688
[6] W. Liu and E.K. Park, “ Big Data as an e-Health Service” ,
International Conference on Computing, Networking and
Communications, 3-6 Feb-2014, pp. 982-988
[7] Patil, Harsh Kupwade, and Ravi Seshadri. "Big data
security and privacy issues in healthcare."
[8] Sagiroglu, S.; Sinanc, D., "Big data: A review,"
Collaboration Technologies and Systems (CTS), 2013
International Conference on , vol., no., pp.42,47, 20-24 May
2013
[9] http://www.rd.com/health/healthcare/18-ideas-to-reform-
healthcare-now/
[10] Tekiner, F.; Keane, J.A., "Big Data Framework,"
Systems, Man, and Cybernetics (SMC), 2013 IEEE
International Conference on , vol., no., pp.1494,1499, 13-16
Oct. 2013
[11] Zaiying Liu; Ping Yang; Lixiao Zhang, "A Sketch of Big
Data Technologies," Internet Computing for Engineering and
Science (ICICSE), 2013 Seventh International Conference on ,
vol., no., pp.26,29, 20-22 Sept. 2013
[12] http://inside-bigdata.com/2013/09/12/beyond-volume-
varietyvelocity-issue-big-data-veracity/
[13] Soumendra Mohanty, Madhu Jagadeesh, and Harsha
Srivatsa,”Big Data Imperatives”, Apress publications. 2013
[14] Edwin Morley-Fletcher, “Big Data Healthcare, An
overview of the challenges in data intensive health care”,
Discussion Paper.
[15] Manuel Zwicker, Juergen Seitz and Nilmini
Wickramasinghe, “Identifying Critical Issues for Developing
Successful E-Health Solutions”, Proceedings of the Pacific
Asia Conference on Information Systems (PACIS 2012).
Paper 33. http://aisel.aisnet.org/pacis2012/33
[16] Bill Hamilton , “Big Data is the Future of Healthcare”,
Cognizant 20-20 Insights, September 2012
[17] Venkatareddy Konasani, Mukul Biswas and Praveen
Krishnan Koleth, “ Healthcare Fraud Management using Big
Data Analytics”, A Whitepaper by Trendwise Analytics.
[18]http://india.angloinfo.com/healthcare/health-system/ Web
References:
[19]http://www.ibm.com/big-data/us/en/
[20] http://www.bigdatauniversity.com
[21] http:// www.ibmbigdatahub.com