A review of the six societal issues related to big data and privacy, including:
- Perception
- The necessity of data sharing
- Cost reduction
- Public mistrust
- Hubris & Hyperbole
David B. Resnik MedicReS World Congress 2015MedicReS
Protecting Privacy and Confidentiality in the Age of Big Data Presentation to MedicReS 5th World Congress on October 19,25,2015 in New York - David B. Resnik, JD, PhD, Bioethicist, NIH/NIEHS
Attivio Customer Success Story - Durkheim Project Search & DiscoveryAttivio
The statistics are alarming: suicide rates among U.S. veterans are almost double those of the general U.S. adult population. Reducing the incidence of suicide
among U.S. veterans has proven to be a complex and challenging battle; no initiative or program to date has worked to reverse this trend. Fortunately, there is
a new ally in veterans’ suicide prevention: predictive analytics technology.
Attivio Customer Success Story - Durkheim Project Attivio
Attivio plays a key role in powering Patterns and Predictions’ real-time predictive analytics solution to identify mental health risk factors among U.S. veterans,including suicide.
The Ethical Responsibilities of Academic Research Instutions and Funder to Safeguard the Integrity of Research Presentation to MedicReS 5th World Congress on October 19,25,2015 in New York by Zubin Master, PhD
Presentation given at the Decentralised Data Infrastructure for Science meeting at the British Library, 21st April 2018. I talk about issues of reproducibility and poor incentives in science and the blockchain-enabled solutions we are working on at Frankl Open Science.
David B. Resnik MedicReS World Congress 2015MedicReS
Protecting Privacy and Confidentiality in the Age of Big Data Presentation to MedicReS 5th World Congress on October 19,25,2015 in New York - David B. Resnik, JD, PhD, Bioethicist, NIH/NIEHS
Attivio Customer Success Story - Durkheim Project Search & DiscoveryAttivio
The statistics are alarming: suicide rates among U.S. veterans are almost double those of the general U.S. adult population. Reducing the incidence of suicide
among U.S. veterans has proven to be a complex and challenging battle; no initiative or program to date has worked to reverse this trend. Fortunately, there is
a new ally in veterans’ suicide prevention: predictive analytics technology.
Attivio Customer Success Story - Durkheim Project Attivio
Attivio plays a key role in powering Patterns and Predictions’ real-time predictive analytics solution to identify mental health risk factors among U.S. veterans,including suicide.
The Ethical Responsibilities of Academic Research Instutions and Funder to Safeguard the Integrity of Research Presentation to MedicReS 5th World Congress on October 19,25,2015 in New York by Zubin Master, PhD
Presentation given at the Decentralised Data Infrastructure for Science meeting at the British Library, 21st April 2018. I talk about issues of reproducibility and poor incentives in science and the blockchain-enabled solutions we are working on at Frankl Open Science.
The Role and Responsibilities of Statisticians in Clinical Trials Presentation to MedicReS 5th World Congress on October 19-25,2015 in New York by Shing Lee, PhD
Cemal H. Guvercin MedicReS 5th World Congress MedicReS
Ethical Issues in Artifical Intelligence Applied to Medicine Presentation to MedicReS 5th World Congress on October 19,25,2015 in New York by Cemal H. Guvercin, MD, PhD
Space Situational Awareness Forum
Following another very successful conference in London in November 2014, Space Situational Awareness 2015 took place in Hyattsville, Maryland in May 2015, with over 60 SSA experts from all over the globe coming together to discuss the most pressing SSA challenges.
With increasing dependence on space-based services, the ability to protect space infrastructure has become essential to our society. Any shutdown of even a part of space infrastructures could have significant consequences for the well-functioning of economic activities and our citizens’ safety, and would impair the provision of emergency services.
However, space infrastructures are increasingly threatened by the risk of collision between spacecraft and more importantly, between spacecraft and space debris. As a matter of fact, space debris has become the most serious threat to the sustainability of certain space activities.
In order to mitigate the risk of collision it is necessary to identify and monitor satellites and space debris, catalogue their positions, and track their movements (trajectory) when a potential risk of collision has been identified, so that satellite operators can be alerted to move their satellites. This activity is known as space surveillance and tracking (SST), and is today mostly based on ground-based sensors such as telescopes and radars.
With a focus on solving the political issues but not ignoring the technical, Space Situational Awareness 2015 the leading gathering of dedicated SSA experts from the USA, Europe and beyond, to discuss and debate the business, political and technical challenges that lie ahead.
Take a look at our previous Space Situation Awareness event…
Who should attend Space Situational Awareness?
Space Situational Awareness 2015 is a community of experts from Government, Space Agencies, Satellite/Spacecraft Operators, Space Lawyers, Space Insurance providers and Defense who are looking to understand and predict the physical location of natural and manmade objects in orbit around the Earth, with the objective of avoiding collisions.
How can you get involved in Space Situational Awareness?
If you feel that you could add to the debate and discussion at Space Situational Awareness, we’d be delighted to hear from you. Please drop us a line on +44(0)7769157787 or email me at adam.plom@coriniumintelligence.com.
OSFair2017 | Barriers to Open Science for junior researchersOpen Science Fair
Jon Tennant talks about barriers to open science for junior researchers
Plenary: Open for all? Diversity & disparity in Open Science.
Presentation Abstract:
What are the barriers to Open Science, and how do they impact upon different demographics? Open Science is supposed to be about inclusivity, equality, and rigour. But is the way it is implemented meeting these ideals, or simply creating a new set of barriers to scholarship? Younger researchers are basically trying to survive within a hyper-competitive academic system. They are beset on all sides by systemic control and inertia, power dynamics, and fear. What they want to do for science is not always what is best for their career. This creates a system of social barriers that cannot be overcome by mandates and policies that do little to address these structural biases.
Paywalls inflict a huge level of disparity on younger researchers. They may feel they cannot afford the exorbitant fees charged by some publishers for Open Access, even if their research funders provide support for it. The way OA is currently implemented has switched the barrier from the reader side to the author side, due in part to political broadsiding from commercial publishers. Unfunded or self-funded grad students, as well as those from emerging economies, are basically doomed when it comes to such high author-facing charges.
More: http://opensciencefair.eu/speakers/jon-tennant
DAY 2 - PLENARY
Brief remarks on big data trends and responsible data science at the Workshop on Science and Technology for Washington State: Advising the Legislature, October 4th 2017 in Seattle.
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017Sandra Gesing
The importance of Big Data and Open Data to achieve scientific advancements in precision medicine is beyond doubt and evident in many different projects and initiatives such as the Precision Medicine Initiative (All of Us), ICTBioMed, NCIP Hub, 100K Genomics England Project, NIH Cancer Moonshot, and the Million Veterans Program. In April 2013, McKinsey & Company proclaimed that Big Data has the ability to revolutionize pharmaceutical research and development within clinical environments, by using data for better informed decision making and targeting the diverse user roles including physicians, consumers, insurers, and regulators. Companies from a wide spectrum such as Oracle Health Sciences, Google, and Data4Cure build solutions that help address efficient and secure data sharing with the patient or clinician in mind. Open data can be maintained and shared by patient communities such as PatientsLikeMe.com and build an invaluable resource for further data mining.
Even with all these advances there are still challenges to address including a recent Precision Medicine World Conference announcement in November 2016: “We are missing easy-to-use solutions to share patient data.” Science gateways are a solution to fill the gap and help form by definition end-to-end solutions – web-based, mobile or desktop applications - that provide intuitive access to advanced resources and allow researchers to focus on tackling today’s challenging science questions. Science Gateways abstract the complex underlying computing and data infrastructure as far as feasible and desired by the stakeholder and can be tailored to different target groups with diverse backgrounds, demands, and technical knowledge.
Science Gateways have existed for over a decade and a wide variety of frameworks and APIs have been developed to support the efficient creation of science gateways and ease the implementation of connections to Cloud infrastructures and distributed data on a large scale. The importance of science gateways has been recognized by NSF by funding the creation of a Science Gateways Community Institute (SGCI) to serve the community with free resources, services, experts, and ideas for creating and sustaining science gateways. To achieve this goal, the SGCI serves the community with five areas that have diverse foci and which also closely interact: Incubator, Extended Developer Support, Scientific Software Collaborative, Community Engagement and Exchange and Workforce Development.
The Institute is technology-agnostic and serves the community by offering a wide variety of services and using technologies that are the best fitting solution for the use case. Gateways allow for precision medicine to be more efficiently developed or adapted by lowering the barriers to data sharing and Big Data analysis.
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Hellmuth Broda
While we bemoan the ever increasing data tsunami new technologies allow to harvest the gold nuggets in the hay stack.
Using the example of the Pharmaceutical Industry some of the possible business uses for Big Data Analitics are outlined.
Facilitating good research data management practice as part of scholarly publ...Varsha Khodiyar
Presentation given to the SciDataCon #IDW2018 session: Democratising Data Publishing: A Global Perspective, on Tuesday 6th November 2018, Gaborone, Botswana
Big Data: Big Opportunities or Big Trouble?Shea Swauger
Big data is changing how research is being conducted and allowing new kinds of questions to be asked. Meanwhile, data management has enabled a rapid increase in the dissemination and preservation of research products and many funding agencies like the National Science Foundation and National Institute of Health now require data management plans in their grant applications. The combination of big data applications and data management processes has created new opportunities and pitfalls for researchers. In the past year, prominent scientists including the Director of the NIH have suggested that inappropriate methodology for data acquisition, analysis and storage has led to a gap in the translation of basic research findings to clinical cures. In this session we will track data through all research stages, describe best practices and university resources available to faculty grappling with these important issues.
Rebecca E. Cooney MedicReS World Congress 2015MedicReS
The New Roles and Responsibilities of Medical Editors in the Age of Big Data Presentation to MedicReS 5th World Congress on October 19-25,2015 in New York by Rebecca E. Conney, PhD North American Editor The Lancet
Keynote Analytics Week, Boston, MA November 7, 2014
Big Data is in its infancy and is opening the door to profound change - Grand Opportunities (Accelerating Scientific Discovery) and Grand Challenges to be addressed over the next decade. We explore the premise that Data Science is to data-intensive discovery as the Scientific Method is to scientific discovery, leading us to potential Laws and Limits of Data Science, and then to Best Practices.
Investing in innovation for genomic medicine - the journey of RepositiveFiona Nielsen
by Fiona Nielsen
Presented for UK Pharmacogenetics and Stratified Medicine Network (UK PGx Network) - Entrepreneurship, Disruptive Innovation and Personalised Medicine University of Liverpool London Campus, Finsbury Square, Wednesday 7th June 2017
USTUN_ Digital Health Assembly Open Innovation Conference: Sharing Global Da...Bedirhan Ustun
An inquiry about the use of Big Data in Health Information Systems as a new way of gathering new data. Inquiring ethical questions on ownership and orientation; analytic approaches and political implications for the society and decision making.
The Role and Responsibilities of Statisticians in Clinical Trials Presentation to MedicReS 5th World Congress on October 19-25,2015 in New York by Shing Lee, PhD
Cemal H. Guvercin MedicReS 5th World Congress MedicReS
Ethical Issues in Artifical Intelligence Applied to Medicine Presentation to MedicReS 5th World Congress on October 19,25,2015 in New York by Cemal H. Guvercin, MD, PhD
Space Situational Awareness Forum
Following another very successful conference in London in November 2014, Space Situational Awareness 2015 took place in Hyattsville, Maryland in May 2015, with over 60 SSA experts from all over the globe coming together to discuss the most pressing SSA challenges.
With increasing dependence on space-based services, the ability to protect space infrastructure has become essential to our society. Any shutdown of even a part of space infrastructures could have significant consequences for the well-functioning of economic activities and our citizens’ safety, and would impair the provision of emergency services.
However, space infrastructures are increasingly threatened by the risk of collision between spacecraft and more importantly, between spacecraft and space debris. As a matter of fact, space debris has become the most serious threat to the sustainability of certain space activities.
In order to mitigate the risk of collision it is necessary to identify and monitor satellites and space debris, catalogue their positions, and track their movements (trajectory) when a potential risk of collision has been identified, so that satellite operators can be alerted to move their satellites. This activity is known as space surveillance and tracking (SST), and is today mostly based on ground-based sensors such as telescopes and radars.
With a focus on solving the political issues but not ignoring the technical, Space Situational Awareness 2015 the leading gathering of dedicated SSA experts from the USA, Europe and beyond, to discuss and debate the business, political and technical challenges that lie ahead.
Take a look at our previous Space Situation Awareness event…
Who should attend Space Situational Awareness?
Space Situational Awareness 2015 is a community of experts from Government, Space Agencies, Satellite/Spacecraft Operators, Space Lawyers, Space Insurance providers and Defense who are looking to understand and predict the physical location of natural and manmade objects in orbit around the Earth, with the objective of avoiding collisions.
How can you get involved in Space Situational Awareness?
If you feel that you could add to the debate and discussion at Space Situational Awareness, we’d be delighted to hear from you. Please drop us a line on +44(0)7769157787 or email me at adam.plom@coriniumintelligence.com.
OSFair2017 | Barriers to Open Science for junior researchersOpen Science Fair
Jon Tennant talks about barriers to open science for junior researchers
Plenary: Open for all? Diversity & disparity in Open Science.
Presentation Abstract:
What are the barriers to Open Science, and how do they impact upon different demographics? Open Science is supposed to be about inclusivity, equality, and rigour. But is the way it is implemented meeting these ideals, or simply creating a new set of barriers to scholarship? Younger researchers are basically trying to survive within a hyper-competitive academic system. They are beset on all sides by systemic control and inertia, power dynamics, and fear. What they want to do for science is not always what is best for their career. This creates a system of social barriers that cannot be overcome by mandates and policies that do little to address these structural biases.
Paywalls inflict a huge level of disparity on younger researchers. They may feel they cannot afford the exorbitant fees charged by some publishers for Open Access, even if their research funders provide support for it. The way OA is currently implemented has switched the barrier from the reader side to the author side, due in part to political broadsiding from commercial publishers. Unfunded or self-funded grad students, as well as those from emerging economies, are basically doomed when it comes to such high author-facing charges.
More: http://opensciencefair.eu/speakers/jon-tennant
DAY 2 - PLENARY
Brief remarks on big data trends and responsible data science at the Workshop on Science and Technology for Washington State: Advising the Legislature, October 4th 2017 in Seattle.
SGCI Science Gateways: Harnessing Big Data and Open Data 03-19-2017Sandra Gesing
The importance of Big Data and Open Data to achieve scientific advancements in precision medicine is beyond doubt and evident in many different projects and initiatives such as the Precision Medicine Initiative (All of Us), ICTBioMed, NCIP Hub, 100K Genomics England Project, NIH Cancer Moonshot, and the Million Veterans Program. In April 2013, McKinsey & Company proclaimed that Big Data has the ability to revolutionize pharmaceutical research and development within clinical environments, by using data for better informed decision making and targeting the diverse user roles including physicians, consumers, insurers, and regulators. Companies from a wide spectrum such as Oracle Health Sciences, Google, and Data4Cure build solutions that help address efficient and secure data sharing with the patient or clinician in mind. Open data can be maintained and shared by patient communities such as PatientsLikeMe.com and build an invaluable resource for further data mining.
Even with all these advances there are still challenges to address including a recent Precision Medicine World Conference announcement in November 2016: “We are missing easy-to-use solutions to share patient data.” Science gateways are a solution to fill the gap and help form by definition end-to-end solutions – web-based, mobile or desktop applications - that provide intuitive access to advanced resources and allow researchers to focus on tackling today’s challenging science questions. Science Gateways abstract the complex underlying computing and data infrastructure as far as feasible and desired by the stakeholder and can be tailored to different target groups with diverse backgrounds, demands, and technical knowledge.
Science Gateways have existed for over a decade and a wide variety of frameworks and APIs have been developed to support the efficient creation of science gateways and ease the implementation of connections to Cloud infrastructures and distributed data on a large scale. The importance of science gateways has been recognized by NSF by funding the creation of a Science Gateways Community Institute (SGCI) to serve the community with free resources, services, experts, and ideas for creating and sustaining science gateways. To achieve this goal, the SGCI serves the community with five areas that have diverse foci and which also closely interact: Incubator, Extended Developer Support, Scientific Software Collaborative, Community Engagement and Exchange and Workforce Development.
The Institute is technology-agnostic and serves the community by offering a wide variety of services and using technologies that are the best fitting solution for the use case. Gateways allow for precision medicine to be more efficiently developed or adapted by lowering the barriers to data sharing and Big Data analysis.
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)Hellmuth Broda
While we bemoan the ever increasing data tsunami new technologies allow to harvest the gold nuggets in the hay stack.
Using the example of the Pharmaceutical Industry some of the possible business uses for Big Data Analitics are outlined.
Facilitating good research data management practice as part of scholarly publ...Varsha Khodiyar
Presentation given to the SciDataCon #IDW2018 session: Democratising Data Publishing: A Global Perspective, on Tuesday 6th November 2018, Gaborone, Botswana
Big Data: Big Opportunities or Big Trouble?Shea Swauger
Big data is changing how research is being conducted and allowing new kinds of questions to be asked. Meanwhile, data management has enabled a rapid increase in the dissemination and preservation of research products and many funding agencies like the National Science Foundation and National Institute of Health now require data management plans in their grant applications. The combination of big data applications and data management processes has created new opportunities and pitfalls for researchers. In the past year, prominent scientists including the Director of the NIH have suggested that inappropriate methodology for data acquisition, analysis and storage has led to a gap in the translation of basic research findings to clinical cures. In this session we will track data through all research stages, describe best practices and university resources available to faculty grappling with these important issues.
Rebecca E. Cooney MedicReS World Congress 2015MedicReS
The New Roles and Responsibilities of Medical Editors in the Age of Big Data Presentation to MedicReS 5th World Congress on October 19-25,2015 in New York by Rebecca E. Conney, PhD North American Editor The Lancet
Keynote Analytics Week, Boston, MA November 7, 2014
Big Data is in its infancy and is opening the door to profound change - Grand Opportunities (Accelerating Scientific Discovery) and Grand Challenges to be addressed over the next decade. We explore the premise that Data Science is to data-intensive discovery as the Scientific Method is to scientific discovery, leading us to potential Laws and Limits of Data Science, and then to Best Practices.
Investing in innovation for genomic medicine - the journey of RepositiveFiona Nielsen
by Fiona Nielsen
Presented for UK Pharmacogenetics and Stratified Medicine Network (UK PGx Network) - Entrepreneurship, Disruptive Innovation and Personalised Medicine University of Liverpool London Campus, Finsbury Square, Wednesday 7th June 2017
USTUN_ Digital Health Assembly Open Innovation Conference: Sharing Global Da...Bedirhan Ustun
An inquiry about the use of Big Data in Health Information Systems as a new way of gathering new data. Inquiring ethical questions on ownership and orientation; analytic approaches and political implications for the society and decision making.
Science as an Open Enterprise – Geoffrey BoultonOpenAIRE
Science as an Open Enterprise – Geoffrey Boulton, University of Edinburgh.
University of Minho Open Access Seminar & OpenAIRE Interoperability Workshop (7 Feb. 2013) - Session: Open Science, Open Data and Repositorie.
Invited presentation at Presenting Data: How to Convey Information Most Effectively Seminar, Centre of Research Excellence in Patient Safety, School of Public Health and Preventive Medicine, Monash University, February 2015.
From personal health data to a personalized adviceWessel Kraaij
Invited talk at the health track of ICT.OPEN 2018, 20-3-2018
1. Related Data science challenges to Digital Health trends
2. Designing an infrastructure to support secure learning from distributed health data repositories, for personalized health advice
3. Supporting patients with rare diseases with patient driven research and the generation of new hypotheses based on patient experiences.
Big Data Provides Opportunities, Challenges and a Better Future in Health and...Cirdan
This presentation was delivered by Ashraf Mina, NSW Pathology at the Pathology Horizons 2017 Conference in Cairns, Australia.
Pathology Horizons 2017 is an annual CPD conference organised by Cirdan on the future of pathology. You can access more information about the event at www.pathologyhorizons.com
The company was founded in 2010 and is headquartered in Lisburn, Northern Ireland and has additional offices in Canada and Australia.
Cirdan is also responsible for organising Pathology Horizons, an annual and open CPD conference on the future of pathology. For more information visit - www.pathologyhorizons.com
Sdal air health and social development (jan. 27, 2014) finalkimlyman
The American Institutes for Research (AIR) and Virginia Tech are collaborating to explore and develop new approaches to combining, manipulating and understanding big data. The two are also looking at how big data analytics can help answer questions critical to solving issues in education, workforce, health, and human and social development. They held two workshops on January 7 and 27, 2014- the first on Education and Workforce Analytics and the second on Health and Social Development Analytics.
Shelley Hurwitz MedicReS World Congress 2014MedicReS
Biostatistics and Ethics Shelley Hurwitz, PhD Brigham and Women’s Hospital Harvard Medical School Fellow, American Statistical Association Advisory Board on Ethics, International Statistical Institute
Improving health care outcomes with responsible data scienceWessel Kraaij
Keynote presentation by Wessel Kraaij at the Dutch pattern recognition and impage processing society (NVPBV) 29/5/2018, Eindhoven.
This talk discusses
1. trends in health care and respondible data science and their intersection
2. Secure federated analytics on distributed data repositories
3. Generating clinically relevant hypotheses from patient forum discussions.
Ehealth: enabling self-management, public health 2.0 and citizen scienceKathleen Gray
Invited presentation, Technology in Diabetes Joint Symposium, Australian Diabetes Society & Australian Diabetes Educators Association Annual Scientific Meeting, August 2014.
Learn how social media intelligence can fill gaps in your research mix.
- Six unique benefits of social media research
- Four key layers of the social intelligence stack
- Eight core applications of social media data for modern businesses
April 3, 2019
Digital innovation is transforming health care, and the amount of digital health care data being generated will likely have increasing research utility over time. Despite the seemingly logical and inevitable application of health care data from deceased persons for research and health care both now and in the future, the issue of how best to manage posthumous medical records is currently unclear, including elements of resource governance, issues of law, and infrastructural challenges.
This presentation explored current issues surrounding how to manage the medical records of the dead, integrating evidence from the field of body donation to inform and guide the discussion on the utilisation of posthumous medical information. It also delivered results from a year-long study on posthumous health care data utility that explored the views of the general population on the use of posthumous medical records, which showed a centrally collated and government-governed resource of posthumous health care data was almost universally supported, with varying caveats around how such a resource should be utilized.
The Digital Health @ Harvard series features speakers from Harvard as well as collaborators and colleagues from other institutions who research the intersection between health and digital technology. The series is cosponsored by the Berkman Klein Center for Internet & Society at Harvard University and the Petrie-Flom Center for Health Law Policy, Biotechnology, and Bioethics at Harvard Law School. The goal of the series is to discuss ongoing research in this research area, share new developments, identify opportunities for collaboration, and explore the digital health ecosystem more generally.
Learn more on the website: https://petrieflom.law.harvard.edu/events/details/digital-health-harvard-april-2019
This presentation outlines a mechanism for using the power of "Big Data", social networking and technology infrastructure to speed the process of curing a horrible disease.
“Big data” in human services organisations: Practical problems and ethical di...husITa
“Big data” initiatives that aim to bring together and mine data from multiple databases across government and non-government agencies promise new insights into human service delivery. Specifically they aim to provide information about what services are being used, how, by whom and with what outcome. However, the process of achieving such insights poses both practical problems and ethical dilemmas. In this presentation, drawing from an extensive literature review and research with government and non-government human service organisations focussing on the design and redevelopment of electronic information systems, the most significant problems and dilemmas will be explored. It will be argued that current frameworks for ethical social work and human service practice will need to be expanded to accommodate developments in technology which have made ‘Big data’ projects possible.
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Round table discussion of vector databases, unstructured data, ai, big data, real-time, robots and Milvus.
A lively discussion with NJ Gen AI Meetup Lead, Prasad and Procure.FYI's Co-Found
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Data and AI
Discussion on Vector Databases, Unstructured Data and AI
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs.This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
Show drafts
volume_up
Empowering the Data Analytics Ecosystem: A Laser Focus on Value
The data analytics ecosystem thrives when every component functions at its peak, unlocking the true potential of data. Here's a laser focus on key areas for an empowered ecosystem:
1. Democratize Access, Not Data:
Granular Access Controls: Provide users with self-service tools tailored to their specific needs, preventing data overload and misuse.
Data Catalogs: Implement robust data catalogs for easy discovery and understanding of available data sources.
2. Foster Collaboration with Clear Roles:
Data Mesh Architecture: Break down data silos by creating a distributed data ownership model with clear ownership and responsibilities.
Collaborative Workspaces: Utilize interactive platforms where data scientists, analysts, and domain experts can work seamlessly together.
3. Leverage Advanced Analytics Strategically:
AI-powered Automation: Automate repetitive tasks like data cleaning and feature engineering, freeing up data talent for higher-level analysis.
Right-Tool Selection: Strategically choose the most effective advanced analytics techniques (e.g., AI, ML) based on specific business problems.
4. Prioritize Data Quality with Automation:
Automated Data Validation: Implement automated data quality checks to identify and rectify errors at the source, minimizing downstream issues.
Data Lineage Tracking: Track the flow of data throughout the ecosystem, ensuring transparency and facilitating root cause analysis for errors.
5. Cultivate a Data-Driven Mindset:
Metrics-Driven Performance Management: Align KPIs and performance metrics with data-driven insights to ensure actionable decision making.
Data Storytelling Workshops: Equip stakeholders with the skills to translate complex data findings into compelling narratives that drive action.
Benefits of a Precise Ecosystem:
Sharpened Focus: Precise access and clear roles ensure everyone works with the most relevant data, maximizing efficiency.
Actionable Insights: Strategic analytics and automated quality checks lead to more reliable and actionable data insights.
Continuous Improvement: Data-driven performance management fosters a culture of learning and continuous improvement.
Sustainable Growth: Empowered by data, organizations can make informed decisions to drive sustainable growth and innovation.
By focusing on these precise actions, organizations can create an empowered data analytics ecosystem that delivers real value by driving data-driven decisions and maximizing the return on their data investment.
1. Societal Issues & Big Data
Sylvia Ogweng
March 2015
Concerns Associated
With Big Data And Its Uses
2. Societal
Issues:
How is big data perceived?
The necessity of data sharing to
validate research results.
Reducing costs and increasing
productivity with big data;
sparking the next wave of
scientific progress
Public mistrust
Saving us from ourselves: when
computers help us avoid incorrect
decisions
Hubris & hyperbole in Big Data
Six
Important
Topics
2
3. How is
Big Data
Perceived?
Gumshoe Hypothesis: Hoarding information about individuals
for investigative purposes.
Big Brother Hypothesis: Collecting information about a
population of individuals to control ever member of the
population.
Borg Hypothesis: Collecting information about a population of
individuals to learn everything about the population.
George Carlin Hypothesis: A place to put a lot of stuff.
Scavenger Hunt Hypothesis: Big Data is a collection of
everything, created for the purpose of searching for individual
items and facts.
Egghead Hypothesis: Collecting information to draw
generalized scientific conclusions.
Facebook Hypothesis: A social archive that generates money.
Nihilist Hypothesis: Big Data does not exist as a definable field
of endeavour; it represents what we have always done, but with
more data.
Eight
Hypotheses by
Jules Berman
3
4. The
Necessity of
Data Sharing
Scientists prepare data to serve a particular
scientific purpose, and sometimes the data serves
unanticipated goals.
Two examples in history:
• Astronomer Tycho Brache (16th century) gave
his charts to Johannes Kepler —> Kepler used
the charts to develop three general laws
describing the movements of the planets —>
Newton’s Laws of Motion
• Climate data indicates that the average power
of tropical cyclones has increased in the past
few decades. Also, ocean temperatures have
also increased in the same time frame. —> New
method for correlating the intensity of tropical
storms and hurricanes with ocean
temperatures.
Historical
Examples
4
5. • Past decade has witnessed a powerful movement
among scientists to share data.
• Without access to primary data, there is no way to
verify final conclusions of experiments, and
research cannot be extended beyond questions
contained in the original manuscript.
• U.S. National Academy of Sciences has called for
scientists to provide publishers with the primary
data that supports the conclusions contained in
their manuscripts.
• Scientific Mischief: Big Data provides a multitude
of opportunities for scientific mischief - poor data,
poor data annotation, poor analyses, distorted
visualization, skewed interpretations, etc.
• Scientific Credibility: In the realm of Big Data,
scientific credibility is based on public access to the
primary data and to all the protocols accounting
for the collection, annotation, extraction, and
analysis of the data.
5
The
Necessity of
Data Sharing
6. The
Necessity of
Data Sharing
De-identification of data is difficult to do.
Common diversions from organizations
include:
• “We’re working on it”
• “Some of our data has been de-identified
and released to the public”: leaves users
with small data set. Stifles creativity.
• “We provide full access to our staff”: staff
restricted to analysis that benefits the
company
• “We access to our data to trusted
individuals outside our institution”
Conflicts that
Exists With
Data Sharing &
Privacy
6
7. The
Necessity of
Data Sharing
Set of recommendation for de-
identification for the healthcare
industry:
• Collect a variety of public domain
algorithms and software
implementation that can be used for
the common tasks in data de-
identification.
• Assign unique code to identify each
patient within a healthcare
environment.
Conflicts that
Exists With
Data Sharing &
Privacy
7
8. Reducing
Costs &
Increasing
Productivity
in Science
• For many fields of science, the
traditional approach to
experimentation has reached its
limits. Lack of money and time to do
the research the old-fashioned way.
• Growing work force; increased
funding. Modern science has reached
a state of diminishing returns on
investment.
• Funders of medical research are
learning that there is simply not
enough money or time to conduct all
the clinical trials needed to advance
medical science at a pace
comparable to the first half of 20th
century.
8
9. How can Big Data address this problem?
• Accelerate scientific progress by giving
research data that can expand or
bypass the clinical trial process.
• Validation tool for small data
experiments. Using Big Data it may be
feasible to confirm that experimental
findings based on small, prospective
studies are consistent with observations
made on very large populations.
• Drug Trials: Big Data may uncover
potential side effects and non
responsive subpopulations.
9
Reducing
Costs &
Increasing
Productivity
in Science
10. • Reluctance to share data is based on
mistrust
• Corporations, medical centres, and
other entities have a fiduciary
responsibility to individuals whose
data is held in their repositories.
Sharing would violate privacy.
• De-identification of data is key.
• Zero risk is not obtainable.
• Without data, society is less safe, less
healthy, less smart.
10
Public
Mistrust
11. Computers can play a crucial role in
helping us avoid incorrect decisions.
Examples in health:
• Drug prescription errors such as
suspected abuse
• Blood transfusion errors
11
Saving Us
From
Ourselves
12. • Assertion based on Big Data analyses
can sometimes be validated, but they
can never be proven true.
• To obtain proof, you must enlist the
help of a mathematician.
• Trends, clusters, and so on, may
appear to be valid over a limited
range of observations, but may fail
miserably in tests conducted over
time with a broader range of data.
12
Hubris &
Hyperbole
13. The Future
of Big Data
• Big Data is a social endeavour
• The future directions of Big Data will be
strongly influenced by social, political
and economic forces
• Public archiving, adopting useful
standards…how scientists use data over
the next few years may provide the
strongest argument for or against the
proliferation of Big Data resources.
13