The document discusses the African Open Science Platform, which aims to promote open science across Africa. It provides context on initiatives like the Square Kilometre Array telescope project and H3ABioNet, which involve large-scale data collection and sharing across multiple African countries. The platform seeks to develop policies, build capacity, and provide incentives to support open data practices in line with FAIR principles. It also outlines the current research infrastructure landscape in Africa and initiatives to strengthen national research networks and cyberinfrastructure to enable open sharing and analysis of genomic and other data for the benefit of African societies.
The Academy of Science of South Africa (ASSAf) takes proud in the implementation of this new initiative. We are looking forward working with all African continents in populating this platform with information.
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
Challenges
Planning traditionally done based on ideal targets, less based on current facts, often due to “lack of information”.
Information often exists, but
largely underutilized (analysis capacities)
not shared/ not accessible (practices)
often heterogeneous and scattered (data management)
sectoral perspectives (planning processes)
→ Challenges of information access and integration across sectors and levels
High-level Meeting & Workshop on Environmental and Scientific Open Data for Sustainable Development Goals in Developing Countries. Madagascar, 4-6 December 2017
The Academy of Science of South Africa (ASSAf) takes proud in the implementation of this new initiative. We are looking forward working with all African continents in populating this platform with information.
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
Challenges
Planning traditionally done based on ideal targets, less based on current facts, often due to “lack of information”.
Information often exists, but
largely underutilized (analysis capacities)
not shared/ not accessible (practices)
often heterogeneous and scattered (data management)
sectoral perspectives (planning processes)
→ Challenges of information access and integration across sectors and levels
High-level Meeting & Workshop on Environmental and Scientific Open Data for Sustainable Development Goals in Developing Countries. Madagascar, 4-6 December 2017
Presented at a NeDICC (Network of Data and Information Curation Communities) meeting, 14 March 2019, CSIR, and at the University of Pretoria and the Carnegie Corporation of New York Capstone Conference, 24-29 March 2019, Kieviets Kroon.
Presentation during the 14th Association of African Universities (AAU) Conference and African Open Science Platform (AOSP)/Research Data Alliance (RDA) Workshop in Accra, Ghana, 7-8 June 2017.
Federation and Interoperability in the Nectar Research CloudOpenStack
Audience Level
Beginner
Synopsis
The Nectar Research Cloud provides an OpenStack cloud for Australia’s academic researchers. Since its inception in 2012 it has grown steadily to over 30,000 CPUs, with over 10,000 registered users from more than 50 research institutions. It is different to many clouds in being a federation across eight organisations, each of which runs cloud infrastructure in one or more data centres and contributes to a distributed help desk and user support. A Nectar core services team runs centralised cloud services. This presentation will give an overview of the experiences, challenges and benefits of running a federated OpenStack cloud and a short demonstration on using the Nectar cloud. We will also describe some current approaches that are looking to extend this federation to encompass other institutions including some in New Zealand, to extend the infrastructure using commercial cloud providers, and to move towards interoperability with the growing number of international science and research clouds through the new Open Research Cloud initiative.
Speaker Bio
Dr Paul Coddington is a Deputy Director of Nectar, responsible for the Nectar national Research Cloud, and also Deputy Director of eResearch SA. He has over 30 years experience in eResearch including computational science, high performance and distributed computing, cloud computing, software development, and research data management.
I am text block. Click edit button to change this text. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.
Presentation on behalf of the SA Weather Service presented during SA National Science Week - The harsh realities of climate change, 29 July to 2 August 2019.
Presented on 30 August 2018: Deployment of Open Data Driven Solutions for Socio-economic Value thorough Good Governance and Efficient Public Service Delivery -
Data Centers - Striving Within A Narrow Range - Research Report - MCG - May 2...pchutichetpong
M Capital Group (“MCG”) expects to see demand and the changing evolution of supply, facilitated through institutional investment rotation out of offices and into work from home (“WFH”), while the ever-expanding need for data storage as global internet usage expands, with experts predicting 5.3 billion users by 2023. These market factors will be underpinned by technological changes, such as progressing cloud services and edge sites, allowing the industry to see strong expected annual growth of 13% over the next 4 years.
Whilst competitive headwinds remain, represented through the recent second bankruptcy filing of Sungard, which blames “COVID-19 and other macroeconomic trends including delayed customer spending decisions, insourcing and reductions in IT spending, energy inflation and reduction in demand for certain services”, the industry has seen key adjustments, where MCG believes that engineering cost management and technological innovation will be paramount to success.
MCG reports that the more favorable market conditions expected over the next few years, helped by the winding down of pandemic restrictions and a hybrid working environment will be driving market momentum forward. The continuous injection of capital by alternative investment firms, as well as the growing infrastructural investment from cloud service providers and social media companies, whose revenues are expected to grow over 3.6x larger by value in 2026, will likely help propel center provision and innovation. These factors paint a promising picture for the industry players that offset rising input costs and adapt to new technologies.
According to M Capital Group: “Specifically, the long-term cost-saving opportunities available from the rise of remote managing will likely aid value growth for the industry. Through margin optimization and further availability of capital for reinvestment, strong players will maintain their competitive foothold, while weaker players exit the market to balance supply and demand.”
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Subhajit Sahu
Abstract — Levelwise PageRank is an alternative method of PageRank computation which decomposes the input graph into a directed acyclic block-graph of strongly connected components, and processes them in topological order, one level at a time. This enables calculation for ranks in a distributed fashion without per-iteration communication, unlike the standard method where all vertices are processed in each iteration. It however comes with a precondition of the absence of dead ends in the input graph. Here, the native non-distributed performance of Levelwise PageRank was compared against Monolithic PageRank on a CPU as well as a GPU. To ensure a fair comparison, Monolithic PageRank was also performed on a graph where vertices were split by components. Results indicate that Levelwise PageRank is about as fast as Monolithic PageRank on the CPU, but quite a bit slower on the GPU. Slowdown on the GPU is likely caused by a large submission of small workloads, and expected to be non-issue when the computation is performed on massive graphs.
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...John Andrews
SlideShare Description for "Chatty Kathy - UNC Bootcamp Final Project Presentation"
Title: Chatty Kathy: Enhancing Physical Activity Among Older Adults
Description:
Discover how Chatty Kathy, an innovative project developed at the UNC Bootcamp, aims to tackle the challenge of low physical activity among older adults. Our AI-driven solution uses peer interaction to boost and sustain exercise levels, significantly improving health outcomes. This presentation covers our problem statement, the rationale behind Chatty Kathy, synthetic data and persona creation, model performance metrics, a visual demonstration of the project, and potential future developments. Join us for an insightful Q&A session to explore the potential of this groundbreaking project.
Project Team: Jay Requarth, Jana Avery, John Andrews, Dr. Dick Davis II, Nee Buntoum, Nam Yeongjin & Mat Nicholas
1. The African Open Science Platform
Presented by Susan Veldsman
Director: Scholarly Publishing Porgramme
Academy of Science of South Africa (ASSAf)
Research Infrastructure Workshop, 14 May 2018
3. Trusted Research & Data
• Trust is at the centre of the process of science
• Trust research & researchers who have your
best interest at heart
• Build new research on existing research/data
• To be trusted, it needs to be managed
4. Square Kilometre Array (SKA)
• Data collection on a massive scale
• Telescope array to consist of 250,000 radio
antennas between Australia & SA
• Investment in machine learning and artificial
intelligence software tools to enable data analysis
• 400+ engineers and technicians in infrastructure,
fibre optics, data collection
• Supercomputers to process data (IBM)
• To come: super computer 3x times power of
world’s current fastest computer (Tianhe-2) to cope
with SKA data
7. • African human genomic research; Central node at University of
Cape Town
• Using NetMap to monitor connectivity
• Data transfer: Africa Globus Online (668,622 files transferred
between Rhodes University & UCT; 140TB data transferred from
USA to SA
• Challenges: slow & unstable Internet, unreliable power supply,
continent-wide obsolete computer infrastructure that varies
between medium-scale server infrastructure to a small number of
workstations, with multiple operating systems, lack of centralized,
secure data storage
• Other: database of participants (H3APRDB, REDCap), data analysis
incl. Galaxy, Job Management System, eBiokits, REDCap,
WebProtege, Pipelines for data execution, data repository
(European Genome-Phenome Archive)
8. Open Science Defined
“Open Science is the practice of science in such a
way that others can collaborate and contribute,
where research data, lab notes and other
research processes are freely available, under
terms that enable reuse, redistribution and
reproduction of the research and its
underlying data and methods.” - FOSTER Project,
funded by the European Commission
11. African Open Science Platform
• Platform = opportunity to engage in dialogue,
create awareness, connect all, provide continental
view
• Funded by SA Dept. of Science & Technology
through National Research Foundation
• 3 years (1 Nov. 2016 – 31 Oct. 2019)
• Managed by Academy of Science of South Africa
(ASSAf)
• Through ASSAf hosting ICSU Regional Office for Africa
(ICSU ROA)
• Direction from CODATA
http://africanopenscience.org.za/
12. Accord on Open Data in a
Big Data World
• Values of open data in
emerging scientific culture
of big data
• Need for an international
framework
• Proposes comprehensive
set of principles
• FAIR Principles
• Provides framework & plan
for African data science
capacity mobilization
initiative
• Proposes African Platform
Call to Endorse
13. Key Stakeholders
• Global Network of Science Academies (IAP)
• International Council for Science (ICSU)
• The World Academy of Sciences (TWAS)
• Research Data Alliance (RDA)
• NRENs (Internet Service Providers for Education)
• Association of African Universities (AAU)
• Network of African Science Academies (NASAC)
• African Research Councils (incl. DIRISA, funders)
• African Universities
• African Governments
• Other
17. Click to view Initiatives/Country
https://www.targetmap.com/viewer.aspx?reportId=56245
Please note: this is just a preview and data still to be cleaned and
updated and corrected.
19. Policy Framework
• Policy provide guidance & see to well-being of
all citizens - political will
• Policies to address (also see existing policies):
• FAIR Principles
• Raw vs Processed/other data
• Licensing
• Sensitive data
• Intellectual Property Issues
20. Policy Framework
• JKUAT (Kenya) Institutional Open Data Policy
• Uganda Draft Open Data Policy
• Madagascar Lobbying for Open Data Policy
• Towards a White Paper on Open Research Data
Strategy in Botswana
• White Paper on Science , Technology and Innovation in
South Africa
• South Africa Open Science Framework—EU/SA
dialogue
• Funder Policy: National Research Foundation (NRF)(SA)
• OECD Principles & Guidelines for Access to Research
Data from Public Funding
21. Capacity Building Framework
• Data collector vs data user vs data manager
Therefore the following are core aspects to capacity building:
• Research Data Management Planning
• Repositories
• Command Line Interpretation
• Software Development
• Data Organisation
• Data Cleaning
• Data Management & Databases
• Data Analysis & Visualisation (incl. programming)
22. Capacity Building Framework
• Engineers, Statisticians, Data Scientists, Librarians, Data
Curators, Researchers, System Administrators,
Policymakers, Auditors, Data Centre Managers, Data
Architects – Wim Hugo
• Different skills for different categories of data workers
• Existing workshops presented
• Tertiary curricula need to adapt more rapidly
• Never too early to learn to work with data, program
23. Incentives Framework
• Funder requirements changing
• Mechanisms that acknowledge publication of
datasets and to promote data sharing
• How do we deal with difficulties in sharing
data—what are the solutions
• Why is sharing essential
• How do we make sharing successful
• How do we lay the fears down and ensure buy-
in
25. South African Research and Infrastructure
Roadmap, DST
• Focus on global infrastructure in South Africa,
• South African Research Infrastructure
Roadmap (SARIR) has been developed to
facilitate a research infrastructure investment
programme
• “SARIR is intended to provide a strategic,
rational, medium to long term framework for
planning, implementing, monitoring and
evaluating the provision of research
infrastructures (RIs) necessary for a
competitive and sustainable national system of
innovation” (DST, 2016:2).
26. National Integrated
Cyberinfrastructure Service (NICIS)
• Research Infrastructure is dependent on cyber-
infrastructure.
• This dependency refers to access to physical sites, data
sharing, curation, provenance, protection, and developing
interoperability and metadata standards.
• From the start of any RI programme, E-science and cyber-
infrastructure need to allow virtual access and open access
to national and international data.
• Centre of High Performance Computing,
• South African National Research Network
• Data Intensive Research Initiative of South Africa
• will provide the necessary cyber-infrastructure capabilities
for the successful operation of all the RIs on a generic basis,
this will be known as the National Integrated
Cyberinfrastructure Service” (DST, 206:55
27. Ilifu
• http://www.researchsupport.uct.ac.za/ilifu
• Consortium of 6 Western Cape institutions
• Data-centric, high-performance computing facility
for data-intensive research
• Proto-typing distributed, federated cloud-based
infrastructure as a platform for data-intensive
research (African Research Cloud)
• Data-processing pipelines and e-science research
tools for big data analysis, visualisation and analytics
• Development and implementation of research data
management systems and tools
• Development of platforms, portals and middleware
to support access and collaborative research by
distributed teams on data-intensive projects
28. • Towards strategy and action plan,
implementation plan and governance structure
• Support strategic plans on Science, Technology,
Innovation
• Guide on creating and enabling environment to
harness science, technology and innovation
• Impact socio-economic development & industrialization
• Enhance education in developing & using technologies
• Support collaborative research development &
innovation
SADC Cyber-Infrastructure Framework
29. • Cyber-infrastructure is a key driver for a
knowledge based economy
• Comprises of technologies, skills, people and
policies which support generation, analysis,
transport, sharing, stewardship of information
(incl. data)
• Framework provides Roadmap towards Cyber-
infrastructure Strategy
31. Closing Remarks
• Collaborate & learn from one another –
strength in diversity
• Take ownership & collect/curate data in ethical
way
• Downloaders vs Uploaders
• Trusted & valid data managed in trusted way
• Exploit data for the benefit of society (Min
Naledi Pandor)
• Tell the African story, in an African way
33. Objectives
• Identify the issues that need to be dealt with when
drafting framework/roadmap of RI
• Raise awareness within REN’s regarding services
beyond connectivity
• Consult with YOU as experts to shape
• Our own ideas about the way forward
• Possible Phase two of project
• Identify a”writer”
• Identify a “team” to assist with a
framework/roadmap
We are living in an increasingly data driven world – facebook, twitter, air bnb, uber
Malaria outbreak 2014-2015
World Economic Forum 2018
How to get rid of fake data
Collaborative projects in Biomedical Sciences – genomics research – catching up with outbreaks, ebola, malaria and more
Bioinformatics legs of H3Africa (Human Heridity and Health in Africa)
Work among 30 institutions, 15 Afrucan countries, 2 partners outside Africa
To get Africa talking to one another
Engineers, Statisticians, Data Scientists, Librarians, Data Curators, Researchers, System Administrators, Policymakers, Auditors, Data Centre Managers, Data Architects – Wim Hugo
Ilifu is a consortium of Western Cape institutions that together will establish and operate a data-centric, high-performance computing facility for data-intensive research. The partner institutions are
Cape Peninsula University of Technology
Stellenbosch University
Sol Plaatje University
South African Radio Astronomy Observatory (SARAO, formerly SKA South Africa).
University of Cape Town (lead institute)
University of the Western Cape.
In addition to establishing and operating a data-intensive computing facility, the consortium will – in collaboration with local and international collaborators and partners – undertake research and development programmes for
Proto-typing a distributed, federated cloud-based infrastructure as a platform for data-intensive research, the African Research Cloud.
Development of data-processing pipelines and e-science research tools for big data analysis, visualisation and analytics.
Development and implementation of research data management systems and tools.
Development of platforms, portals and middleware to support access and collaborative research by distributed teams on data-intensive projects.