Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

The African Open Science Platform: Policy | Infrastructure | Skills | Incentives/Ina Smith

48 views

Published on

Presented during the IST-Africa2018 Conference, Botswana.

Published in: Data & Analytics
  • Be the first to comment

  • Be the first to like this

The African Open Science Platform: Policy | Infrastructure | Skills | Incentives/Ina Smith

  1. 1. The African Open Science Platform Policy | Infrastructure | Skills | Incentives Presented by Ina Smith IST-Africa2018, 10 May 2018
  2. 2. Data Driven World
  3. 3. Fake Data, Fake Research http://www.bbc.com/news/science-environment-39357819
  4. 4. Open Science (incl. Data) Defined “Open Science is the practice of science in such a way that others can collaborate and contribute, where research data, lab notes and other research processes are freely available, under terms that enable reuse, redistribution and reproduction of the research and its underlying data and methods.” - FOSTER Project, funded by the European Commission
  5. 5. Open Data, Open Science & Research Lifecycle (Foster)
  6. 6. Original Research Data Lifecycle image from University of California, Santa Cruz http://guides.library.ucsc.edu/datamanagement/ Repositories Repositories Tools Plan Policy&Infrastructure
  7. 7. Benefits of Open Research Data (1) • Predict trends, help make informed decisions, informs policy • Collaboration advances science, discovery • Drives development, improves livelihoods of citizens of countries • Increases return on investment (funders), avoid duplication – research is expensive • More and more entrepreneurs are using data in innovative ways, creating more jobs which is much needed on our continent
  8. 8. Benefits of Open Research Data (2) • Helps improve service delivery e.g. mobile apps, robots, artificial intelligence (AI) • Provides evidence for research conducted • Data potentially has far more outcomes when open, with a higher impact • Use for tenure/promotion/measure contribution of researchers (data citation)
  9. 9. https://theodi.org/news/new-report-reveals-how-open-data-is-fuelling-problemsolving-in-the-developing- world-from-mapping-ebola-to-protecting-banana-crops Data in support of the SDGs
  10. 10. Protecting banana farmers’ livelihoods (Uganda) Using maps to increase access to education (Kenya) Monitoring child malnutrition (Uganda)
  11. 11. http://theconversation.com/what-115-years-of-data-tells-us-about-africas-battle-with-malaria-past-and- present-85482 The prevalence of malaria infection in sub- Saharan Africa today is at the lowest point since 1900.
  12. 12. http://www.nature.com/news/data-sharing-make- outbreak-research-open-access-1.16966
  13. 13. Tunisia Data Computing Centre el Khawarizmi Kenya Data Centre & Services @ Research & Education Network (KENET) South Africa DIRISA, Data Intensive Research Cloud Infrastructure Initiatives – ARC, SADIRC, Ilifu High Performance Computing Centres Botswana, Lesotho, Mozambique, SA, Tanzania, Zambia, Zimbabwe Open Data for Africa African Development Bank Initiatives,HPCCs,Services
  14. 14. Ilifu • http://www.researchsupport.uct.ac.za/ilifu • Consortium of 6 Western Cape institutions • Data-centric, high-performance computing facility for data-intensive research • Proto-typing distributed, federated cloud-based infrastructure as a platform for data-intensive research (African Research Cloud) • Data-processing pipelines and e-science research tools for big data analysis, visualisation and analytics • Development and implementation of research data management systems and tools • Development of platforms, portals and middleware to support access and collaborative research by distributed teams on data-intensive projects
  15. 15. Square Kilometre Array H3ABioNet Genomics Data GBIF Biodiversity Data InternationalCollaborations
  16. 16. H3ABioNet (H3Africa) 30 institutions, 15 African countries, 2 partners outside Africa
  17. 17. Square Kilometre Array (SKA) • Data collection on a massive scale • Telescope array to consist of 250,000 radio antennas between Australia & SA • Investment in machine learning and artificial intelligence software tools to enable data analysis • 400+ engineers and technicians in infrastructure, fibre optics, data collection • Supercomputers to process data (IBM) • To come: super computer 3x times power of world’s current fastest computer (Tianhe-2) to cope with SKA data
  18. 18. Testing Albert Einstein’s general theory of relativity; imaging neutral hydrogen—the building blocks for stars – in the distant universe; and examining galaxies that were formed billions of years ago. “Construction of the SKA is due to begin in 2018 and finish sometime in the middle of the next decade. Data acquisition will begin in 2020, requiring a level of processing power and data management know-how that outstretches current capabilities. Astronomers estimate that the project will generate 35,000- DVDs-worth of data every second. This is equivalent to “the whole world wide web every day,” said Fanaroff.”
  19. 19. African Open Science Platform (AOSP) • Platform = opportunity to engage in dialogue, create awareness, connect all, provide continental view • Funded by SA Dept. of Science & Technology through National Research Foundation • 3 years (1 Nov. 2016 – 31 Oct. 2019) • Managed by Academy of Science of South Africa (ASSAf) • Through ASSAf hosting ICSU Regional Office for Africa (ICSU ROA) • Direction from CODATA http://africanopenscience.org.za/
  20. 20. Accord on Open Data in a Big Data World • Proposes comprehensive set of principles • FAIR Principles • Data as open possible, as closed necessary • Provides framework & plan for African data science capacity mobilization initiative – AOSP Call to Endorse
  21. 21. “Several open science activities are underway across Africa, but a great deal will be gained if, in the context of developing inter-regional links, these activities were to be coordinated and developed through such a coordinating initiative.” - CODATA
  22. 22. Key Stakeholders • Global Network of Science Academies (IAP) • International Council for Science (ICSU) • The World Academy of Sciences (TWAS) • Research Data Alliance (RDA) • NRENs (Internet Service Providers for Education) • Association of African Universities (AAU) • Network of African Science Academies (NASAC) • African Research Councils (incl. DIRISA, funders) • African Universities • African Governments • Other
  23. 23. AOSP Focus Areas (Frameworks & Roadmaps) Policy Infrastructur Capacity Building Incentives SA, Uganda, Botswana, Madagascar, Kenya, Ethiopia
  24. 24. Intellectual Property Rights Policy “In many African countries, intellectual property protection is undeveloped, ineffective, expensive and unenforced and in some African countries there exists uncertainty on protection of IP and the threat of innovation being stolen away from inventors.” https://ipstrategy.com/2016/12/05/a-new-look-at-intellectual-property-and- innovation-in-africa/
  25. 25. ICTInfrastructure(NRENs) ASREN WACREN UbuntuNet
  26. 26. Africa Data Consensus Study • Adopted in March 2015 at High Level Conference on Data Revolution • Strategy for implementing data revolution in Africa • Plan of action to be guided by United Nations Economic Commission for Africa (UNECA), African Union Commission (AUC), African Development Bank (AfDB), supported by UN Development Programme (UNDP), UN Populations Fund (UNFPA) • Implemented in collaboration with partner institutions from public & private sectors, civil society organisations
  27. 27. Services offered by UbuntuNet NRENs [Source: Colin Wright SADC/ET-ST1/1/2016/11 Document]
  28. 28. Grid & Cloud-based ICT Infrastructure Support Activity Tools (some discipline specific) Bandwidth Internet Connectivity NetMap, Eduroam Data Transfer Africa Globus Online Data Organisation, Storage, Access (Data Centres) Spreadsheets, REDCap, H3APRDP, MIPAR, Data Repositories e.g. Dataverse, Invenio, Dspace, Zenodo, re3data.org. Also FigShare (Ilifu), BaseGeo Data Management Spreadsheets, REDCap, H3APRDP, MIPAR, Data Repositories e.g. Dataverse, Invenio, Dspace, Zenodo, re3data.org. Also FigShare (Ilifu), BaseGeo, SQL, Hadoop(for really large datasets), MySQL, PostgreSQL
  29. 29. Activity Tools (some discipline specific) Data Analysis & Visualisation Galaxy, Job Management System, WebProtege, eBiokits, R, R Markdown, ggplot2, Python, MATLAB, C, C++ Data Processing Medical Image Processor and Repository (MIPAR) Data Execution (pipelines/workflow) Futuregateway, OAR, PHP, JSON, WAMP, DAGMan, Pegasus, Makeflow Data Collection Raspberry Pi, sensors & sensor networks, electron & astronomical microscopes, medical imaging Data Mining Data Cleaning OpenRefine
  30. 30. Activity Tools (some discipline specific) Data Description Metadata standards incl. DOIs Batch/Grid Computing Africa Grid Science Gateway, European Grid Infrastructure, SURFSara (Dutch National Grid) Research Data Management Planning DMPTool (DMPRoadmap), FigShare Supercomputers/High Performance Computing Tunisia (Data Computing Centre el Khawarizmi), Kenya (Data Centre & Services), South Africa (DIRISA, Data Intensive Research Cloud Infrastructure Initiatives – ARC, SADIRC, Ilifu), Countries: Botswana, Lesotho, Mozambique, SA, Tanzania, Zambia, Zimbabwe. Machine Learning R, Python Artificial Intelligence (Artificial Neural Networks) R, Python, Tensorflow, deep convolutional networks
  31. 31. Activity Tools (some discipline specific) Identity Provider Federated Identity Services Certification Authority Accredited by EUGRIDPMA. Digital certificates to access eduGain resources (including open research ID - ORCID) Security Data Licensing Copyright, Creative Commons, GNU Data Citation Command Line Interface UNIX Shell, bash (incl. editors e.g. Nano, Emacs, Vim, etc.) Software Development Git, GitHub, Mercurial Other User controlled light paths, videoconferencing, connecting e-resources e.g. simulators, accelerators, dedicated point-to-point Internet Protocol circuits General Publishing Open Access Publications, Copyright, Creative Commons Open Peer Review, Data Citations, DOIs, ORCID IDs, working with data using LaTeX, XML, Markdown, BiBTeX, JSON, JabRef, Zotero
  32. 32. https://www.targetmap.com/viewer.aspx?reportId=56245 Please note: this is just a preview and data still to be cleaned and updated and corrected. African Open Science Platform (AOSP) Landscape Study
  33. 33. AOSP ICT Infrastructure Framework • Pascal Hoba (Chief Executive Officer, UbuntuNet Alliance) • Yousef Torman (Managing Director, ASREN) • Ousmane Moussa Tessa (Chief Executive Officer, NigerREN & Board Member WACREN) • Duncan Greaves (Chief Executive Officer, TENET) • David Walwyn (University of Pretoria) • Nicky Mulder (Head: Computational Biology, UCT & Lead: H3ABioNet) • Mélianie Raymond, Senior Programme Officer for Node Development, GBIF Secretariat – TBC – via Skype • Jasper Horrell (SKA) • Leon Staphorst (Head Head of the South African Research Network (SANReN), CSIR) • Jean-Claude Burgelman (EU)
  34. 34. http://internationaldataweek.org/
  35. 35. “You can have data without information, but you cannot have information without data.” – Daniel Keys Moran, an American computer programmer and science fiction writer. “Data is a precious thing and will last longer than the systems themselves.” – Tim Berners-Lee, inventor of the World Wide Web.
  36. 36. Thank you Ina Smith Project Manager, African Open Science Platform Project, Academy of Science of South Africa (ASSAf) ina@assaf.org.za Visit http://africanopenscience.org.za

×