Description of the way in which the software sustainability institute engages the software in research community. It covers why, how, the programmes, how to select people, activities those selected do, benefits, recommendations and more.
Federation and Interoperability in the Nectar Research CloudOpenStack
Audience Level
Beginner
Synopsis
The Nectar Research Cloud provides an OpenStack cloud for Australia’s academic researchers. Since its inception in 2012 it has grown steadily to over 30,000 CPUs, with over 10,000 registered users from more than 50 research institutions. It is different to many clouds in being a federation across eight organisations, each of which runs cloud infrastructure in one or more data centres and contributes to a distributed help desk and user support. A Nectar core services team runs centralised cloud services. This presentation will give an overview of the experiences, challenges and benefits of running a federated OpenStack cloud and a short demonstration on using the Nectar cloud. We will also describe some current approaches that are looking to extend this federation to encompass other institutions including some in New Zealand, to extend the infrastructure using commercial cloud providers, and to move towards interoperability with the growing number of international science and research clouds through the new Open Research Cloud initiative.
Speaker Bio
Dr Paul Coddington is a Deputy Director of Nectar, responsible for the Nectar national Research Cloud, and also Deputy Director of eResearch SA. He has over 30 years experience in eResearch including computational science, high performance and distributed computing, cloud computing, software development, and research data management.
I am text block. Click edit button to change this text. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.
Description of the way in which the software sustainability institute engages the software in research community. It covers why, how, the programmes, how to select people, activities those selected do, benefits, recommendations and more.
Federation and Interoperability in the Nectar Research CloudOpenStack
Audience Level
Beginner
Synopsis
The Nectar Research Cloud provides an OpenStack cloud for Australia’s academic researchers. Since its inception in 2012 it has grown steadily to over 30,000 CPUs, with over 10,000 registered users from more than 50 research institutions. It is different to many clouds in being a federation across eight organisations, each of which runs cloud infrastructure in one or more data centres and contributes to a distributed help desk and user support. A Nectar core services team runs centralised cloud services. This presentation will give an overview of the experiences, challenges and benefits of running a federated OpenStack cloud and a short demonstration on using the Nectar cloud. We will also describe some current approaches that are looking to extend this federation to encompass other institutions including some in New Zealand, to extend the infrastructure using commercial cloud providers, and to move towards interoperability with the growing number of international science and research clouds through the new Open Research Cloud initiative.
Speaker Bio
Dr Paul Coddington is a Deputy Director of Nectar, responsible for the Nectar national Research Cloud, and also Deputy Director of eResearch SA. He has over 30 years experience in eResearch including computational science, high performance and distributed computing, cloud computing, software development, and research data management.
I am text block. Click edit button to change this text. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.
presented by Stuart Macdonald at the College of Science and Engineering - "What's new for you in the Library“, Murray Library, Kings Buildings, University of Edinburgh. 28 May 2014
Covers research data, research data management, funder policies and the University's RDM policy, RDM services and support, awareness raising, training, progress so far.
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Sandra Gesing
Research computing is in an exciting era and has never as fast evolved as in the last 20 years. We can nowadays answer research questions that we could not even ask two decades ago. This has led to discoveries such as the analyses of DNA from Next-Generation Sequencing technologies. The increased complexity of software, data, hardware and lab instruments demands for more openness and sharing of data and methods. Researchers and educators are not necessarily IT specialists though. Thus, a further trend in research computing is the shift from system-centric design to user-centric design and interdisciplinary teams – complex solutions are offered in self-explanatory user interfaces, so-called science gateways or virtual research environments. I will present solutions and projects supporting users to be able to focus on their research questions without the need to become acquainted with the nitty-gritty details of the complex research computing infrastructure. Key aspects of the presented projects are usability and interoperability of computational methods, reproducibility of research results as well as sustainability of research software. Sustainability of research software has many facets. I advocate for improving the diversity in workforce development, career paths for research software engineers and for incentivizing their work via means beyond the traditional academic rewarding system.
Rebecca Grant DAH Research Presentationdri_ireland
Presentation given by Rebecca Grant of the Digital Repository of Ireland at the Research in the Digital Age symposium at the Trinity Long Room Hub, 14 July, 2015. The presentation gives an overview of some of the key concepts and drivers in research data management for the arts and humanities, and introduces the Digital Repository of Ireland as potential place of deposit for such data.
Slides | Research data literacy and the libraryColleen DeLory
Slides from the Dec. 8, 2016 Library Connect webinar "Research data literacy and the library" with Sarah Wright, Christian Lauersen and Anita de Waard. See the full webinar at: http://libraryconnect.elsevier.com/library-connect-webinars?commid=226043
Slides from presentation at CHI2015:
Paper Title: Designing for Citizen Data Analysis: A Cross-Sectional Case Study of a Multi-Domain Citizen Science Platform
Abstract:
Designing an effective and sustainable citizen science (CS) project requires consideration of a great number of factors. This makes the overall process unpredictable, even when a sound, user-centred design approach is followed by an experienced team of UX designers. Moreover, when such systems are deployed, the complexity of the resulting interactions challenges any attempt to generalisation from retrospective analysis. In this paper, we present a case study of the largest single platform of citizen driven data analysis projects to date, the Zooniverse. By eliciting, through structured reflection, experiences of core members of its design team, our grounded analysis yielded four sets of themes, focusing on Task Specificity, Community Development, Task Design and Public Relations and Engagement. For each, we propose a set of design claims (DCs), drawing comparisons to the literature on crowdsourcing and online communities to contextualise our findings.
Dissertation proposal defense for a comparative case study of virtual citizen science projects, focusing on the concepts of virtuality, technology, organizing, participation, and outcomes.
Successfully defended with no revisions on 5 May, 2010.
Understanding the Big Picture of e-ScienceAndrew Sallans
A. Sallans. "Understanding the Big Picture of e-Science." Presented at the 2011 eScience Bootcamp at the University of Virginia's Claude Moore Health Sciences Library. 4 March 2011
About the Webinar
Presenters will discuss the role of the library in the academic research enterprise and provide an overview of new librarian strategies, tools, and technologies developed to support the lifecycle of scholarly production and data curation. Specific challenges that face research libraries will be described and potential responses will be explored, along with a discussion of the types of skills and services that will be required for librarians to effectively curate research output.
presented by Stuart Macdonald at the College of Science and Engineering - "What's new for you in the Library“, Murray Library, Kings Buildings, University of Edinburgh. 28 May 2014
Covers research data, research data management, funder policies and the University's RDM policy, RDM services and support, awareness raising, training, progress so far.
Bridging Gaps and Broadening Participation inToday's and Future Research Com...Sandra Gesing
Research computing is in an exciting era and has never as fast evolved as in the last 20 years. We can nowadays answer research questions that we could not even ask two decades ago. This has led to discoveries such as the analyses of DNA from Next-Generation Sequencing technologies. The increased complexity of software, data, hardware and lab instruments demands for more openness and sharing of data and methods. Researchers and educators are not necessarily IT specialists though. Thus, a further trend in research computing is the shift from system-centric design to user-centric design and interdisciplinary teams – complex solutions are offered in self-explanatory user interfaces, so-called science gateways or virtual research environments. I will present solutions and projects supporting users to be able to focus on their research questions without the need to become acquainted with the nitty-gritty details of the complex research computing infrastructure. Key aspects of the presented projects are usability and interoperability of computational methods, reproducibility of research results as well as sustainability of research software. Sustainability of research software has many facets. I advocate for improving the diversity in workforce development, career paths for research software engineers and for incentivizing their work via means beyond the traditional academic rewarding system.
Rebecca Grant DAH Research Presentationdri_ireland
Presentation given by Rebecca Grant of the Digital Repository of Ireland at the Research in the Digital Age symposium at the Trinity Long Room Hub, 14 July, 2015. The presentation gives an overview of some of the key concepts and drivers in research data management for the arts and humanities, and introduces the Digital Repository of Ireland as potential place of deposit for such data.
Slides | Research data literacy and the libraryColleen DeLory
Slides from the Dec. 8, 2016 Library Connect webinar "Research data literacy and the library" with Sarah Wright, Christian Lauersen and Anita de Waard. See the full webinar at: http://libraryconnect.elsevier.com/library-connect-webinars?commid=226043
Slides from presentation at CHI2015:
Paper Title: Designing for Citizen Data Analysis: A Cross-Sectional Case Study of a Multi-Domain Citizen Science Platform
Abstract:
Designing an effective and sustainable citizen science (CS) project requires consideration of a great number of factors. This makes the overall process unpredictable, even when a sound, user-centred design approach is followed by an experienced team of UX designers. Moreover, when such systems are deployed, the complexity of the resulting interactions challenges any attempt to generalisation from retrospective analysis. In this paper, we present a case study of the largest single platform of citizen driven data analysis projects to date, the Zooniverse. By eliciting, through structured reflection, experiences of core members of its design team, our grounded analysis yielded four sets of themes, focusing on Task Specificity, Community Development, Task Design and Public Relations and Engagement. For each, we propose a set of design claims (DCs), drawing comparisons to the literature on crowdsourcing and online communities to contextualise our findings.
Dissertation proposal defense for a comparative case study of virtual citizen science projects, focusing on the concepts of virtuality, technology, organizing, participation, and outcomes.
Successfully defended with no revisions on 5 May, 2010.
Understanding the Big Picture of e-ScienceAndrew Sallans
A. Sallans. "Understanding the Big Picture of e-Science." Presented at the 2011 eScience Bootcamp at the University of Virginia's Claude Moore Health Sciences Library. 4 March 2011
About the Webinar
Presenters will discuss the role of the library in the academic research enterprise and provide an overview of new librarian strategies, tools, and technologies developed to support the lifecycle of scholarly production and data curation. Specific challenges that face research libraries will be described and potential responses will be explored, along with a discussion of the types of skills and services that will be required for librarians to effectively curate research output.
Science Gateways – Leveraging Modeling and Simulations in HPC Infrastructure...Sandra Gesing
A tutorial on science gateways at the cHiPSet Training School – New Trends in Modeling and Simulation in HPC Systems, 21-23 September 2016 in Bucharest, Romania.
As células son estruturas químicas que funcionan a temperatura constante, creando e mantendo a súa organización, ordenada e complexa, a expensas da materia que obteñen do contorno e que transforman nos millares de reaccións químicas que constitúen o metabolismo celular.
Administrador de Infraestructuras Ferroviarias (ADIF)Editorial CEP
Tema de muestra del manual que contiene el material adecuado para la preparación de las pruebas selectivas de acceso a las plazas de ingreso en el Administrador de Infraestructuras Ferroviarias (ADIF) para determinadas categorías profesionales.
O risco ambiental caracterízase principalmente por catro variables: magnitude do fenómeno, frecuencia con que ocorre, duración e extensión do espacio afectado.
En la presentación en el VII Congreso de Directores de Proyectos PMI conté mis experiencias en una parada de Refinería La Pampilla en Perú desarrollada en 2010.
SGCI Science Gateways Landscape in North AmericaSandra Gesing
Presentation at RDA
A) Approaches to interoperability among Science Gateways
B) Key ingredients for successful and vibrant virtual research communities
C) Sustainability of Science Gateways - what are the current models that work (and conversely have failed))
A description of software as infrastructure at NSF, and how Apache projects may be similar. What lessons can be shared from one organization to the other? How does science software compare with more general software?
Talk given at the Sciencedigital@UNGA75 on 29th September as part of a series of side events to mark the 75th anniversary of the United Nations General Assembly.
SGCI - The Science Gateways Community Institute: International Collaboration ...Sandra Gesing
Science gateways - also called virtual research environments or virtual labs - allow science and engineering communities to access shared data, software, computing services, instruments, and other resources specific to their disciplines. The US Science Gateways Community Institute (SGCI), opened in August 2016, provides free resources, services, experts, and ideas for creating and sustaining science gateways. It offers five areas of services to the science gateway developer and user communities: the Incubator, Extended Developer Support, the Scientific Software Collaborative, Community Engagement and Exchange, and Workforce Development. While all these services are available to US-based communities, the Incubator, the Scientific Software Collaborative and the Community Engagement and Exchange serve also the international communities. SGCI aims at supporting beyond borders on international scale with diverse measures and to form and deepen collaborations with partner organizations and coalitions beneficial and/or related to the science gateways community. Research topics are independent of national borders and researchers spread worldwide can benefit from each other’s research results, software, data and from lessons learned — via online materials and publications or at international events. The gateway community has benefitted from this type of exchange for years and one mission of SGCI is to support the international community. This talk will present related work describing the benefits of international collaborations generally, and specifically as they relate to science gateways. It will go into detail regarding SGCI’s ongoing work on an international scale and SGCI's work planned in the near future to foster collaborations under consideration of challenges such as different timezones and long distances between collaborators.
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...hsuleslie
Presentation given to the Summer Institute for Earth Surface Dynamics (SIESD) 2014 at St. Anthony Falls Laboratory, University of Minnesota, about the Sediment Experimentalist Network (SEN). SEN is an EarthCube Research Coordination Network, whose goal is to integrate the efforts of sediment experimentalists and build a knowledge base for guidance on best practices for data collection and management.
The talk gives an overview on current trends for developing science gateways also called virtual labs or virtual research environments. It presents the services of the US Science Gateways Community Institute and international collaborations in the context of science gateways.
Big Data Europe SC6 WS 3: Ron Dekker, Director CESSDA European Open Science A...BigData_Europe
Slides for keynote talk at the Big Data Europe workshop nr 3 on 11.9.2017 in Amsterdam co-located with SEMANTiCS2017 conference by Ron Dekker, Director CESSDA: European Open Science Agenda: where we are and where we are going?
Australia's Environmental Predictive CapabilityTERN Australia
Federating world-leading research, data and technical capabilities to create Australia’s National Environmental Prediction System (NEPS).
Community consultation presentation.
3-12 February 2020
Dr Michelle Barker (Facilitator)
(Presentation v5)
Enabling Research without Geographical Boundaries via Collaborative Research ...Sandra Gesing
Collaborative research infrastructures on global scale for earth and space sciences face a plethora of challenges from technical implementations to organizational aspects. Science gateways – also known as virtual research environments (VREs) or virtual laboratories - address part of such challenges by providing end-to-end solutions to aid researchers to focus on their specific research questions without the need to become acquainted with the technical details of the complex underlying infrastructures. In general, they provide a single point of entry to tools and data irrespective of organizational boundaries and thus make scientific discoveries easier and faster. The importance of science gateways has been recognized on national as well as on international level by funding bodies and by organizations. For example, the US NSF has just funded a Science Gateways Community Institute, which offers support, consultancy and open accessible software repositories for users and developers; Horizon 2020 provides funding for virtual research environments in Europe, which has led to projects such as VRE4EIC (A Europe-wide Interoperable Virtual Research Environment to Empower Multidisciplinary Research Communities and Accelerate Innovation and Collaboration); national or continental research infrastructures such as XSEDE in the USA, Nectar in Australia or EGI in Europe support the development and uptake of science gateways; the global initiatives International Coalition on Science Gateways, the RDA Virtual Research Environment Interest Group as well as the IEEE Technical Area on Science Gateways have been founded to provide global leadership on future directions for science gateways in general and facilitate awareness for science gateways. This presentation will give an overview on these projects and initiatives aiming at supporting domain researchers and developers with measures for the efficient creation of science gateways, for increasing their usability and sustainability under consideration of the breadth of topics in the context of science gateways. It will go into detail for the challenges the community faces for collaborative research on global scale without geographical boundaries and will provide suggestions for further enhancing the outreach to domain researchers.
Trust and Accountability: experiences from the FAIRDOM Commons Initiative.Carole Goble
Presented at Digital Life 2018, Bergen, March 2018. In the Trust and Accountability session.
In recent years we have seen a change in expectations for the management and availability of all the outcomes of research (models, data, SOPs, software etc) and for greater transparency and reproduciblity in the method of research. The “FAIR” (Findable, Accessible, Interoperable, Reusable) Guiding Principles for stewardship [1] have proved to be an effective rallying-cry for community groups and for policy makers.
The FAIRDOM Initiative (FAIR Data Models Operations, http://www.fair-dom.org) supports Systems Biology research projects with their research data, methods and model management, with an emphasis on standards and sensitivity to asset sharing and credit anxiety. Our aim is a FAIR Research Commons that blends together the doing of research with the communication of research. The Platform has been installed by over 30 labs/projects and our public, centrally hosted FAIRDOMHub [2] supports the outcomes of 90+ projects. We are proud to support projects in Norway’s Digital Life programme.
2018 is our 10th anniversary. Over the past decade we learned a lot about trust between researchers, between researchers and platform developers and curators and between both these groups and funders. We have experienced the Tragedy of the Commons but also seen shifts in attitudes.
In this talk we will use our experiences in FAIRDOM to explore the political, economic, social and technical, social practicalities of Trust.
[1] Wilkinson et al (2016) The FAIR Guiding Principles for scientific data management and stewardship Scientific Data 3, doi:10.1038/sdata.2016.18
[2] Wolstencroft, et al (2016) FAIRDOMHub: a repository and collaboration environment for sharing systems biology research Nucleic Acids Research, 45(D1): D404-D407. DOI: 10.1093/nar/gkw1032
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Sgci iwsg-a-10-10-16
1. science gateway /sī′ əәns gāt′ wā′/ n.
1. an online community space for science and engineering research and
education.
2. a Web-based resource for accessing data, software, computing services,
and equipment specific to the needs of a science or engineering discipline.
Science Gateways: History, Successes,
Path Forward
A
tale
of
many
slide
templates
J
Thank
you
to
Michelle
Barker,
Richard
Sinnott and
David
Abramson
for
the
invitation
to
speak
2. Remember 2004?
• Microsoft, AOL and Jeeves ruled
the Web
• Facebook launched
– More users today than were on the
entire internet in 2004
• Google 5th most popular brand
behind AOL and Yahoo in
popularity
• Time magazine recommends
friendster as website of the year
• I first start working with science
gateways
3. Beginnings of the TeraGrid program
• TeraGrid develops Deep, Wide and Open strategy
• For the first time we are targeting not just the high-
end HPC user community
Despite the technological progress of grid technology and deployment, only a
minority of the scientific, engineering, and education community use
today’s national computing infrastructure. Our WIDE strategy addresses this
situation by working directly with specific community leaders who are building
discipline-specific cyberinfrastructure capabilities and resources for their
communities.
TeraGrid proposal, 2003
4. April 2006
Science Gateways
A new initiative for the TeraGrid
• Increasing investment by
communities in their own
cyberinfrastructure, but
heterogeneous:
• Resources
• Users – from expert to K-12
• Software stacks, policies
• Science Gateways
– Provide “TeraGrid Inside”
capabilities
– Leverage community investment
• Three common forms:
– Web-based Portals
– Application programs running on
users' machines but accessing
services in TeraGrid
– Coordinated access points
enabling users to move
seamlessly between TeraGrid and
other grids.
Workflow Composer
5. April 2006
But in the beginning, we had no services
We paid science teams to help us develop them
Science Gateway Prototype Discipline Science Partner(s) TeraGrid Liaison
Linked Environments for
Atmospheric Discovery
(LEAD)
Atmospheric Droegemeier (OU) Gannon (IU), Pennington (NCSA)
National Virtual Observatory
(NVO)
Astronomy Szalay (Johns Hopkins) Williams (Caltech)
Network for Computational
Nanotechnology (NCN) and
“nanoHUB”
Nanotechnology Lundstrum (PU) Goasguen (PU)
Open Life Sciences Gateway Biomedicine and Biology Schneewind (UC), Osterman
(Burnham/UCSD), DeLong
(MIT), Dusko (INRA)
Stevens (UC/Argonne)
Biology and Biomedical
Science Gateway
Biomedicine and Biology Cunningham (Duke), Magnuson
(UNC)
Reed (UNC), Blatecky (UNC)
Neutron Science Instrument
Gateway
Physics Cobb (ORNL) Cobb (ORNL)
Grid Analysis Environment High-Energy Physics Newman (Caltech) Bunn (Caltech)
Transportation System
Decision Support
Homeland Security Stephen Eubanks (LANL) Beckman (Argonne)
Groundwater/Flood Modeling Environmental Wells (UT-Austin), Engel (ORNL) Boisseau (TACC)
Science Grid
[GrPhyN/ivDGL/Grid3]
Multiple Pordes (FNAL), Huth (Harvard),
Avery (Uflorida)
Foster (UC/Argonne), Kesselman (USC-
ISI), Livny (UW)
6. So how will we meet all these needs?
• With RATS! (Requirements
Analysis Teams)
• Collection, analysis and
consolidation of requirements to
jump start the work
– Interviews with 10 Gateways
– Common user models,
accounting needs, scheduling
needs
• Summarized requirements for
each TeraGrid working group
– Accounting, Security, Web
Services, Software
• Areas for more study identified
• Primer outline for new Gateways
in progress
• And milestones
April 2006
7. April 2006
Linked Environments for Atmospheric Discovery
LEAD
•Providing tools that are needed to make accurate
predictions of tornados and hurricanes
•Data exploration and Grid workflow
8. Social Informatics Data Grid
Collaborative access to large, complex datasets
•SIDGrid is unique among
social science data archive
projects
–Streaming data which change
over time
•Voice, video, images (e.g. fMRI),
text, numerical (e.g. heart rate,
eye movement)
–Investigate multiple datasets,
collected at different time
scales, simultaneously
•Large data requirements
•Sophisticated analysis tools
9. Viewing multimodal data like a
symphony conductor
•“Music-score” display and
synchronized playback of video
and audio files
– Pitch tracks
– Text
– Head nods, pause, gesture
references
•Central archive of multi-modal
data, annotations, and analyses
– Distributed annotation efforts by
multiple researchers working on a
common data set
•History of updates
•Computational tools
– Distributed acoustic analysis using
Praat
– Statistical analysis using R
– Matrix computations using Matlab
and Octave
Source: Studying Discourse and Dialog with SIDGrid, Levow, 2008
10. Over the years, the program developed
I gave lots and lots of talks
11. Eventually we had a program
• And
customers
• Starting
in
2013,
gateway
users
surpass
command
line
users
in
XSEDE
Gateways
Login
13. Cyberinfrastructure for Phylogenetic Research
(CIPRES)
PI Mark Miller, SDSC, www.phylo.org
• 210
US
research
universities
– Harvard,
Yale,
UC
Berkeley,
Stanford,
etc.
– Non-‐PhD
granting
colleges
(including
one
all-‐
women’s
college,
community
colleges,
and
Hispanic-‐serving
institutions)
• 3 K-‐12
school
systems
• 43
non-‐governmental
organizations,
– Museums
including
the
Smithsonian
Institution,
the
American
Museum
of
Natural
History,
and
the
Field
Museum),
– Botanical
gardens,
(e.g.
Chicago,
Rancho
Santa
Ana,
and
New
York)
– Institutes
(e.g.
JCVI
and
Broad)
• 10
US
governmental
agencies
– Including
NIH,
USDA,
NOAA,
US
Forest
Service
• Curriculum
delivery
(76)
• 2000+
publications
since
2010
• 47%
of
all
XSEDE
users
in
Q4
2015
14. CIPRES’ reach is deep and wide
Nature article, Feb 2016
Mass. state science fair, July 2012
15. Saving wetlands with the Simulocean science gateway
Football field-sized parcel of land lost every hour
It's
important
to
enhance
the
collaboration
among
earth
scientists,
computer
scientists,
cyberinfrastructure
specialists
and
coastal
engineers
tasked
with
solving
the
sustainability
issues
of
deltaic
coasts
like
those
in
Louisiana.
Dr.
Jian
Tao,
research
scientist,
LSU
Source:
XSEDE
External
Relations
16. Some NSF programs even specify the use of gateways
This is the right direction to go! Gateways as cost-effective infrastructure
17. • Developers
typically
– work
in
isolation
– must
bridge
to
variety
of
resources
– need
building
blocks
in
order
to
focus
on
higher-‐level
functionality
– struggle
to
secure
sustainable
funding
Despite many successes, there are still
challenges
Gateways often funded as 3-year research projects
Early
adopters
Publicity
Wider
adoption
Funding
ends
Scientists
disillusioned
New
project
prototype
18. In 2014, we sent a survey to 29,000 NSF PIs
and academic CIOs and CTOs
5000 responded
We wanted to understand both the
importance of gateways and challenges
developers face
19. Specialized
Resources
Percent
Data
collections
75%
Data
analysis
tools,
including
visualization
and
mining 72%
Computational
tools 72%
Tools
for
rapidly
publishing
and/or
finding
articles
and
data
specific
to
my
domain
69%
Educational
tools 67%
Platforms
for
fostering
group
or
community
collaboration 63%
Simplified
interfaces
that
eliminate
the
need
to
learn
coding 62%
Citizen
science
and
other
public
engagement
resources 47%
Workflows
that
automate
or
capture
tasks
or
processes 42%
Scientific
instruments,
such
as
telescopes,
microscopes,
or
sensors 39%
We learned that 88% of respondents felt Web-
based applications were important to their work
n=4,004,
or
88%
of
4,538
researcher/educators.
Percentage
indicates
these
resources
are
“somewhat”
or
“very”
important
to
their
work.
20. 57% played some role in gateway creation
and these gateways were used for a variety of purposes
n
of
application
types=7,805,
by
2,756
creators
(out
of
2,819);
mean=2.8
application
types
per
application
creator
21. 34% 36%
20%
17%
31%
26%
42%
16%
30%
18%
45% 44%
14% 15%
0%
5%
10%
15%
20%
25%
30%
35%
40%
45%
50%
Usability
Consultant
Graphic
Designer
Community
Liaison/
Evangelist
Project
Manager
Professional
Software
Developer
Security
Expert
Quality
Assurance
and
Testing
Expert
Wished
we
had
this
Yes,
we
had
this
A variety of expertise was needed for successful
gateway development
n=2,756
respondents
or
98%
of
application
creators
24. A successful gateway institute will provide
leadership to
– 1)
bring
science
gateway
developers
together
with
each
other
and
with
the
developers
and
operators
of
existing
and
potential
cyberinfrastructure elements
that
science
gateways
integrate
and
enable
the
use
of
• in
order
to
promote
the
efficient,
effective,
and
sustainable
development
of
scientific
web
and
mobile
interfaces
– 2)
educate
developers
and
the
next
generation
of
investigators
to
effectively
use
the
gateway
software
ecosystem to
solve
real
research
problems;
and
– 3)
educate
the
next
generation
of
researchers
to
enable
them
to
create
the
software
cyberinfrastructure required
to
both
advance
fundamental
understanding
of
science
gateways
and
enable
researchers
to
address
the
grand
challenge
problems
of
the
future
26. • Incubator
– Modeled
after
business
incubators
– Diverse
expertise
on
demand
• Extended
Developer
Support
– We
help
others
build
gateways
and
teach
them
in
the
process
• Scientific
Software
Collaborative
– Listing
of
both
functional
gateways
and
gateway
software
• Community
Engagement
and
Exchange
– Annual
conference
– Gateways
in
the
news
– Job
postings
– International
and
inter-‐
agency
community
building
– Campus
expertise
• Workforce
Development
– Student
interns
– Gateways
in
the
classroom
SGCI Highlights
27. • New
developments
in
electron
detectors
and
electron
microscopes
now
provide
images
of
macromolecules
(protein,
RNA,
etc.)
that
can
be
determined
to
atomic
resolution
• Inherent
low
signal
to
noise
ratio
means
300,000
images
needed
for
one
object
of
interest
• HPC
resources
to
calculate
atomic
structures
based
upon
these
thousands
of
images
• Now
discovering
the
structures
of
macromolecules
that
were
previously
unattainable
using
traditional
methods
• The
importance
of
these
discoveries
has
brought
global
interest
into
our
field
from
scientists
without
HPC
training
Our first customer
Dr. Michael Cianfrocco, Cryo-EM gateway
Source:
Michael
Cianfrocco
28. The
cryo-‐EM
science
gateway
will
offer
users
access
to
HPC
resources,
requiring
only
that
they
have
raw
cryo-‐EM
data.
This
will
have
a
wide-‐ranging
impact
as
national
cryo-‐EM
centers
are
coming
online
in
the
coming
years,
requiring
that
users
have
a
location
to
process
their
data.
We
are
building
a
gateway
that
can
handle
all
cryo-‐EM
data,
whereas,
currently,
every
user
has
to
navigate
the
complex
work
of
HPC
data
analysis
to
either
install
software
on
local
clusters
or
get
access
to
national
centers
for
data
analysis.
Long
term,
we
will
incorporate
workflows
that
will
guide
new
users
through
the
processing
pipeline,
helping
them
assess
data
quality
along
the
way.
This
pipeline
will
be
the
first
of
its
kind.
Source:
Michael
Cianfrocco
29. The Institute allows us to expand our focus
beyond HPC
• Sensor-‐based
gateways
• Interfaces
to
instruments
– Telescopes,
microscopes,
ultracentrifuges,
more
• Gateways
that
access
data
collections
• Citizen
science
• Gateways
that
use
clouds
or
campus
resources
There is a whole wide world of gateways
out there
30. Sage Bionetworks developing predictors of
disease
• Synapse
is
an
open
computational
platform
used
by
Challenge
teams
spread
across
the
globe
to
crowdsource
questions
in
biology
and
medicine
http://sagebase.org/challenges/
33. How do you find a gateway?
We plan to design a marketplace
One that would interact with other marketplaces
34. Vision for SGCI success
5-10 years from now
• Science
gateways
form
a
vibrant
community
– Inter-‐agency,
international,
collegial
• Creating
gateways
is
easier
– Created
with
more
thoughtfulness,
so
they
are
more
sustainable
• Gateway
developers
have
stable
career
paths
– More
efficient
environments
on
campuses
• Students
are
excited
to
stay
in
the
sciences
• Radical
changes
in
how
research
is
conducted
35. Beyond the institute
• The
effects
of
the
democratization
on
science
• Gateways’
role
in
reproducibility
36. Benefits of democratization
New areas of study
• Breakthroughs
don’t
always
come
from
assembling
the
best
and
the
brightest
in
a
closed
room
• Lowering
barriers
to
resources
encourages
experimentation
– 2010
study
from
MIT
and
UCSD
compared
research
from
the
National
Institutes
for
Health
(NIH)
and
the
non-‐profit,
Howard
Hughes
Medical
Institute
(HHMI)
• Riskier
HHMI
grants
produced
more
innovative
and
influential
research
37. Gateways’ role in reproducibility
Exploring collaboration between SGCI and Whole Tale
• How
can
we
design
gateways
in
support
of
reproducible
science?
With
ties
to
publishing?
39. Jupyter notebooks
Wonderful examples of interactive gateway development
https://anaconda.org/jbednar/nyc_taxi/notebook
• Additional
work
needed
to
support
very
large
user
communities?
Very
large
data?
40. Gateways interfacing with
other gateways
Science
Gateway
Platform
as
a
Service
(SciGaP)
provides
application
programmer
interfaces
(APIs)
to
hosted
generic
infrastructure
services
that
can
be
used
by
domain
science
communities
to
create
Science
Gateways.
RDF:
a
directed,
labeled
graph
data
format
for
representing
information
in
the
Web
SPARQL:
query
language
for
RDF
across
diverse
data
sources
41. • US
workshops
– Gateway
Computing
Environments
workshops
since
2005
• European
workshops
– International
Workshop
on
Science
Gateways
since
2009
• Australian
workshops
– IWSG-‐A
since
2015
• Joint
special
issue
journals
combine
submissions
from
all
of
the
above
Final note: International collaborations
42. • Provide
leadership
on
future
directions
for
science
gateways
• Facilitate
awareness
and
international,
regional
and
national
developments
in
science
gateways
• Identify
and
share
best
practice
in
the
field
• Science
Gateways
Community
Institute (USA)
• NeCTAR (Australia)
• NESI (New
Zealand)
• Sci-‐GaIA (Africa)
• Academia
Sinica
Grid
Computing
Center (Taiwan)
• Software
Sustainability
Institute (UK)
• VRE4E1C (Europe)
• IWSG (Europe)
• CANARIE (Canada)
• Research
Data
Canada (Canada)
• IEEE
Technical
Area
on
Science
Gateways (International)
International Coalition on Science Gateways
Michelle Barker, Nectar providing leadership
http://www.icsciencegateways.org/