Video: https://www.simonsfoundation.org/event/collaborations-in-the-extreme-the-rise-of-open-code-development-in-the-scientific-community/
The internet is changing the scientific landscape by fostering international, interdisciplinary and collaborative software development. More than ever before, software is a crucial component of any scientific result. The ability to easily share code is reshaping expectations about reproducibility -- a fundamental tenet of the scientific process. In this lecture, Kelle Cruz will briefly provide the backstory of how these shifts have come about, describe some of the most impactful open source projects, and discuss efforts currently underway aimed at ensuring these community-led projects are sustainable and receive support.
Talk given at the SSSW 2013 Semantic Web Summerschool.
Part 1: What is "Semantic Web" (in 4 principles and 1 movie)
Part 2: What question can we ask now that we couldn't ask 10 years ago
Part 3: Treat Computer Science as a *science*, not just as engineering!
(this part a short version of http://slidesha.re/SaUhS4 )
Discover why Python is better for Data Science: the whole workflow of Data Analysis is covered by Python. Tools for various tasks are shown, including: workflow, data analysis, data visualization, integration with Hadoop ecosystem, and communication.
Talk given at the SSSW 2013 Semantic Web Summerschool.
Part 1: What is "Semantic Web" (in 4 principles and 1 movie)
Part 2: What question can we ask now that we couldn't ask 10 years ago
Part 3: Treat Computer Science as a *science*, not just as engineering!
(this part a short version of http://slidesha.re/SaUhS4 )
Discover why Python is better for Data Science: the whole workflow of Data Analysis is covered by Python. Tools for various tasks are shown, including: workflow, data analysis, data visualization, integration with Hadoop ecosystem, and communication.
'Scikit-project': How open source is empowering open science – and vice versaNathan Shammah
Open-source pipelines are accelerating scientific discovery, by empowering not only reproducibility of research results but also generalizability of methods. I address the rise of open source in scientific research in quantum physics and quantum information and introduce `scikit-project` a cookbook with best practices for (data) scientists.
See also https://github.com/Machine-Learning-Tokyo/MLT_Talks
https://bigscience.huggingface.co/
EN: Presentation of the BigScience project: a research initiative launched by HuggingFace and aiming to build a large language model (inspired by OpenAI and GPTx) over multiple languages and a very large processing cluster. The participants plan to investigate the dataset and the model from all angles: bias, social impact, capabilities, limitations, ethics, potential improvements, specific domain performances, carbon impact, general AI/cognitive research landscape.
FR : Présentation du projet Bigscience : un projet de recherche ouvert lancé par HuggingFace et qui a pour objectif de contruire un modèle de langue (ie un peu comme openAI et GPT-3) mais en explorant les problèmes liés au jeux de données et au modèle selon les angles des biais cognitifs, de l'impact social et environemental, des limites éthiques, des possibles gain de performance et de l'impact général de ce type d'approche lorsque le but n'est pas seulement "d'avoir un plus gros modèle".
[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...3TU.Datacentrum
3TU.Datacentrum Symposium Research Data Management:
Funder requirements, Questions and Solutions
At this symposium the funding organisation NWO and the European Commission explained their vision, plans and requirements. Researchers from the three universities of technology shared their experiences of data management in different stages of research. And the Research Data Services team informed the audience about research data management services offered by 3TU.Datacentrum.
The 3TU.Datacentrum symposium took place at the TU Delft (26 May), University of Twente (2 June) and TU Eindhoven (11 June) for and with local researchers.
More information on: datacentrum.3tu.nl/over-3tudatacentrum/symposium-2014
Digital Identity is fundamental to collaboration in bioinformatics research and development because it enables attribution, contribution, publication to be recorded and quantified.
However, current models of identity are often obsolete and have problems capturing both small contributions "microattribution" and large contributions "mega-attribution" in Science. Without adequate identity mechanisms, the incentive for collaboration can be reduced, and the utility of collaborative social tools hindered.
Using examples of metabolic pathway analysis with the taverna workbench and myexperiment.org, this talk will illustrate problems and solutions to identifying scientists accurately and effectively in collaborative bioinformatics networks on the Web.
Keynote on software sustainability given at the 2nd Annual Netherlands eScience Symposium, November 2014.
Based on the article
Carole Goble ,
Better Software, Better Research
Issue No.05 - Sept.-Oct. (2014 vol.18)
pp: 4-8
IEEE Computer Society
http://www.computer.org/csdl/mags/ic/2014/05/mic2014050004.pdf
http://doi.ieeecomputersociety.org/10.1109/MIC.2014.88
http://www.software.ac.uk/resources/publications/better-software-better-research
Presentation for Harvard's ABCD Technology in Education group:
The Institute for Quantitative Social Science (IQSS) is a unique entity at Harvard - it combines research, software development, and specialized services to provide innovative solutions to research and scholarship problems at Harvard and beyond. I will talk about the software projects that IQSS is currently working on (Dataverse, Zelig, Consilience, and OpenScholar), including the research and development processes, the benefits provided to the Harvard community, and the impacts on research and scholarship.
Open Source Collaboration in Drug Discovery in PharmaKees van Bochove
How pre-competitive collaboration in the pharmaceutical sector through open source platforms enables joint innovation of academics, pharma, SMEs and non-profits.
OpenStack is an IaaS provider software written in Python. As such, it provides a massive scalable operating system and services like: Image, Storage, Object, Compute, etc.
This talks aims to give the audience an overview about OpenStack, its capabilities, its modules, coding styles, workflow and organization.
As a successful community driven development case, it’s definitely a good reference for anyone willing to take that road or maybe joining existing projects.
Outlines the vision and philosophy for Wakari.io with a basic overview of popular python data analysis packages. Most of the talk is conducted in Wakari and is not visible on these slides. 90 minutes for PyData NYC, November 8th 2013.
Crab - A Python Framework for Building Recommendation SystemsMarcel Caraciolo
Keynote introducing the Framework Crab: A Python toolkit for bulding recommendation engines. It is a open source project as an alternative for Mahout Taste for Python developers.
Presented at XII Python User Group Pernambuco, 07-05-2011 at CIN/UFPE.
(Keynote presented at the WEESR and REVE 2022 workshop colocated with SPLC2022)
Social coding platforms centred around git provide explicit facilities to share code between projects: forks, pull requests, cherry-picking to name but a few. Variant forks are an interesting phenomenon in that respect, as it permits for different projects to peacefully co-exist, yet explicitly acknowledge the common ancestry. The phenomenon of variant forks is quite common: in a recent study we found more than 400 open source projects originating from a common code-base. In this keynote we share our insights on the phenomenon of variant forks on social coding platforms. First, we report the results of an exploratory qualitative analysis on the motivations for creating variant forks. Next we illustrate how bug fixes may (should?) be transferred from one variant to another. As such we hope to inspire researchers to study the phenomenon of variant forks.
The Trials and Tribulations of Software Development in Astronomy: Where are w...Kelle Cruz
With the ever-growing role of software development and computer programming in astronomical research, our community is faced with many new challenges in nearly every aspect of our profession. In this talk, I first will describe the various types of software and the many different roles it plays in the research cycle. I will also describe my personal journey — involving Astropy, the AAS, a National Academies panel, and the 2020 Decadal Survey — which has both shaped my understanding and enabled me to influence the environment around software development. Throughout the talk, I will highlight resources and tactics which individual researchers and teams can use to adapt to the rapidly changing landscape. I will also describe actions which could be taken to influence the prevailing culture to further recognize and value software development as a fundamental and crucial aspect of the entire astronomical research process.
https://doi.org/10.5281/zenodo.7719585
Lessons from the Astropy Code of ConductKelle Cruz
Talk given on Oct 15, 2019 at the Inclusive Astronomy 2 conference held in at STScI in Baltimore, MD. Video is here: https://cloudproject.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=5bf2ea68-b046-4c13-9a4a-aaea00dae5f8
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. DOI:10.5281/zenodo.3515334.
More Related Content
Similar to Collaborations in the Extreme: The rise of open code development in the scientific community
'Scikit-project': How open source is empowering open science – and vice versaNathan Shammah
Open-source pipelines are accelerating scientific discovery, by empowering not only reproducibility of research results but also generalizability of methods. I address the rise of open source in scientific research in quantum physics and quantum information and introduce `scikit-project` a cookbook with best practices for (data) scientists.
See also https://github.com/Machine-Learning-Tokyo/MLT_Talks
https://bigscience.huggingface.co/
EN: Presentation of the BigScience project: a research initiative launched by HuggingFace and aiming to build a large language model (inspired by OpenAI and GPTx) over multiple languages and a very large processing cluster. The participants plan to investigate the dataset and the model from all angles: bias, social impact, capabilities, limitations, ethics, potential improvements, specific domain performances, carbon impact, general AI/cognitive research landscape.
FR : Présentation du projet Bigscience : un projet de recherche ouvert lancé par HuggingFace et qui a pour objectif de contruire un modèle de langue (ie un peu comme openAI et GPT-3) mais en explorant les problèmes liés au jeux de données et au modèle selon les angles des biais cognitifs, de l'impact social et environemental, des limites éthiques, des possibles gain de performance et de l'impact général de ce type d'approche lorsque le but n'est pas seulement "d'avoir un plus gros modèle".
[3.6] Beyond Data Sharing - Pieter van Gorp [3TU.Datacentrum Symposium 2014, ...3TU.Datacentrum
3TU.Datacentrum Symposium Research Data Management:
Funder requirements, Questions and Solutions
At this symposium the funding organisation NWO and the European Commission explained their vision, plans and requirements. Researchers from the three universities of technology shared their experiences of data management in different stages of research. And the Research Data Services team informed the audience about research data management services offered by 3TU.Datacentrum.
The 3TU.Datacentrum symposium took place at the TU Delft (26 May), University of Twente (2 June) and TU Eindhoven (11 June) for and with local researchers.
More information on: datacentrum.3tu.nl/over-3tudatacentrum/symposium-2014
Digital Identity is fundamental to collaboration in bioinformatics research and development because it enables attribution, contribution, publication to be recorded and quantified.
However, current models of identity are often obsolete and have problems capturing both small contributions "microattribution" and large contributions "mega-attribution" in Science. Without adequate identity mechanisms, the incentive for collaboration can be reduced, and the utility of collaborative social tools hindered.
Using examples of metabolic pathway analysis with the taverna workbench and myexperiment.org, this talk will illustrate problems and solutions to identifying scientists accurately and effectively in collaborative bioinformatics networks on the Web.
Keynote on software sustainability given at the 2nd Annual Netherlands eScience Symposium, November 2014.
Based on the article
Carole Goble ,
Better Software, Better Research
Issue No.05 - Sept.-Oct. (2014 vol.18)
pp: 4-8
IEEE Computer Society
http://www.computer.org/csdl/mags/ic/2014/05/mic2014050004.pdf
http://doi.ieeecomputersociety.org/10.1109/MIC.2014.88
http://www.software.ac.uk/resources/publications/better-software-better-research
Presentation for Harvard's ABCD Technology in Education group:
The Institute for Quantitative Social Science (IQSS) is a unique entity at Harvard - it combines research, software development, and specialized services to provide innovative solutions to research and scholarship problems at Harvard and beyond. I will talk about the software projects that IQSS is currently working on (Dataverse, Zelig, Consilience, and OpenScholar), including the research and development processes, the benefits provided to the Harvard community, and the impacts on research and scholarship.
Open Source Collaboration in Drug Discovery in PharmaKees van Bochove
How pre-competitive collaboration in the pharmaceutical sector through open source platforms enables joint innovation of academics, pharma, SMEs and non-profits.
OpenStack is an IaaS provider software written in Python. As such, it provides a massive scalable operating system and services like: Image, Storage, Object, Compute, etc.
This talks aims to give the audience an overview about OpenStack, its capabilities, its modules, coding styles, workflow and organization.
As a successful community driven development case, it’s definitely a good reference for anyone willing to take that road or maybe joining existing projects.
Outlines the vision and philosophy for Wakari.io with a basic overview of popular python data analysis packages. Most of the talk is conducted in Wakari and is not visible on these slides. 90 minutes for PyData NYC, November 8th 2013.
Crab - A Python Framework for Building Recommendation SystemsMarcel Caraciolo
Keynote introducing the Framework Crab: A Python toolkit for bulding recommendation engines. It is a open source project as an alternative for Mahout Taste for Python developers.
Presented at XII Python User Group Pernambuco, 07-05-2011 at CIN/UFPE.
(Keynote presented at the WEESR and REVE 2022 workshop colocated with SPLC2022)
Social coding platforms centred around git provide explicit facilities to share code between projects: forks, pull requests, cherry-picking to name but a few. Variant forks are an interesting phenomenon in that respect, as it permits for different projects to peacefully co-exist, yet explicitly acknowledge the common ancestry. The phenomenon of variant forks is quite common: in a recent study we found more than 400 open source projects originating from a common code-base. In this keynote we share our insights on the phenomenon of variant forks on social coding platforms. First, we report the results of an exploratory qualitative analysis on the motivations for creating variant forks. Next we illustrate how bug fixes may (should?) be transferred from one variant to another. As such we hope to inspire researchers to study the phenomenon of variant forks.
The Trials and Tribulations of Software Development in Astronomy: Where are w...Kelle Cruz
With the ever-growing role of software development and computer programming in astronomical research, our community is faced with many new challenges in nearly every aspect of our profession. In this talk, I first will describe the various types of software and the many different roles it plays in the research cycle. I will also describe my personal journey — involving Astropy, the AAS, a National Academies panel, and the 2020 Decadal Survey — which has both shaped my understanding and enabled me to influence the environment around software development. Throughout the talk, I will highlight resources and tactics which individual researchers and teams can use to adapt to the rapidly changing landscape. I will also describe actions which could be taken to influence the prevailing culture to further recognize and value software development as a fundamental and crucial aspect of the entire astronomical research process.
https://doi.org/10.5281/zenodo.7719585
Lessons from the Astropy Code of ConductKelle Cruz
Talk given on Oct 15, 2019 at the Inclusive Astronomy 2 conference held in at STScI in Baltimore, MD. Video is here: https://cloudproject.hosted.panopto.com/Panopto/Pages/Viewer.aspx?id=5bf2ea68-b046-4c13-9a4a-aaea00dae5f8
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. DOI:10.5281/zenodo.3515334.
Career Path Strategy and Work-Life BalanceKelle Cruz
Sildes for workshop given at Postdoctoral Fellows Workshop. Topics include identifying goals, decision making strategy, working towards achieving work-life balance, commitment budgets, and saying no. The slides include prompts for 4 activities.
My methodical approach to finding professional success and personal happinessKelle Cruz
I would like to share with you the methods that I have used to identify my dream job as an academic scientist and to guide my personal and professional choices. I will present this framework in a way which individuals could potentially adapt to their own situation and life goals. The framework is adaptable for all career paths and it just so happens that in my case, being a professor was the best solution given my constraints. I will further describe some of the philosophies which have helped me to find both success and happiness as a woman in a patriarchal profession.
It is widely recognized that not all groups of people succeed in equal numbers in academia despite our best intentions to make the profession a fair and objective meritocracy. I will give a brief review of the historical context of oppression in the US and some statistics which demonstrate the current situation in science and astronomy. I will spend the majority of the talk discussing ways in which individuals at all seniority levels can take actions to increase the inclusion of marginalized groups in our profession.
OpenMetadata Community Meeting - 5th June 2024OpenMetadata
The OpenMetadata Community Meeting was held on June 5th, 2024. In this meeting, we discussed about the data quality capabilities that are integrated with the Incident Manager, providing a complete solution to handle your data observability needs. Watch the end-to-end demo of the data quality features.
* How to run your own data quality framework
* What is the performance impact of running data quality frameworks
* How to run the test cases in your own ETL pipelines
* How the Incident Manager is integrated
* Get notified with alerts when test cases fail
Watch the meeting recording here - https://www.youtube.com/watch?v=UbNOje0kf6E
Do you want Software for your Business? Visit Deuglo
Deuglo has top Software Developers in India. They are experts in software development and help design and create custom Software solutions.
Deuglo follows seven steps methods for delivering their services to their customers. They called it the Software development life cycle process (SDLC).
Requirement — Collecting the Requirements is the first Phase in the SSLC process.
Feasibility Study — after completing the requirement process they move to the design phase.
Design — in this phase, they start designing the software.
Coding — when designing is completed, the developers start coding for the software.
Testing — in this phase when the coding of the software is done the testing team will start testing.
Installation — after completion of testing, the application opens to the live server and launches!
Maintenance — after completing the software development, customers start using the software.
Zoom is a comprehensive platform designed to connect individuals and teams efficiently. With its user-friendly interface and powerful features, Zoom has become a go-to solution for virtual communication and collaboration. It offers a range of tools, including virtual meetings, team chat, VoIP phone systems, online whiteboards, and AI companions, to streamline workflows and enhance productivity.
Code reviews are vital for ensuring good code quality. They serve as one of our last lines of defense against bugs and subpar code reaching production.
Yet, they often turn into annoying tasks riddled with frustration, hostility, unclear feedback and lack of standards. How can we improve this crucial process?
In this session we will cover:
- The Art of Effective Code Reviews
- Streamlining the Review Process
- Elevating Reviews with Automated Tools
By the end of this presentation, you'll have the knowledge on how to organize and improve your code review proces
Introducing Crescat - Event Management Software for Venues, Festivals and Eve...Crescat
Crescat is industry-trusted event management software, built by event professionals for event professionals. Founded in 2017, we have three key products tailored for the live event industry.
Crescat Event for concert promoters and event agencies. Crescat Venue for music venues, conference centers, wedding venues, concert halls and more. And Crescat Festival for festivals, conferences and complex events.
With a wide range of popular features such as event scheduling, shift management, volunteer and crew coordination, artist booking and much more, Crescat is designed for customisation and ease-of-use.
Over 125,000 events have been planned in Crescat and with hundreds of customers of all shapes and sizes, from boutique event agencies through to international concert promoters, Crescat is rigged for success. What's more, we highly value feedback from our users and we are constantly improving our software with updates, new features and improvements.
If you plan events, run a venue or produce festivals and you're looking for ways to make your life easier, then we have a solution for you. Try our software for free or schedule a no-obligation demo with one of our product specialists today at crescat.io
May Marketo Masterclass, London MUG May 22 2024.pdfAdele Miller
Can't make Adobe Summit in Vegas? No sweat because the EMEA Marketo Engage Champions are coming to London to share their Summit sessions, insights and more!
This is a MUG with a twist you don't want to miss.
Quarkus Hidden and Forbidden ExtensionsMax Andersen
Quarkus has a vast extension ecosystem and is known for its subsonic and subatomic feature set. Some of these features are not as well known, and some extensions are less talked about, but that does not make them less interesting - quite the opposite.
Come join this talk to see some tips and tricks for using Quarkus and some of the lesser known features, extensions and development techniques.
Artificia Intellicence and XPath Extension FunctionsOctavian Nadolu
The purpose of this presentation is to provide an overview of how you can use AI from XSLT, XQuery, Schematron, or XML Refactoring operations, the potential benefits of using AI, and some of the challenges we face.
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxrickgrimesss22
Discover the essential features to incorporate in your Winzo clone app to boost business growth, enhance user engagement, and drive revenue. Learn how to create a compelling gaming experience that stands out in the competitive market.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Launch Your Streaming Platforms in MinutesRoshan Dwivedi
The claim of launching a streaming platform in minutes might be a bit of an exaggeration, but there are services that can significantly streamline the process. Here's a breakdown:
Pros of Speedy Streaming Platform Launch Services:
No coding required: These services often use drag-and-drop interfaces or pre-built templates, eliminating the need for programming knowledge.
Faster setup: Compared to building from scratch, these platforms can get you up and running much quicker.
All-in-one solutions: Many services offer features like content management systems (CMS), video players, and monetization tools, reducing the need for multiple integrations.
Things to Consider:
Limited customization: These platforms may offer less flexibility in design and functionality compared to custom-built solutions.
Scalability: As your audience grows, you might need to upgrade to a more robust platform or encounter limitations with the "quick launch" option.
Features: Carefully evaluate which features are included and if they meet your specific needs (e.g., live streaming, subscription options).
Examples of Services for Launching Streaming Platforms:
Muvi [muvi com]
Uscreen [usencreen tv]
Alternatives to Consider:
Existing Streaming platforms: Platforms like YouTube or Twitch might be suitable for basic streaming needs, though monetization options might be limited.
Custom Development: While more time-consuming, custom development offers the most control and flexibility for your platform.
Overall, launching a streaming platform in minutes might not be entirely realistic, but these services can significantly speed up the process compared to building from scratch. Carefully consider your needs and budget when choosing the best option for you.
AI Genie Review: World’s First Open AI WordPress Website CreatorGoogle
AI Genie Review: World’s First Open AI WordPress Website Creator
👉👉 Click Here To Get More Info 👇👇
https://sumonreview.com/ai-genie-review
AI Genie Review: Key Features
✅Creates Limitless Real-Time Unique Content, auto-publishing Posts, Pages & Images directly from Chat GPT & Open AI on WordPress in any Niche
✅First & Only Google Bard Approved Software That Publishes 100% Original, SEO Friendly Content using Open AI
✅Publish Automated Posts and Pages using AI Genie directly on Your website
✅50 DFY Websites Included Without Adding Any Images, Content Or Doing Anything Yourself
✅Integrated Chat GPT Bot gives Instant Answers on Your Website to Visitors
✅Just Enter the title, and your Content for Pages and Posts will be ready on your website
✅Automatically insert visually appealing images into posts based on keywords and titles.
✅Choose the temperature of the content and control its randomness.
✅Control the length of the content to be generated.
✅Never Worry About Paying Huge Money Monthly To Top Content Creation Platforms
✅100% Easy-to-Use, Newbie-Friendly Technology
✅30-Days Money-Back Guarantee
See My Other Reviews Article:
(1) TubeTrivia AI Review: https://sumonreview.com/tubetrivia-ai-review
(2) SocioWave Review: https://sumonreview.com/sociowave-review
(3) AI Partner & Profit Review: https://sumonreview.com/ai-partner-profit-review
(4) AI Ebook Suite Review: https://sumonreview.com/ai-ebook-suite-review
#AIGenieApp #AIGenieBonus #AIGenieBonuses #AIGenieDemo #AIGenieDownload #AIGenieLegit #AIGenieLiveDemo #AIGenieOTO #AIGeniePreview #AIGenieReview #AIGenieReviewandBonus #AIGenieScamorLegit #AIGenieSoftware #AIGenieUpgrades #AIGenieUpsells #HowDoesAlGenie #HowtoBuyAIGenie #HowtoMakeMoneywithAIGenie #MakeMoneyOnline #MakeMoneywithAIGenie
Custom Healthcare Software for Managing Chronic Conditions and Remote Patient...Mind IT Systems
Healthcare providers often struggle with the complexities of chronic conditions and remote patient monitoring, as each patient requires personalized care and ongoing monitoring. Off-the-shelf solutions may not meet these diverse needs, leading to inefficiencies and gaps in care. It’s here, custom healthcare software offers a tailored solution, ensuring improved care and effectiveness.
Mobile App Development Company In Noida | Drona InfotechDrona Infotech
Looking for a reliable mobile app development company in Noida? Look no further than Drona Infotech. We specialize in creating customized apps for your business needs.
Visit Us For : https://www.dronainfotech.com/mobile-application-development/
GraphSummit Paris - The art of the possible with Graph TechnologyNeo4j
Sudhir Hasbe, Chief Product Officer, Neo4j
Join us as we explore breakthrough innovations enabled by interconnected data and AI. Discover firsthand how organizations use relationships in data to uncover contextual insights and solve our most pressing challenges – from optimizing supply chains, detecting fraud, and improving customer experiences to accelerating drug discoveries.
Transform Your Communication with Cloud-Based IVR SolutionsTheSMSPoint
Discover the power of Cloud-Based IVR Solutions to streamline communication processes. Embrace scalability and cost-efficiency while enhancing customer experiences with features like automated call routing and voice recognition. Accessible from anywhere, these solutions integrate seamlessly with existing systems, providing real-time analytics for continuous improvement. Revolutionize your communication strategy today with Cloud-Based IVR Solutions. Learn more at: https://thesmspoint.com/channel/cloud-telephony
A Study of Variable-Role-based Feature Enrichment in Neural Models of CodeAftab Hussain
Understanding variable roles in code has been found to be helpful by students
in learning programming -- could variable roles help deep neural models in
performing coding tasks? We do an exploratory study.
- These are slides of the talk given at InteNSE'23: The 1st International Workshop on Interpretability and Robustness in Neural Software Engineering, co-located with the 45th International Conference on Software Engineering, ICSE 2023, Melbourne Australia
10. I care about maximizing efficiency of science
The goal of this blog, AstroBetter, is to provide
information and tips about streamlining all the ways
we use our Macs everyday.
11. There is now astrolib (https://trac6.assembla.com/astrolib ), also pyAstroLib (which
appears to be something entirely different than astrolib? http://sourceforge.net/projects/
pyastrolib/develop), astropysics (http://packages.python.org/Astropysics/), now
pyAstronomy, and probably more. It looks like PyAstronomy contains wavelength conversions
(already implemented in pyastrolib and astropysics) and coordinate conversions and
precession (already in coords, astropysics, pySLALIB, & more).
I'm going to be provocative here: As a community, we are doing something wrong if everyone
wants to start their own new module rather than contributing to a common, shared, open-
source core. We are clearly doing something wrong if people repeatedly implement the same
basic functions rather than building on what's already there. What do we need to do
differently? How can we make it easier to use a shared repository and shared namespace for
all this?
https://mail.scipy.org/pipermail/astropy/2011-June/thread.html
12. https://mail.scipy.org/pipermail/astropy/2011-June/thread.html
I think a sit-down is desperately needed to resolve these issues, figure out the
mgmt structure (aka, pecking order), for the BDFL to emerge, and for progress to
occur.
I'd be happy to participate as a non-python/programming expert and maybe provide
the voice of the “users”.
…if there are > 3 ppl on this list already planning on going, then you guys
should definitely get together to have something a bit more formal than just a
chat. You could plan the next meeting, evaluate the existing packages, start to
sketch out a roadmap, maybe even assign some tasks, or create some leadership
roles (e.g., Lead of Phot packages, Lead of Spectra packages, Lead of Xray), etc.
13. Describing code is hard
To calculate distances and areas on the solar-disc, firstly helioprojective
coordinates are converted to heliographic coordinates using the world coordinate
system available in the standard solar data analysis software… doi: 10.1007/s11207-015-0808-7
Courtesy Danny Ryan
17. Open Source Definition
Software whose source code is made
available under an open source license.
An open source license grants anyone the rights to
inspect, modify and distribute the code
(with some possible caveats).
https://opensource.org/osd
19. Open Source Misconceptions
Putting code on my webpage makes it open.
No license = copyright, all rights reserved.
Nobody else can use, copy, distribute, or modify your work
without being at risk of take-downs or litigation.
True
False
https://choosealicense.com/no-permission/
21. Open Source Misconceptions
If I make my code open, I must support and maintain it.
Making code open means other people can
use and modify it,
even if you abandon it.
True
False
23. Open Source Misconceptions
Anyone can participate in the development of open code
Code can be distributed with an open
source license but
developed behind closed doors
True
False
24. GitHub has made open development feasible
• github is the thing which makes open development possible.
• github can be used to work on closed software.
32nd most popular internet site in the US
https://www.alexa.com/topsites/countries/US
A repository to store, exchange, trade,
and talk about code.
25. • Version Control: clean, simple-to-track coding history
• Accessible: projects are discoverable and widely accessible
• Transparent: work in progress is visible
• Collaborative: Revisions are debated publicly
• Educational: critical coding knowledge is amassed via all of the above
https://resources.whitesourcesoftware.com/blog-whitesource/the-octocat-that-changed-the-world
GitHub has made open development feasible
30. • Python plays well with others
highly interoperable with other languages
• Python is powerful
large ecosystem of 3rd party modules
• Python is easy to learn
low barrier to entry
• Python is open
helps with reproducibility problem
• Python is zero cost
Nobody likes to pay for things
Scientists are converging on Python
https://www.python.org/about/
https://www.datacamp.com/community/blog/python-scientific-computing-case
35. 7
§Total downloads via PyPi,
Jan. 2016 — Feb. 2018
*Estimated cost based on $100k average salary per year
**defined as more than 40 commits since Jan. 1, 2017
***defined as more than 100 commits since Jan. 1, 2017
openhub.net
Contributor stats via Libraries.io
Maintainers stats via github.com
6
Current Maintainers
**
Current Maintainers
Downloads
27.7 Million§
Estimated Cost
$7 Million*
Codebase
267,775 lines
Contributors
703
70
person-years
4
latest release: v0.22.0
(development began in 2009)
***
latest release: v1.14.0
(development began in 2001)
76
person-years
Downloads
49 Million§
Estimated Cost
$7.57 Million*
Codebase
285,480 lines
Contributors
564
***
Current Maintainers
Downloads
13.8 Million§
Estimated Cost
$6.36 Million*
Codebase
242,431 lines
Contributors
580
64
person-years
5
latest release: v2.2.0
(development began in 2003)
Vital code
is
maintained
by a small
number of
people
36.
37.
38.
39.
40. Implications for the future of academic science
• Software development skills will be
considered fundamental & essential.
• Software will be considered a research
product, similar to a journal article.
• Code review will part of peer review process.
• All code will be expected to be open and
accessible.
• Contributing to open, collaborative projects
will be the norm.
42. Conclusions
• Science is software
• GitHub facilitates open development and
has changed the way we interact with
software.
• Python is rapidly becoming the common
language of science.
• NumFOCUS needs to be recognized as a a
vital part of science infrastructure.
• Software development needs to be taught
and integrated into the formal STEM
curriculum.
43. Collaborations in the Extreme:
The rise of open code development in the
scientific community
Kelle Cruz
CUNY Hunter College
@kellecruz