Machine learning, health data & the limits of knowledgePaul Agapow
Lecture for Imperial College London's MSc in Health Data Analytics, critiquing a recent paper on COVID diagnosis and moving out to talk about good practices (& limits) in ML and model building
Presentation on how past medical records can be used to provide appropriate and timely treatment for patients using Genetic Algorithm and Feature Selection
Scientific research: What Anna Karenina teaches us about useful negative resultsDaniel S. Katz
a panel talk for the 1st Workshop on E-science ReseaRch leading tO negative Results (ERROR), held in conjunction with the 11th eScience conference on 3 September 2015 in Munich, Germany
Machine learning, health data & the limits of knowledgePaul Agapow
Lecture for Imperial College London's MSc in Health Data Analytics, critiquing a recent paper on COVID diagnosis and moving out to talk about good practices (& limits) in ML and model building
Presentation on how past medical records can be used to provide appropriate and timely treatment for patients using Genetic Algorithm and Feature Selection
Scientific research: What Anna Karenina teaches us about useful negative resultsDaniel S. Katz
a panel talk for the 1st Workshop on E-science ReseaRch leading tO negative Results (ERROR), held in conjunction with the 11th eScience conference on 3 September 2015 in Munich, Germany
A replication crisis in the making: how we reward unreliable scienceBjörn Brembs
Presentation at the 2016 annual meeting of the Mind and Brain College of the university of Lisbon on the infrastructural causes for the apparent replication crisis in the experimental/biomedical sciences.
This presentation was given as a part of the Microsoft eScience panel discussion in Sao Paulo, Brazil. The panel discussion was in regards to Going Native, a reference to a quote from Jim Gray along the lines of “in order to really understand the computing needs of a scientist you have to go native”. Jim himself did this, immersing himself in astronomy to build what would become the WorldWide Telescope. Bridging the gap between experimental scientists and the computing that underpins their discoveries is an ongoing challenge for eScience. The panel explored what it means to go native and gave examples of where they have seen this work well and shared lesson’s learned from working in this way.
sience 2.0 : an illustration of good research practices in a real studywolf vanpaemel
a presentation explaining the what, how and why of some of the features of science 2.0 (replication, registration, high power, bayesian statistics, estimation, co-pilot multi-software approach, distinction between confirmatory and exploratory analyses, and open science) using steegen et al. (2014) as a running example.
The data sets you are about to analyze are only as good and valid as the methodology used to gather the data and create the data set. The presentation by Tom Johnson and Cheryl Phillips was made at the 2012 meeting of the National Institute for Computer-Assisted Reporting, Feb. 2012, in St. Louis.
IMRAD format
An acronym for Introduction – Method – Results – and – Discussion. The IMRaD format is a way of structuring a scientific article. It is often used in health care and the natural sciences. Unlike theses in the social sciences, the IMRaD format does not include a separate theory chapter
JALA Editor-in-Chief Edward Kai-Hua Chow, Ph.D., of National University of Singapore shared step-by-step advice on how to design and write scientific research papers more clearly and effectively to improve their chances for successful publication at the recently held conference in Washington, DC. Learn what editors want, what they don't want and how reviewers evaluate manuscripts by reviewing slides from the session.
A replication crisis in the making: how we reward unreliable scienceBjörn Brembs
Presentation at the 2016 annual meeting of the Mind and Brain College of the university of Lisbon on the infrastructural causes for the apparent replication crisis in the experimental/biomedical sciences.
This presentation was given as a part of the Microsoft eScience panel discussion in Sao Paulo, Brazil. The panel discussion was in regards to Going Native, a reference to a quote from Jim Gray along the lines of “in order to really understand the computing needs of a scientist you have to go native”. Jim himself did this, immersing himself in astronomy to build what would become the WorldWide Telescope. Bridging the gap between experimental scientists and the computing that underpins their discoveries is an ongoing challenge for eScience. The panel explored what it means to go native and gave examples of where they have seen this work well and shared lesson’s learned from working in this way.
sience 2.0 : an illustration of good research practices in a real studywolf vanpaemel
a presentation explaining the what, how and why of some of the features of science 2.0 (replication, registration, high power, bayesian statistics, estimation, co-pilot multi-software approach, distinction between confirmatory and exploratory analyses, and open science) using steegen et al. (2014) as a running example.
The data sets you are about to analyze are only as good and valid as the methodology used to gather the data and create the data set. The presentation by Tom Johnson and Cheryl Phillips was made at the 2012 meeting of the National Institute for Computer-Assisted Reporting, Feb. 2012, in St. Louis.
IMRAD format
An acronym for Introduction – Method – Results – and – Discussion. The IMRaD format is a way of structuring a scientific article. It is often used in health care and the natural sciences. Unlike theses in the social sciences, the IMRaD format does not include a separate theory chapter
JALA Editor-in-Chief Edward Kai-Hua Chow, Ph.D., of National University of Singapore shared step-by-step advice on how to design and write scientific research papers more clearly and effectively to improve their chances for successful publication at the recently held conference in Washington, DC. Learn what editors want, what they don't want and how reviewers evaluate manuscripts by reviewing slides from the session.
Presented 11/2/13 at National Association of Science Writers 2013 in Gainesville, Florida, for the panel "Rising Above the Noise: Using Statistics-Based Reporting." Tips for journalists.
Data visualization has become increasingly more important and sits at the center of how people learn about and experience the world. We process information about politics, business insights and every day decisions through “visual soundbites”. As data journalists, we have incredible power to both positively influence as well as misguide conversations with the choices that we make when presenting graphical results.
In this presentation, we will share some of the best practices that help deliver stories that matter and avoid creating those that mislead.
IT support and services are the backbone of an organization. Having round-the-clock IT support solutions strategically supports your organization's ability to run effectively. This enables you to focus on your core business operations. Jerait.co.uk comprehend organizational needs and can navigate the business through updated technology recommendations. They majorly offer their services in locations like Edinburgh, Aberdeen, and Glasgow. They can help you enhance your IT infrastructure and future-proof your business with highly reliable, secure, and optimized IT support solutions.
BIG DATA | How to explain it & how to use it for your career?Tuan Yang
If you ask people what BIG DATA is they often say it is about a lot of data. But the world has ALWAYS had a lot of data. It is about datafication – a word so new even spellcheck functions don’t know it is a real word!
Learn more about:
» How BIG DATA changes career paths of even the most unsuspecting?
» How BIG DATA changes the way business decision are made?
» How BIG DATA changes who makes those decisions & the reshuffle of the balance of power it causes?
» What BIG DATA skills can you bring to the office tomorrow to increase your value to the firm
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...Neo4j
Leonard Jayamohan, Partner & Generative AI Lead, Deloitte
This keynote will reveal how Deloitte leverages Neo4j’s graph power for groundbreaking digital twin solutions, achieving a staggering 100x performance boost. Discover the essential role knowledge graphs play in successful generative AI implementations. Plus, get an exclusive look at an innovative Neo4j + Generative AI solution Deloitte is developing in-house.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
Pushing the limits of ePRTC: 100ns holdover for 100 daysAdtran
At WSTS 2024, Alon Stern explored the topic of parametric holdover and explained how recent research findings can be implemented in real-world PNT networks to achieve 100 nanoseconds of accuracy for up to 100 days.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Climate Impact of Software Testing at Nordic Testing DaysKari Kakkonen
My slides at Nordic Testing Days 6.6.2024
Climate impact / sustainability of software testing discussed on the talk. ICT and testing must carry their part of global responsibility to help with the climat warming. We can minimize the carbon footprint but we can also have a carbon handprint, a positive impact on the climate. Quality characteristics can be added with sustainability, and then measured continuously. Test environments can be used less, and in smaller scale and on demand. Test techniques can be used in optimizing or minimizing number of tests. Test automation can be used to speed up testing.
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfPeter Spielvogel
Building better applications for business users with SAP Fiori.
• What is SAP Fiori and why it matters to you
• How a better user experience drives measurable business benefits
• How to get started with SAP Fiori today
• How SAP Fiori elements accelerates application development
• How SAP Build Code includes SAP Fiori tools and other generative artificial intelligence capabilities
• How SAP Fiori paves the way for using AI in SAP apps
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
GridMate - End to end testing is a critical piece to ensure quality and avoid...ThomasParaiso2
End to end testing is a critical piece to ensure quality and avoid regressions. In this session, we share our journey building an E2E testing pipeline for GridMate components (LWC and Aura) using Cypress, JSForce, FakerJS…
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
The Art of the Pitch: WordPress Relationships and Sales
Co-working on Omics Data
1. Subio Platform
is NOT a “statistical” tool,
but a “biological” analysis software.
Subio Inc.
2. Many Biologists Uses Excel
LINUX Server
Desktop/Laptop
NGS Tools
• Mapping
• Quantification
• Variant calling
• etc.
R
Bioinformaticians Biologists
Many biologists
receive Excel files
as analysis results.
CLC
Other Stat. Tools
Galaxy Excel PDF
3. Like This
(A List of Differentially Expressed Genes)
You can sort a list of
differentially expressed
genes by fold change and/or
p-values.
But are the genes
at top of the list
biologically
important?
4. Aren’t “Statistical” Tools Enough?
Statisticians say
“Statistical significance is
NOT biological significance.”
What does it mean?
• Statistics never answers to biological questions.
• Statistics sorts data for a better overviewing.
• Biological significance can be proven only by
experiments.
5. Ability of Computer & Human
Computers are RATIONAL FOOLS.
• They make enormous calculations and tasks.
But, they never hit on a new idea out of logic.
Human should STAY FOOLISH, because
• We don’t have whole knowledge about the life.
• Human’s “serendipity” brings innovations.
• You prove your idea by experiments.
How to maximize the chance of biological discovery?
6. Data Analysis Needs Both Wheels
DRY Side WET Side
Tools for Bioinformaticians Tools for Biologists
1001011101010110
0101101010100100
0111010100101000
0100101110101001
0011001010101101
0101000101001100
1010100010101001
• Interactive
visualizations for
exploratory mining.
• Data sharing for open
discussions.
• Reviewing the results
of statistical analysis.
Prediction
for new experiments
Idea
of new experiments
Utilizing Computer Power Maximizing Human Ability
7. Challenging Super Complex Problem
Bioinformaticians Biologists
Need better
communications
The goal is so far!
8. How to Make It Better?
LINUX
Windows/MAC
NGS Tools
• Mapping
• Quantification
• Variant calling
• etc.
R
Bioinformaticians Biologists
Passing “SSA” file
instead of Excel file.
CLC
Other Stat. Tools
Galaxy Excel PDF
Subio Platform
9. Data Sharing via SSA on Subio Platform
Subio Platform
Subio Platform
Subio Platform Subio Platform
Subio Platform
Subio Platform
Subio Platform
is free. They
can import the
SSA on their
PC/Mac.
All data are
packed in SSA.
(raw data, gene
annotation, sample
information,
pathways, statistical
analysis results)
DRY Side
WET Side
10. It’s Not Only A Gene List!
You can see like
Excel here.
For example, you
can pass a list of
genes, which are
down-regulated by
aging.
11. It’s Not Only A Gene List!
You can
search genes
by keywords.
12. It’s Not Only A Gene List!
You can see how
apparently
deviating,
comparing to
others.
13. It’s Not Only A Gene List!
You can see if
the raw data are
valid and
comparable.
14. It’s Not Only A Gene List!
You can
distinguish
genes which are
likely to be
expressing only
in young people.
15. It’s Not Only A Gene List!
You can see if
they are closely
located. If so,
what/where
they are.
16. It’s Not Only A Gene List!
You can expand
the gene list to
a cluster of
sharing similar
expression
patterns.
17. It’s Not Only A Gene List!
You can see
where those
genes are on
pathways.
18. Not A Cascade, But A Cycle
DRY Side WET Side
New Idea to
new experiments
• Make the data viewable.
• Add statistical analysis
for better overviewing.
1.
• Report what they found.
• Ask for applying other
methods to clarify the
findings.
2.
Poor Findings
19. What Did Prevent The Cycle?
Costs of introducing software
• Commercial Software
• R and other command line based tools.
• Free Software with GUI by Academic Projects
Free Zero Cost
Purchasing cost Communication cost
Learning cost Communication cost
Learning cost Maintenance cost
Maintenance cost
20. A Solution to Reduce Cost In Total
• Subio Platform (free data browser)
• You can install on any PC/Macs.
• You can pass data via SSA files.
• Low-priced plug-ins
• Free online training.
Communication cost
Learning cost
Purchasing cost Maintenance cost
Learning cost
21. Subio Platform Works Best In A Team
Everyone can
go mining the
data by themselves,
but not all
need plug-ins.
Subio Platform is
not a replacement,
but a complement
of bioinformatics tools.
DRY Side
WET Side
Subio Platform
R / Stat. ToolsLinux Tools
Basic Plug-in
Advanced Plug-in
Subio Platform
Subio Platform
Subio Platform Subio Platform
Subio Platform
Subio Platform
AP
AP
22. Omics Data Are Juicy.
Nobody can extract all knowledge
by him/herself.
Teamwork
to maximize chances.
Design New
Experiments
to prove the hypothesis.
• More idea!
• More discussions!
• More discoveries!