My presentation at the 5th Sequencing FInishing and Analysis in the Future (SFAF -- http://www.lanl.gov/conferences/finishfuture/2010SFAF_Meeting_Guide.pdf) June 3, 2010
Detection of Low Level Sequence Variants by Sanger Sequencing | ESHG 2015 Pos...Thermo Fisher Scientific
Sanger sequencing using fluorescent BigDye® terminator chemistry and semi‐automated capillary electrophoresis (CE) has long been considered the gold standard for identifying sequence variations such as disease‐causing mutations. The robustness, low error rate, ease of use, human interpretable visual displays of the signals generated by the instruments, and low cost per sample and target have all contributed to this reputation. Homozygous and heterozygous germ line mutations are reliably detected and reported using established DNA sequencing analysis software such as the Applied Biosystems Variant Reporter™ software. However, somatic variants with an allelic proportion of 25% or less are often undetected (i.e. not "called") by the software and thus escape awareness if not detected by careful visual inspection of the electropherograms. With the rapid adoption of next generation sequencing technology (NGS) and its use for characterization of specific and discrete mutations in tumor samples, an urgent need has emerged to establish an orthogonal technology for reliable and sensitive detection of somatic mutations which may occur at proportions of 10% or lower compared to the normal allele.
To this end, we have developed an innovative algorithm, software, and a protocol that specialize in the detection and reporting of minor mutations by Sanger sequencing. Moreover the algorithm preserves the ability to generate the familiar displays of the data to facilitate human review. Using panels of prepared mixtures of minor alleles in the range of 2.5%, 5%, 10% and 20%, we have achieved 94.6% sensitivity and 99.8% specificity for automated detection of mutations present at the 5% level with high quality data.
In conclusion, we have demonstrated that standard protocols for fluorescent dye terminator Sanger sequencing in conjunction with the new algorithm delivered in Variant Finder software may enable the identification of de novo somatic mutations to a level of 5%. This technology will also be useful for the confirmation of minor variants identified by NGS platforms.
Next generation sequencing & microarray-- Genotypic TechnologyGenotypic Technology
Greetings from Genotypic Technology, Bangalore (www.genotypic.co.in). We are a 13 year old genomics and bioinformatics company ( 65+ employees, Service. Products and R & D) based in Bangalore, India, primarily working on applications of Microarrays and Next Generation Sequencing in Human Health and Disease, including in Molecular Diagnostics, Prognosis, Therapeutics, Vaccine Research, Basic Science Research, Veterinary Science, Agriculture, Industrial Biotechnology, Microbial Genetics and more.
Our major strength is in customized genomics solutions, particularly in your field, we can develop panel of markers for specific diseases, optimize, validate and help commercialize on open platforms or specific instrument platforms- in microarrays and sequencing based methods/ assays. We can also use genomic markers to aid in treatment of specific disease using personalized medicine approaches. All this can be done on a comprehensive end-to-end manner in our company as we have a very good blend of molecular biology and bioinformatics with totally 6 Ph.Ds. We work closely with Agilent's R &D as their partner.
my students use ideas from my class on business models to develop a business model for ion proton's DNA sequencer. This sequencer uses semiconductor technology to read an organism's DNA sequence and is faster and cheaper than existing sequencers. This presentation describes the value proposition, customer selection, method of value capture and other aspects of a business model for Ion Proton's DNA sequencer
Detection of Low Level Sequence Variants by Sanger Sequencing | ESHG 2015 Pos...Thermo Fisher Scientific
Sanger sequencing using fluorescent BigDye® terminator chemistry and semi‐automated capillary electrophoresis (CE) has long been considered the gold standard for identifying sequence variations such as disease‐causing mutations. The robustness, low error rate, ease of use, human interpretable visual displays of the signals generated by the instruments, and low cost per sample and target have all contributed to this reputation. Homozygous and heterozygous germ line mutations are reliably detected and reported using established DNA sequencing analysis software such as the Applied Biosystems Variant Reporter™ software. However, somatic variants with an allelic proportion of 25% or less are often undetected (i.e. not "called") by the software and thus escape awareness if not detected by careful visual inspection of the electropherograms. With the rapid adoption of next generation sequencing technology (NGS) and its use for characterization of specific and discrete mutations in tumor samples, an urgent need has emerged to establish an orthogonal technology for reliable and sensitive detection of somatic mutations which may occur at proportions of 10% or lower compared to the normal allele.
To this end, we have developed an innovative algorithm, software, and a protocol that specialize in the detection and reporting of minor mutations by Sanger sequencing. Moreover the algorithm preserves the ability to generate the familiar displays of the data to facilitate human review. Using panels of prepared mixtures of minor alleles in the range of 2.5%, 5%, 10% and 20%, we have achieved 94.6% sensitivity and 99.8% specificity for automated detection of mutations present at the 5% level with high quality data.
In conclusion, we have demonstrated that standard protocols for fluorescent dye terminator Sanger sequencing in conjunction with the new algorithm delivered in Variant Finder software may enable the identification of de novo somatic mutations to a level of 5%. This technology will also be useful for the confirmation of minor variants identified by NGS platforms.
Next generation sequencing & microarray-- Genotypic TechnologyGenotypic Technology
Greetings from Genotypic Technology, Bangalore (www.genotypic.co.in). We are a 13 year old genomics and bioinformatics company ( 65+ employees, Service. Products and R & D) based in Bangalore, India, primarily working on applications of Microarrays and Next Generation Sequencing in Human Health and Disease, including in Molecular Diagnostics, Prognosis, Therapeutics, Vaccine Research, Basic Science Research, Veterinary Science, Agriculture, Industrial Biotechnology, Microbial Genetics and more.
Our major strength is in customized genomics solutions, particularly in your field, we can develop panel of markers for specific diseases, optimize, validate and help commercialize on open platforms or specific instrument platforms- in microarrays and sequencing based methods/ assays. We can also use genomic markers to aid in treatment of specific disease using personalized medicine approaches. All this can be done on a comprehensive end-to-end manner in our company as we have a very good blend of molecular biology and bioinformatics with totally 6 Ph.Ds. We work closely with Agilent's R &D as their partner.
my students use ideas from my class on business models to develop a business model for ion proton's DNA sequencer. This sequencer uses semiconductor technology to read an organism's DNA sequence and is faster and cheaper than existing sequencers. This presentation describes the value proposition, customer selection, method of value capture and other aspects of a business model for Ion Proton's DNA sequencer
Customer Case Study: How Novel Compute Technology Transforms Medical and Life...Amazon Web Services
This session outlines how to deal with “big” (many samples) and “wide” (many features per sample) data on Apache Spark, how to keep runtime constant by using instantaneously scalable micro services (AWS Lambda), and how AWS technology has enabled inspirational real-world research use cases at CSIRO.
Speaker: Denis Bauer, Transformational Bioinformatics Team Leader, CSIRO
Level: 200
Accelerate Your Discovery with QIAGEN Service Solutions for Biomarker Researc...QIAGEN
This slidedeck will highlight QIAGEN’s service capabilities in sample isolation, microarray and NGS-sequencing, qPCR panel and custom assay development and bioinformatics as we look at the identification of potential biomarkers and gene signatures. The applications of QIAGEN Service Core in microRNA discovery for toxicology markers in serum and plasma and in identification of RNA signatures for tumor stratification are featured. Learn how you can accelerate your research with QIAGEN service solutions.
CD Genomics provides viral genome sequencing service within Illumina and PacBio Platforms. We can create high-quality de novo assembly of large viral genomes and highest possible data quality at low cost.
Digital RNAseq for Gene Expression Profiling: Digital RNAseq Webinar Part 2QIAGEN
Traditional RNA sequencing (RNA-Seq) is a powerful tool for expression profiling, but is hindered by PCR amplification bias and inaccuracy at low expressing genes. QIAseq RNA is a flexible and precise tool developed for mitigating these complications, allowing digital gene expression analysis. In this webinar we will cover, in depth, the sample requirements, experimental design, NGS platform specific challenges, and workflow for gene enrichment, library prep and sequencing. The applications of QIASeq RNA Panels in cancer research, stem cell differentiation and elucidating the effects small molecules on signaling pathways will be highlighted.
RNA Sequence data analysis,Transcriptome sequencing, Sequencing steady state RNA in a sample is known as RNA-Seq. It is free of limitations such as prior knowledge about the organism is not required.
RNA-Seq is useful to unravel inaccessible complexities of transcriptomics such as finding novel transcripts and isoforms.
Data set produced is large and complex; interpretation is not straight forward.
Sequencing 60,000 Samples: An Innovative Large Cohort Study for Breast Cancer...QIAGEN
This slidedeck focuses on the design of a large cohort study for assessing breast cancer risk and how an innovative digital sequencing approach is able to solve the previously unmet challenges of this type of NGS study design. Our speaker, Dr. Fergus J. Couch of the Mayo Clinic, presents on the design of this NCI-funded project, which comprises the sequencing of 60,000 samples to assess the risk of breast cancer through association with targeted genes. The design and size of the study requires an accurate, robust and high-throughput sequencing method. The investigators are using a digital DNA sequencing approach from QIAGEN that incorporates molecular barcodes to tag and remove PCR duplicates and increase NGS assay sensitivity. The approach also uses proprietary chemistry that enables uniform sequencing to efficiently utilize sequencing power and deliver optimized results.
A preview of Dr. Joseph Michelli's book, The Starbucks Experience.
Created by Ethos3 Communications
Web: http://www.ethos3.com
Blog: http://blog.ethos3.com
Analysis and Interpretation of Cell-free DNAQIAGEN
Identification and monitoring of cancer mutations from cell free DNA-Seq data is a key application in liquid biopsy. In this part of the webinar we will show how mutations can be best identified from this type of data and how they can be interpreted. Furthermore, potential challenges when analyzing this type of data will be discussed together with relevant strategies.
AGRF in conjunction with EMBL Australia recently organised a workshop at Monash University Clayton. This workshop was targeted at beginners and biologists who are new to analysing Next-Gen Sequencing data. The workshop also aimed to provide users with a snapshot of bioinformatics and data analysis tips on how to begin to analyse project data. An introduction to RNA-seq data analysis was presented by AGRF Senior Bioinformatician Dr. Sonika Tyagi.
Presented: 1st August 2012
Customer Case Study: How Novel Compute Technology Transforms Medical and Life...Amazon Web Services
This session outlines how to deal with “big” (many samples) and “wide” (many features per sample) data on Apache Spark, how to keep runtime constant by using instantaneously scalable micro services (AWS Lambda), and how AWS technology has enabled inspirational real-world research use cases at CSIRO.
Speaker: Denis Bauer, Transformational Bioinformatics Team Leader, CSIRO
Level: 200
Accelerate Your Discovery with QIAGEN Service Solutions for Biomarker Researc...QIAGEN
This slidedeck will highlight QIAGEN’s service capabilities in sample isolation, microarray and NGS-sequencing, qPCR panel and custom assay development and bioinformatics as we look at the identification of potential biomarkers and gene signatures. The applications of QIAGEN Service Core in microRNA discovery for toxicology markers in serum and plasma and in identification of RNA signatures for tumor stratification are featured. Learn how you can accelerate your research with QIAGEN service solutions.
CD Genomics provides viral genome sequencing service within Illumina and PacBio Platforms. We can create high-quality de novo assembly of large viral genomes and highest possible data quality at low cost.
Digital RNAseq for Gene Expression Profiling: Digital RNAseq Webinar Part 2QIAGEN
Traditional RNA sequencing (RNA-Seq) is a powerful tool for expression profiling, but is hindered by PCR amplification bias and inaccuracy at low expressing genes. QIAseq RNA is a flexible and precise tool developed for mitigating these complications, allowing digital gene expression analysis. In this webinar we will cover, in depth, the sample requirements, experimental design, NGS platform specific challenges, and workflow for gene enrichment, library prep and sequencing. The applications of QIASeq RNA Panels in cancer research, stem cell differentiation and elucidating the effects small molecules on signaling pathways will be highlighted.
RNA Sequence data analysis,Transcriptome sequencing, Sequencing steady state RNA in a sample is known as RNA-Seq. It is free of limitations such as prior knowledge about the organism is not required.
RNA-Seq is useful to unravel inaccessible complexities of transcriptomics such as finding novel transcripts and isoforms.
Data set produced is large and complex; interpretation is not straight forward.
Sequencing 60,000 Samples: An Innovative Large Cohort Study for Breast Cancer...QIAGEN
This slidedeck focuses on the design of a large cohort study for assessing breast cancer risk and how an innovative digital sequencing approach is able to solve the previously unmet challenges of this type of NGS study design. Our speaker, Dr. Fergus J. Couch of the Mayo Clinic, presents on the design of this NCI-funded project, which comprises the sequencing of 60,000 samples to assess the risk of breast cancer through association with targeted genes. The design and size of the study requires an accurate, robust and high-throughput sequencing method. The investigators are using a digital DNA sequencing approach from QIAGEN that incorporates molecular barcodes to tag and remove PCR duplicates and increase NGS assay sensitivity. The approach also uses proprietary chemistry that enables uniform sequencing to efficiently utilize sequencing power and deliver optimized results.
A preview of Dr. Joseph Michelli's book, The Starbucks Experience.
Created by Ethos3 Communications
Web: http://www.ethos3.com
Blog: http://blog.ethos3.com
Analysis and Interpretation of Cell-free DNAQIAGEN
Identification and monitoring of cancer mutations from cell free DNA-Seq data is a key application in liquid biopsy. In this part of the webinar we will show how mutations can be best identified from this type of data and how they can be interpreted. Furthermore, potential challenges when analyzing this type of data will be discussed together with relevant strategies.
AGRF in conjunction with EMBL Australia recently organised a workshop at Monash University Clayton. This workshop was targeted at beginners and biologists who are new to analysing Next-Gen Sequencing data. The workshop also aimed to provide users with a snapshot of bioinformatics and data analysis tips on how to begin to analyse project data. An introduction to RNA-seq data analysis was presented by AGRF Senior Bioinformatician Dr. Sonika Tyagi.
Presented: 1st August 2012
Course: Bioinformatics for Biomedical Research (2014).
Session: 4.1- Introduction to RNA-seq and RNA-seq Data Analysis.
Statistics and Bioinformatisc Unit (UEB) & High Technology Unit (UAT) from Vall d'Hebron Research Institute (www.vhir.org), Barcelona.
Microbial Pathogenesis and Host Immune ResponseQIAGEN
The research community has begun correlating the makeup of individual microbiomes with disorders and diseases such as obesity, atherosclerosis and cancer. To accomplish this, researchers must first identify and characterize these microbial communities and understand the complex immune interactions between host and pathogen. This webinar provides you with a complete overview of the microbiome, metagenomics and host-pathogen interactions. Experimental strategies, from sample to insight, which can facilitate your microbiology and immunology research, are highlighted.
Next-generation sequencing (NGS) has revolutionized the way we analyze diseases and commercial outfits such as Illumina, Helicos, QIAGEN and Pacific Biosciences have made significant contributions. In addition, the launch of direct-to-consumer genetic testing solutions has dramatically changed the way consumers access genomics data. Until a few years ago, the cost of sequencing was a major bottleneck. Recent developments have reduced the cost from thousands of dollars to a couple of cents per megabase. When did these changes start? What were the changes in the commercial sector in the last 15 years? This infographic is a timeline of the NGS commercial marketplace.
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to ...Bonnie Hurwitz
iMicrobe and iVirus: Extending the iPlant cyberinfrastructure from plants to microbes. Overview of work underway to add applications and computational analysis pipelines to iPlant for metagenomics and microbial ecology.
How novel compute technology transforms life science researchDenis C. Bauer
Unprecedented data volumes and pressure on turnaround time driven by commercial applications require bioinformatics solutions to evolve to meed these new demands. New compute paradigms and cloud-based IT solutions enable this transition. Here I present two solution capable of meeting these demands for genomic variant analysis, VariantSpark, as well as genome engineering applications, GT-Scan2.
VariantSpark classifies 3000 individuals with 80 Million genomic variants each in under 30 minutes. This Hadoop/Spark solution for machine learning application on genomic data is hence capable to scale up to population size cohorts.
GT-Scan2, identifies CRISPR target sites by minimizing off-target effects and maximizing on-target efficiency. This optimization is powered by AWS Lambda functions, which offer an “always-on” web service that can instantaneously recruit enough compute resources keep runtime stable even for queries with several thousand of potential target sites.
Production Bioinformatics, emphasis on ProductionChris Dwan
Production bioinformatics at Sema4 can be thought of as data ops - a peer to the lab ops organization. We operate 24/7 to deliver correct and timely results on NGS and other data for thousands of samples per week. This deck introduces the Prod BI organization and systems architecture with a focus on what it takes to run bioinformatics in production rather than for R&D or pure research.
Whole Genome Sequencing - Data Processing and QC at SciLifeLab NGIPhil Ewels
Slides presented at the "Rare Disease Genomics" course held at the Centre for Molecular Medicine (Karolinska Institute, Stockholm, Sweden). Phil Ewels, 4th December 2017.
Next generation sequencing: research opportunities and bioinformatic challenges. A seminar I gave for the Computational Life Science (Univ. of Oslo) seminar series, March 2, 2011
VariantSpark: applying Spark-based machine learning methods to genomic inform...Denis C. Bauer
Genomic information is increasingly used in medical practice giving rise to the need for efficient analysis methodology able to cope with thousands of individuals and millions of variants. Here we introduce VariantSpark, which utilizes Hadoop/Spark along with its machine learning library, MLlib, providing the means of parallelisation for population-scale bioinformatics tasks. VariantSpark is the interface to the standard variant format (VCF), offers seamless genome-wide sampling of variants and provides a pipeline for visualising results.
To demonstrate the capabilities of VariantSpark, we clustered more than 3,000 individuals with 80 Million variants each to determine the population structure in the dataset. VariantSpark is 80% faster than the Spark-based genome clustering approach, ADAM, the comparable implementation using Hadoop/Mahout, as well as Admixture, a commonly used tool for determining individual ancestries. It is over 90% faster than traditional implementations using R and Python. These benefits of speed, resource consumption and scalability enables VariantSpark to open up the usage of advanced, efficient machine learning algorithms to genomic data.
The package is written in Scala and available at https://github.com/BauerLab/VariantSpark.
This is a talk I gave at a Northwestern University - Complete Genomics Workshop on April 21, 2011 about using clouds to support research in genomics and related areas.
State of ICS and IoT Cyber Threat Landscape Report 2024 previewPrayukth K V
The IoT and OT threat landscape report has been prepared by the Threat Research Team at Sectrio using data from Sectrio, cyber threat intelligence farming facilities spread across over 85 cities around the world. In addition, Sectrio also runs AI-based advanced threat and payload engagement facilities that serve as sinks to attract and engage sophisticated threat actors, and newer malware including new variants and latent threats that are at an earlier stage of development.
The latest edition of the OT/ICS and IoT security Threat Landscape Report 2024 also covers:
State of global ICS asset and network exposure
Sectoral targets and attacks as well as the cost of ransom
Global APT activity, AI usage, actor and tactic profiles, and implications
Rise in volumes of AI-powered cyberattacks
Major cyber events in 2024
Malware and malicious payload trends
Cyberattack types and targets
Vulnerability exploit attempts on CVEs
Attacks on counties – USA
Expansion of bot farms – how, where, and why
In-depth analysis of the cyber threat landscape across North America, South America, Europe, APAC, and the Middle East
Why are attacks on smart factories rising?
Cyber risk predictions
Axis of attacks – Europe
Systemic attacks in the Middle East
Download the full report from here:
https://sectrio.com/resources/ot-threat-landscape-reports/sectrio-releases-ot-ics-and-iot-security-threat-landscape-report-2024/
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfPaige Cruz
Monitoring and observability aren’t traditionally found in software curriculums and many of us cobble this knowledge together from whatever vendor or ecosystem we were first introduced to and whatever is a part of your current company’s observability stack.
While the dev and ops silo continues to crumble….many organizations still relegate monitoring & observability as the purview of ops, infra and SRE teams. This is a mistake - achieving a highly observable system requires collaboration up and down the stack.
I, a former op, would like to extend an invitation to all application developers to join the observability party will share these foundational concepts to build on:
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
UiPath Test Automation using UiPath Test Suite series, part 4DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 4. In this session, we will cover Test Manager overview along with SAP heatmap.
The UiPath Test Manager overview with SAP heatmap webinar offers a concise yet comprehensive exploration of the role of a Test Manager within SAP environments, coupled with the utilization of heatmaps for effective testing strategies.
Participants will gain insights into the responsibilities, challenges, and best practices associated with test management in SAP projects. Additionally, the webinar delves into the significance of heatmaps as a visual aid for identifying testing priorities, areas of risk, and resource allocation within SAP landscapes. Through this session, attendees can expect to enhance their understanding of test management principles while learning practical approaches to optimize testing processes in SAP environments using heatmap visualization techniques
What will you get from this session?
1. Insights into SAP testing best practices
2. Heatmap utilization for testing
3. Optimization of testing processes
4. Demo
Topics covered:
Execution from the test manager
Orchestrator execution result
Defect reporting
SAP heatmap example with demo
Speaker:
Deepak Rai, Automation Practice Lead, Boundaryless Group and UiPath MVP
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Albert Hoitingh
In this session I delve into the encryption technology used in Microsoft 365 and Microsoft Purview. Including the concepts of Customer Key and Double Key Encryption.
Essentials of Automations: The Art of Triggers and Actions in FMESafe Software
In this second installment of our Essentials of Automations webinar series, we’ll explore the landscape of triggers and actions, guiding you through the nuances of authoring and adapting workspaces for seamless automations. Gain an understanding of the full spectrum of triggers and actions available in FME, empowering you to enhance your workspaces for efficient automation.
We’ll kick things off by showcasing the most commonly used event-based triggers, introducing you to various automation workflows like manual triggers, schedules, directory watchers, and more. Plus, see how these elements play out in real scenarios.
Whether you’re tweaking your current setup or building from the ground up, this session will arm you with the tools and insights needed to transform your FME usage into a powerhouse of productivity. Join us to discover effective strategies that simplify complex processes, enhancing your productivity and transforming your data management practices with FME. Let’s turn complexity into clarity and make your workspaces work wonders!
Elevating Tactical DDD Patterns Through Object CalisthenicsDorra BARTAGUIZ
After immersing yourself in the blue book and its red counterpart, attending DDD-focused conferences, and applying tactical patterns, you're left with a crucial question: How do I ensure my design is effective? Tactical patterns within Domain-Driven Design (DDD) serve as guiding principles for creating clear and manageable domain models. However, achieving success with these patterns requires additional guidance. Interestingly, we've observed that a set of constraints initially designed for training purposes remarkably aligns with effective pattern implementation, offering a more ‘mechanical’ approach. Let's explore together how Object Calisthenics can elevate the design of your tactical DDD patterns, offering concrete help for those venturing into DDD for the first time!
Enhancing Performance with Globus and the Science DMZGlobus
ESnet has led the way in helping national facilities—and many other institutions in the research community—configure Science DMZs and troubleshoot network issues to maximize data transfer performance. In this talk we will present a summary of approaches and tips for getting the most out of your network infrastructure using Globus Connect Server.
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...SOFTTECHHUB
The choice of an operating system plays a pivotal role in shaping our computing experience. For decades, Microsoft's Windows has dominated the market, offering a familiar and widely adopted platform for personal and professional use. However, as technological advancements continue to push the boundaries of innovation, alternative operating systems have emerged, challenging the status quo and offering users a fresh perspective on computing.
One such alternative that has garnered significant attention and acclaim is Nitrux Linux 3.5.0, a sleek, powerful, and user-friendly Linux distribution that promises to redefine the way we interact with our devices. With its focus on performance, security, and customization, Nitrux Linux presents a compelling case for those seeking to break free from the constraints of proprietary software and embrace the freedom and flexibility of open-source computing.
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionAggregage
Join Maher Hanafi, VP of Engineering at Betterworks, in this new session where he'll share a practical framework to transform Gen AI prototypes into impactful products! He'll delve into the complexities of data collection and management, model selection and optimization, and ensuring security, scalability, and responsible use.
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
12. Swiss Army Knife of NGS Analysis SDK Intuitive GUI Traditional Bioinformatics Visualization Desktop Solutions EnterpriseSolutions High Performance File Format Conversion Tools Integration Epigenomics Transcriptomics Genomics RNA-Seq miRNA Read Mapping De Novo Assembly SNP/DIP Detection CHIP-Seq
13. Why not use free tools? Are tools free or “free”? Tools vs solutions True cost of ownership Ease of Use Tools integration Support
14. Small RNA Analysis(in Beta soon) Identify and filter/trim adapters annotate using mirBASE and other resources - target species of interest Merge/group by mature, precursor/reference Fully integrated with expression analysis
15. De Novo Assembler Human assembly of 38x Illumina paired-end CLC Quality equivalent to Abyss CLC: 7 hrs, 1 node, 42 Gbof RAM Abyss: 80 hrs, 21 nodes, 336 Gbof RAM Metagenomics Assembly METAHIT Dataset MH0041 40M 75bp paired end 3 hrs on desktop, 6 Gb RAM Higher N50 and Total Contig Size than Reported
16. Viral Sequencing at JCVI(See Nadia Fedorova’s Poster!) Amplify and Barcode using SISPA, 454 + Illumina Sequencing Depth of coverage sometimes >1000x De novo Assembly of Consensus for all Segments For each segment: Map reads from each technology independently using best full length reference from NCBI, call variations Update reference with variations confirmed by multiple technologies Map reads using updated reference and all reads Convert to consed, analyze, order Sanger closure reactions Source: Jessica Hostetler, Nadia Federova, Tim Stockwell, Danny Katzel
17. Why CLC bioTools? CLC handled hybrid sequencing technologies directly Very biased coverage confounded other assemblers that expect random arrival stats. CLC didn’t seem to suffer from biased coverage. Very accurate SNP calls in areas of deep coverage. Tim Stockwell Director of Viral Informatics J. Craig Venter Institute
18. Targeted Resequencing QC Assessment of targeted sequencing technology Coverage Statistics for Targeted Regions Very short schedule, limited bioinformatics staff Plug-in development leveraging CLC tools to automate the process and meet short deadline QC Report now available as plug-in
19. Professional Services Developing customized solutions Integration with LIMS, workflows, DB Bioinformatics Algorithm Development Cloud and Grid Integration Data Analysis
20. Questions? Saul A. Kravitz, PhD skravitz@clcbio.com (301)355-0813 Thank you for listening
21. Questions Saul A. Kravitz, PhD skravitz @ clcbio.com 301)355-0813 Thank you for listening
Editor's Notes
GUI-driven tools and workflows
- miRNA workflow leveraging mirBASE and other resources
Very fast, small memory footprint
SISPA = Sequence Independent Single Primer Amplification (if that needs spelling out) – amplifies and barcodes DNA moleculesAlso, if people are interested, can also mention availability of Danny Katzel’s cas2consed software.
CustomizationJava plug-in architecture for Server and WorkbenchOptimized “Cell” command line tools for efficient HPCWizard-based integration of customer toolsServer integration via SOAP and Command Line*