1. The National Library of Medicine (NLM) was established to assist the advancement of medical and related sciences and to aid the dissemination and exchange of scientific and other information important to the progress of medicine and public health.
2. NLM will focus on understanding how searches are initiated, how information is used, and how questions are posed and answered through corroboration.
3. NLM's strategic plan for 2017-2027 aims to provide answers to questions from clinicians, patients, and consumers by linking research literature and clinical information through technologies like question answering systems and clinical decision support.
IMPROVING PERFORMANCE OF ALGORITHMS TO POWER UNMET NEED AND EFFECTIVENESS IN ...Schiffon Wong
The purpose of this workshop is to highlight best practices to improve observational research performance through 1) a methodological overview of algorithm development and validation in electronic health records (EHR) and healthcare claims-based databases, 2) an understanding of novel approaches using text-mining and natural language processing (NLP), and 3) case studies of algorithms developed to identify multiple sclerosis (MS) and its clinical subgroups to generate real world evidence (RWE). By offering an interactive forum, participants will be able to identify and share their successes and challenges in algorithm development and validation.
In this talk, we present our work on developing large-scale text mining and machine learning tools as well as their uses in real-world applications in PubMed search, biocuration and healthcare (medical image analysis).
Interest has increased in the use of prognosis factors as a cursor for breast cancer personalized treatment. For clinicians, early detection of those factors can be helpful for a good management of the disease and for the choice of an efficient treatment. Moreover, it exists a huge amount of meaningful information in pathological reports, biological measurements and clinical information in a patient journey that remain unexploited. In that context, I propose to develop and apply novel machine learning techniques to predict cancer outcome such as recurrence or survival from multi-modal breast cancer patient data (including medical notes in natural languages and the outcome of various lab analyses). For that, I use a deep neural sequence transduction for electronic health records called BEHRT1. This model is inspired from one of the most powerful transformer-based architecture in Natural Language Processing: BERT2.
IMPROVING PERFORMANCE OF ALGORITHMS TO POWER UNMET NEED AND EFFECTIVENESS IN ...Schiffon Wong
The purpose of this workshop is to highlight best practices to improve observational research performance through 1) a methodological overview of algorithm development and validation in electronic health records (EHR) and healthcare claims-based databases, 2) an understanding of novel approaches using text-mining and natural language processing (NLP), and 3) case studies of algorithms developed to identify multiple sclerosis (MS) and its clinical subgroups to generate real world evidence (RWE). By offering an interactive forum, participants will be able to identify and share their successes and challenges in algorithm development and validation.
In this talk, we present our work on developing large-scale text mining and machine learning tools as well as their uses in real-world applications in PubMed search, biocuration and healthcare (medical image analysis).
Interest has increased in the use of prognosis factors as a cursor for breast cancer personalized treatment. For clinicians, early detection of those factors can be helpful for a good management of the disease and for the choice of an efficient treatment. Moreover, it exists a huge amount of meaningful information in pathological reports, biological measurements and clinical information in a patient journey that remain unexploited. In that context, I propose to develop and apply novel machine learning techniques to predict cancer outcome such as recurrence or survival from multi-modal breast cancer patient data (including medical notes in natural languages and the outcome of various lab analyses). For that, I use a deep neural sequence transduction for electronic health records called BEHRT1. This model is inspired from one of the most powerful transformer-based architecture in Natural Language Processing: BERT2.
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...David Talby
An April 2023 presentation to the AMIA working group on natural language processing. The talk focuses on three current trends in NLP and how they apply in healthcare: Large language models, No-code, and Responsible AI.
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...Kees van Bochove
Talk from Kees van Bochove, The Hyve at SCOPE Summit, Real World Data track, Jan 26, 2017, Miami
A large open source initiative for standardisation and epidemiological analysis for real world data is OHDSI: Observational Health Data Sciences and Informatics. OHDSI leverages the OMOP common data model for observational data, and provides data analysis tools for a broad range of use cases. This talk will explain OMOP and OHDSI with case study IMI EMIF, in which health data from over 50 million patients from 13 national and regional European registries is brought together.
Managing Health and Disease Using Omics and Big DataLaura Berry
Presented at the NGS Tech and Applications Congress: USA. To find out more, visit:
www.global-engage.com
Michael Snyder is a Professor, Chair of Genetics and Director of the Stanford Center for Genomics and Personalized Medicine at Stanford University. In this presentation Michael discusses using omics and big data to predict disease risk and catch early disease onset.
Learn how to use Pathway Studio to explore biomarkers and brain regions. With the addition of highly sophisticated visualization tools, users can interactively explore the vast number of connections created to help unravel disease biology. In addition, an innovative new taxonomy based on brain region identifications will be presented. Together, these innovations can be applied to rapidly increase the knowledge of diseases based on published findings.
Next generation electronic medical records and search a test implementation i...lucenerevolution
Presented by David Piraino, Chief Imaging Information Officer, Imaging Institute Cleveland Clinic, Cleveland Clinic
& Daniel Palmer, Chief Imaging Information Officer, Imaging Institute Cleveland Clinic, Cleveland Clinic
Most patient specifc medical information is document oriented with varying amounts of associated meta-data. Most of pateint medical information is textual and semi-structured. Electronic Medical Record Systems (EMR) are not optimized to present the textual information to users in the most understandable ways. Present EMRs show information to the user in a reverse time oriented patient specific manner only. This talk discribes the construction and use of Solr search technologies to provide relevant historical information at the point of care while intepreting radiology images.
Radiology reports over a 4 year period were extracted from our Radiology Information System (RIS) and passed through a text processing engine to extract the results, impression, exam description, location, history, and date. Fifteen cases reported during clinical practice were used as test cases to determine if ""similar"" historical cases were found . The results were evaluated by the number of searches that returned any result in less than 3 seconds and the number of cases that illustrated the questioned diagnosis in the top 10 results returned as determined by a bone and joint radiologist. Also methods to better optimize the search results were reviewed.
An average of 7.8 out of the 10 highest rated reports showed a similar case highly related to the present case. The best search showed 10 out of 10 cases that were good examples and the lowest match search showed 2 out of 10 cases that were good examples.The talk will highlight this specific use case and the issues and advances of using Solr search technology in medicine with focus on point of care applications.
Measuring and improving the impacts of Health IT on clinical, cost and efficiency outcomes. Presented by Steven Shaha, Center for Policy & Public Administration, UK, at HINZ 2014, 12 November 2014, 12.22pm, Marlborough Room 3
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryJean-Claude Bradley
Mel Reichman, senior investigator and director of the LIMR Chemical Genomics Center at the Lankenau Institute for Medical Research presents at the chemistry department at Drexel University on November 12, 2009.
Modern drug discovery by high-throughput screening (HTS) begins with testing hundreds of thousands of compounds in biological assays. The confirmed hit rate for typical HTS is less than 0.5%; therefore, 99.5% of the costs of HTS are for generating null data. Orthogonal convolution of compound libraries (OCL) is 500% more efficient than present HTS practice. The OCL method combines 10 compounds per well. An advantage of this method is that each compound is represented twice in two separately arrayed pools. The potential for the approach to better enable academic centers of excellence to validate medicinally relevant biological targets is discussed.
Sepsis is one of the top causes of inpatient mortality and rapid detection presents numerous challenges. In March, 2016, an interdisciplinary team consisting of top clinicians, data scientists and machine learning experts at a large academic medical center (AMC) embarked on an innovation pilot to develop a novel machine learning model to detect sepsis. A computable sepsis definition and deep learning model were developed using a curated dataset capturing over 43,000 inpatient admissions between October 1, 2014 and December 31, 2015. Ten computable sepsis definitions were compared and our clinicians agreed on the following: >= 2 SIRS criteria, blood culture order, and end organ damage. This sepsis phenotype identified patients early in the hospital course: 38% of cases occur an average of 1.3 hours after presentation to the ED and 42% of cases occur an average of 15 hours after hospital admission. At 4 hours prior to sepsis, the best deep learning model generated 1.4 false alarms per true alarm at a sensitivity of 80%, compared to 3.2 false alarms per true alarm for National Early Warning System (NEWS).
Purpose
Sepsis Watch detects sepsis early, guides completion of appropriate treatment, and supports front-line providers with minimal interruption of clinical workflows. Key Performance Indicators include emergency department (ED) length of stay, hospital length of stay, inpatient mortality, intensive care unit requirement, and time to antibiotics for patients who develop sepsis.
Description
The core technology components of Sepsis Watch are web services to extract electronic health record (EHR) data in real-time, a data pipeline to normalize features, a computable sepsis definition, a deep learning sepsis prediction model, a web application (Figure 1), an automated report that calculates KPI performance, and a model input and output monitoring tool. A suite of education, training, communication, and workflow materials were also prepared with nurse educators and are hosted on an intranet training site. After a three-month silent period, Sepsis Watch was deployed in the ED of the 1,000 bed flagship hospital on November 5, 2018.
Conclusions
Sepsis Watch is the first deployment of deep learning model in real-time to detect sepsis integrated with an EHR. The tool is used by Rapid Response Team (RRT) nurses to provide proactive support to ED providers to identify and manage sepsis. A six-month clinical trial will be completed in May 2019 to rigorously assess the clinical and operational impact of the program.
How to Improve the Accuracy of the Initial Evaluation, Using a System Developed By Johns Hopkins Hospital Doctors by Nelson Hendler in Examines in Physical Medicine & Rehabilitation
The Personalized Health Risk Profile: A New Tool for Safety and Occupational ...Richard Hartman, Ph.D.
This presentation introduces the Personalized Risk Health Profile (PRHP), a mathematical process to quantitatively evaluate personalized health risks by integrating workplace, lifestyle, and environmental exposure (the root cause of disease) data from traditional and new personal monitoring technologies combined with individual health histories and genomic data to provide a new and novel capability for the safety and health professionals and policymakers. The PRHP creates for the first time a mechanism to better understand the relationships between a worker's health, genetic predispositions, and exposures through mathematical expression and process, ultimately providing a modern tool to better understand the effects of exposures from the workplace, environment, as well as day-to-day activities. More importantly, the PRRP displays individual and population risks through user-friendly visualizations bridging the gap between "Population Health" and "Personalized Medicine" so safety and health professionals can recommend data-driven interventions to mitigate individual risks to improve health/performance, and policymakers and decision-makers can make more informed policy and resource decisions.
Portée de la négation : détection par apprentissage supervisé en français et ...CORIA-TALN 2018
Poster issu de la session du jeudi midi de la conférence conjointe CORIA-TALN 2018 qui s'est déroulé du 14 au 18 mai 2018 à Rennes.
https://project.inria.fr/coriataln2018/
More Related Content
Similar to NLP support for clinical tasks and decisions
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...David Talby
An April 2023 presentation to the AMIA working group on natural language processing. The talk focuses on three current trends in NLP and how they apply in healthcare: Large language models, No-code, and Responsible AI.
SCOPE Summit - Applying the OMOP data model & OHDSI software to national Euro...Kees van Bochove
Talk from Kees van Bochove, The Hyve at SCOPE Summit, Real World Data track, Jan 26, 2017, Miami
A large open source initiative for standardisation and epidemiological analysis for real world data is OHDSI: Observational Health Data Sciences and Informatics. OHDSI leverages the OMOP common data model for observational data, and provides data analysis tools for a broad range of use cases. This talk will explain OMOP and OHDSI with case study IMI EMIF, in which health data from over 50 million patients from 13 national and regional European registries is brought together.
Managing Health and Disease Using Omics and Big DataLaura Berry
Presented at the NGS Tech and Applications Congress: USA. To find out more, visit:
www.global-engage.com
Michael Snyder is a Professor, Chair of Genetics and Director of the Stanford Center for Genomics and Personalized Medicine at Stanford University. In this presentation Michael discusses using omics and big data to predict disease risk and catch early disease onset.
Learn how to use Pathway Studio to explore biomarkers and brain regions. With the addition of highly sophisticated visualization tools, users can interactively explore the vast number of connections created to help unravel disease biology. In addition, an innovative new taxonomy based on brain region identifications will be presented. Together, these innovations can be applied to rapidly increase the knowledge of diseases based on published findings.
Next generation electronic medical records and search a test implementation i...lucenerevolution
Presented by David Piraino, Chief Imaging Information Officer, Imaging Institute Cleveland Clinic, Cleveland Clinic
& Daniel Palmer, Chief Imaging Information Officer, Imaging Institute Cleveland Clinic, Cleveland Clinic
Most patient specifc medical information is document oriented with varying amounts of associated meta-data. Most of pateint medical information is textual and semi-structured. Electronic Medical Record Systems (EMR) are not optimized to present the textual information to users in the most understandable ways. Present EMRs show information to the user in a reverse time oriented patient specific manner only. This talk discribes the construction and use of Solr search technologies to provide relevant historical information at the point of care while intepreting radiology images.
Radiology reports over a 4 year period were extracted from our Radiology Information System (RIS) and passed through a text processing engine to extract the results, impression, exam description, location, history, and date. Fifteen cases reported during clinical practice were used as test cases to determine if ""similar"" historical cases were found . The results were evaluated by the number of searches that returned any result in less than 3 seconds and the number of cases that illustrated the questioned diagnosis in the top 10 results returned as determined by a bone and joint radiologist. Also methods to better optimize the search results were reviewed.
An average of 7.8 out of the 10 highest rated reports showed a similar case highly related to the present case. The best search showed 10 out of 10 cases that were good examples and the lowest match search showed 2 out of 10 cases that were good examples.The talk will highlight this specific use case and the issues and advances of using Solr search technology in medicine with focus on point of care applications.
Measuring and improving the impacts of Health IT on clinical, cost and efficiency outcomes. Presented by Steven Shaha, Center for Policy & Public Administration, UK, at HINZ 2014, 12 November 2014, 12.22pm, Marlborough Room 3
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryJean-Claude Bradley
Mel Reichman, senior investigator and director of the LIMR Chemical Genomics Center at the Lankenau Institute for Medical Research presents at the chemistry department at Drexel University on November 12, 2009.
Modern drug discovery by high-throughput screening (HTS) begins with testing hundreds of thousands of compounds in biological assays. The confirmed hit rate for typical HTS is less than 0.5%; therefore, 99.5% of the costs of HTS are for generating null data. Orthogonal convolution of compound libraries (OCL) is 500% more efficient than present HTS practice. The OCL method combines 10 compounds per well. An advantage of this method is that each compound is represented twice in two separately arrayed pools. The potential for the approach to better enable academic centers of excellence to validate medicinally relevant biological targets is discussed.
Sepsis is one of the top causes of inpatient mortality and rapid detection presents numerous challenges. In March, 2016, an interdisciplinary team consisting of top clinicians, data scientists and machine learning experts at a large academic medical center (AMC) embarked on an innovation pilot to develop a novel machine learning model to detect sepsis. A computable sepsis definition and deep learning model were developed using a curated dataset capturing over 43,000 inpatient admissions between October 1, 2014 and December 31, 2015. Ten computable sepsis definitions were compared and our clinicians agreed on the following: >= 2 SIRS criteria, blood culture order, and end organ damage. This sepsis phenotype identified patients early in the hospital course: 38% of cases occur an average of 1.3 hours after presentation to the ED and 42% of cases occur an average of 15 hours after hospital admission. At 4 hours prior to sepsis, the best deep learning model generated 1.4 false alarms per true alarm at a sensitivity of 80%, compared to 3.2 false alarms per true alarm for National Early Warning System (NEWS).
Purpose
Sepsis Watch detects sepsis early, guides completion of appropriate treatment, and supports front-line providers with minimal interruption of clinical workflows. Key Performance Indicators include emergency department (ED) length of stay, hospital length of stay, inpatient mortality, intensive care unit requirement, and time to antibiotics for patients who develop sepsis.
Description
The core technology components of Sepsis Watch are web services to extract electronic health record (EHR) data in real-time, a data pipeline to normalize features, a computable sepsis definition, a deep learning sepsis prediction model, a web application (Figure 1), an automated report that calculates KPI performance, and a model input and output monitoring tool. A suite of education, training, communication, and workflow materials were also prepared with nurse educators and are hosted on an intranet training site. After a three-month silent period, Sepsis Watch was deployed in the ED of the 1,000 bed flagship hospital on November 5, 2018.
Conclusions
Sepsis Watch is the first deployment of deep learning model in real-time to detect sepsis integrated with an EHR. The tool is used by Rapid Response Team (RRT) nurses to provide proactive support to ED providers to identify and manage sepsis. A six-month clinical trial will be completed in May 2019 to rigorously assess the clinical and operational impact of the program.
How to Improve the Accuracy of the Initial Evaluation, Using a System Developed By Johns Hopkins Hospital Doctors by Nelson Hendler in Examines in Physical Medicine & Rehabilitation
The Personalized Health Risk Profile: A New Tool for Safety and Occupational ...Richard Hartman, Ph.D.
This presentation introduces the Personalized Risk Health Profile (PRHP), a mathematical process to quantitatively evaluate personalized health risks by integrating workplace, lifestyle, and environmental exposure (the root cause of disease) data from traditional and new personal monitoring technologies combined with individual health histories and genomic data to provide a new and novel capability for the safety and health professionals and policymakers. The PRHP creates for the first time a mechanism to better understand the relationships between a worker's health, genetic predispositions, and exposures through mathematical expression and process, ultimately providing a modern tool to better understand the effects of exposures from the workplace, environment, as well as day-to-day activities. More importantly, the PRRP displays individual and population risks through user-friendly visualizations bridging the gap between "Population Health" and "Personalized Medicine" so safety and health professionals can recommend data-driven interventions to mitigate individual risks to improve health/performance, and policymakers and decision-makers can make more informed policy and resource decisions.
Portée de la négation : détection par apprentissage supervisé en français et ...CORIA-TALN 2018
Poster issu de la session du jeudi midi de la conférence conjointe CORIA-TALN 2018 qui s'est déroulé du 14 au 18 mai 2018 à Rennes.
https://project.inria.fr/coriataln2018/
Construction d'un corpus multilingue annoté en relations de traductionCORIA-TALN 2018
Poster issu de la session du jeudi midi de la conférence conjointe CORIA-TALN 2018 qui s'est déroulé du 14 au 18 mai 2018 à Rennes.
https://project.inria.fr/coriataln2018/
Analyse des noms agentifs dans des espaces vectoriels distributionnelsCORIA-TALN 2018
Slides de la conférence conjointe CORIA-TALN 2018 qui s'est déroulé du 14 au 18 mai 2018 à Rennes.
https://project.inria.fr/coriataln2018/
Marine Wauquiez
Article soumis à la conférence RJC en format long.
Décodeur neuronal pour la transcription de documents manuscrits anciensCORIA-TALN 2018
Slides de la conférence conjointe CORIA-TALN 2018 qui s'est déroulé du 14 au 18 mai 2018 à Rennes.
Adeline Granet, Emmanuel Morin, Harold Mouchère, Solen Quiniou et Christian Viard-gaudin
Article soumis à la conférence TALN en format long.
De l’usage réel des emojis à une prédiction de leurs catégoriesCORIA-TALN 2018
Slides de la conférence conjointe CORIA-TALN 2018 qui s'est déroulé du 14 au 18 mai 2018 à Rennes.
Gaël Guibon, Magalie Ochs et Patrice Bellot
Article soumis à la conférence TALN en format court.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.Sérgio Sacani
The return of a sample of near-surface atmosphere from Mars would facilitate answers to several first-order science questions surrounding the formation and evolution of the planet. One of the important aspects of terrestrial planet formation in general is the role that primary atmospheres played in influencing the chemistry and structure of the planets and their antecedents. Studies of the martian atmosphere can be used to investigate the role of a primary atmosphere in its history. Atmosphere samples would also inform our understanding of the near-surface chemistry of the planet, and ultimately the prospects for life. High-precision isotopic analyses of constituent gases are needed to address these questions, requiring that the analyses are made on returned samples rather than in situ.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Sérgio Sacani
We characterize the earliest galaxy population in the JADES Origins Field (JOF), the deepest
imaging field observed with JWST. We make use of the ancillary Hubble optical images (5 filters
spanning 0.4−0.9µm) and novel JWST images with 14 filters spanning 0.8−5µm, including 7 mediumband filters, and reaching total exposure times of up to 46 hours per filter. We combine all our data
at > 2.3µm to construct an ultradeep image, reaching as deep as ≈ 31.4 AB mag in the stack and
30.3-31.0 AB mag (5σ, r = 0.1” circular aperture) in individual filters. We measure photometric
redshifts and use robust selection criteria to identify a sample of eight galaxy candidates at redshifts
z = 11.5 − 15. These objects show compact half-light radii of R1/2 ∼ 50 − 200pc, stellar masses of
M⋆ ∼ 107−108M⊙, and star-formation rates of SFR ∼ 0.1−1 M⊙ yr−1
. Our search finds no candidates
at 15 < z < 20, placing upper limits at these redshifts. We develop a forward modeling approach to
infer the properties of the evolving luminosity function without binning in redshift or luminosity that
marginalizes over the photometric redshift uncertainty of our candidate galaxies and incorporates the
impact of non-detections. We find a z = 12 luminosity function in good agreement with prior results,
and that the luminosity function normalization and UV luminosity density decline by a factor of ∼ 2.5
from z = 12 to z = 14. We discuss the possible implications of our results in the context of theoretical
models for evolution of the dark matter halo mass function.
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
3. National Library of Medicine
Purpose and establishment
In order to assist the advancement of medical and related sciences
and to aid the dissemination and exchange of scientific and other
information important to the progress of medicine and to the public
health, there is established the National Library of Medicine
https://www.gpo.gov/fdsys/pkg/USCODE-2011-title42/html/USCODE-2011-title42-chap6A-subchapIII-partD.htm
3
3
5. o When making health-related decisions, asking questions
is a natural and preferred approach to satisfying
information needs
o Corroboration: NLM will focus on understanding how
searches are initiated, how information is used, and how
questions are posed and answered.
A Platform for Biomedical Discovery and Data-Powered Health
National Library of Medicine
Strategic Plan 2017–2027
Report of the NLM Board of Regents
5
6. What is the best plan of
care for this patient?
what are the causes of
abdominal pain or cramps?
What was the pre-op
echocardiogram result?
6
8. Summarization (“bottom-
line” advice)
Linking research and clinical
information
GUI
API
Clinical question answering
Linking patient
records and
literature (CDS)
Identifying gaps in clinical research
Consumer Health Question
Answering
Repository for
Informed Decision
Making
8
9. o Central to clinical NLP tasks
o MetaMap Lite: A new implementation of the ontology-
based (UMLS) Named Entity Recognition tool
o "Impressive...it looks like MetaMap Lite is around 20 times faster *and* has
better performance! I understand there are caveats [in the evaluation]
(e.g., focus on disorders and only using default options for regular
MetaMap), but this is good news.”
Demner-Fushman D. Rogers WJ, Aronson SR. MetaMap Lite: an evaluation of a
new Java implementation of MetaMap. J Am Med Inform Assoc. 2017;0(0):1-5.
Collection / Tool
MetaMap cTAKES (DL) DNorm MetaMap Lite
P R F-1 P R F-1 P R F-1 P R F-1
NCBI disease 60.3 68.3 64.1 47.0 53.8 47.4 74.1 67.6 70.7 73.1 71.9 72.5
ShARe (entities) 59.5 48.1 53.2 46.3 46.2 46.2 N/A N/A N/A 74.2 42.1 53.8
i2b2 2010 38.1 35.7 36.8 31.9 34.1 32.9 N/A N/A N/A 47.0 31.9 38.0
LHC clinical 58.8 77.2 66.8 42.6 59.9 49.8 71.5 58.2 64.2 69.4 74.9 70.0
LHC biological 46.8 75.6 57.8 47.1 60.6 53.0 67.7 62.8 65.2 67.5 77.9 72.4
9
11. Roberts, K. & Demner-Fushman, D. (2016).
Annotating Logical Forms for EHR
Questions. Proceedings of the Language
Resources and Evaluation Conference
(LREC).
Roberts K, Demner-Fushman D. Toward a
Natural Language Interface for EHR
Questions. AMIA Joint Summit 2015
11
12. o Ask: “What was my last A1c?” OR look for it:
12
13. o Traditional QA systems search over unstructured data
o Not compatible with EHRs: free text + structured data
o Each EHR organizes unstructured/structured data differently
à Structured query
“What was my last A1c?”
Latest Test: A1C
13
14. ① Can EHR questions be converted to logical forms?
② What logical operations are necessary to represent EHR
questions?
③ Can human annotators achieve sufficient agreement?
④ Will a logical form method scale to the diversity of potential
EHR questions?
14
15. o From Li (2012): Structured database (17 questions) and
specific note (432 questions)
o Sample 100 questions to maximize representativeness
of question categories:
o Temporal: admission, discharge, PMH, visit, status, plan,
time range
o Concept: problem, treatment, test
o Answer type: boolean, count, trend, medical unit, time,
person/org, other
15
16. o Take advantage of existing NLP components:
o Concept recognition/normalization
“Was she hypertensive on admission?”
Was patient UMLSFinding(C057121) on admission?
“Do I have diabetes?”
Does patient have UMLSDisease(C0011849)?
16
17. o First-Order Logic (FOL) + Lambda Calculus (λ)
o Atomic objects C0004057
o Boolean predicates has_treatment(x, y, z)
o Functions max(…)
o λ-expressions λx.condition(x)
o λx.has_treatment(x, C0004057, visit)
o All events of the patient taking aspirin in this hospital visit
o “What was the volume of her urine last night?”
o δ(λx.has_function(x, C0232856, visit) ^ time_within(x, “last
night”))
o “When was the patient first discharged from the ward”
o time(earliest(λx.has_event(x, discharge, visit) ^ at_location(x,
“the ward”)))
17
18. o Q 1-25: double-annotated w/o instruction to determine
initial set of logical elements (LE)
o Q 26-50: double-annotated: 81% agreement on LEs,
40% agreement on complete logical form
o Q 51-100: double annotated: 85% agreement on LEs,
50% agreement on complete logical form
③ Can human annotators achieve sufficient
agreement?
18
19. o All 100 questions could be structured as a logical form
o From 100 questions
o 113 non-CUI objects (7 unique)
o 104 CUI objects (84 unique)
o 136 predicates (21 unique)
o 226 functions (10 unique)
o 110 lambda expressions
① Can EHR questions be converted to logical forms?
② What logical operations are necessary to represent
EHR questions?
19
20. o Long tail in frequency distribution of LEs
o 12 unique LEs make up 86% of non-CUI LEs
o has_treatment predicate used 32 times, but
14 predicates used only once
o Future work: further annotation, integration of
semantic parser for automatic question understanding
④ Will a logical form method scale to the diversity of
potential EHR questions? (mostly)
20
21. Roberts, K. & Patra, B. A Semantic Parsing Method for Mapping Clinical Questions to Logical
Forms. AMIA 2017
21
22. Deardorff A, Masterton K, Roberts K, Kilicoglu H,
Demner-Fushman D. A protocol-driven approach to
automatically finding authoritative answers to
consumer health questions in online resources.
Journal of the Association for Information Science
and Technology. 2017 July;68(7):1724–1736
Ben Abacha A, Demner-Fushman D. Recognizing
Question Entailment for Medical Question
Answering. AMIA 2016
Demner-Fushman D, Kilicoglu H, Roberts K,
Masterton K, Deardorff A. Consumer Health Question
Answering to Automatically Support NLM Customer
Services September, 2015, Technical Report to the
LHNCBC Board of Scientific Counselors
YASSINE ME
RABET
M R A B E T Y @ M A I L . N I H .G O V
ASMA BEN ABACH A
A S M A . B E N A B A C H A @ N I H .G O V
22
23. o Variety of styles
o Mostly informal language
o Ungrammatical sentences
o Inconsistent capitalization & punctuation
o Abbreviations
o Misspellings
o Extraneous information interspersed among questions
o Abundance of anaphora and ellipses
o Unclear information needs
o Collection annotated with 15 question types focused on diseases and drugs:
https://ceb.nlm.nih.gov/ridem/infobot_docs/CHQA-NER-Corpus_1.0.zip
Kilicoglu H, Ben Abacha A, Mrabet Y, Shooshan SE, Rodriguez L, Masterton K, Demner-Fushman D. Semantic annotation
of consumer health questions. BMC Bioinformatics. 2018 Feb 6;19(1):34. doi: 10.1186/s12859-018-2045-1.
23
25. o Misspellings can hinder automatic question
understanding
My mom is 82 years old suffering from anixity and
depression for the last 10 years was dianosed early on set
deminita 3 years ago. Do yall have a office in Greensboro
NC? Can you recommend someone. she has seretona
syndrome and nonething helps her.
o Error types:
o Not a real word: deminita à dementia
o Misuse of a real word: bowl movement à bowel movement
o Merge: for along time à for a long time
o Split: early on set à early onset
25
26. Detector Candidates Ranker Corrector
H1
H2
.
.
.
Hi
.
.
.
Hn-1
Hn
T1
T2
.
.
.
Ti
.
.
.
Tw-1
Tw
C1
C2
.
.
.
Ci
.
.
.
Cw-1
Cw
Input Layer
(Context)
w x n
Word2Vec
Input Matrix
n x w
Word2Vec
Output Matrix
Hidden Layer
(Word Embedding)
Output Layer
(Target Word)
SoftMax
P1
P2
.
.
.
Pi
.
.
.
Pw-1
Pw
Probability Score
(Target Word)
26
27. * An Ensemble Method for Spelling Correction in Consumer Health Questions. Kilicoglu H, Fiszman M,
Roberts K, Demner-Fushman D. AMIA 2015
Non-word:
Real-word Included:
Method Precision Recall F1 Time
Baseline * 66.91% 71.32% 0.6904 <1 hr.
CSpell 82.90% 78.29% 0.8053 < 1 min.
Method Precision Recall F1 Time
Baseline 72.01% 53.63% 0.6147 < 1 hr.
CSpell 82.80% 64.94% 0.7279 < 5 min.
Tested on 471 consumer health questions
27
29. o The focus is the primary entity or event of interest
o At least one per question, but occasionally multiple when
the consumer is interested in the interactions,
associations and comparisons
o UMLS entities à SVM à boundary adjustment à focus
o 56% (73% inexact) F1
o KODA à SVM
o 66% F1 (P ~ 70%, R ~ 62%)
o BiLSTM
o 59% (78% inexact) F1
Mrabet Y, Kilicoglu H, Roberts K, Demner-Fushman D.
Combining Open-domain and Biomedical Knowledge for
Topic Recognition in Consumer Health Questions. AMIA
2016 Annual Symposium, Chicago, IL, November 12-16,
2016.
Roberts K, Masterton K, Kilicoglu H, Fiszman M, Demner-
Fushman D. Annotating Question Decomposition on
Complex Medical Questions. LREC 2014.
29
33. How to use existing
question & answer
pairs to answer
new questions?Websites
Ben Abacha A & Demner-Fushman
D. Recognizing Question Entailment
for Medical Question Answering.
AMIA 2016
33
34. — Proposed definition: Question A entails Question B if
every answer to B is also an exact or partial answer to A.
A1 à B1 (An exact answer)
• A1 (CHQ): Hi I have retinitis pigmentosa for 3years. Im
suffering from this disease. Please introduce me any way to
treat mg eyes such as stem cell ....I am 25 years old and I have
only central vision. Please help me. Thank you
• B1 (FAQ): Are there treatments for RP?
A2 à B2 (A partial answer)
• A2 (CHQ): Can sepsis be prevented? Can someone get this
from a hospital?
• B2 (FAQ): Who gets sepsis?
34
35. q RQE Data (4k pairs of entailment questions) constructed
automatically from clinical questions (Ely & Osheroff, 2000).
Ø Data available on Github:
q Compared Machined Learning (ML) & Deep Learning (DL)
methods trained on open-domain and medical collections
of textual entailment and question similarity/entailment
(e.g. SNLI, Multi-NLI, cQA-SemEval, Quora).
q Logistic Regression trained on medical RQE data achieved
the best performance (75% Accuracy) on test data of
consumer health questions & NIH FAQs.
35
36. Recognizing Question
Entailment (RQE)
Similar Question
Retrieval (QR)
Question-Answer
Selection
Top-K Question Candidates
Question
Question
Index
Question-Answer
Collection
Top-N Entailed Questions
Collection of
47k QA pairs
will be available
Search Engine +
MetaMapLite
Answers
Logistic Regression
+ RQE Data
36
37. o Organization of a medical QA task @ TREC LiveQA 2017
o New benchmark for medical QA:
Ø Variety of consumer health questions, with reference
answers and annotations (Question Foci, Types & Keywords)
Ben Abacha A., Agichtein
E., Pinter Y. & Demner-
Fushman D. Overview of
the Medical QA Task @
TREC 2017 LiveQA Track.
Data available on Github:
37
40. Results on TREC’17 LiveQA medical test questions
MEASURES QR System QR+RQE
System
LiveQA’17
Best Results
LiveQA’17
Median Results
AvgScore (0-3) 0.711 0.827 0.637 0.431
Success@2+ 0.442 0.461 0.392 0.245
Precision@2+ 0.46 0.475 0.404 0.331
MAP@10 0.282 0.311 -- --
§ The best LiveQA team combined deep neural networks to retrieve similar
answered questions from the web.
Ø Relevance of this approach vs. classical QA methods.
§ Using QR+RQE and QA collection led to a 29.8% increase over the best official
score at LiveQA’17.
Ø Efficiency of recognizing question entailment and restricting answer sources
to trusted medical resources.
40
43. New Research Topic:
• Question: What does transverse
ct image demonstrate?
• Answer: focal defect in inflamed
appendiceal wall and
periappendiceal inflammatory
stranding.
Example:
QA
over images
43
44. ØOur first Deep Learning
VQA models achieved
good results: Second
best WBSS.
Ø To advance research in VQA, we built a first
manually annotated medical VQA collection
44
45. Dina Demner-Fushman D, Mork JG, Rogers W,
Shooshan SE, Rodriguez LM, Aronson AR.
Finding medication doses in the literature.
Submitted to AMIA 2018
Rodriguez LM, Demner-Fushman D.
Uncovering Knowledge Gaps in the Scientific
Literature on Maternal Morbidity and
Mortality using EHR Data. Submitted to AMIA
2018
45
46. o Doses determine medication safety and effectiveness
o Dose extraction from clinical text is extensively studied (i2b2)
o No studies on complete prescription information extraction
from the literature: Medication name, Dosage, Route of
administration, Frequency of administration, Duration of
administration, Reason for giving medication
o Questions:
o Will the approaches developed for clinical text work?
o Which sections of scientific papers provide dose information?
o Is sequence-to-sequence learning with neural networks a viable
approach to extraction of dose information?
46
47. o 694 documents fully annotated with drug
doses/strengths, forms, routes of administration,
frequencies and durations of administration, and the
reasons for administration
47
48. o MedEx
o DoseRegEx: numbers preceded or followed by units of
measure
o DoseRegEx + Chemical filter
o Long Short-Term Memory (LSTM) neural network with a
conditional random field (CRF) layer using character
embeddings.
https://guillaumegenthial.github.io/sequence-tagging-
with-tensorflow.html
Xu H, Stenner SP, Doan S, Johnson KB, Waitman LR, Denny JC. MedEx: a medication information
extraction system for clinical narratives. J Am Med Inform Assoc. 2010 Jan-Feb;17(1):19-24.
48
49. o Publicly available collection of scientific articles annotated with drug
doses/strengths, forms, routes of administration, frequencies and
durations of administration and the reasons for administration.
o Drop in performance when switching from clinical text.
o Dose information is predominantly reported in the full text, but
about 45% of the articles provide dose information in the titles and
abstracts as well.
49