AAPM Foster July 2009

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Notes on slide 1

    Medicine is approaching a profound transition as the methods of molecular medicine start to transform the nature of health care.What is the significance of such methods? For the researcher, it is a paradigm shift, as the number of things that can be measured increases dramatically.

    Researchers express a vision for a scientific revolution in health care, from the qualitative to the quantitative-- A revolution based on information and thus computing

    However, even as we talk about transformation and revolution, we must recognize that computing is poorly used in health care today.These are the words of a recent National Research Council report.Thus, I will seek in my remarks today to shed light on three questions: how information technology is evolving, how this evolution may impact medicine, and how changes in medicine and health care will stress information techology.

    The story of computers is one of exponentials

    The story of computers is one of exponentials

    The story of computers is one of exponentials

    But things are not quite as bad as that

    What does this mean for medicine?We will certainly continue to see increasingly sophisticated computer applications aiding the physician in their tasks of observing, diagnosing, and treating – what used to be solely the domain the human senses, the brain, and the hands.More accurate, higher resolution, and more automated data acquisition systems.Computer-aided diagnosis and treatment planning systems that use large-scale data analysis and computer simulations.Automated radiation treatment and surgery systems. However, I want to focus here on some larger systems issues relating to quantitative medicine.

    Using gene expression microarrays, we find that these two diseases have quite different phenotypes—that quite different genes are expressed in the two conditions.Here, columns are patients; rows are genes.Not sure what is the significance of the Stage 1/Stage 2.”The beauty of gene expression profiling data is that it is quantitative and highly reproducible. Because of this, these data can be used to generate multivariate statistical models of the clinical behavior of cancer that have great predictive power.” -- http://lymphochip.nih.gov/Staudt_Adv_Immunol_2005.pdf

    And of course, we must not forget image-based biomarkers, as used in computer aided diagnosis of breast cancer, or as shown here, in an attempt to identify biomarkers for traumatic brain injury.ROIs used in a study at UIC(A) forceps minor (green), cortico-spinal tract (purple), inferior frontal-occipital fasciculus (red), external capsule (yellow), sagittal stratum (blue) (B) anterior corona radiata (green), superior longitudinal fasciculus (red), posterior corona radiata (blue); (C) cingulum (red), corpus callosum body (blue), splenium (yellow), and genu (green), and forceps major (purple).

    Then, by tracking the personalized treatment plan, we collect more patient data.Success demands that we integrate, to a far greater degree than previously possible, clinical practice, basic research, and clinical trials. A profound challenge for health care system and for information technology.

    Collecting and managing the enormous quantities of data that are now feasible, and required for EBM, is a huge challenge.However, merely putting in place the systems required to collect large quantities of data is not enough.We then need to make sense of that data. A challenge both for the physician and the researcher.

    These problems arise at multiple scales. E.g. …

    What these (and other examples that we will not have time to review) have in common …

    We cite [Rouse, Health Care as a CAS: Implications for Design… , NAE 2008] for the righthand side aprt.Must supportDynamic composition for a specific purposeEvolving community, function, environmentMessy data, failure, incomplete knowledgeNice, but insufficientData standardsPlatform standardsFederal policies

    Another perspective on the problem. A few words of explanation. If we are deploying a hospital IT system, we are (hopefully) in the bottom left hand corner.“You can’t achieve success via central planning.” Quoted in Crossing the Quality Chasm, p. 312In our scenarios, we don’t have that ability to control.

    What is the alternative? We can put in place mechanisms that facilitate groups with some common goal to form and function.Over time, things change, these groups evolve.If we are successful, they can expand, perhaps merge.Challenges: make this easy. Leverage scale effects.

    These are issues that the grid community has been working on for many years. We call these groupings Virtual Organizations.In healthcare today, there are of course many such “VOs.”But they are hard to form, fragmented, …

    Principles and mechanisms that has been under development for some years.First CS, then physical sciences, then biology, most recently biomedicine –

    What are these grid mechanisms and concepts, then? Hard to say something sensible in a few minutes.But basically it is about separating out concerns in a way that reduces barriers to entry and permits flexible use.

    API vs. protocol? “Illities”?

    [Create an image here.]For example DICOM and HL7 combine messaging and data model in the same interoperability standard. People are contextualizing this problem at the data interoperability level.  Systems interoperability often neglected.  An area of differentiation, bringing in best practice in industry and science into health care space. Open source platform.  Experience with systems interoperability standards: IETF, OASIS, W3C, 

    Scaling via automating data adaptersRepresentations of those things and semantics of those representations.Talk about how services are published, data modeling, etc.Publish data basesPublish servicesName published objects

    Loose coupling and encapsulationInteroperability through integration based on data mediation Evolutionary in nature Set of scalable systems and methods Explicit in architecture – data integration layerDemonstrated in GSI, GridFTP, MDS, ECOG

    Most images are never seen—and are not available—outside their originating institution

    Recap …

    6 representative challenges

    Data sharing is the most important

    1 Favorite

    AAPM Foster July 2009 - Presentation Transcript

    1. The present and future role of computers in medicine
      Ian Foster
      Computation Institute
      Argonne National Lab & University of Chicago
    2. Credits
      Thanks for support from
      Chan Soon-Shiong Foundation
      Department of Energy
      National Institutes of Health
      National Science Foundation
      And for many helpful conversations, Carl Kesselman, Jonathan Silverstein, Steve Tuecke, Stephan Erberich, Steve Graham, Ravi Madduri, and Patrick Soon-Shiong
    3. Biology is shifting from being an observational science to a quantitative molecular science
      Old biology: measure one/two things in two/three conditions
      High cost per measurement
      Analysis straightforward as little data
      Enormously difficult to work out pathways due to inadequate data
      New biology: measure 10,000 things under many conditions
      Low cost per measurement
      Analysis no longer straightforward
      Payoff can be bigger: potential to understand a complex system
      Ajay Jain, UCSF
    4. Change health care
      from
      an empirical, qualitative systemof silos of information
      to a model of
      predictive, quantitative, shared,evidence-based outcomes
    5. The health care information technology chasm
      Health care IT [is] rarely used to provide clinicians with evidence-based decision support and feedback; to support data-driven process improvement; or to link clinical care and research.
      Computational Technology for Effective Health Care, NRC, 2009
    6. Digital power =
      computing x communicationxstorage x content
      Moore’s law
      doubles
      every 18
      months
      disk law
      doubles
      x every 12
      months
      fiber law
      doubles
      xevery9
      months
      community law
      n
      x 2
      where n is # people
      John SeelyBrown
    7. (Intel)
    8. Microprocessor trends
      AMD
      Dual core (April 2005)
      Quad core (October 2007)
      Intel
      Dual core (July 2005)
      Quad core (December 2006)
      Sun
      Niagara: 8 cores * 4 threads/core (November 2005)
      Niagara2: 8 cores * 8 threads/core (August 2007)
      IBM POWER6
      2 cores * 4 threads/core (May 2007)
      Tilera 64 cores
      Dan Reed, Microsoft
      11
    9. Marching towards manycore
      Intel’s 80 core prototype
      2-D mesh interconnect
      62 W power
      Tilera 64 core system
      8x8 grid of cores
      5 MB coherent cache
      4 DDR2 controllers
      2 10 GbE interfaces
      IBM Cell
      PowerPC and 8 cores
      Dan Reed, Microsoft
      12
    10. 1E+17
      multi-Petaflop
      Petaflop
      Blue Gene/L
      1E+14
      Thunder
      Red Storm
      Earth
      Blue Pacific
      ASCI White, ASCI Q
      SX-5
      ASCI Red Option
      ASCI Red
      T3E
      SX-4
      NWT
      CP-PACS
      1E+11
      CM-5
      Paragon
      T3D
      Delta
      SX-3/44
      Doubling time = 1.5 yr.
      i860 (MPPs)
      VP2600/10
      SX-2
      CRAY-2
      Y-MP8
      S-810/20
      X-MP4
      Peak Speed (flops)
      Cyber 205
      X-MP2 (parallel vectors)
      1E+8
      CRAY-1
      CDC STAR-100 (vectors)
      CDC 7600
      ILLIAC IV
      CDC 6600 (ICs)
      IBM Stretch
      1E+5
      IBM 7090 (transistors)
      IBM 704
      IBM 701
      UNIVAC
      ENIAC (vacuum tubes)
      1E+2
      1940
      1950
      1960
      1970
      1980
      1990
      2000
      2010
      Year Introduced
      The evolution of the fastest supercomputer
      Argonne
      My laptop
    11. The Argonne IBM BG/P
    12. www.top500.org
      1
      2
      3-4
      >128K
    13. Simulation of the human arterial tree on the TeraGrid
      G. Karniadakis et al.
    14. Storage costs
      (PC Magazine, Oct 2, 2007)
    15. Informationbig bang
      All informationper year
      100Exabytes
      Uniqueinformationper year
      All human documentsproduced last 40,000 years(to 1997)
      12 Exabytes
      2000 2001 2002 2003 2004 2005 2006 2007 2008 2009
      © Stuart Card, basedon Lesk, Berkeley SIMS, Landauer, EMC
    16. Growth of Genbank(1982-2005)
      Broad Institute
    17. More data does not always mean more knowledge
      Folker Meyer, Genome Sequencing vs. Moore’s Law: Cyber Challenges for the Next Decade, CTWatch, August 2006.
    18. The Red Queen’s race
      "Well, in our country," said Alice … "you'd generally get to somewhere else — if you run very fast for a long time, as we've been doing.”
      "A slow sort of country!" said the Queen. "Now, here, you see, it takes all the running you can do, to keep in the same place. If you want to get somewhere else, you must run at least twice as fast as that!"
    19. Computing ondemand
      Public PUMA knowledge base
      Information about proteins analyzed against ~2 million gene sequences
      Back officeanalysis on Grid
      Millions of BLAST, BLOCKS, etc., onOSG and TeraGrid
      Natalia Maltsev et al.
    20. 1.6 Tbps
      Cost perGigabit-Mile
      320 Gbps
      Moore’sLaw
      2.5 Gbps
      50 Mbps
      1984
      1994
      1998
      2000
      1993
      1998
      2002
      Opticalnetworkingbreakthrough!
      Revolution
      Capacity increase and new economics
      Nortel
    21. Optical switches
      Lucent
    22. Empiricism
      Theory
      Simulation
      Data
      New ways of knowing
      300 BCE
      1700
      1950
      1990
      Enhanced by the power of collaboration
    23. Quantitative medicine is the key to reducing healthcare costs and improving healthcare outcomes
      Patients with same diagnosis
    24. Quantitative medicine is the key to reducing healthcare costs and improving healthcare outcomes
      Non-responderstoxic responders
      Non-toxic responders
      Patients with same diagnosis
      Misdiagnosed
    25. Leukemia and Lymphoma
      After Mara Aspinall, GenzymeGenetics; Felix W. Frueh, FDA
    26. Leukemia and Lymphoma
      After Mara Aspinall, GenzymeGenetics; Felix W. Frueh, FDA
    27. Currently, 17% of Burkitt's Lymphoma are incorrectly diagnosed as Diffuse Large B Cell Lymphoma
      Classic
      Burkitt’sLymphoma
      Atypical
      Burkitt’sLymphoma
      Diffuse Large
      B Cell Lymphoma
      Louis Staudt, National Cancer Institute
    28. Classic Burkitt Lymphoma Cure Rate
      Diagnosis
      Atypical Burkitt Lymphoma Cure Rate
      Treatment
      Patients’ Actual Disease
      Burkitt’sLymphoma
      60 %
      80 %
      Intensive chemotherapy
      Burkitt’sLymphoma
      Diffuse Large B Cell Lymphoma
      0 %
      15 %
      CHOPregimen
      Burkitt’sLymphoma
    29. Survival estimates for patients with Burkitt's Lymphoma
      Best treatment for Burkitt’s Lymphoma
      Best treatment for Diffuse Large B Cell Lymphoma
      Dave et al, NEJM, June 8, 2006.
    30. Burkitt’s
      Lymphoma
      Diffuse Large
      B-cell Lymphoma
      Classic Atypical
      Louis Staudt, National Cancer Institute
    31. Imaging biomarkers: Diffusion Tensor Imaging and brain injury
      Kraus et al., Brain (2007), 130, 2508-2519
    32. Enabling quantitative medicine
      Collect a lot of patient data
      Analyze data to infer effective treatments
      Identify personalized treatment plans
      Clinical practice
      Basic research
      Clinical trials
    33. Challenges
      Increasing volumes of data, types of data: genomics, blood proteins, imaging, …
      New science and treatments are hidden in the data, not the biology (biomarkers)
      Too much for the individual physician or researcher to absorb
      … have to pay attention to cognitive support … computer-based tools and systems that offer clinicians and patients assistance for thinking about and solving problems related to specific instances of health care.
      NRC Report on Computational Technology for Effective Health Care: Immediate Steps and Strategic Directions, 2009
    34. Bridging silos to enable quantitative medicine
      Basic research
      ongoing investigative studies
      Outcomes, tissue bank
      screening tests
      pathways
      library
      Clinical practice
      Clinical trials
      trial subjects, outcomes
    35. Addressing urban health needs
    36. Important characteristics
      We must integrate systems that may not have worked together before
      These are human systems, with differing goals, incentives, capabilities
      All components are dynamic—change is the norm, not the exception
      Processes are evolving rapidly too
      We are not building something simple like a bridge or an airline reservation system
    37. Healthcare is acomplex adaptive system
      A complex adaptive system is a collection of individual agents that have the freedom to act in ways that are not always predictable and whose actions are interconnected such that one agent’s actions changes the context for other agents.
      Crossing the Quality Chasm, IOM, 2001; pp 312-13
      • Non-linear and dynamic
      • Agents are independent and intelligent
      • Goals and behaviors often in conflict
      • Self-organization through adaptation and learning
      • No single point(s) of control
      • Hierarchical decomp-osition has limited value
    38. We need to function in the zone of complexity
      Low
      Chaos
      Zone ofcomplexity
      Agreement
      about
      outcomes
      Plan and control
      High
      Low
      High
      Certainty about outcomes
      Ralph Stacey, Complexity and Creativity in Organizations, 1996
    39. We need to function in the zone of complexity
      Low
      Chaos
      Agreement
      about
      outcomes
      Plan and control
      High
      Low
      High
      Certainty about outcomes
      Ralph Stacey, Complexity and Creativity in Organizations, 1996
    40. We call these groupingsvirtual organizations (VOs)
      A set of individuals and/or institutions engaged in the controlled sharing of resources in pursuit of a common goal
      But U.S. health system is marked by fragmented and inefficient VOs with insufficient mechanisms for controlled sharing
      Healthcare = dynamic, overlapping VOs, linking
      Patient – primary care
      Sub-specialist – hospital
      Pharmacy – laboratory
      Insurer – …
      I advocate … a model of virtual integration rather than true vertical integration … G. Halvorson, CEO Kaiser
    41. The Grid paradigm
      Principles and mechanisms for dynamic VOs
      Leverage service oriented architecture (SOA)
      Loose coupling of data and services
      Open software,architecture
      Engineering
      Biomedicine
      Computer science
      Physics
      Healthcare
      Astronomy
      Biology
      1995 2000 2005 2010
    42. The Grid paradigm and healthcare information integration
      [Grid architecture joint work with Carl Kesselman, Steve Tuecke, Stephan Erberich, and others]
      Manage who can do what
      Make data usable and useful
      Platform services
      Name data and move it around
      Make data accessible over the network
      Data sources
      Radiology
      Medical records
      Pathology
      Genomics
      Labs
      RHIO
    43. The Grid paradigm and healthcare information integration
      Enhance user cognitive processes
      Security and policy
      Incorporate into business processes
      Transform data into knowledge
      Integration
      Platform services
      Management
      Publication
      Data sources
      Radiology
      Medical records
      Pathology
      Genomics
      Labs
      RHIO
    44. The Grid paradigm and healthcare information integration
      Cognitive support
      Security and policy
      Valueservices
      Applications
      Analysis
      Integration
      Platform services
      Management
      Publication
      Data sources
      Radiology
      Medical records
      Pathology
      Genomics
      Labs
      RHIO
    45. We partition the multi-faceted interoperability problem
      Process interoperability
      Integrate work across healthcare enterprise
      Data interoperability
      Syntactic: move structured data among system elements
      Semantic: use information across system elements
      Systems interoperability
      Communicate securely, reliably among system elements
      Applications
      Analysis
      Integration
      Management
      Publication
    46. Publication:Make information accessible
      Make data available in a remotely accessible, reusable manner
      Leave mediation for integration layer
      Gateway from local policy/protocol into wide area mechanisms (transport, security, …)
    47. Imaging clinical trials
      NeuroblastomaCancerFoundation
      Childrens Oncology Group
    48. Stephan Erberich,
      Carl Kesselman, et al.
    49. As of Oct19, 2008:
      122 participants
      105 services
      70 data
      35 analytical
    50. Data movement in clinical trials
      (Center for Health Informatics)
    51. Community public health:Digital retinopathy screening network
      (Center for Health Informatics)
    52. Integration:Making data usable and useful
      ?
      Adaptive approach
      100%
      Degree of communication
      Loosely coupled approach
      Rigid standards-based approach
      0%
      0% 100%
      Degree of prior syntactic and semantic agreement
    53. Integration via mediation
      • Map between models
      • Scoped to domain use
      • Multiple concurrent use
      • Bottom up mediation
      • between standards and versions
      • between local versions
      • in absence of agreement
      Global Data Model
      Query reformulation
      Query in union of exported source schema
      Query optimization
      Distributed query execution
      Query execution engine
      Wrapper
      Wrapper
      Query in the
      sourceschema
      Alon Halevy, 2000
    54. Analytics:Transform data into knowledge
      “The overwhelming success of genetic and genomic research efforts has created an enormous backlog of data with the potential to improve the quality of patient care and cost effectiveness of treatment.”
      — US Presidential Council of Advisors on Science and Technology, Personalized Medicine Themes, 2008
    55. Published
      The imagepyramid
      Enrolled/ evaluated
      Eligible patients
      Created
      Michael Vannier
    56. Query and retrieve microarray data from a caArray data service:cagridnode.c2b2.columbia.edu:8080/wsrf/services/cagrid/CaArrayScrub
      Normalize microarray data using GenePattern analytical servicenode255.broad.mit.edu:6060/wsrf/services/cagrid/PreprocessDatasetMAGEService
      Hierarchical clustering using geWorkbench analytical service: cagridnode.c2b2.columbia.edu:8080/wsrf/services/cagrid/HierarchicalClusteringMage
      Microarray clustering using Taverna
      Workflow in/output
      caGrid services
      “Shim” services
      others
      Wei Tan et al.
    57. Many many tasks:Identifying potential drug targets
      2M+ ligands
      Protein xtarget(s)
      Benoit Roux et al.
    58. 6 GB
      2M structures
      (6 GB)
      ~4M x 60s x 1 cpu
      ~60K cpu-hrs
      FRED
      DOCK6
      Select best ~5K
      Select best ~5K
      ~10K x 20m x 1 cpu
      ~3K cpu-hrs
      Amber
      Select best ~500
      ~500 x 10hr x 100 cpu
      ~500K cpu-hrs
      GCMC
      ZINC
      3-D
      structures
      Manually prep
      DOCK6 rec file
      Manually prep
      FRED rec file
      NAB scriptparameters
      (defines flexible
      residues,
      #MDsteps)
      NAB
      Script
      Template
      DOCK6
      Receptor
      (1 per protein:
      defines pocket
      to bind to)
      FRED
      Receptor
      (1 per protein:
      defines pocket
      to bind to)
      PDB
      protein
      descriptions
      1 protein
      (1MB)
      BuildNABScript
      Amber prep:
      2. AmberizeReceptor
      4. perl: gen nabscript
      NAB
      Script
      start
      Amber Score:
      1. AmberizeLigand
      3. AmberizeComplex
      5. RunNABScript
      For 1 target:
      4 million tasks500,000 cpu-hrs
      (50 cpu-years)
      end
      report
      ligands
      complexes
    59. DOCK on BG/P: ~1M tasks on 118,000 CPUs
      CPU cores: 118784
      Tasks: 934803
      Elapsed time: 7257 sec
      Compute time: 21.43 CPU years
      Average task time: 667 sec
      Relative Efficiency: 99.7% (from 16 to 32 racks)
      Utilization:
      Sustained: 99.6%
      Overall: 78.3%
      Time (secs)
      Ioan Raicu et al.
    60. The health care information technology chasm
      Health care IT [is] rarely used to provide clinicians with evidence-based decision support and feedback; to support data-driven process improvement; or to link clinical care and research.
      Computational Technology for Effective Health Care, NRC, 2009
    61. Six research challenges for information technology and healthcare
      Patient-centered cognitive support
      Modeling—an individualized virtual patient
      Automation—integrated use, adaptivity
      Data sharing and collaboration
      Data management at scale
      Automated full capture of physician-patient interactions
      Computational Technology for Effective Health Care, NRC, 2009
    62. Six research challenges for information technology and healthcare
      Patient-centered cognitive support
      Modeling—an individualized virtual patient
      Automation—integrated use, adaptivity
      Data sharing and collaboration
      Data management at scale
      Automated full capture of physician-patient interactions
      Computational Technology for Effective Health Care, NRC, 2009
    63. Functioning in the zone of complexity
      Low
      Chaos
      Agreement
      about
      outcomes
      Plan and control
      High
      Low
      High
      Certainty about outcomes
      Ralph Stacey, Complexity and Creativity in Organizations, 1996
    64. The Grid paradigm and healthcare information integration
      Cognitive support
      Security and policy
      Valueservices
      Applications
      Analysis
      Integration
      Platform services
      Management
      Publication
      Data sources
      Radiology
      Medical records
      Pathology
      Genomics
      Labs
      RHIO
    65. “People tend to overestimate the short-term impact of change, and underestimate the long-term impact.”
      — Roy Amara
      “The computer revolution hasn’t happened yet.”
      — Alan Kay, 1997
    66. Thank you!
      Computation Institutewww.ci.uchicago.edu

    + Ian FosterIan Foster, 4 months ago

    custom

    388 views, 1 favs, 0 embeds more stats

    I gave this talk in the "Presidential Symposium" at more

    More info about this document

    CC Attribution License

    Go to text version

    • Total Views 388
      • 388 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 1
    • Downloads 17
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories