This document provides summaries of 10 applications that were presented at the NIH IC Applications Show & Tell Program on September 4, 2014. The applications cover a variety of functions including abstract classification, technology showcasing, large screen display, research services matching, idea generation, cloud computing, data warehousing, and meeting registration. Contact information is provided for each application submitter.
Artificial Intelligence, Machine Learning, Deep Learning and Robot Process Au...Constantinos Galanakis
With the occasion of ACI's Conference DIGITALISATION IN SHIPPING that took place in Hamburg (9th & 10th October 2019) Elvictor presented its IT infrastructure, by illustrating Elvictor's successful story of Digitalisation in Crew Management and Recruitment. The presentation of Constantinos S Galanakis : "Artificial Intelligence, Machine Learning, Deep Learning and Robot Process Automation applied to Crew Recruitment" may be a road map to a successful digital transformation of a crew manager.
DROIDSWAN: Detecting Malicious Android Applications Based on Static Feature A...csandit
Android being a widely used mobile platform has witnessed an increase in the number of malicious samples on its market place. The availability of multiple sources for downloading
applications has also contributed to users falling prey to malicious applications. Classification of an Android application as malicious or benign remains a challenge as malicious applications maneuver to pose themselves as benign. This paper presents an approach which extracts various features from Android Application Package file (APK) using static analysis and subsequently classifies using machine learning techniques. The contribution of this work includes deriving, extracting and analyzing crucial features of Android applications that aid in efficient classification. The analysis is carried out using various machine learning algorithms
with both weighted and non-weighted approaches. It was observed that weighted approach depicts higher detection rates using fewer features. Random Forest algorithm exhibited high detection rate and shows the least false positive rate.
Search engines, and Apache Solr in particular, are quickly shifting the focus away from “big data” systems storing massive amounts of raw (but largely unharnessed) content, to “smart data” systems where the most relevant and actionable content is quickly surfaced instead. Apache Solr is the blazing-fast and fault-tolerant distributed search engine leveraged by 90% of Fortune 500 companies. As a community-driven open source project, Solr brings in diverse contributions from many of the top companies in the world, particularly those for whom returning the most relevant results is mission critical.
Out of the box, Solr includes advanced capabilities like learning to rank (machine-learned ranking), graph queries and distributed graph traversals, job scheduling for processing batch and streaming data workloads, the ability to build and deploy machine learning models, and a wide variety of query parsers and functions allowing you to very easily build highly relevant and domain-specific semantic search, recommendations, or personalized search experiences. These days, Solr even enables you to run SQL queries directly against it, mixing and matching the full power of Solr’s free-text, geospatial, and other search capabilities with the a prominent query language already known by most developers (and which many external systems can use to query Solr directly).
Due to the community-oriented nature of Solr, the ecosystem of capabilities also spans well beyond just the core project. In this talk, we’ll also cover several other projects within the larger Apache Lucene/Solr ecosystem that further enhance Solr’s smart data capabilities: bi-directional integration of Apache Spark and Solr’s capabilities, large-scale entity extraction, semantic knowledge graphs for discovering, traversing, and scoring meaningful relationships within your data, auto-generation of domain-specific ontologies, running SPARQL queries against Solr on RDF triples, probabilistic identification of key phrases within a query or document, conceptual search leveraging Word2Vec, and even Lucidworks’ own Fusion project which extends Solr to provide an enterprise-ready smart data platform out of the box.
We’ll dive into how all of these capabilities can fit within your data science toolbox, and you’ll come away with a really good feel for how to build highly relevant “smart data” applications leveraging these key technologies.
Artificial Intelligence, Machine Learning, Deep Learning and Robot Process Au...Constantinos Galanakis
With the occasion of ACI's Conference DIGITALISATION IN SHIPPING that took place in Hamburg (9th & 10th October 2019) Elvictor presented its IT infrastructure, by illustrating Elvictor's successful story of Digitalisation in Crew Management and Recruitment. The presentation of Constantinos S Galanakis : "Artificial Intelligence, Machine Learning, Deep Learning and Robot Process Automation applied to Crew Recruitment" may be a road map to a successful digital transformation of a crew manager.
DROIDSWAN: Detecting Malicious Android Applications Based on Static Feature A...csandit
Android being a widely used mobile platform has witnessed an increase in the number of malicious samples on its market place. The availability of multiple sources for downloading
applications has also contributed to users falling prey to malicious applications. Classification of an Android application as malicious or benign remains a challenge as malicious applications maneuver to pose themselves as benign. This paper presents an approach which extracts various features from Android Application Package file (APK) using static analysis and subsequently classifies using machine learning techniques. The contribution of this work includes deriving, extracting and analyzing crucial features of Android applications that aid in efficient classification. The analysis is carried out using various machine learning algorithms
with both weighted and non-weighted approaches. It was observed that weighted approach depicts higher detection rates using fewer features. Random Forest algorithm exhibited high detection rate and shows the least false positive rate.
Search engines, and Apache Solr in particular, are quickly shifting the focus away from “big data” systems storing massive amounts of raw (but largely unharnessed) content, to “smart data” systems where the most relevant and actionable content is quickly surfaced instead. Apache Solr is the blazing-fast and fault-tolerant distributed search engine leveraged by 90% of Fortune 500 companies. As a community-driven open source project, Solr brings in diverse contributions from many of the top companies in the world, particularly those for whom returning the most relevant results is mission critical.
Out of the box, Solr includes advanced capabilities like learning to rank (machine-learned ranking), graph queries and distributed graph traversals, job scheduling for processing batch and streaming data workloads, the ability to build and deploy machine learning models, and a wide variety of query parsers and functions allowing you to very easily build highly relevant and domain-specific semantic search, recommendations, or personalized search experiences. These days, Solr even enables you to run SQL queries directly against it, mixing and matching the full power of Solr’s free-text, geospatial, and other search capabilities with the a prominent query language already known by most developers (and which many external systems can use to query Solr directly).
Due to the community-oriented nature of Solr, the ecosystem of capabilities also spans well beyond just the core project. In this talk, we’ll also cover several other projects within the larger Apache Lucene/Solr ecosystem that further enhance Solr’s smart data capabilities: bi-directional integration of Apache Spark and Solr’s capabilities, large-scale entity extraction, semantic knowledge graphs for discovering, traversing, and scoring meaningful relationships within your data, auto-generation of domain-specific ontologies, running SPARQL queries against Solr on RDF triples, probabilistic identification of key phrases within a query or document, conceptual search leveraging Word2Vec, and even Lucidworks’ own Fusion project which extends Solr to provide an enterprise-ready smart data platform out of the box.
We’ll dive into how all of these capabilities can fit within your data science toolbox, and you’ll come away with a really good feel for how to build highly relevant “smart data” applications leveraging these key technologies.
ITAC 2016 Where Open Source Meets Audit AnalyticsAndrew Clark
Open source software is taking the computer science community and IT departments by storm. The breadth of options, the timeliness of updates, the price, and the sense of community are all contributing factors to the rise of open source computing. For many years audit analytics has been confined to the Computer Assisted Auditing Techniques, CAAT, software vendors ACL, IDEA and now Arbutus. However, these software programs require extensive training to use effectively, are not very flexible, and in most cases fail to provide the outcome auditors are expecting. Moving to an open source platform based around the python ecosystem allows for true customization of analytics, and provides a common language to interact with your IT department. By using the same set of tools, an auditing department can move from rudimentary AP duplicate tests all the way to advanced classification and clustering machine learning tests. Although the barrier to entry for open source software is higher than for most CAATs, with cross-functional collaboration, a truly customized, sustainable, and highly effective analytics program can be created.
Program for Show & Tell #1 (10 December 2013)nihshowandtell
This guide describes the applications demonstrated during the inaugural IC Applications Show & Tell event. A brief description of each application is included, along with contact information for the current provider or owner of the application.
The global need to securely derive (instant) insights, have motivated data architectures from distributed storage, to data lakes, data warehouses and lake-houses. In this talk we describe Tag.bio, a next generation data mesh platform that embeds vital elements such as domain centricity/ownership, Data as Products, Self-serve architecture, with a federated computational layer. Tag.bio data products combine data sets, smart APIs, statistical and machine learning algorithms into decentralized data products for users to discover insights using FAIR Principles. Researchers can use its point and click (no-code) system to instantly perform analysis and share versioned, reproducible results. The platform combines a dynamic cohort builder with analysis protocols and applications (low-code) to drive complex analysis workflows. Applications within data products are fully customizable via R and Python plugins (pro-code), and the platform supports notebook based developer environments with individual workspaces.
Join us for a talk/demo session on Tag.bio data mesh platform and learn how major pharma industries and university health systems are using this technology to promote value based healthcare, precision healthcare, find cures for disease, and promote collaboration (without explicitly moving data around). The talk also outlines Tag.bio secure data exchange features for real world evidence datasets, privacy centric data products (confidential computing) as well as integration with cloud services
Outlining the common challenges encountered when structuring clinical and research datasets for deep learning training.
Typically the datasets are so unstructured that they are impossible to analyze by any deep learning practitioners. And the cleaning and data wrangling ends up taking most of the time which could have been planned properly even before the clinical data acquisition.
One could argue that especially for medical data, the annotated data is the new gold, and not just the Big Data scattered all over the place. This is practice translates to efforts to design as intelligent as possible data labelling pipelines for efficient use of expert clinician annotation work.
Alternative download link:
https://www.dropbox.com/s/bbgc21yc86h0t14/Efficient_Ocular_Data_Labelling.pdf?dl=0
The NIH Data Commons - BD2K All Hands Meeting 2015Vivien Bonazzi
Presentation given at the BD2K All Hands meeting in Bethesda, MD, USA in November 2015
https://datascience.nih.gov/bd2k/events/NOV2015-AllHands
Video cast of this presentation:
http://videocast.nih.gov/summary.asp?Live=17480&bhcp=1
talk starts at 2hrs 40min (its about 55mins long) - includes video!
Document describing the Commons : https://datascience.nih.gov/commons
Arabidopsis Information Portal: A Community-Extensible Platform for Open DataMatthew Vaughn
Araport is an innovative model organism database resource that offers users the ability to bring their own visualizations, data sets, algorithms, and genome browser tracks and share them with their colleagues.
Interoperability is the key: repositories networks promoting the quality and ...Pedro Príncipe
Presentation from José Carvalho and Pedro Principe, University of Minho, at ETD 2019 Conference (22nd International Symposium on Electronic Theses and Dissertations), Porto, Nov 7, 2019.
This session provides a comprehensive overview of the latest updates to the Uniform Administrative Requirements, Cost Principles, and Audit Requirements for Federal Awards (commonly known as the Uniform Guidance) outlined in the 2 CFR 200.
With a focus on the 2024 revisions issued by the Office of Management and Budget (OMB), participants will gain insight into the key changes affecting federal grant recipients. The session will delve into critical regulatory updates, providing attendees with the knowledge and tools necessary to navigate and comply with the evolving landscape of federal grant management.
Learning Objectives:
- Understand the rationale behind the 2024 updates to the Uniform Guidance outlined in 2 CFR 200, and their implications for federal grant recipients.
- Identify the key changes and revisions introduced by the Office of Management and Budget (OMB) in the 2024 edition of 2 CFR 200.
- Gain proficiency in applying the updated regulations to ensure compliance with federal grant requirements and avoid potential audit findings.
- Develop strategies for effectively implementing the new guidelines within the grant management processes of their respective organizations, fostering efficiency and accountability in federal grant administration.
ITAC 2016 Where Open Source Meets Audit AnalyticsAndrew Clark
Open source software is taking the computer science community and IT departments by storm. The breadth of options, the timeliness of updates, the price, and the sense of community are all contributing factors to the rise of open source computing. For many years audit analytics has been confined to the Computer Assisted Auditing Techniques, CAAT, software vendors ACL, IDEA and now Arbutus. However, these software programs require extensive training to use effectively, are not very flexible, and in most cases fail to provide the outcome auditors are expecting. Moving to an open source platform based around the python ecosystem allows for true customization of analytics, and provides a common language to interact with your IT department. By using the same set of tools, an auditing department can move from rudimentary AP duplicate tests all the way to advanced classification and clustering machine learning tests. Although the barrier to entry for open source software is higher than for most CAATs, with cross-functional collaboration, a truly customized, sustainable, and highly effective analytics program can be created.
Program for Show & Tell #1 (10 December 2013)nihshowandtell
This guide describes the applications demonstrated during the inaugural IC Applications Show & Tell event. A brief description of each application is included, along with contact information for the current provider or owner of the application.
The global need to securely derive (instant) insights, have motivated data architectures from distributed storage, to data lakes, data warehouses and lake-houses. In this talk we describe Tag.bio, a next generation data mesh platform that embeds vital elements such as domain centricity/ownership, Data as Products, Self-serve architecture, with a federated computational layer. Tag.bio data products combine data sets, smart APIs, statistical and machine learning algorithms into decentralized data products for users to discover insights using FAIR Principles. Researchers can use its point and click (no-code) system to instantly perform analysis and share versioned, reproducible results. The platform combines a dynamic cohort builder with analysis protocols and applications (low-code) to drive complex analysis workflows. Applications within data products are fully customizable via R and Python plugins (pro-code), and the platform supports notebook based developer environments with individual workspaces.
Join us for a talk/demo session on Tag.bio data mesh platform and learn how major pharma industries and university health systems are using this technology to promote value based healthcare, precision healthcare, find cures for disease, and promote collaboration (without explicitly moving data around). The talk also outlines Tag.bio secure data exchange features for real world evidence datasets, privacy centric data products (confidential computing) as well as integration with cloud services
Outlining the common challenges encountered when structuring clinical and research datasets for deep learning training.
Typically the datasets are so unstructured that they are impossible to analyze by any deep learning practitioners. And the cleaning and data wrangling ends up taking most of the time which could have been planned properly even before the clinical data acquisition.
One could argue that especially for medical data, the annotated data is the new gold, and not just the Big Data scattered all over the place. This is practice translates to efforts to design as intelligent as possible data labelling pipelines for efficient use of expert clinician annotation work.
Alternative download link:
https://www.dropbox.com/s/bbgc21yc86h0t14/Efficient_Ocular_Data_Labelling.pdf?dl=0
The NIH Data Commons - BD2K All Hands Meeting 2015Vivien Bonazzi
Presentation given at the BD2K All Hands meeting in Bethesda, MD, USA in November 2015
https://datascience.nih.gov/bd2k/events/NOV2015-AllHands
Video cast of this presentation:
http://videocast.nih.gov/summary.asp?Live=17480&bhcp=1
talk starts at 2hrs 40min (its about 55mins long) - includes video!
Document describing the Commons : https://datascience.nih.gov/commons
Arabidopsis Information Portal: A Community-Extensible Platform for Open DataMatthew Vaughn
Araport is an innovative model organism database resource that offers users the ability to bring their own visualizations, data sets, algorithms, and genome browser tracks and share them with their colleagues.
Interoperability is the key: repositories networks promoting the quality and ...Pedro Príncipe
Presentation from José Carvalho and Pedro Principe, University of Minho, at ETD 2019 Conference (22nd International Symposium on Electronic Theses and Dissertations), Porto, Nov 7, 2019.
This session provides a comprehensive overview of the latest updates to the Uniform Administrative Requirements, Cost Principles, and Audit Requirements for Federal Awards (commonly known as the Uniform Guidance) outlined in the 2 CFR 200.
With a focus on the 2024 revisions issued by the Office of Management and Budget (OMB), participants will gain insight into the key changes affecting federal grant recipients. The session will delve into critical regulatory updates, providing attendees with the knowledge and tools necessary to navigate and comply with the evolving landscape of federal grant management.
Learning Objectives:
- Understand the rationale behind the 2024 updates to the Uniform Guidance outlined in 2 CFR 200, and their implications for federal grant recipients.
- Identify the key changes and revisions introduced by the Office of Management and Budget (OMB) in the 2024 edition of 2 CFR 200.
- Gain proficiency in applying the updated regulations to ensure compliance with federal grant requirements and avoid potential audit findings.
- Develop strategies for effectively implementing the new guidelines within the grant management processes of their respective organizations, fostering efficiency and accountability in federal grant administration.
ZGB - The Role of Generative AI in Government transformation.pdfSaeed Al Dhaheri
This keynote was presented during the the 7th edition of the UAE Hackathon 2024. It highlights the role of AI and Generative AI in addressing government transformation to achieve zero government bureaucracy
Monitoring Health for the SDGs - Global Health Statistics 2024 - WHOChristina Parmionova
The 2024 World Health Statistics edition reviews more than 50 health-related indicators from the Sustainable Development Goals and WHO’s Thirteenth General Programme of Work. It also highlights the findings from the Global health estimates 2021, notably the impact of the COVID-19 pandemic on life expectancy and healthy life expectancy.
Jennifer Schaus and Associates hosts a complimentary webinar series on The FAR in 2024. Join the webinars on Wednesdays and Fridays at noon, eastern.
Recordings are on YouTube and the company website.
https://www.youtube.com/@jenniferschaus/videos
Donate to charity during this holiday seasonSERUDS INDIA
For people who have money and are philanthropic, there are infinite opportunities to gift a needy person or child a Merry Christmas. Even if you are living on a shoestring budget, you will be surprised at how much you can do.
Donate Us
https://serudsindia.org/how-to-donate-to-charity-during-this-holiday-season/
#charityforchildren, #donateforchildren, #donateclothesforchildren, #donatebooksforchildren, #donatetoysforchildren, #sponsorforchildren, #sponsorclothesforchildren, #sponsorbooksforchildren, #sponsortoysforchildren, #seruds, #kurnool
Jennifer Schaus and Associates hosts a complimentary webinar series on The FAR in 2024. Join the webinars on Wednesdays and Fridays at noon, eastern.
Recordings are on YouTube and the company website.
https://www.youtube.com/@jenniferschaus/videos
Jennifer Schaus and Associates hosts a complimentary webinar series on The FAR in 2024. Join the webinars on Wednesdays and Fridays at noon, eastern.
Recordings are on YouTube and the company website.
https://www.youtube.com/@jenniferschaus/videos
Presentation by Jared Jageler, David Adler, Noelia Duchovny, and Evan Herrnstadt, analysts in CBO’s Microeconomic Studies and Health Analysis Divisions, at the Association of Environmental and Resource Economists Summer Conference.
Preliminary findings _OECD field visits to ten regions in the TSI EU mining r...OECDregions
Preliminary findings from OECD field visits for the project: Enhancing EU Mining Regional Ecosystems to Support the Green Transition and Secure Mineral Raw Materials Supply.
Preliminary findings _OECD field visits to ten regions in the TSI EU mining r...
Show and tell program 04 2014-09-04
1. NIH IC Applications Show & Tell Program
September 4, 2014
Contents Page Show Time
Introduction ......................................................................................................... 8:35
Prevention Abstract Classification Tool (PACT) ................................................ 2 8:45
NIH Library Technology Sandbox ...................................................................... 3 9:05
Low cost display solution for Large screen TVs using Raspberry Pi.................. 4 9:25
CCR Research Exchange (CREx) ........................................................................ 5 9:45
IdeaScale ........................................................................................................... 6 10:05
Amazon Cloud AWS EC2 ................................................................................... 7 10;25
nVision Data Warehouse .................................................................................. 8 10:45
Meeting Registration System - MREGS ............................................................. 9 11:05
Meeting Registration System - MREGS ........................................................... 10 11:25
NLM MeSH on Demand .................................................................................. 11 11:45
2. 1
Introduction
This event is the fourth quarterly Show & Tell event organized by the Enterprise Architecture program of National Heart, Lung, and Blood Institute’s Office of Management, IT and Applications Center (ITAC). We are thrilled to offer this forum for application users and technical teams from across the NIH to share technology that improves how our work is done, and we look forward to transitioning this stewardship activity to a trans-NIH group for the future with the potential for developing separate scientific, administrative, and IT oriented demonstration venues.
Still, while every application may not interest everyone in the NIH, the willingness to share both technologies and the stories behind them is essential for our collective success.
3. 2
App Name:
PREVENTION ABSTRACT CLASSIFICATION TOOL (PACT)
Access URL:
N/A
Docs URL: (if any)
None
IC:
OD/ODP
Submitter Name:
Patricia L. Mabry, Ph.D.
Email Address:
mabryp@od.nih.gov
Phone
(301) 402-1753
Functional Area Where Used:
Extramural
How Acquired?
Custom Development
App IT Contact:
Rich Panzer
Operating Sys:
Windows
DBMS
SQL
Front End:
.NET, HTML, CSS
We intend to:
Just an FYI / other
FUNCTIONALITY PROVIDED
The Prevention Abstract Classification Tool (PACT) facilitates coding of grant applications used in portfolio analysis. ODP is working with NIH Office of Portfolio Analysis (OPA) and Center for Information Technology (CIT) to create a machine-learning based, computerized portfolio analysis tool to characterize the Prevention Research Portfolio. This requires “gold standards” (a training set) for the machine to learn from. “Gold standards” are grant abstracts that have been accurately classified (manually) according to ODP’s Prevention Research Taxonomy. Many thousand gold standards, generated by manual classification (aka coding), are needed for the machine to “learn.” Accurate coding requires a team approach: each abstract is read and coded by three individuals who then discuss and reach a “consensus code” that becomes the gold standard. A 23-page protocol dictates the rules for coding based on the Taxonomy. For a subset of coded abstracts, a second team of three coders recodes the abstracts following the same procedure. The two consensus codes are reconciled to yield a revised gold standard, if indicated. This double coding provides quality control.
PACT was requisitioned by ODP to facilitate the coding of the large number of gold standards needed. PACT is a software tool containing a large database of grant abstracts imported from NIH RePORTER. PACT provides a user interface for coding abstracts in accordance with the Taxonomy and allows coders to seamlessly access relevant parts of the protocol in real time while coding, enhancing accuracy and reliability. PACT captures and stores data on individual and consensus codes. Because there are multiple categories within the Taxonomy to be coded, multiple coders, and multiple teams of coders, PACT calculates 96 inter-rater reliability statistics (Kappas) for each abstract.
A training and testing environment in PACT parallels the “production” environment. Would-be coders use PACT in training mode to learn the Taxonomy and protocol by coding a large number of abstracts selected for their pedagogical value. After training is completed, would-be coders code a test set of abstracts. Kappas are reviewed to ensure that each coder’s test coding is at or above a predetermined threshold for inter-rater reliability when compared to the correct codes. During production, Kappas are used to identify abstracts that need coding by a second team, to identify problem coders, and as indicators of quality control.
ODP and IQ Solutions, Inc. are showing this app to other NIH staff in order to demonstrate the utility of PACT for supporting abstract coding for portfolio analysis. We think others who are doing any type of portfolio analysis that relies on manual coding will be interested in our approach and the software that facilitates this process. Interested parties may contact ODP or IQ Solutions about adapting the approach
4. 3
or tool for their own needs.
Number and Type of Current Users
The current users are ODP staff and staff at IQ Solutions who have been hired by ODP to perform abstract coding.
EXTERNAL DATA USED
External data consists of abstracts imported from NIH RePORTER (IMPAC II database).
ADDITIONAL NOTES
Contact Patricia Mabry, Ph.D. at ODP (mabryp@od.nih.gov) or Rich Panzer at IQ Solutions (rpanzer@iqsolutions.com) for further information about PACT.
App Name:
NIH LIBRARY TECHNOLOGY SANDBOX
Access URL:
http://nihlibrary.nih.gov/Sandbox
Docs URL: (if any)
IC:
NIH Library (ORS)
Submitter Name:
Ben Hope
Email Address:
ben.hope@nih.gov
Phone
301-594-6473
Functional Area Where Used:
Intramural & training programs
How Acquired?
Library Sponsored
App IT Contact:
Ben Hope (NIH Library)
Operating Sys:
n/a
DBMS
n/a
Front End:
n/a
We intend to:
Partner on future development or acquisitions
FUNCTIONALITY PROVIDED
The Technology Sandbox is designed to highlight technology-based projects at the NIH, inspire curiosity and exploration in emerging technologies, and facilitate the development of meaningful partnerships between the NIH Library staff and its customers.
NUMBER AND TYPE OF CURRENT USERS
All of NIH and HHS are welcome
ADDITIONAL NOTES
The Technology Sandbox has a physical presence in the NIH Library in building 10 and a virtual presence on http://nihlibrary.nih.gov/Sandbox
5. 4
App Name:
LOW COST DISPLAY SOLUTION FOR LARGE SCREEN TVS using Raspberry Pi
Access URL:
http://www.raspberrypi.org
Docs URL: (if any)
IC:
NIAID, NIH Library (ORS)
Submitter Name:
Dawei Lin, Verma Walker, Ben Hope
Email Address:
dawei.lin@nih.gov, walkerve@mail.nih.gov, ben.hope@nih.gov
Phone
301-312-3986
Functional Area Where Used:
Intramural & training programs
How Acquired?
Custom development
App IT Contact:
Dawei Lin (NIAID), Verma Walker and Ben Hope (NIH Library)
Operating Sys:
Raspbian
DBMS
Front End:
We intend to:
Partner on future development or acquisitions
FUNCTIONALITY PROVIDED
This project is a use case of the Technology Sandbox presented in March, 2014 by NIH Library. It uses Raspberry Pi, a ~$40 computer, to display rotating static images using Quick Image Viewer (QIV, http://spiegl.de/qiv/) on large screen TVs via HDMI. The project demonstrated that Raspberry Pi is simple yet powerful enough to support major information display functions using large screen TVs.
NUMBER AND TYPE OF CURRENT USERS
NIH Library may use it as a pilot to replace laptop computers to drive large screen displays to save cost and energy use.
ADDITIONAL NOTES
Some of the work was done by a fifth grader, which indicated that such projects might be suitable for future outreach and education activities.
6. 5
App Name:
CCR RESEARCH EXCHANGE (CREX)
Access URL:
https://nci.assaydepot.com
Docs URL: (if any)
Information is available at https://nci.assaydepot.com/pages/faq. CREx is accessible to NIH IRP using active directory credentials.
IC:
CCR/NCI
Submitter Name:
Mariam Malik
Email Address:
malikm@mail.nih.gov
Phone
301-496-2593
Functional Area Where Used:
Intramural
How Acquired?
Custom Development
App IT Contact:
Sherman Tang (Assay Depot)
Front End:
Web interface – developed using Ruby on Rails
We intend to:
Provide this service to others
FUNCTIONALITY PROVIDED
CREx is a “one-stop-shop” for research services available through over 70 CCR/NCI cores and collaborative resources, as well as over 10,000 commercial vendors. Services can be quickly and easily identified using service taxonomy or a Google-like search field, and results refined using available filters to identify vendors with specific certifications or qualifications. The platform also enables communication with multiple cores and vendors simultaneously, to gather capabilities and quotes, exchange files and project reports, through the investigator’s private dashboard. In addition, CREx allows users to review and rate vendors and their services, thus capturing and sharing the organization’s collective consumer experience and knowledge of best practices.
NUMBER AND TYPE OF CURRENT USERS
More than 400 NCI scientists (Principal Investigators, postdoctoral fellows, staff scientists etc.) are using the platform to search and compare research services available through internal NCI cores and external vendors.
EXTERNAL DATA USED
NCI core managers maintain and update information on the CREx backoffice website, while commercial vendor information is maintained and updated by Assay Depot, and used on their public site (assaydepot.com) and for all their private research exchanges.
ADDITIONAL NOTES
Broad adoption throughout the NIH would enhance transparency of services and resources available through the other IRP cores, and facilitate sharing of core resources amongst the NIH Intramural Program. In addition, the platform can enable NIH to benefit both from their collective consumer experience and from savings of research dollars, based on economies of scale or pre-negotiated pricing.
7. 6
App Name:
IDEASCALE
Access URL:
n/a
Docs URL: (if any)
http://www.ideascale.com/
IC:
NHLBI
Submitter Name:
David Phillips
Email Address:
David.Phillips@nih.gov
Phone
301-402-1039
Functional Area
Where Used:
Administrative
How Acquired?
COTS/GOTS Procurement
App Acq Contact:
Katherine Nguyen (CIT Acquisitions)
Operating Sys:
Web- cloud based
DBMS:
Cloud
Front End:
Major browsers
FUNCTIONALITY PROVIDED
Ideation Programs allow stakeholders to generate, develop, rate, communicate, improve and overall engage in a process that leads to innovative ideas to address organizational issues and challenges. There is a growing recognition across many branches of the US Government that better mechanisms are needed for harnessing the collective talent and expertise of agency employees, as well as external stakeholders including the general public, to solve problems. Too often, identifying and connecting innovative thinkers who are most capable of generating truly pioneering solutions can be challenging. IdeaScale is a Cloud-based platform which NHLBI OM ITAC is using for its regular IT Needs Data Call to users across the IC and which our IC Director intends to leverage for a strategic visioning process.
NUMBER AND TYPE OF CURRENT USERS
Several hundred, across all functional areas.
EXTERNAL DATA USED
None
8. 7
App Name:
AMAZON CLOUD AWS EC2
Access URL:
http://pub.ncibbrb.org/ , others
Docs URL: (if any)
http://aws.amazon.com/
IC:
NCI
Submitter Name:
David Tabor
Email Address:
david.tabor@nih.gov
Phone
(301) 402-7087
Functional Area Where Used:
IT
How Acquired?
COTS/GOTS Procurement
App IT Contact:
David Tabor (david.tabor@nih.gov, (301) 402-7087
Operating Sys:
Windows / linux
DBMS
Any, n/a
Front End:
Web-based
We intend to:
Just an FYI / other
FUNCTIONALITY PROVIDED
We have found that using the Amazon Cloud as an adjunct to our in-house Data Center allows us to very rapidly configure and deploy new applications, demo and train for internal and external customers, stand up and configure servers and work out the “kinks.” It also provides a platform for secure collaboration inside and outside the enterprise, and a unique platform for monitoring our public-facing application, http://biospecimens.cancer.gov/brd/
NUMBER AND TYPE OF CURRENT USERS
The version of Biospecimen Research Database (BRD) we have running in the Amazon Cloud is for User Acceptance Testing (UAT). We also have a Windows server that we use to test access to BRD through the firewall.
The BRD is a free and publicly accessible literature database that contains peer-reviewed primary and review articles in the field of human Biospecimen Science. Each entry has been created by a Ph.D. level scientist to capture
(1) relevant parameters that include the biospecimen investigated (type and location, patient diagnosis), preservation method, analyte(s) of interest and technology platform(s) used for analysis;
(2) the pre-analytical factors investigated, including those encountered during the lifecycle of a biospecimen (e.g. ischemia time, fixation parameters, storage conditions); and
(3) an original summary of relevant results.
The Comprehensive Data Resource (CDR) is a distributed web-based bioinformatics system that manages and maintains multi-dimensional data models on biospecimens. The CDR was developed and is currently utilized to collect data on biospecimens obtained from cancer patients and post-mortem donors, for the Biospecimen Pre-analytical Variables (BPV) and Genotype-tissue Expression (GTEx) programs. The CDR is an extranet hosted at the NCI Frederick. We use the Amazon Cloud to host a training server for the CDR, with test data, that does not have to be behind the NIH firewall.
Grand total over 150 users.
EXTERNAL DATA USED
PubMed, numerous scientific journals, UMLS
9. 8
App Name:
NVISION DATA WAREHOUSE
Access URL:
https://nvision.nih.gov/nVision_Portlets/nVision_Launch_Pad/nVision_User_Access.cfm
Docs URL: (if any)
https://nvision.nih.gov/
IC:
CIT-Center for Information Technology
Submitter Name:
Denise Dmuchowski
Email Address:
Denise.dmuchowski@nih.gov
Phone
301-443-4446
Functional Area Where Used:
Administrative
How Acquired?
Custom Development
App IT Contact:
Ryan.wilvert@nih.gov
Operating Sys:
Mix of Microsoft Windows and Unix
DBMS
Oracle 11g
Front End:
Mix of Adobe ColdFusion Web Apps, SAP BusinessObjects, and QlikView Applications (I’m leaving our Hummingbirsd and IBM Cognos)
We intend to:
Provide this service to others
FUNCTIONALITY PROVIDED
nVision is the enterprise information solution that delivers a centralized data repository, desktop analytics and reporting tools to streamline business management at the NIH. nVision gathers data from XX source systems and then applies business rules to over 59,000 business transactions per day. The result is useful, actionable information placed at the fingertips of 4,000 users and available to over 100 key business systems across the NIH. nVision provides information necessary to achieve essential business objectives via standard and custom reports, dashboards, and analytics.
NUMBER AND TYPE OF CURRENT USERS
Administrative management community ~4,000 users
EXTERNAL DATA USED
25 systems feed nVision data
10. 9
App Name:
MEETING REGISTRATION SYSTEM - MREGS
Access URL:
Sample site: http://meetings.nigms.nih.gov/meetings/NIGMSWebaaps/
Docs URL: (if any)
n/a
IC:
NIGMS
Submitter Name:
Alex Naneyshvili
Email Address:
naneyshvilial@nigms.nih.gov
Phone
301-594-2190
Functional Area Where Used:
Administrative
How Acquired?
Custom Development
App IT Contact:
Anjum Dahya 301-435-8575
Operating Sys:
Windows 2003
DBMS
Oracle 11gR2
Front End:
ColdFusion 8
We intend to:
Provide this service to others
FUNCTIONALITY PROVIDED
The NIGMS Meeting Registration System (MREGS) is a web-based tool for organizers of NIGMS- sponsored conferences and meetings held on or off the NIH campus.
Features of MREGS:
Publishes meeting information including agenda and logistics
Provides web-based meeting registration
Allows the meeting organizer to:
manage registration process
generate customized rosters
send e-mails to registrants
NUMBER AND TYPE OF CURRENT USERS
MREGS is currently used by NIGMS, FIC, NINR, OD, and NINDS
EXTERNAL DATA USED
Meeting Registrants’ Data only.
11. 10
App Name:
MEETING REGISTRATION SYSTEM - MREGS
Access URL:
Sample site: http://meetings.nigms.nih.gov/meetings/NIGMSWebaaps/
Docs URL: (if any)
n/a
IC:
NIGMS
Submitter Name:
Alex Naneyshvili
Email Address:
naneyshvilial@nigms.nih.gov
Phone
301-594-2190
Functional Area Where Used:
Administrative
How Acquired?
Custom Development
App IT Contact:
Anjum Dahya 301-435-8575
Operating Sys:
Windows 2003
DBMS
Oracle 11gR2
Front End:
ColdFusion 8
We intend to:
Provide this service to others
FUNCTIONALITY PROVIDED
The NIGMS Meeting Registration System (MREGS) is a web-based tool for organizers of NIGMS-sponsored conferences and meetings held on or off the NIH campus.
Features of MREGS:
Publishes meeting information including agenda and logistics
Provides web-based meeting registration
Allows the meeting organizer to:
manage registration process
generate customized rosters
send e-mails to registrants
NUMBER AND TYPE OF CURRENT USERS
MREGS is currently used by NIGMS, FIC, NINR, OD, and NINDS
EXTERNAL DATA USED
Meeting Registrants’ Data only.
ADDITIONAL NOTES
[Type any additional notes if needed.]
12. 11
App Name:
NLM MESH ON DEMAND
Access URL:
http://www.nlm.nih.gov/mesh/MeSHonDemand.html
Docs URL: (if any)
n/a
IC:
NLM
Submitter Name:
Dan Cho
Email Address:
NLMMESH-MOD@mail.nih.gov
Phone
301-594-2086
Functional Area Where Used:
Intramural
How Acquired?
Custom Development
App IT Contact:
Jim Mork
Operating Sys:
Windows 2003
DBMS
Oracle 11gR2
Front End:
ColdFusion 8
We intend to:
Provide this service to others
FUNCTIONALITY PROVIDED
Identifies MeSH Terms in your text using the NLM Medical Text Indexer (MTI) program. After processing, MeSH on Demand returns a list of MeSH Terms relevant to your text and PMID of top ten related articles.
NUMBER AND TYPE OF CURRENT USERS
Peak users at 150,000+ in first month after rollout
EXTERNAL DATA USED
Meeting Registrants’ Data only.
ADDITIONAL NOTES
For more information about MeSH on Demand, please see our NLM Technical Bulletin article and more detail here.