Information Systems Group

Infrastructure or Platforms for Science-Oriented Analytics

CDCS (Customizable Data Curation System) focuses on registries, curating document-oriented data, and providing persistent IDs. It provides web interfaces for retrieving and querying data, text-oriented search. It is being augmented with advanced tools, inspired by Natural Language Processing (NLP), to provide semantic search.

WIPP (Web Image Processing Pipeline) focuses on image analytics over terabyte-sized image collections running on distributed computational hardware (clusters and clouds). It provides web interfaces for managing and viewing images or subsets of images, and for the traceable and reproducible processing of images via workflows of software containers from a WIPP registry.

Advanced Computational Techniques and Algorithms

Image Analytics—the group is developing approaches that combine approaches from conventional feature engineering and ones rooted in Artificial Intelligence/Deep Learning (AI/DL) for analyzing a variety of image types: optical microscopy, electron microscopy, Cryo-EM images (Cryogenic Electron Microscopy), neutron images, etc. Several of these image types go beyond 3D by adding a time dimension (T) or multiple channels (C) and can be very large (approaching 1 TB).

Text Analytics—the group is applying Natural Language Processing (NLP) techniques and language models (e.g., BERT) to analyze curated scientific publications and answer more sophisticated queries than traditional Informational Retrieval (IR) systems. The group is also a participant in identifying a subdomain of NLP, Technical Language Processing (TLP), which aims to tackle text-related problems in technical domains with limited data availability (e.g., maintenance logs).

Algorithmic Acceleration—the group is continuing its development of specialized algorithms with reduced operation counts in areas ranging from Monte Carlo sampling for Molecular Dynamics, mixed or reduced precision computations, and stochastic algorithms.

Foundational Capabilities

Artificial Intelligence/Deep Learning for Imaging and NLP—most group members have become quite proficient in the use of DL tools. The group now uses DL approaches as a foundational building block to solve problems in multiple domains: imaging across multiple modalities, text, specialized signal processing, and computer security (trojan detection). Furthermore, the group is collaborating with groups in several NIST OUs (EL, MML, NCNR) to apply AI/DL techniques to OU-specific problems, has made code available to NIST researchers for automating training on AI-oriented hardware resources at NIST, and has given AI-related presentations to NIST researchers.

The group is administering a public competition to detect Trojans (hidden classes) in AI Deep Learning models (Neural Networks) on behalf of IARPA.

Scalability & Performance—the group continues to extend its work on Program Execution Models for obtaining performance in a range of applications. This work identified Data Flow Graphs as a promising execution model that makes it easy to take advantage of accelerators (e.g., GPUs). The group released Hedgehog, a library and runtime system for implementing Multi-threaded Asynchronous Data Flow Graphs on high-end single compute nodes, along with FastLoader, a companion library for the multithreaded asynchronous reading of large objects from files (e.g., very large images), to simplifying the development of performance-oriented applications. We have used this execution model to develop performance-oriented applications (e.g., analysis of very large microscopy images [100K x 50K pixels]).

At a conceptual level, the group is cooperating with a University of Utah research team, led by Prof. Martin Berzins, to extend this programming model beyond a single compute node, so it applies to a cluster. The group is also exploring Ray Tracing as a programming model to accelerate and simplify particle transport simulation, which are of interest to multiple NIST OUs.

Trustworthy Computing—the group is developing or extending approaches to enhance trust in computing in three areas: (a) numerical reproducibility—by associating a numerical uncertainty with a computed result; (b) explainable AI in OMICS Problems—by combining simulations of neural networks, interactive visualizations for sequencing data, and designs of multiple metrics of AI models using perturbations; (c) reproducible image analysis—by organizing imaging computations as reproducible workflows using containers and tracking data & result provenance.

Advanced Techniques

Foundational Capabilities

Applications

Platforms

Image Analytics

Text Analytics

Algorithmic Acceleration

Machine Learning

Scalability and Performance

Trustworthy Computing

Configurable Data Curation System

Trusted AI Framework for Standard Reference with Uncertainty

Explainable AI in OMICS Problems

Actionable Intelligence

SCOUT

Image Segmentation of Concrete Samples

Accuracy of Measurements from 3D Reconstructions

Real-time Transformation for Surgical Displays

Reproducibility of Nucleus Image Quantification

Numerical Reproducibility

Binding Prediction for Drug Design

Texture Directionality Detection for Biomedical Images

Multi-modal AI Models for Precision Medicine

Trusted Computations from Terabyte Images

Zeno

2D and 3D Image Measurements of Cells at Terabyte Scale

Quality Assessment of Cryogenic Electron Microscopy Images

AI for Low-Field MRI

Projects and Programs

2D and 3D image Measurements of Retinal Pigment Epithelial Cells at Terabyte Scale

Ongoing

Progressive increases in the number of cell therapies in the preclinical and clinical phases has prompted the need for reliable and non-invasive assays to

Accuracy and Uncertainty of Measurements from 3D Tomographic Reconstructions

Ongoing

NIST has invested into an Innovations in Measurement Science (IMS) project that seeks to develop far field neutron interferometers to create multi-scale images

Actionable Intelligence

Ongoing

Much information that could have immediate practical value in the understanding, management, and measurement of socio-technical systems exists as domain

Advanced Analytical Electron Tomography for Materials Development and Failure Analysis in Semiconductor Devices

Ongoing

Transmission electron microscopy (TEM) and scanning transmission electron microscopy (STEM) are widely used in industry for identifying structural and

News and Updates

Multiple airplanes on the tarmac. Some are surrounded by a green box and others by a red X.

Spotlight: The Challenge to Detect Stealthy Attacks Against AI Data

September 5, 2023

What if someone were to manipulate the data used to train AI? NIST is collaborating on a competition to get ahead of potential threats like this.

Four sets of black, chunky circular shapes with center markings on a white background.

Recovering Data: NIST’s Neural Network Model Finds Small Objects in Dense Images

August 4, 2020

In efforts to automatically capture important data from scientific papers, computer scientists at the National Institute of Standards and Technology (NIST) have

A Map App to Track Stem Cells

February 13, 2018

Researchers who work with stem cells have ambitious goals. Some want to cure cancer or treat heart disease. Others want to grow the tissues and organs that

NIST Tools for Searching the Coronavirus Dataset

New resources developed by a multidisciplinary team of experts in natural language processing and data curation and discovery allow researchers to get the most

Publications

Report from the 2nd International Workshop on FAIR Containerized Computational Software

April 30, 2024

Author(s)

Peter Bajcsy, Nathan Hotaling

The National Institute of Standards and Technology (NIST) is evaluating and improving the specification for achieving interoperability of containerized

A New Assessment of Convolutional Neural Networks for Texture Directionality Detection

December 1, 2023

Author(s)

Marcin Kociolek, Antonio Cardone

Image texture analysis is ubiquitous as it finds application in many scientific fields of interest, including biomedical and material science. The automated

Ballot Definition Common Data Format Specification

January 24, 2023

Author(s)

Benjamin Long, John Dziurłaj

This publication describes a ballot definition common data format for the interchange of logical and physical ballot style information. It contains a UML

Micro Common Data Format Specification

January 23, 2023

Author(s)

Benjamin Long, John Dziurłaj

This specification describes a data format for space-constrained environments, such as the placement of machine readable data on paper. The specification is

Awards

Information Systems Group

Infrastructure or Platforms for Science-Oriented Analytics

Advanced Computational Techniques and Algorithms

Foundational Capabilities

Advanced Techniques

Foundational Capabilities

Applications

Platforms

Image Analytics

Text Analytics

Algorithmic Acceleration

Machine Learning

Scalability and Performance

Trustworthy Computing

Projects and Programs

2D and 3D image Measurements of Retinal Pigment Epithelial Cells at Terabyte Scale

Accuracy and Uncertainty of Measurements from 3D Tomographic Reconstructions

Actionable Intelligence

Advanced Analytical Electron Tomography for Materials Development and Failure Analysis in Semiconductor Devices

News and Updates

Spotlight: The Challenge to Detect Stealthy Attacks Against AI Data

Recovering Data: NIST’s Neural Network Model Finds Small Objects in Dense Images

A Map App to Track Stem Cells

NIST Tools for Searching the Coronavirus Dataset

Publications

Report from the 2nd International Workshop on FAIR Containerized Computational Software

A New Assessment of Convolutional Neural Networks for Texture Directionality Detection

Ballot Definition Common Data Format Specification

Micro Common Data Format Specification

Awards

2021 - Bronze Medal Award---Yan Lu, Paul Witherell, Benjamin Long

2021 - Bronze Medal Award---Alden Dima, Talapady Bhat , Benjamin Long , Jacob Collard, Rachael Sexton, John Elliott

2020 - Bronze Medal Award---Barry Schneider, Robert Hanisch, Carl Spangler, Walid Keyrouz, Andrew Reid, Timothy Blattner, Derek Juba, Robert Densock, Abdella Battou, Thomas Allison

2019 - Gold Medal Award---Mark Luce, Allison Barnard Feeney, Thomas D. Hedberg, Jr., Moneer Helu, Brian A. Weiss, Alden Dima

Archived Projects

Contacts