User Guide for NIST Media Forensic Challenge (MFC) Datasets

Haiying Guan; Andrew Delgado; Yooyoung Lee; Amy Yates; Daniel Zhou; Timothée N. Kheyrkhah; Jonathan G. Fiscus

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

User Guide for NIST Media Forensic Challenge (MFC) Datasets

Published

July 6, 2021

Author(s)

Haiying Guan, Andrew Delgado, Yooyoung Lee, Amy Yates, Daniel Zhou, Timothée N. Kheyrkhah, Jonathan G. Fiscus

Abstract

NIST released a set of Media Forensic Challenge (MFC) datasets developed in DARPA MediFor (Media Forensics) project to the public in the past 5 years. More than 300 individuals, 150 organizations, from 26 countries and regions worldwide use our datasets for their research. During this period, we received many questions from users about the datasets. The questions vary from what the datasets contain, what is the definition of different kinds of metadata, how to use them, to how to modify/extract the data for their own research purposes, etc. This document aims to provide a systematical user guide for MFC datasets to answer those questions. In this document, first, an introduction about the MFC is given. Second, to provide users a deep understanding of what data and metadata are collected and how they are used in MFC datasets, a simple example with the manipulation journal graph is shown. The corresponding MFC dataset reference files of the given example are explained in detail. Third, the historical evolution of MFC datasets is introduced, and the releasable MFC datasets are summarized. Two commonly used approaches to illustrate the manipulation operation histogram of a dataset are presented. Moreover, a flexible approach for the special subset selection, selective scoring, is introduced to sample the test probes from the full test set for special task evaluation. Finally, suggestions to use MFC datasets beyond of MFC evaluations are presented for user interests.

Citation

NIST Interagency/Internal Report (NISTIR) - 8377

Report Number

8377

NIST Pub Series

NIST Interagency/Internal Report (NISTIR)

Pub Type

NIST Pubs

Download Paper

https://doi.org/10.6028/NIST.IR.8377

Local Download

Keywords

Media Forensics, NIST Media Forensic Challenge (MFC) Evaluation, Journaling Tool (JT), Manipulation journal graph, Image Manipulation Detection, Image Manipulation Localization, Manipulation Localization Reference Mask, JPEG 2000, Manipulation Reference Ground-truth, Localization Mask, DARPA MediFor (Media Forensic) program

Video analytics, Information retrieval, Forensic Science, Digital evidence and Artificial intelligence

Citation

Guan, H. , Delgado, A. , Lee, Y. , Yates, A. , Zhou, D. , Kheyrkhah, T. and Fiscus, J. (2021), User Guide for NIST Media Forensic Challenge (MFC) Datasets, NIST Interagency/Internal Report (NISTIR), National Institute of Standards and Technology, Gaithersburg, MD, [online], https://doi.org/10.6028/NIST.IR.8377, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=931782 (Accessed May 8, 2026)

Additional citation formats

Issues

If you have any questions about this publication or are having problems accessing it, please contact [email protected].

Created July 6, 2021, Updated October 14, 2021

Was this page helpful?