An official website of the United States government
Here’s how you know
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
Secure .gov websites use HTTPS
A lock (
) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.
NIST Special Database 2. NIST Structured Forms Reference Set of Binary Images (SFRS)
Published
Author(s)
Michael Garris
Abstract
The NIST Structured Forms Database consists of 5,590 pages of binary, black-and-white images of synthesized documents.This database is a valuable tool for measurement of system performance and system comparison on complex forms.The documents in this database are 12 different tax forms from the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real forms prepared by individuals, but the images havebeen automatically derived and synthesized using a computer. There are 900 simulated tax submissions represented in the database averaging 6.2 form faces per submission. This significant new database totals approximately 5.9 gigabytes of uncompressed image data including image format documentation and example software.
Citation
World Wide Web-Internet and Web Information Systems
Pub Type
Journals
Keywords
ASCII reference, automated character recognition, binary image database, forms identification, image format documentation, IRS, OCR, printed characters, synthesized documents
Citation
Garris, M.
(2008),
NIST Special Database 2. NIST Structured Forms Reference Set of Binary Images (SFRS), World Wide Web-Internet and Web Information Systems
(Accessed December 27, 2024)