Take a sneak peek at the new NIST.gov and let us know what you think!
(Please note: some content may not be complete on the beta site.).
Standard Reference Data, NIST:
If you have any questions regarding this website, or notice any problems or inaccurate information, please contact the webmaster by sending e-mail to: firstname.lastname@example.org
NIST Special Database 2
NIST Structured Forms Reference Set of Binary Images (SFRS)
Effective immediately, there will be a minimum $30.00 shipping charge for all international shipments of databases via UPS International. Customer will be responsible for their own duties, tax, and VAT. Contact 301 975 2200 or email@example.com if you have questions.
The NIST Structured Forms Database consists of 5,590 pages of binary, black-and-white images of synthesized documents.
The documents in this database are 12 different tax forms from the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE.
Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database.
The document images in this database appear to be real forms prepared by individuals, but the images have been automatically derived and synthesized using a computer.
There are 900 simulated tax submissions represented in the database averaging 6.2 form faces per submission. This significant new database totals approximately 5.9 gigabytes of uncompressed image data including image format documentation and example software.
The database has the following features:
System Requirements: CD-ROM drive with software to read ISO-9660 format.
Please click here to view the PDF version of Users' Guide.
For more information on Special Database 2 please contact:
The scientific contact for this database is:
Keywords: ASCII Reference, automated character recognition, automated data capture, Binary Image Database, forms identification, image format documentation, IRS, NIST, Machine Print, OCR, optical character recognition, printed characters, software recognition, synthesized documents, tax forms