Skip to main content
U.S. flag

An official website of the United States government

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Adapting natural language processing for technical text

Published

Author(s)

Alden A. Dima, Sarah Lukens, Melinda Hodkiewicz, Thurston Sexton, Michael Brundage

Abstract

Despite recent dramatic successes, Natural Language Processing (NLP) is not ready to address a variety of real-world problems. Its reliance on large standard corpora, a training and evaluation paradigm that favors the learning of shallow heuristics, and large computational resource requirements, makes domain-specific application of even the most successful NLP techniques difficult. This paper discusses Technical Language Processing (TLP) which brings engineering principles and practices to NLP specifically for the purpose of extracting actionable information from language generated by experts in their technical tasks, systems, and processes. TLP envisages NLP as a socio-technical system rather than algorithmic pipeline. We describe how the TLP approach to meaning and generalization differs from that of NLP, how data quantity and quality can be addressed in engineering technical domains, and the potential risks of not adapting NLP for technical use cases. Engineering problems can benefit immensely from the inclusion of knowledge from unstructured data, currently unavailable due to issues with out of the box NLP packages. We illustrate the TLP approach by focusing on maintenance in industrial organizations as a case-study.
Citation
Applied AI Letters

Keywords

tural language processing, technical language processing, technical data, maintenance records, domain adaptation

Citation

Dima, A. , Lukens, S. , Hodkiewicz, M. , Sexton, T. and Brundage, M. (2021), Adapting natural language processing for technical text, Applied AI Letters, [online], https://doi.org/10.1002/ail2.33, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=931810 (Accessed November 21, 2024)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created June 29, 2021, Updated October 14, 2021