Addressing misclassification costs in machine learning through asymmetric loss functions

Bryan Barnes; Mark-Alexander Henn

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

PUBLICATIONS

Addressing misclassification costs in machine learning through asymmetric loss functions

Published

April 27, 2023

Author(s)

Bryan Barnes, Mark-Alexander Henn

Abstract

Background: Patterning defect metrology requires data interpretation with classification, each well-suited to machine learning (ML). Defect classification however has notable misclassification costs; mislabeling a defect as nominal has greater impact than the converse. Aim: Though quantified costs are not publicly available, total economic misclassification cost (total cost) is optimized across orders-of-magnitude variation in cost ratio C and classification threshold 0.01 < τ < 0.99. Approach: Convolutional neural networks are trained using the intrinsically weighted and scaled asymmetric focal losses (AFL, sAFL) with hyperparameter γ with weighted and unweighted binary cross-entropy (wBCE, BCE) functions trained for comparisons. Optimal functions and conditions are identified for reducing total cost. For reproducibility, publicly available ML data sets are surrogates for industrial imaging data. Results: For these data the sAFL mimimizes total cost at τ = 0.5, C ≥ 16. The AFL reduces total cost at 0.1 ≤ τ < 0.5, C > 128. Asymmetric loss functions lower total cost versus wBCE by 15 % to 40 % for 0.2 < τ < 0.5, C > 64. Conclusions: Total economic misclassification cost can be tailored using asymmetric focal losses. Estimations are presented to allow the extension of reported trends to industrial applications with strong class imbalances between defect-indicative and nominal-indicative data.

Proceedings Title

Proceedings Volume 12496, Metrology, Inspection, and Process Control XXXVII

Volume

12496

Conference Dates

February 26-March 2, 2023

Conference Location

San Jose, CA, US

Conference Title

SPIE 2023 Advanced Lithography + Patterning

Pub Type

Conferences

Download Paper

https://doi.org/10.1117/12.2662027

Local Download

Keywords

asymmetric loss functions, asymmetric focal loss, goal-oriented metrics, defect metrology, binary classification, machine learning, convolutional neural networks, scaled asymmetric focal loss

Process measurement and control, Nanofabrication / manufacturing and Machine learning

Citation

Barnes, B. and Henn, M. (2023), Addressing misclassification costs in machine learning through asymmetric loss functions, Proceedings Volume 12496, Metrology, Inspection, and Process Control XXXVII, San Jose, CA, US, [online], https://doi.org/10.1117/12.2662027, https://tsapps.nist.gov/publication/get_pdf.cfm?pub_id=936548 (Accessed April 23, 2025)

Issues

If you have any questions about this publication or are having problems accessing it, please contact reflib@nist.gov.

Created April 27, 2023, Updated May 7, 2023

Addressing misclassification costs in machine learning through asymmetric loss functions

Author(s)

Abstract

Download Paper

Keywords

Citation

Additional citation formats

Issues