Teaching AI when to care about gender
Journal Article
·
· Code4Lib Journal
OSTI ID:1885750
- Los Alamos National Lab. (LANL), Los Alamos, NM (United States)
Natural Language Processing (NLP) is a branch of Artificial Intelligence (AI) concerned with solving language tasks by modeling large amounts of textual data. Some NLP techniques use word embeddings which are semantic models where machine learning (ML) is used to learn to cluster semantically related words by learning about word co-occurrences in the original training text. Unfortunately, these models tend to reflect or even exaggerate biases that are present in the training corpus. Here we describe the Word Embedding Navigator (WEN), which is a tool for exploring word embedding models. We examine a specific potential use case for this tool: interactive discovery and neutralization of gender bias in word embedding models, and compare this human-in-the-loop approach to reducing bias in word embeddings with a debiasing post-processing technique.
- Research Organization:
- Los Alamos National Laboratory (LANL), Los Alamos, NM (United States)
- Sponsoring Organization:
- USDOE
- Grant/Contract Number:
- 89233218CNA000001
- OSTI ID:
- 1885750
- Report Number(s):
- LA-UR-22-27833
- Journal Information:
- Code4Lib Journal, Journal Name: Code4Lib Journal Vol. 54; ISSN 1940-5758
- Publisher:
- code4lib.orgCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
LEARNING SEMANTICS-ENHANCED LANGUAGE MODELS APPLIED TO UNSUEPRVISED WSD
PhysBERT: A text embedding model for physics scientific literature
Computationally Efficient Learning of Quality Controlled Word Embeddings for Natural Language Processing
Conference
·
Sun Jan 28 23:00:00 EST 2007
·
OSTI ID:985889
PhysBERT: A text embedding model for physics scientific literature
Journal Article
·
Mon Oct 28 20:00:00 EDT 2024
· APL Machine Learning
·
OSTI ID:2564799
Computationally Efficient Learning of Quality Controlled Word Embeddings for Natural Language Processing
Conference
·
Mon Jul 01 00:00:00 EDT 2019
·
OSTI ID:1545208