Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts

Sharon Levy; William Adler; Tahilin Sanchez Karver; Mark Dredze; Michelle R Kaufman

doi:10.18653/v1/2024.findings-emnlp.331

Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts

Sharon Levy, William Adler, Tahilin Sanchez Karver, Mark Dredze, Michelle R Kaufman

Abstract

Large language models (LLMs) acquire beliefs about gender from training data and can therefore generate text with stereotypical gender attitudes. Prior studies have demonstrated model generations favor one gender or exhibit stereotypes about gender, but have not investigated the complex dynamics that can influence model reasoning and decision-making involving gender. We study gender equity within LLMs through a decision-making lens with a new dataset, DeMET Prompts, containing scenarios related to intimate, romantic relationships. We explore nine relationship configurations through name pairs across three name lists (men, women, neutral). We investigate equity in the context of gender roles through numerous lenses: typical and gender-neutral names, with and without model safety enhancements, same and mixed-gender relationships, and egalitarian versus traditional scenarios across various topics. While all models exhibit the same biases (women favored, then those with gender-neutral names, and lastly men), safety guardrails reduce bias. In addition, models tend to circumvent traditional male dominance stereotypes and side with “traditionally female” individuals more often, suggesting relationships are viewed as a female domain by the models.

Anthology ID:: 2024.findings-emnlp.331
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2024
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5777–5800
Language:
URL:: https://aclanthology.org/2024.findings-emnlp.331/
DOI:: 10.18653/v1/2024.findings-emnlp.331
Bibkey:
Cite (ACL):: Sharon Levy, William Adler, Tahilin Sanchez Karver, Mark Dredze, and Michelle R Kaufman. 2024. Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 5777–5800, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Gender Bias in Decision-Making with Large Language Models: A Study of Relationship Conflicts (Levy et al., Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-emnlp.331.pdf

PDF Cite Search Fix data