alignment tax
Appearance
English
[edit]Etymology
[edit]First attested in a 2019 speech by computer scientist Paul Christiano (see quote), who attributed the idea to AI researcher and writer Eliezer Yudkowsky.
Noun
[edit]alignment tax (plural alignment taxes)
- (artificial intelligence) A cost to the capabilities of an artificial intelligence resulting from the effects of aligning it with human ethics and morality. [from late 2010s]
- 2019 August 29, Paul Christiano, Current work in AI alignment[1], EA Global San Francisco 2019:
- I like this notion of an "alignment tax" […] the reason I might compromise is if there's some tension, between having the AI that's robustly trying to do what I want, and having the AI that is competent or intelligent, and the alignment tax is intended to capture that gap—that cost that I incur if I insist on alignment.