ERef Bayreuth

Anmelden

Literatur vom gleichen Autor/der gleichen Autor*in

bei Google Scholar

Bibliografische Daten exportieren

Explaining AI through mechanistic interpretability

Titelangaben

Kästner, Lena ; Crook, Barnaby:
Explaining AI through mechanistic interpretability.
In: European Journal for Philosophy of Science. Bd. 14 (2024) Heft 4 . - 52.
ISSN 1879-4920
DOI: https://doi.org/10.1007/s13194-024-00614-4

Volltext

Link zum Volltext (externe URL):

Angaben zu Projekten

Projekttitel:	Offizieller Projekttitel Projekt-ID Open Access Publizieren Ohne Angabe
Projektfinanzierung:	VolkswagenStiftung

Abstract

Recent work in explainable artificial intelligence (XAI) attempts to render opaque AI systems understandable through a divide-and-conquer strategy. However, this fails to illuminate how trained AI systems work as a whole. Precisely this kind of functional understanding is needed, though, to satisfy important societal desiderata such as safety. To remedy this situation, we argue, AI researchers should seek mechanistic interpretability, viz. apply coordinated discovery strategies familiar from the life sciences to uncover the functional organisation of complex AI systems. Additionally, theorists should accommodate for the unique costs and benefits of such strategies in their portrayals of XAI research.

Weitere Angaben

Publikationsform:	Artikel in einer Zeitschrift
Begutachteter Beitrag:	Ja
Keywords:	AI; ANN; Deep learning; Discovery; Explanation; Mechanistic; interpretability; XAI
Institutionen der Universität:	Fakultäten Fakultäten > Kulturwissenschaftliche Fakultät Fakultäten > Kulturwissenschaftliche Fakultät > Institut für Philosophie Fakultäten > Kulturwissenschaftliche Fakultät > Institut für Philosophie > Lehrstuhl Philosophie, Informatik und Künstliche Intelligenz Fakultäten > Kulturwissenschaftliche Fakultät > Institut für Philosophie > Lehrstuhl Philosophie, Informatik und Künstliche Intelligenz > Lehrstuhl Philosophie, Informatik und Künstliche Intelligenz - Univ.-Prof. Dr. Lena Kästner Forschungseinrichtungen > Zentrale wissenschaftliche Einrichtungen > Research Center for AI in Science and Society Forschungseinrichtungen Forschungseinrichtungen > Zentrale wissenschaftliche Einrichtungen
Titel an der UBT entstanden:	Ja
Themengebiete aus DDC:	100 Philosophie und Psychologie > 100 Philosophie
Eingestellt am:	08 Mär 2025 22:00
Letzte Änderung:	28 Nov 2025 10:05
URI:	https://eref.uni-bayreuth.de/id/eprint/92739