===== Critical Perspectives on Large Language Models (LLM) such as ChatGPT ===== **Half-day Input** Z-Modul [[https://intern.zhdk.ch/?vorlesungsverzeichnis&semester_id=183609&search=kohli&course_id=251118|Kunst und Künstliche Intelligenz]] by Andreas Kohli, 12-17.02.2023 **The eternal hype cycle of tech?** * self-driving cars, [[https://www.nytimes.com/2015/03/20/business/elon-musk-says-self-driving-tesla-cars-will-be-in-the-us-by-summer.html|2015]] * Blockchain, [[https://www.pwc.com/gx/en/industries/technology/publications/blockchain-report-transform-business-economy.html|2019]] * AI Sentience, 2023 **Main Sources:** * Bender, Emily M., Timnit Gebru, Angelina McMillan-Major, and Shmargaret Shmitchell. 2021. “On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜.” In [[https://doi.org/10.1145/3442188.3445922|Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency]], 610–23. Virtual Event Canada: ACM. \\ “we understand the term language model (LM) to refer to systems which are trained on string prediction tasks: that is, predicting the likelihood of a token (character, word or string) given either its preceding context or (in bidirectional and masked LMs) its surrounding context.” (Bender et al., 2021, p. 611) \\ resource use \\ bias (gender, class, language, geography) \\ false narrative (coherence is not sentience) \\ "LMs are not performing natural language understanding (NLU), and only have success in tasks that can be approached by manipulating linguistic form” (Bender et al., 2021, p. 610) * Bender, Emily. 2022. [[https://www.youtube.com/watch?v=wuU-5rGPbyg|Resisting dehumanization in the age of AI.]] Talk at CogSci: Interdisciplinary Study of the Mind (07.29), 62 Min * Mozilla Internet Health Report. 2022. [[https://2022.internethealthreport.org/facts/|Who Has Power Over AI?]] * [[https://wiki.zhdk.ch/fs/lib/exe/detail.php?id=what_kind_of_ai_do_we_want&media=biases.png|Avoidable and Unavoidable Bias in the AI Pipeline]] by Felix Stalder, 2022 **Training Data for ChatGPT** * Colossal Clean Crawled Corpus * [[https://commoncrawl.org/the-data/|Common Crawl]] Data Set * Dodge, Jesse, Maarten Sap, Ana Marasović, William Agnew, Gabriel Ilharco, Dirk Groeneveld, Margaret Mitchell, and Matt Gardner. 2021. “[[https://doi.org/10.48550/ARXIV.2104.08758|Documenting Large Webtext Corpora: A Case Study on the Colossal Clean Crawled Corpus.]]” arxiv.org **Simple example of bias in machine translation:** * [[https://www.deepl.com/translator#en/de/doctors%20and%20nurses%20work%20at%20hospitals|Assumption of gender]], statistical vs linguistic view **Exploring Bias by artistic means** [[https://wwwwwwwwwwwwwwwwwwwwww.bitnik.org/sor|Mediengruppe Bitnik, State of Reference]], 2017 **Group Work** * create groups of ~4 people * play around with ChatGPT to document a specific bias /missinformation (30 minutes) * brainstorm an application by which to make use of this bias/missinformation at scale (30 minutes) * Make a short (5-minute) presentation on the bias and it application for (ab)use