Return to search

A Contribution to the Philosophy of Artificial Intelligence and Chatbot Communication

The purpose of this thesis is to attempt to answer the question why chatbot computer programs are often not very good at communicating in human natural language. It is argued that one possible reason why chatbots are often not good conversationalists is because they model communication in terms of only encoding and decoding processes. Human communication, however, involves making inferences about the mental states of others.
Chapter one begins by exploring a popular theory about how communication works called the code model of communication. The code model describes human communication as having to do with speakers encoding thoughts into utterances and listeners decoding utterances to recover a representation of the thought that the speaker wanted to communicate. A variation on the code model is also explained; this is referred to as the information-theoretic code model. Two arguments against the code model will then be presented. Finally, an alternative to the code model is considered, called the ostensive-inferential model of communication.
Chapter two begins with an explanation of how chatbots work. Chatbots are made up of several different components. The language model component provides chatbots with the ability to produce and interpret utterances. Next, an explanation of how language models work, and how chatbots can represent the meanings of words is provided. The chapter concludes by documenting the fact that chatbots communicate using only encoding and decoding processes – that is, that chatbots communicate within the paradigm of the code model of communication.
Chapter three explains how the fact that chatbots communicate using only encoding and decoding processes can help explain why chatbots often cannot communicate effectively in human natural language. The poor conversational abilities of chatbots are a result of the fact that chatbots only access linguistic context, whereas listeners need access to non-linguistic context to be able to grasp utterance meaning. The question of whether chatbots are able to make inferences about non-linguistic properties of context at all is also considered. It is argued that they cannot, precisely because the neural language models that they rely upon for their linguistic competence are natural codes that merely associate percepts with output behaviours using encoding and decoding processes. / Thesis / Master of Philosophy (MA) / The purpose of this thesis is to explore the question why chatbot computer programs are often not very good at communicating in human natural language. It is argued that one possible reason why chatbots are often not good conversationalists is because they model communication in terms of only encoding and decoding processes. Human communication, however, involves making inferences about the mental states of others.

Identiferoai:union.ndltd.org:mcmaster.ca/oai:macsphere.mcmaster.ca:11375/27066
Date January 2021
CreatorsRaman, Siddharth
ContributorsLapointe, Sandra, Philosophy
Source SetsMcMaster University
LanguageEnglish
Detected LanguageEnglish
TypeThesis

Page generated in 0.0067 seconds