Speech perception

A big focus on this blog has been centered around understanding and producing speech, but something that I have ignored up until this point is how speech is perceived. Speech perception is focused on hearing, decoding and interpreting speech. As we will see today, our brains are often not as reliable as we might think.

Photo by shutter_speed on Pexels.com

So rather than just turn this into a lecture about speech perception and the multitude of theories behind it (let’s face it, this is an educational blog, not a university course) I am just going to show off something weird and wild that our brains do and talk a little bit about the mechanics behind it. Alright, so raise your hand if you have heard of the McGurk effect. (Oh wait, sorry. Blog, not lecture)

The McGurk effect is an auditory illusion where certain speech sounds are miscategorized and misheard based on a conflict in what we are hearing versus what we are seeing. We can see this in action by watching the short video below.

So what is actually going on here? The audio that is being played in all three of those clips is exactly the same. You are hearing the same speaker say “ba ba” over and over. But when the audio is played over a video of someone mouthing “da da” or “va va” we are able to hear it as those instead.

Well as it turns out, this illusion provides positive evidence for something called the motor theory of perception. This theory argues that people perceive speech by identifying how the sounds were articulated in the vocal tract as opposed to solely relying on the information that the sound contains.

This motor theory is supported by something like the McGurk effect because we are taking this audio information and supplementing it with what we are visually observing in the video in order to decide what is being said. It also explains why it is easier to hear someone in a crowded or noisy setting if you can look at their mouth and watch them speak as opposed to not being able to see their mouth.

But it’s not as though we are following along with what people are saying by moving our own articulators or imagining how their mouths are moving while we are listening to them. Supporters of the motor theory argue for the process with specialized cells in our brains known as mirror neurons.

A mirror neuron is a specialized neuron in the brain that activates (or fires if you prefer) in two different conditions, it will activate when the individual performs an action and it will also activate when an individual observes another performing the same action. In speech, this would mean the same part of your brain that activates when you move your mouth to produce a “ba” sound will also activate when you watch someone else produce a “ba” sound.

With this knowledge in mind, it should be easier to see why we are able to get something like the McGurk effect to occur. If perception of speech is influenced by visual information, and we are observing someone producing a sound that is activating these mirror neurons, it makes sense that our perceptions might change slightly so that what we are hearing matches what we are seeing.

Photo by meo on Pexels.com

It is important to note that, as I mentioned earlier, this is not the only theory of speech perception that we have right now, and the motor theory is not without its flaws. It relies on a persons ability to produce the sounds themselves. According to the motor theory, if you were unable to produce the sound yourself, and you could not visually see how the speaker was articulating the sound, you should not be able to perceive it.

So what about prelinguistic infants? An infant who has not developed the ability to speak yet should not be able to perceive the difference between a “ba” and a “da” without visual assistance because acoustically these sounds are quite similar.

Some studies have used a novel methodology where the infant will suck on a specialized soother of sorts that will measure the rate at which they are sucking. Using this soother and presenting the infants with audio stimuli through a speaker (no visual input), they have found that presenting infants with new and novel stimuli causes them to suck faster and presenting them with familiar stimuli means that they will suck at a slower rate.

So, by presenting these infants with a series of “ba ba ba” followed by a sudden change to “da da da” will result in an increased sucking rate. These findings are contradictory to the motor theory of speech perception because the infants in this study are too young to speak on their own and their articulators are not refined enough to be able to produce both a “ba” and “da” sound. Because the infants cannot produce these sounds at this point, their mirror neurons would not activate because they would not have developed fully yet.

This is not to say that the motor theory of perception is wrong though. The fact that we are able to perceive the McGurk effect means that their must be some truth to it. It just calls into question whether this theory captures the whole story. This is something that almost every science deals with at some point. There is almost never a perfect explanation or theory that deals with every problem. If you look hard enough, there will be counter evidence to almost any theory, but it becomes a matter of refining theories as we learn more and more about the way that the world works.

There are many other theories of speech perception that have their own explanations and their own problems. I will likely return to discuss some of the other big ones such as Exemplar theory, but for now I think this is a good place to leave this one.

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week for more interesting linguistic insights. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

The anatomy of speech

Photo by Pixabay on Pexels.com

Have you ever thought about how you talk? I don’t just mean the way that you say certain words, or maybe the fact that you slur your words after a few too many drinks. I mean HOW you talk. The anatomy of the mouth and the way that your tongue does such quick and precise movements is truly fascinating. I also want to issue a pre-emptive apology because if you are anything like me, after reading this you will spend way too much time being aware your tongue. But enough of the preamble, let’s just get into it.

If you think about it for too long, tongues are just gross muscular things in our mouths. We use them when we eat to move food around in our mouths and to get food that was trapped between our teeth free, and of course they are primarily responsible for tasting. An often underappreciated function of tongues is their involvement in speech. This is not to say that tongues are essential for all speech, but they play a major part in the formation of both consonant and vowel sounds.

For reference, of the 23 English consonants in the International Phonetic Alphabet, only 7 of them do not directly involve the tongue. But this is just a little taste of what is to come. For now, lets talk about all of the things we need to classify a sound. When it comes to identifying a sound, there are three things we need to consider: voicing, place of articulation, and manner of articulation.

Voicing is not something that involves the tongue at all, but it is something that we have talked about previously. As a reminder, voiced sounds are produced with your vocal folds being held close together so they vibrate when air passes through them. You can feel this in a word like “zit” by placing your fingers on your neck as you say it. Compare this to a word like “sit” which has a voiceless sound at the beginning. Voiceless sounds are produced by keeping your vocal folds spread open so that there is no vibration.

Moving up from the vocal folds, let’s get back to the tongue. We will begin talking about the tongue by discussing the different places of articulation. The places of articulation are mostly self explanatory with names like inter-dental (between the teeth) and bilabial (involving both lips), but the one we will discuss first deals with the “s” and “z” sounds we have discussed previously. These sounds are classified as alveolar sounds, meaning that they are articulated with the tongue at a place in your mouth known as the alveolar ridge. The alveolar ridge is just behind your upper front teeth and if you feel around with your tongue, you can feel a small protuberance where the roof of your mouth raises slightly. The diagram below shows a mid sagittal cross section of an oral cavity which shows the alveolar ridge, and all of the other places of articulation in the mouth.

Places of articulation

Not all of these places involve the tongue as we previously discussed. All bilabial sounds like “b”, “p” and “m” are produced with only the lips and the tongue is not involved at all. Sounds like “f” and “v” combine two articulators (the teeth and the lips) to produce sound and these are known as labiodental sounds which, again do not use the tongue.

Before we move onto the manner of articulation, I want to talk about “r” for a second. “r” is a unique sound in English because it can be produced in two different ways depending on how you move your tongue. So now is when I ask you, are you a buncher or a curler?

To figure out whether you are a buncher or a curler, there is a simple test you can do. Go grab a toothpick or something similar that you are comfortable putting in your mouth and just poke your tongue as you are producing an “r” sound. If the thing you are poking is the bottom of your tongue, congratulations, that means you are a curler. If you are poking the top, then also congratulations, you are a buncher.

It turns out that an “r” sound can either be produced by curling your tongue tip back toward the rear of your mouth, or by just bunching up your tongue blade toward the back of the tongue. It is important for speech language pathologists to know about this so they can be prepared to teach techniques for both of them. There is no advantage or disadvantage to either technique, bunchers and curlers can both produce “r” sounds just fine.  This is just a weird quirk of our bodies that we can observe.

Now back to the different sounds. Let’s talk about manner of articulation. Manner of articulation deals with the finer aspects of the tongue and how it directly impacts the airflow in the oral cavity. For example, lets return to the “s” and “z” alveolar sounds that we talked about earlier. These sounds are known as fricatives because they are produced by having the tongue very close to the place of articulation, but not touching it so that there is a small amount of space between them that results in a small amount of frication in the airflow, hence the name.

So now think about a sound like “t” or “d”. These sounds are both alveolar sounds as well, but they are produced by having the tongue touching the place of articulation and momentarily stopping the airflow entirely. Unsurprisingly, these are called stops. Now what about a sound like an “n” or an “m”. When you produce these sounds, you are producing them like you would a stop, but you can feel a little bit of reverberation in your sinus as you are producing them. These sounds are nasal stops, and they are produced by lowering the velum at the back of your oral cavity which allows the air to flow into your nasal cavity and resonate like that.

The amazing thing about all these actions is that they are not things that you actively need to think about to do. In fact, you probably put zero thought into how this works until you read this post. Our bodies can do all of this effortlessly and automatically.

As always, this is just a brief overview. We don’t have time to get into all the different places and manners of articulation. We will likely return to talk about more unique language sounds (like clicks), but for now, I think this is a good place to leave it.

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week for more interesting linguistic insights. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

When you can’t see the sentence for the trees

Syntax is the greatest subfield of linguistics and I say this as a syntactician with absolutely zero bias (wink wink). The field of syntax cares about the ordering of words in a sentence, and the operations that took place to create the word order known as the derivation. The thing that I love about syntax is that is basically a series of math and logic problems. We can take a sentence and work backwards from it to learn how it was constructed. Now before everyone panics about the fact that I am trying to equate math and language lets all just take a deep breath and I will walk us through how a syntactic derivation works.

Photo by Brandon Montrone on Pexels.com

Before we start talking about full sentences though, we need to start a little bit smaller. We will start by talking about verbs. A verb, as you already know, is an action word that tells us what happened, what is happening, or what will happen depending on the tense used. Think of the verb in a sentence as the conceptual seed of a sentence. There are three major verb types that we will talk about today that are separated based on how many arguments they have. These verb types are transitive, intransitive, and ditransitive.

When you picture a basic sentence with a subject, a verb, and an object, you are likely picturing a transitive verb. Transitive verbs are verbs that have two obligatory arguments (the subject and the predicate). What this means is that the information contained in a sentence with a transitive verb includes: an action (the verb itself), the thing that is doing the action (the subject), and the thing that the action is happening to (the predicate). For instance, you can take the verb ‘discuss’ as a good example of a transitive verb. ‘Discuss’ needs at least two arguments (nouns in this case) in order to create a grammatical sentence. This means that you are no able to say something like “John discussed.” or “Discussed the contract.”, you would need to say “John discussed the contract” (Note that you can say “Discuss the contract!”. This is a null subject imperative though and we will talk about those another time).

So, to bring the math connection back around; one way that we can represent sentences like “John kicked the ball.” is by using parentheses. It would look something like this:

                [John [discussed [the contract]]]

Now I am simplifying things quite a bit from how it would actually be represented, but for the purposes of this article it’s good enough. Let’s break down the bracketing so you can see why it is organized like this. In the innermost layer, we have “the contract” which is the thing that is being discussed. This is why it is contained within the bracketing for the verb “discuss” because there is a connection between these two words. On the outermost layer, we have “John”, who is the one that is doing the discussing. “John” has the verb contained within his bracketing because he has the same sort of connection to the act of discussing that the action of discussing does to the contract. Again, this is simplifying things a bit, but I am just trying to explain why we have this embedded bracketing as opposed to something like [John][discussed][the contract] where there is no clear connection between any of the words.

Now, lets branch out to the other verb types. Intransitive verbs are verbs which only have one obligatory argument rather than two. Intransitive verbs are a little bit tricky because they are actually divided into two subtypes, unergatives and unaccusatives. Full disclosure; I constantly get tripped up on the difference between these two because it is subtle and a little unintuitive, but it is my hope that maybe teaching people about it on the internet will also help to clear it up in my head!

The biggest difference between these two verbs deal with whether their subject is semantically an agent or not. If you kick something or hit something, then YOU are a semantic agent in that case because (it is safe to assume) you are doing those things intentionally. Conversely, if you fall, it is not likely that you are doing this on purpose so we can say that you are not the agent in this case but rather that you are the experiencer; the one who experiences the fall.

Unergative intransitive verbs are single argument verbs that only have an obligatory subject (which is a semantic agent) and no obligatory object. I have used the word obligatory twice in this sentence to really drive home the fact that there is a difference between an object that needs to be there and one that does not. Take the sentence “John ate the cake” for example. This sentence has two arguments, “John” and “the cake”, but the fact that it has two arguments does not make it transitive. It is perfectly grammatical and acceptable to simply say “John ate” and leave “the cake” off of it. This is because “the cake” is an optional argument in this sentence, or what we call an ADJUNCT argument in syntax terms. Adjuncts are optional arguments that can be removed without making the sentence ungrammatical.

So we see that a verb like “eat” is a great example of an unergative verb because you only need to specify the thing that is doing the eating, and you are not required to specify the thing that is eaten. Other good examples are things like “run” or “walk” because they are things that require intent and agency to do, but there is no need to specify where you are running or walking to. You can simply just specify that movement is occurring and leave it at that.

An unaccusative intransitive verb is a verb whose subject is not a semantic agent. The best example of an unaccusative sentence would be something like “the tree fell” or “the window broke” because things like trees and windows do not have any agency and are certainly not falling or breaking on their own accord. It is also important to note that unaccusative verbs cannot have any type of object after the verb.

Both unergative and unaccusative verb sentences have simple bracket representations like this:
                [John [ran]]

                [The tree [fell]]

The third type of verb that we will talk about is the ditransitive verb. A ditransitive verb is a verb that requires three arguments (two nouns and a preposition usually) in order to be grammatical. Take the verb “put” for instance. You can’t use “put” as a transitive verb and simply say something like “John put the book”, you also need to specify where the book was put! This is why we would need to make it “John put the book on the table” instead. The labeled bracketing for these ditransitives gets a little more complicated:

                [John [put [the book][on the table]]]

Now keep in mind that these are just simple sentences, but you can imagine as they get bigger and more complex that the labeled bracketing will become very hard to read. Luckily, syntacticians have figured out a more visually pleasing way to represent these sentences that serves the same purpose. Allow me to introduce you to the sentence tree:

John put the book on the table

These trees are drawn with a program known as LaTeX, which is a typesetting used in many scientific and academic settings. If you haven’t worked with LaTeX before, it is hard to describe but it is essentially a halfway point between typing and programming. For example, for this sentence tree above I had to provide LaTeX with a command to draw the tree, and that command makes use of the bracketing that I have been talking about all along. Here is the command that I used:

\documentclass[12pt]{article}
\usepackage{qtree}

\begin{document}

\Tree [ John [ put [{the book} {on the table} ] ] ]

\end{document}

So essentially, this command is taking the same information that is contained in the bracketing and turning it into a visual representation that allows us to easily see how all of the parts connect without having to count all of the brackets by hand.

Now my biggest fear at this point is that my supervisor will somehow stumble across this post and think less of me for these trees. It is at this point I will remind you that there is SOOOOOOOO MUCH that I am glossing over. I am just trying to give a brief overview of the things that I do so that people like my mom will have a better understanding of some of the things that I do. There are entire four-month university courses dedicated to almost all of the subjects that I talk about on this blog so I can’t talk about them all in detail, but if people are interested, I will certainly keep writing more about it. I am a syntactician at heart and I could go on about this stuff until I pass out, but I don’t want to keep you here forever either.

This is a logical stopping point for this one, but I am sure that I will be returning to syntax again in the future so keep an eye out for that. Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week for more interesting linguistic insights. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

What makes a language? (Part 4)

For the past few weeks, I have been discussing the design features of language, and how we can use them to compare our communication systems to those of animals. Today we will discuss the final language design features and how they set us as humans apart from all the other animals.

First, let’s talk about the idea of discreteness. Discreteness is the idea that spoken language can be broken down into individual parts that can be combined in many different ways to create many words and phrases. If you say a word like “pan”, all you are really doing is saying a combination of a “p” sound, an “a” sound, and an “n” sound to make up the word. These same sounds also exist in other words with completely different meaning. The “p” sound also shows up at the end of the world “rip” while the “a” sound can show up at the beginning of “apple” and all three of these sounds show up in different places of a word like “eggplant”!

Photo by Toa Heftiba u015einca on Pexels.com

The point here is that we have infinite ways to combine these same sounds to make infinitely complex words and phrases. Like we discussed previously with recursion, there is no limit to the length and complexity of human language other than our own mortality.

This design feature also pairs nicely with the next feature, duality of patterning. Duality of patterning is defined as the ability to combine these meaningless individual units of sounds into something meaningful. In human language, we can create a sort of hierarchy of meaning so to speak. We can start at the bottom with individual sounds for instance. Like we have already discussed, these sounds can be combined into words, which we can use to form sentences and it just builds up from there.

There is a basic unit of meaning though at some stage between a sound and a word that we haven’t really talked about, and that unit is called a morpheme. A morpheme is a combination of sounds that carries some sort of a meaning to it. Some words can be made up of several morphemes and some words stand alone as a single morpheme. We know that morphemes are different than words because there are some morphemes that cannot stand on their own.

Take the word “unrecoverable” for instance. This word is composed of three morphemes; the base word “recover”, the suffix “-able”, and the prefix “un-“. These three morphemes have their own meaning that they contribute to the word, but of the three of them, only “recover” can stand on it’s own. You cannot use the word “able” on it’s own… well… you can, but it doesn’t quite carry the same meaning as it does when used as a suffix of a word. The stronger example in this case is the prefix “un-“. A prefix like this absolutely not stand on it’s own. There is an entry for the word “un” in the Oxford English Dictionary, but it is listed as rare and the only two cited instances of it are being used to refer to refer to several “un” prefixed words.

All of this is to say that we have evidence for combining units in human language in meaningful ways, but this does not show up to the same degree in animal communication. Of course, there is evidence of animals using the same sounds repeatedly such as bird calls, but we do not have evidence that they are combining smaller units in unique ways to form new, novel meanings like we do as humans.

And speaking of novel meaning, this leads us nicely into our final feature which is productivity and creativity. This feature is self explanatory, but it is extremely important. This feature is the main thing that allows something like this blog to exist! I mean think about it. Everything that I am saying here is from the top of my head. The actual ideas and concepts are not brand new. Like I mentioned in part 1, have been discussed since the 1960’s. Even though they are not my own original concepts, I am still able to create new sentences and find unique ways to express these concepts that have never been used before.

Every single day, you are being creative with language. You are saying things that have likely never been said before. You are expressing old concepts in new ways. It is not as if you are creating new sounds and using them in unique ways either. You are using the same sounds over and over in different ways to make new sentences. We can do so much as humans with what seems like a finite language. Our ability to utilize all of these features that I have discussed over these past four weeks is what sets us apart.

At this point, we simply don’t have the evidence to support animals being creative in the same way that we are. Sure, animals might produce unique sounds from time to time, but there is no way for any other animal in that species to innately understand what they mean by that new sound the same way we as humans can.

For example, let’s take a completely made-up verb like “flup”. Now let’s assign a meaning to this verb. Let’s say that “to flup” means to hide an object under your desk. With this verb, we can start to describe objects around us according to their flupability. I mean, a car is certainly not a very flupable object, but a paper clip is quite flupable. This word does not exist. I completely made this up, but we can have intuitions about it and use existing meaningful morphemes to do creative things with it!

This has been a very long and drawn-out series. I feel like I have really broken form on these posts several times and turned into something that is much to “lectury” for my taste (more creativity!). Once I started part 1 though, I kinda had to finish it so I really want to thank everyone for sticking it out with me. Everything will return the casual format starting next week I promise, so be sure to come back then so we can talk about more fun language facts. In the meantime, remember to speak up and give linguists more data.

What makes a language? (Part 3)

For the past few weeks, I have been discussing the design features of language, and how we can use them to compare our communication systems to those of animals. So far, we have discussed features that are present in both human and animal communication. Today, we will start talking about a final feature that is present in humans and animals before discussing some features that are only present in human communication or have very limited examples in animal communication.

The first feature to discuss today is the idea of tradition and cultural transmission. This feature states that communication is passed down from the elder generations of a species. This is not widely spread in animal communication, but it does exist in primates and birds. The alarm calls that we have discussed previously in primates like capuchin monkeys are often specific to a specific troop and not the species as a whole. This means that the way the calls were formed needed to be passed down from the elders of the troop to the younger generation. These predator calls fit the definition of cultural transmission because we do not have any evidence of a primate raised in isolation developing it’s own system of calls.

In birds, the songs that they produce have a cultural transmission aspect to them. Each individual bird does have its own unique voice, but the songs that are produced by birds raised in isolation will often be vastly different from other birds of the same species. This means that similar to our language, it needs to be passed down from generation to generation in these birds. This does not quite completely fit the definition of this design feature though because even birds raised in isolation will still develop some sort of birdsong for mating purposes. In humans, it is impossible to learn language without interacting with other humans and having it passed down in some way.

Photo by Pixabay on Pexels.com

We know that cultural transmission is necessary in humans because if you remove an infant from interactions with all humans, that infant will have an extremely difficult time learning language at a later date. Ethically, we are not able to prove this in an experimental setting, but we do have some evidence that this is true in some extreme cases. The most famous example of this that is widely discussed in psychology and linguistics literature is the case of Genie.

Genie is the pseudonym of a child discovered in 1970 at the age of 13 who suffered from extreme neglect and abuse. At the time of her discovery, Genie had little to no language skills, which made Genie a highly sought-after case study by linguists and psychologists at the time. After time and training, Genie was able to learn limited language and social skills, but was never able to fully acquire a first language in the same way that most children do.

The first publicly released picture of Genie, taken in 1970, just after authorities took control of her care at the age of 13 – From Wikipedia

Cases like Genie, while tragic and extremely horrible, have done a lot to inform language acquisition research. The case of Genie provides evidence of the importance of transferring language from one generation to the next because without it, humans are incapable of acquiring a first language when left to their own devices. Compare this to cases of animals that are raised in isolation away from other members of their species who are able to fully develop a complete vocal range that is indistinguishable from a member of that species raised around others.

The next feature we will talk about today is learnability. Learnability means that the learner of a communication variant can learn to communicate using other variants. In the case of us humans, this consists of our ability to learn another language. As a speaker of English, I am able to learn a language like German or Mandarin with little difficulty and eventually I would be able communicate to native speakers of those language with no problem.

This does not mean it is easy to learn another language, but it is possible for humans. Compare this to the animal kingdom where we have no evidence of an animal learning to communicate with another species using a communication system that is not their own. You could try to make an argument for gorillas learning sign language in certain cases, but as we will see in some of these other features, they are not truly learning the system. The most famous example of this is Koko the gorilla who knew more than 1000 unique signs and was able to combine them in novel ways to communicate. The biggest thing that prevents us from saying that Koko fully learned the system is the fact that there was an upper limit to her knowledge.

Koko in December 2015 – From Wikipedia

Although Koko knew over 1000 signs, that is only a tenth of the estimated total amount of signs in American Sign Language. In addition to this, there is also the ability to finger spell words which do not have their own unique sign yet, meaning that there is an infinite number of ways that ASL users are able to communicate ideas. This is not to diminish the impressive knowledge that Koko had, but it is inaccurate to claim that she was a full and fluent user of sign language as we know it.

Another design feature that sets us apart from animals is known as recursion. Recursion is the ability to use a communication system to make infinitely large utterances through a process known as self embedding. This is the feature that allows us to craft infinitely long sentences and ideas by embedding clauses within clauses within clauses.

Human ideas are not limited by anything other than our own mortality and lung capacity. The best way to see this is add onto a sentence to make it infinitely long. Take the worlds longest sentence recognized by the Guinness Book of World Records, a sentence that is 1287 words long in William Faulkner’s novel ‘Absalom, Absalom!’ This sentence is excessive, but we can make it a bit longer by simply saying something like “Faulkner wrote” and then quoting the entire sentence. And now you could imagine infinite iterations of adding on a word here and there to extend this sentence out until the end of time.

The final feature we will discuss today is reflexiveness. Reflexiveness is the ability that we have as humans to use our communication to talk about our communication system. This one is a little bit heady and meta, but it is the core design feature that allows a field like linguistics to exist!

The fact that we are able to use the English language to make observations about the English language and then communicate those observations to another person using the English language is quite amazing once you take a step back and look at it from the outside. No animals are able to use their own communication system to talk about their communication system (or at least we don’t have evidence of it). It is unlikely that we are simply missing something in animals because it does require a sophisticated level of intelligence.

So it is plain to see that these features definitely set us apart from other animals. There are still four more to talk about though, so come back next week when we finally conclude this series on the design features of language!

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week when we start to talk about the uniquely human design features. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

What makes a language (part 2)

Last week I started talking about some of the design features of language, and how they do or do not apply to various types of animal communication. Today, we will talk about a few more of these features that are present in both human and animal communication.

As you may recall, these design features are what set apart human communication from the communication of other animals. If you haven’t already, I suggest you go and read the first part of this series as I give a much better explanation there. For now, let’s dive back into some more design features.

Photo by Pixabay on Pexels.com

The design feature known as arbitrariness means that there is no logical or intrinsic connection between a sound or word and the thing that it refers to. Now it’s important to note that a language doesn’t need ALL of the words to be arbitrary, but the vast majority of them should be. Some sounds and words that we have in English are closely related to, or attempted imitations of sounds, like onomatopoeias. Onomatopoeias like “meow” or “bark” are our attempts to develop words that sound like the sounds that cats and dogs produce. This is not to say that onomatopoeias are not valid words or not actually a part of our language (because they are), I am just trying to provide an example of non-arbitrary things.

Sign language also has a large amount of non-arbitrary signs. For instance, the sign for “house” in American Sign Language is a moving sign where the hands make a shape that resembles what we imagine a simple house to look like. Again, this design feature does not say that we cannot have iconic or logically connected words or signs, simply that there is a large amount of them that have no such connection.

Take the word “chair” for instance. A chair is typically a four-legged piece of furniture with a flat seat to sit on and a back to rest against. There is nothing about the word “chair” that contains any sort of information which may indicate anything I just told you about a chair. Because of this, we can say that this word is arbitrary.

There are concepts and ideas that have been studied within linguistics that seem to challenge parts of this design feature. Let’s run through a quick example. Look at the two shapes in the picture below and tell me which one is called “kiki” and which one is called “bouba”.

Drawing by Quinn Goddard

You probably guessed the one on the left was called a kiki and the one on the right was called bouba. This touches on the idea of sound symbolism. I won’t go too deep on this concept (I need something for when this series is over), but essentially it boils down to the fact that sounds that are rounder like an “o” or noisy like a “b” are associated with rounded shapes. Conversely, sounds that are sharp like an “I” or a “k” are often associated with pointy objects.

This idea of sound symbolism turns out to inform the design of some things in our world, but that’s a story for another week. The larger point is that this doesn’t mean that words that use sound symbolism are not arbitrary still, there just might be things we can predict based on how a word sounds sometimes.

Another design feature is known as prevarication. Prevarication is the ability for someone to lie or deceive another using their communication system. As humans, we are certainly aware of our ability to lie or to tell stories about things that do not exist. This should not come as a surprise unless you are stuck in the world of Ricky Gervais’s 2009 film “The Invention of Lying” where the ability to lie has not been invented yet.

Lying and deception is something that exists in the animal kingdom, although it is not as widespread as it is in humans. We can look at the behaviour of capuchin monkeys to see a good example of this.

Capuchin monkeys are small, nimble monkeys that use alarm calls to alert members of their troupe to predators that they may not be able to see. This is a common thing in smaller primates, but a study published in 2009 found some interesting behaviour in capuchin monkeys.

Photo by Charl Durand on Pexels.com

The study found that capuchin monkeys who are weaker or lower in the dominance hierarchy will use false alarm calls to make their stronger counterparts run away and drop high quality food sources rather than fight them for it. This behaviour is a good example of prevarication because they are knowingly using their communication system to deceive others. Obviously, this is not as broad as a humans ability to lie or tell a false story, but it is lying none the less.

The final feature we will talk about today is displacement. Displacement is the ability that we have as humans to talk about things that are not physically present, or event to talk about things that do not exist at all. This is one of the key features that can separate us from primate species that can only communicate about things that are within their eyesight.

There are some animals that can communicate about things that they cannot see. If you check out part one of this series, I talked about the honey bee and how the foragers are able to perform dances to let others know where the food is. This is a very good example of displacement because it is clearly a communication system that the other forager bees can understand, and it deals with a food source that is not visible from where the bees are (which is why they need this dance in the first place).

Photo by mostafa eissa on Pexels.com

The three design features discussed today are all present in both human and animal communication systems, but next week we will start to talk about features that are only present in human communication. Being able to draw these parallels between animals and humans can help us understand the intelligence and capabilities of these animals, while also helping us understand what sets us apart from them.

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week when we start to talk about the uniquely human design features. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

What makes a language? (Part 1)

Now that I am over two months into this blog about linguistics and language, I suppose it would be nice to define what exactly a language is. Even more specifically, what makes a natural human language, and what can we learn about the complexity of our own languages by looking at the language of animals.

Photo by Nirav Shah on Pexels.com

I will start like most average university papers do; Websters dictionary defines language as “the words, their pronunciation, and the methods of combining them used and understood by a community” as in “I studied the French language”. This definition is entirely serviceable, but we can break down the concept of what makes a language even further by introducing some more terminology and definitions. Specifically, what characteristics of human language can differentiate them from non-human language.

These characteristics I am alluding to are known as ‘language design features’. These design features were first defined in the 1960’s by a linguist named Charles F. Hockett and are what set apart human communication from animal communication. There are a total of 13 design features that he initially proposed, which is far too many to talk about in one post, so this will be split up into at least three parts (this is what we call milking for content).

It is worth noting that these are not necessarily all characteristics that are unique to human language. Many forms of animal communication posses some of these features, but as we will see over the next few weeks, there is no form of animal communication that possesses ALL of these features. So, let’s begin with some of the shared characteristics, and in later weeks we will talk about what makes human language unique.

The first feature we will talk about is specialization. Specialization means that the communication system is only used for communication and nothing else. In other words, we do not use our speech for any other biological purposes than for communication. This is not something that is unique to humans though. Many animals such as dogs also communicate with specialized systems. When a dog barks or howls, this serves the purpose of communicating their location to other animals or to communicate that something is in their territory.

While it may seem like many small dogs will bark just for the fun of it, they are still communicating in some way. For a counter example of specialization in dogs though, we can look at an action like panting. When you see a dog panting on a hot day, you know that the dog is hot, but this action of panting is not only a way of communicating that they are hot. It also serves as a method of sweating and trying to cool themselves off. Because it also serves some biological function to the animal, we can say that this is not a specialized system.

Photo by Thirdman on Pexels.com

In humans, speech is simply speech. We only use it communicate a message to others. We do not get any biological benefit or other functions from speech so we can say that this is indeed a specialized system for us.

The next feature is that of interchangeability. Interchangeability means that all members of a species are capable of both sending and receiving messages. This is certainly something that we as humans have. It is not just restricted to spoken languages. Sign language is also a natural human language, so even people who are unable to hear or speak can satisfy this feature by communicating in sign language.

As you can imagine, many animals who communicate in some way also meet this feature. In the majority of animals, all members of the species will be capable of roaring, meowing, or hooting to their hearts content. There are cases where this feature does not show up for all members of a species though.

An often studied and fascinating communication method (which will come up a few times in this series) is the bee dance performed by forager bees which serves to communicate the location of food to others in the hive. This particular phenomenon (also known as the waggle dance) is a complex figure eight like pattern that these forager bees will move in to share with others what the angle and distance of a particular food source is located in relative to the sun. It is a complex, multistage form of communication that we will be returning to when discussing other design features, but for the sake of this one it is important to simply note that it is only performed by the forager bees for other foragers.

Photo by mostafa eissa on Pexels.com

Forager bees make up a subset of the honeybee species and serve a different function in the bee ecosystem from other subsets like drones. Both drones and foragers are members of the honeybee species though, and since the drones are incapable of performing or understanding the waggle dance, we can say that this particular communication system lacks interchangeability.

The final feature that we will talk about today is feedback. Feedback is defined as the fact that users of a system are aware of the signal they are transmitting. This means that we as humans can understand the speech that we are saying and if there are any errors, we are able to correct them.

This is yet another obvious one to see in human language. When you say something, but it doesn’t quite come out right because your voice cracks or you have a slip of the tongue or something, you are able to recognize the error and correct yourself to make sure you are understood.

This is a tough one to analyze in animal communication because we simply don’t understand their communication systems. We don’t have evidence of a cat saying something like “meow. Pardon me, I meant MEOW!” so we can’t confidently say that this is a feature that these animals have.

We can infer that at least some animals have this feature in their communication system though. Take the warning calls of vervet monkeys as an example. Vervet monkeys have three specific calls that they use to signal to other that there are predators nearby. Depending on the call that they use, it could signal that the predator is in the sky, or that a leopard is in the trees with them, or even that a snake is on the ground. The fact that these three specific calls exist and are shared by members of a tribe means that they would need to be learned by the young primates.

Photo by Charl Durand on Pexels.com

Part of learning any speech sounds in this instance is a trial an error period. It’s not as if there is an automatic system in the vervet monkeys to produce these particular calls that they are all born with innately. If we then assume that these primates are having to learn how to make these sounds, we can assume that there are instances where they get them wrong and they would need to be able to recognize that somehow. Because primates are our closest biological analogue, we can give them a little bit of the doubt on this one.

This covers the first three design features of language. These are all commonly shared features with many animal species, but they are certainly not universal features. Come back next week for and we will talk about semanticity, symbolism and displacement. Hopefully everything that I said here is clear and interesting. If you can think of any other examples of communication that fits these features, let me know in the comments.

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week for more interesting linguistic insights. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

What is a question?

Photo by Pixabay on Pexels.com

For many years I have been a fan of the TV gameshow Jeopardy! I have been recounted tales of when I was two years old, and I would be dancing in front of the television to the Final Jeopardy! music. I haven’t watched it in quite some time now unfortunately (partly because cable is too expensive and partly because I am still not entirely over the passing of Alex Trebek) but for some reason, the show popped into my head, and I started to think about questions.

For those of you who may not have seen the show, Jeopardy! is a trivia gameshow where contestants are given an answer as a prompt, and they then must respond to the host with the question that would generate that answer. It is not as tricky as it may sound though. For instance, contestants would be read the prompt “Snake Island off Brazil’s coast is filled with golden lanceheads a deadly pit type of this snake” and would respond with the question “what is a viper?”

We won’t focus on the other rules regarding the dollar amounts of the questions, the daily doubles, or the wagering in final jeopardy. For now, let’s just focus on the question formation.

In a random survey of 60 Jeopardy! categories from the Jeopardy! YouTube page (the equivalent of 5 games), contestants responded with 185 ‘what’ questions, and 115 ‘who’ questions. Let’s also note that this number may be slightly skewed thanks in part to current reigning champion Matt Amodio who has drawn some heat for his propensity to respond to every prompt with ‘what’s’, even if a ‘who’ would be more appropriate. Just take a listen to his response from the category named Audible (most contestants would respond with “Who is (Matthew) McConaughey?”).

Unexpected responses aside, we do see that using ‘who’ and ‘what’ are the two question words that these contestants are using. This is not mandated by the rules of Jeopardy! in any way. Although I was not able to find examples, there are reported instances of contestants responding with ‘where’ questions. According to the Jeopardy! rules, you are only required to respond in the form of a question. So how can you know for sure if your Jeopardy! response is valid?

The first thing we need to ask ourselves is: what is a question and how is it formed in English? Questions are sentences that are aimed to have the addressee (the person you are speaking to) to provide information. There are several different types of questions that a person could ask, and not all of them are acceptable Jeopardy! responses.

Typical Jeopardy! responses are known as wh-questions. These types of questions use words like who, what, where, when, or why to signal to the addressee what kind of response you are expecting. If you ask a question with “when”, you are likely expecting some sort of response dealing with time. In the same vein, if you ask a “where” question, you would be looking for a place.

The funny thing about Jeopardy! is that even though there are plenty of responses that are formed with place names or specific years, the contestants will use “what” for these questions. There are a few reasons for this. In the case of the place names, using a where would seem a little strange based on how the answers are usually worded. If you asked someone the question “Where is New Orleans?”, you would probably be confused if someone answered with something like “This city was founded in 1718 by Jean Baptiste le Moyne, Sieur de Bienville, y’all” (A Round of Gulf Coast category from June 11, 2013).

The second reason for these “what” questions is that contestants are under a large amount of pressure to perform quickly. If you were on the show, and you were asked to respond to a prompt quickly in the form of a question, it would just be easier to use a default “what” than it would be to stress about whether a “when” or a “where” could apply.

So this tells us “what” a question is, but “how” are questions made? For wh-questions, the sentence is first generated to mirror how the answer would look. So, imagine you had a declarative statement like “The capital of Ontario is Toronto.” To ask what the capital of Ontario is as a question, the initial form of it would be “The capital of Ontario is what?” This is acceptable as it is, although you could imagine someone saying something like this in an incredulous way perhaps (“The capital of Ontario is WHAT?!?!).

Once we have this form of the sentence, the wh-word will move to the front of the sentence and give us the question “What is the capital of Ontario?” What this means for Jeopardy! contestants is that, when faced with the prompt “This city is the capital of Ontario”, it would be perfectly fine for them to answer with a question like “Toronto is what?” This would likely raise a few eyebrows though, and it would be tough to do on the spot. Again, with the pressure these contestants are under, it’s easier to try and keep things simple and consistent.

Another way to form a question is through the process of auxiliary inversion, where the auxiliary verb (can, may, is) is moved to the front of the sentence. These questions are known as yes/no questions because the answer to them is, unsurprisingly, a yes or a no. This type of question is, surprisingly, permitted by the rules of Jeopardy! It would result in seemingly strange question and answer pairings; “Is it Toronto?” is not the type of question that would elicit the answer “It is the capital of Ontario.”

This type of question, apparently, has been used in the past by some contestants. I was unable to track down any physical evidence of this, but it is possible within the rules.

The rules of Jeopardy! (while not explicitly published anywhere) do not require the questions provided by the contestants to be grammatical or to match up with the answer explicitly. The questions are only required to be clearly identifiable as questions.

Questions that present two or more possible options are known as alternative questions. This would be the type of question that you might ask your child at desert time. “Would you like cake, or ice cream?” These types of questions could not be used in a game of Jeopardy! because the goal of Jeopardy! is to provide a question response that would satisfy the answer provided, and the answer to an alternative question is one of the alternatives presented. It would be confusing and difficult to construct an answer prompt that would elicit this type of a question. I also don’t imagine you could convince the judges that “What is Queen Elizabeth or Queen Victoria?” is a good response to “This female ruler was the first member of the Royal family to live at Buckingham Palace” even though one of those alternatives is correct.

Another question type not allowed in the rules of Jeopardy! is known as a tag question. It is called this because you are adding a ‘tag’ to a declarative sentence that turns it into a question. An example of this would be if you think that Tobias likes jean shorts, but you aren’t 100% confident and you would like some confirmation, so you would say “Tobias likes jean shorts, doesn’t he?” The first portion of the sentence is just a declarative statement (Tobias likes jean shorts), and it is the “doesn’t he” that turns the whole thing into a question.

The response to a question like this would either be true because the initial statement was true, or false for because the initial statement was false. Because of this, it would be difficult to construct a Jeopardy! style answer where the contestant could provide a tag question response. This has not been tried by a contestant, but I cannot imagine it being accepted by the judges.

Photo by Andrea Piacquadio on Pexels.com

A final type of question that would not be allowed by the rules of Jeopardy! is an inflection question. This is done in English, not by using question words or by rearranging the sentence in any way. Instead, these questions are made by raising the pitch of your voice at the end of the sentence as if to say “I think this right??” A question like this would not be permitted in Jeopardy! because you are not explicitly forming a question with your statement, you are instead questioning whether the statement that you said is correct.

All of this is not to say that there are no fun ways to bend the rules of Jeopardy! So long as your response is a proper question, you can still respond in some clever, corner-case ways.

For example, if the answer provided was something like “This book series has children around the globe searching crowded malls and beaches for the title character, portrayed in a red and white sweater and toque.” A perfectly acceptable and legal response question would be “Where’s Waldo?” The Jeopardy! team even cites similar instances of this situation that are permitted in this article.

So now we have seen that there are several ways that you can make a question in English, and many of them are permitted in Jeopardy! However, if you ever find yourself on the show, it is likely easier to just stick with the traditional “What/Who is __?” type responses. Under those bright studio lights with real money on the line, it would be a shame for you to make a silly error that get’s your response disqualified just so you could look a little bit clever.

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week for more interesting linguistic insights. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

This is a wug.

A few weeks ago, I talked about part of my tattoo, but I neglected the blue bird looking thing in the middle of it. Well, gather round and let me tell you the story of how this simple thing became the mascot of linguists around the world.

This is a wug.

Now if there were two of them, you would have two _____.

No this is not a trick question, the answer is “wugs”, but think about the sound that you use at the end of it when you say it out loud. The word “wugs” has a “z” sound at the end of it.

Now what if this were called a “heaf” instead. Then it would be an “s” sound at the end of it, right?

What if it was called a gutch? Then we would have a different spelling by inserting a short unstressed vowel (hey, didn’t we talk about those recently?) and calling them gutches.

Now keep in mind that these are all made up words that you have likely never seen before, but somehow you knew exactly what to do with them. Now you might be saying “okay, so we have plural rules… What’s the big deal?” Well, discovering this back in the 1960’s was a big deal because there was debate among many linguists and psychologists whether we used rules to pluralize nouns, or whether we simply memorized all of the words and their plurals individually.

Think about this in simpler terms. Imagine that your brain has a library of all the words it knows stored on file cards, and we can call it a lexicon. Researchers at the time were unsure whether people had two separate file cards for things like “dog” and “dogs”, or whether we just have the file card for the word “dog” and there is some universal rule we can apply to this that lets us know what sound to make when we pluralize it.

From an organizational standpoint, it is much cleaner to imagine the latter scenario here, but this test (known as “the wug test”) now gave us a unique way to prove it by asking children what they would do.

Dr. Berko’s wug test. (from Wikipedia)

In 1958, an article published by Dr. Jean Berko Gleason introduced the world to these loveable little wugs. Like I described above, she had presented this test to children and asked them to pluralize words like “wug”, “heaf”, and “gutch”.

The children in this study were divided into two groups, pre-school children and first-grade children. What she found was that first-grade students were significantly more likely to produce the correct forms “wugs”, and while performing highly on words like “heafs” it was not significantly better than the pre-school children. Irregular words like “gutches” had very low performance, but the children will still be able to produce a word like “glasses” which implies that they had not quite learned that rule yet and likely just memorized that particular word.

So, from this test, we can formalize a pattern in English for how we pluralize words. Words that end in a “voiced sound” like dog or mug will get a “z” sound for their plural. Words that end with an “unvoiced sound” like cat or map get an “s” sound. Words that end in a “ch” like sound, like glitch, will have the unstressed vowel inserted as well and have an “es” ending.

Now let’s back up a bit. I introduced a concept in that last paragraph that probably caused some confusion. In any spoken language, the consonant sounds can vary in their voice property making them either voiced or voiceless. Voiced sounds are consonant sounds where your vocal folds are pressed closed together so when the air passes by them, they vibrate. In contrast to this, voiceless sounds are produced with your vocal folds spread apart so the air does not cause them to vibrate.

You can feel this difference when you are speaking by placing your fingers on your esophagus as you say the words “sit” and “zit”. You can feel that when you start the word “sit” that there is no vibration in your throat, but when you say “zit” the “z” sound causes your throat to vibrate a bit.

These properties of the sounds end up affecting the type of plural suffix that gets added to a word, and this just becomes a process that we do automatically without thinking about it. In fact, if you do think really hard about it, and try to produce to word “dogs” with an “s” sound at the end of it, it is actually quite difficult to do.

A word like ‘glitch’ ends up getting the extra vowel added on because it would be a very large cluster of consonant sounds at the end of a word without it. This is the same reason that you may have heard people in your life pronounce the plural of “text” as “textes”. It’s just easier to insert an extra unstressed schwa than it is to try and produce a large sequence of similar consonants in a row.

So there you have it. Because of this simple test done way back in the 1960’s, we now have a cute little bird like thing that we can use to identify ourselves as language nerds by getting it inked into our skin. There are other things that were covered in Dr. Berko’s test related to verb tense, but we will save the past tense work for the future.

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week for more interesting linguistic insights. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.

Is it a big green house, or a green big house?

Quite often people will ask me if there is a particular reason why adjectives go in a particular order. Unfortunately, the answer isn’t super deep and complex, but there are some interesting things we can learn about native intuition because of it.

According to the Cambridge dictionary website there are a total of ten categories of adjectives in English and they go in the following order:

  1. Opinion – unusual
  2. Size – big
  3. Physical quality – thin
  4. Shape – round
  5. Age – old
  6. Colour – blue
  7. Origin – Dutch
  8. Material – metal
  9. Type – U-shaped
  10. Purpose – cooking

So based on this order, you can have an unusual big thin round old blue Dutch metal general-purpose cooking utensil, but you wouldn’t dare call it a metal unusual cooking round blue Dutch old metal general-purpose thin utensil.

Unusual big thin round old blue Dutch metal general-purpose cooking utensil – By: Amy Block

But this order is not mandated by any sort of laws. Just because this is coming from the Cambridge dictionary does not mean that we are obliged to follow it. We do it because breaking this order just feels wrong most of the time. After all, you probably didn’t need me to tell you that it is much more natural to say thin metal rod than it is to say metal thin rod.

Like I mentioned though, we only follow these rules most of the time. I’m sure you have all heard of the big bad wolf, right? Well, this is a perfect example of being able to break the order of these adjectives and have it be totally fine. Big is a size adjective, while bad is an opinion adjective. Based on the order provided above it should be bad big wolf, but that just doesn’t seem to fit the story at all!

We can also get around this restriction by creating a compound word with an adjective and a noun. Having something like a blue prison referred to as the blue big-house is fine as well.

These are just two examples of ways that English can, once again, break the rules of it’s own language. So like I sad, this is not the most interesting or satisfying answer in the world. This is just something we all universally agree upon because it feels better this way.

While we are on the topic of adjectives and weirdness though, I’d like to talk about a classic sentence in linguistic research that comes up again and again:

Colourless green ideas sleep furiously.

Colourless Green Ideas – By: Mikael Parkvall

This sentence is weird, right? But not because it is breaking any of the adjective rules that I mentioned above. How can something be green and colourless? How can an idea be green? Can ideas sleep? And if they can sleep, how do they sleep furiously?

This sentence follows all of the word ordering constraints of English, but is still incredibly uncomfortable to parse because it just doesn’t make any sense.

This sentence first appeared in a book by Noam Chomsky in 1957 and was used as evidence that no division of linguistics can stand on its own. Division of linguistics refers to all of the areas of language that make up sentences when we are creating them. The divisions of linguistics that we care about in this instance are syntax, which determines the order that words go in, and semantics, which cares about the meaning of the individual words and how they combine to form a meaning for a sentence.

In other words, just because the word order of the sentence is correct, that does not make it a good sentence. For this same reason, we would be able to make sense of something like metal unusual cooking round blue Dutch old metal U-shaped thin utensil at the end of the day, but it would still not make a good sentence.

This emphasizes the fact that all the areas of linguistics need to be involved in the generation of a sentence in order to be sure that we are creating grammatically sound sentences.

Thank you for reading folks! I hope this was informative and interesting to you. Be sure to come back next week for more interesting linguistic insights. If you have any topics that you want to know more about, please reach out and I will do my best to write about them. In the meantime, remember to speak up and give linguists more data.