Photo
(posted with tweetshots.com)

(posted with tweetshots.com)

Photo
(posted with tweetshots.com)

(posted with tweetshots.com)

Text

Class Schedule

Follow-up to my state of affairs post: Registered for classes one hour ago. Anxiety over having slim pickings subsided as I got to play the optimization game of course scheduling.

Constraints to satisfy:

  • 3-mile bike/bus commute: avoid unnecessary early-morning awakenings and trips to campus.
  • Fridays need to be open for trips to endocrinologist and in-network phlebotomists. Bonus: extended weekend visits to Chicago/Mary.
  • Need courses to make me more marketable in work-place or grad school. Split the difference: two mathy and two linguisticky classes.

The schedule is thus:

  1. Communicative Disorders 240: Language Development in Children
  2. Linguistics 561: Intro to Experimental Phonetics
  3. Computer Science 520: Intro to Theory of Computing (woot!)
  4. Math 320: Linear Algebra & Differential Equations

All courses scheduled between 11am and 5pm Tuesdays and Thursdays, except for 50 minutes on Wednesday. That’s a four-day weekend, though I will give up Mondays/Wednesdays for employment. This is a very light course load for me, and I find math courses are very calming when I’m diligent. Mission One remains graduate school.

In other news: I bought a cheap, refurbed police-auctioned bike for my commute. I still need to pack, pack, pack, and move—I hate packing and moving more than anything else in world. I have to ask my mom for haircut.

Tags: school
Text

Knowledge Bomb

Latin dissimilates liquids (r’s and l’s), and by virtue of our partitioned lexicon of classical and Germanic roots and affixes, we English speakers also dissimilate liquids and we’ve gone our whole lives without knowing it.

Exhibit A: The normal -al affix

  • radial, regional, hysterical, labial, digital, lateral, tribal

Exhibit B: The -ar words

  • polar (pole), solar, angular, particular, vehicular, uvular, popular, cellular

What unites the B set is the fact that -lal is a funny way to end those words, so -lar is chosen instead. This is what is meant by dissimilated liquids: -lar is chosen over -lal. In contrast, the A set (once sufficiently extrapolated) will contain all and only the -al endings that are not -lal. There might be a couple exceptions—lunar is an analogy of the dissimilated solar—but if you encounter a random Latinish word that ends in /l/ and you need to make a quick adjective from it, you will instinctively choose -ar over -al.

Text

yes! a phonological argument against behaviorism!

As part of my project of condensing and archiving my phonology notes into one super awesome moleskine, I reviewed the arguments for the dual-level hypothesis in Kenstowicz 1994, and I found a token mention of the disutility of extreme behaviorist empiricism in phonology. YES! I exclaimed. Most of generative linguistics’ anti-behaviorist (hence psychologically meaningful) arguments come from syntax, as though the field is the discipline’s golden child. Which is a shame because the study of sound systems is the only part of linguistics that deals with real things located in reality! Words and sentences are abstract bundles of—wait for it—acoustic sound (or sign) made by the gestures of the human vocal tract (or hands)!

The argument is as follows: Strict behaviorism posits that there is no such thing as mental states and that the only thing we can know or discuss is overt human behavior. So a behaviorist theory of speech sounds can only refer to observable phonetic detail. For the untrained speaker, observably distinct sounds may all be perceived as being the same sound. /t/ has seven distinct realizations in American English: s[t]em, [tʰ]in, a[ɾ]om, in[ɾⁿ]ernet (bad IPA, I know), ro[ʔ]en, ten[]s (tents). We perceive sounds that are not actually present or at all similar. Moreover, /t/ is not the only sound with a null allophone—e.g. ten[]s (tends)—so there is no unique intersecting acoustic characterization of the /t/ category because (1) it overlaps with other categories and (2) has a null realization (the category intersects the empty set!). These facts cannot be adequately explained by strict behaviorism; instead we need to talk about /t/ not as a mere set of sounds but also as a mental construct that behaves accordingly to rules and information in the speaker’s mind. Our instincts allow us to recover neutralized distinctions (/t/ and /d/ can be [ɾ] or []) with categorical certainty, and the strongest theory of phonology requires abstract mental states and objects in addition to observable phonetic detail. Another nail in the coffin!

Photo
kened:

probably the most famous sentence ever written by noam chomsky. he wrote it to show that an utterance can be linguistically, at least syntactically, valid, yet utterly meaningless. is it.
if i’ve parsed the phrase structure rules incorrectly let me know (for example:are Det1 and Det2 co-terminous? or is Det1 super-ordinate to Det2? - is it the ‘green-ness’ which is without colour or the ideas?)

In NP[ DP[ A[colourless] A[green] ]DPN[ideas] ]NP—what you have pictured—neither colourless nor green strictly modify the other, so I gather a conjunctive reading as in “colourless and green ideas” or “green and colourless ideas” (ordering does not matter in a conjunction).
That is an okay reading, but I’ve always parsed it with green ideas being modified, i.e. NP[ A[colourless] NP1[ A[green] N[ideas] ]NP1 ]NP. The difference here is that the green ideas are colourless.

kened:

probably the most famous sentence ever written by noam chomsky. he wrote it to show that an utterance can be linguistically, at least syntactically, valid, yet utterly meaningless. is it.

if i’ve parsed the phrase structure rules incorrectly let me know (for example:are Det1 and Det2 co-terminous? or is Det1 super-ordinate to Det2? - is it the ‘green-ness’ which is without colour or the ideas?)

In NP[ DP[ A[colourless] A[green] ]DPN[ideas] ]NP—what you have pictured—neither colourless nor green strictly modify the other, so I gather a conjunctive reading as in “colourless and green ideas” or “green and colourless ideas” (ordering does not matter in a conjunction).

That is an okay reading, but I’ve always parsed it with green ideas being modified, i.e. NP[ A[colourless] NP1[ A[green] N[ideas] ]NP1 ]NP. The difference here is that the green ideas are colourless.

Text

The Ragbag

It’s a testament to his larger-than-life je ne sais quoi—honestly how would you describe it?—that Raynor Ganan managed to infiltrate the typically community- and group-oriented Tumblr Tuesday recommendations. I know nothing about the person behind the persona, but I’ve long enjoyed his work: an almanac in the obscure and quixotic, a Consumer Reports of marginalia and all things colophonic and accidental, a precisely curated museum of things you do not know and do not need to know but will very much enjoy coming to know. I’ve seen many language enthusiasts on Tumblr but most are know-it-all, unprincipled, self-indulgent grammar-nazis—you took Latin in high school and know Strunk & White! congratufuckinglations you literate functional monolingual!!—but Raynor, he’s not afraid to get his hands dirty in the rich mess of the English language. He’s not intimidated by linguistic technicalities or logical gymnastics, and he can even crack a few good jokes about the International Phonetic Alphabet. Raynor is one of the few to whom I grant my Linguistic Seal of Approval.

Photo
Thumbnail: “Four Forms of Information” (2003)Terrence C. Stewart and Andrew BrookCarleton University Cognitive Science Technical Reports
The foundational assumption of cognitive science is that the mind is an information-processing machine. This simple stipulation enables discussion among psychologists, computer scientists, linguists, and philosophers, since all these people can say smart things about information-processors. Despite this productive common ground, some ambiguity remains as to what exactly “information” is—(‽) which makes some of the reading quite annoying and tautological!—and this paper by Stewart & Brook outlines four specific types of information (summarized in above diagram).
Type 1: Reduction In Local Uncertainty
This is the mathematical sense of “information” from Information Theory:

This definition is that the amount of information (measured in bits) gained by observing an event is the negative log (base two) of the probability of that event occurring. So, if you flip a coin and discover that it has landed on heads, then you have gained one bit of information. If you flip three coins, and discover that they land heads, tails, heads, in that order, then you have gained three bits of information. If you roll a six-sided die and discover that it landed on a value less than 10, then you gain zero bits of information (Cover and Thomas, 1999).

This definition requires knowledge of event-probabilities, and it refers to the amount of information in a signal. A natural line of research in cognition is visual processing, since our eyes receive an unpredictable stream of photons.

The reason for calling this usage a “reduction in local uncertainty” is to highlight the fact that we are talking about reducing uncertainty in the signal itself. We have something very specific and immediate (the signal), and we do not know what the signal is going to be until it arrives.

This is why some messages may give zero “information”.
(‽) This might be why we phonologists are interested in what is unpredictable in a language sound system. What is not predictable is contrastive and thus meaningful. It’s Information!
Type 2: Exploitable Regularity

The next usage of ‘information’ deals with the idea that while most processes in nature are concerned mostly with the flow of energy (rain, gravity, friction, etc.), living creatures, and human beings in particular, are controlled by the flow of ‘information’. This idea is grounded in the physics of entropy. The laws of thermodynamics imply that everything in the universe tends towards decay and breakdown, yet living creatures are able to maintain a stable internal structure. The reason they are able to do this is by exploiting regularities in the environment. So, in order to maintain your structure, you must obtain ordered materials from your environment (by eating and drinking).

Information here refers to the amount of order—patterns and regularities—in the external world that allows organisms to extract material from it’s environment. For us to do very cool human things requires a world stable enough for those cool things to take place. The Game of Life is a helpful model:

It is this sense of information that is used by Chris Langton in his original Game of Life analysis. By using simple simulated universes with different “physical laws”, he showed that complex “life-like” behaviour arose only in situations where the physical laws led to reasonably stable (regular) environments. However, if the laws made things too regular, then the complex structures would be unable to extract the order from their environments in order to maintain themselves.

(‽) The “too regular” problem reminds me of the problem of unmarkedness in Optimality Theory, whereby the simplest conceivable language—3 consonants, 2 vowels, CV syllables—is not optimal at all! Factorially, we have six possible syllables, meaning there are only 1296 possible four syllable words. At least 40,000 words are required for a language to talk about the things that humans need to talk about, so the majority of words in this hyper-regular language will have six syllables! The absolute absence of complexity is counterproductive for communication, and with respect to information, total unmarkedness provides us information-processors with too little order for us to extract and exploit the order needed to maintain our own complexity. (I hope I got that right!)
(‽) The principle of regular patterns comes up in game design as well. Meaningful play says that a game needs to respond to players in a discernible and predictable manner. A game that responds unpredictably to player actions—this button does x some of the time, does y some of the time, but unpredictably in the same context—is a broken game. Without the regularity of the system or clear rules of cause-and-effect, we cannot extract the information from the system that we need in order to have the kind of faith in our actions that makes our actions meaningful.
The paper contrasts this type of information with the mathematical type:

Here, creatures make use of predictable patterns in the environment. Thus, a regularly patterned environment is more useful to an organism than a random one, since there is more order in it. However, by the first definition of information, the random input would contain more information.

And it would really suck if life were like a bad videogame.
Type 3: Data to be Manipulated and Transformed
The idea here is that the brain is a computer with different components tasked with various functions. The brain processes the signal. A vibration works on your eardrum, and the Organ of Cortia translates mechanical force into nerve activation for auditory processing. Categorical perception translates the raw signal into discrete features which are packaged into units of sound. Language processing strings the sounds into packets of potential language, comparing the input to known words and storing new forms. This kind of information is data in the computational sense: input, processing, output, and storage.

There is a clear distinction between this sense of information and the first sense. In the “reduction of local uncertainty” definition, information was something contained in the signal. Here, we are using the term ‘information’ to refer to the signal itself. Another term for this would perhaps be ‘data’, as that term stresses the fact that we are interested in the objective truth about the state of the signal. What is being transmitted from one component in the brain to another is an objective physical truth (as much as any other truth about the real world).

(‽) This is how I always thought about the Information of information-processing, but it seems so quaint compared to those two prior definitions!
Type 4: Aboutness
Aboutness connects the processed signal to things and circumstances in the real world.

This is the idea that the signal we are receiving gives us information about something in the external world. For example, the signal entering a person’s eyes gives that person information about the things around them: their colour, their shape, their motion, and so on.

As soon as the signal takes on a meaning—as soon as a blast of vibration becomes a word and that word tells us something about the world—the information is about something.
(‽) To jump into a comfortable frame of reference, aboutness emerges when the signifier (signal) signifies (means something). Which is really fuzzy territory. If we’re going to think about cognition as information-processing done by an information-processing machine, at what point in a computer program does the input become about something?!
In the diagram above, the dotted lines between the representations and the external environment mean two things:
There is no direct line between the real world and it’s representations. This is because aboutness is the only kind of information that may be true or false. We may incorrectly mentally represent the world! The ability for the information processor to be wrong or mistakened is essential for a theory of human cognition.
Representations allow us to cogitate about things we have never sensed firsthand and things that don’t exist in the physical world.

Thumbnail: “Four Forms of Information” (2003)
Terrence C. Stewart and Andrew Brook
Carleton University Cognitive Science Technical Reports

The foundational assumption of cognitive science is that the mind is an information-processing machine. This simple stipulation enables discussion among psychologists, computer scientists, linguists, and philosophers, since all these people can say smart things about information-processors. Despite this productive common ground, some ambiguity remains as to what exactly “information” is—(‽) which makes some of the reading quite annoying and tautological!—and this paper by Stewart & Brook outlines four specific types of information (summarized in above diagram).

Type 1: Reduction In Local Uncertainty

This is the mathematical sense of “information” from Information Theory:

This definition is that the amount of information (measured in bits) gained by observing an event is the negative log (base two) of the probability of that event occurring. So, if you flip a coin and discover that it has landed on heads, then you have gained one bit of information. If you flip three coins, and discover that they land heads, tails, heads, in that order, then you have gained three bits of information. If you roll a six-sided die and discover that it landed on a value less than 10, then you gain zero bits of information (Cover and Thomas, 1999).

This definition requires knowledge of event-probabilities, and it refers to the amount of information in a signal. A natural line of research in cognition is visual processing, since our eyes receive an unpredictable stream of photons.

The reason for calling this usage a “reduction in local uncertainty” is to highlight the fact that we are talking about reducing uncertainty in the signal itself. We have something very specific and immediate (the signal), and we do not know what the signal is going to be until it arrives.

This is why some messages may give zero “information”.

(‽) This might be why we phonologists are interested in what is unpredictable in a language sound system. What is not predictable is contrastive and thus meaningful. It’s Information!

Type 2: Exploitable Regularity

The next usage of ‘information’ deals with the idea that while most processes in nature are concerned mostly with the flow of energy (rain, gravity, friction, etc.), living creatures, and human beings in particular, are controlled by the flow of ‘information’. This idea is grounded in the physics of entropy. The laws of thermodynamics imply that everything in the universe tends towards decay and breakdown, yet living creatures are able to maintain a stable internal structure. The reason they are able to do this is by exploiting regularities in the environment. So, in order to maintain your structure, you must obtain ordered materials from your environment (by eating and drinking).

Information here refers to the amount of order—patterns and regularities—in the external world that allows organisms to extract material from it’s environment. For us to do very cool human things requires a world stable enough for those cool things to take place. The Game of Life is a helpful model:

It is this sense of information that is used by Chris Langton in his original Game of Life analysis. By using simple simulated universes with different “physical laws”, he showed that complex “life-like” behaviour arose only in situations where the physical laws led to reasonably stable (regular) environments. However, if the laws made things too regular, then the complex structures would be unable to extract the order from their environments in order to maintain themselves.

(‽) The “too regular” problem reminds me of the problem of unmarkedness in Optimality Theory, whereby the simplest conceivable language—3 consonants, 2 vowels, CV syllables—is not optimal at all! Factorially, we have six possible syllables, meaning there are only 1296 possible four syllable words. At least 40,000 words are required for a language to talk about the things that humans need to talk about, so the majority of words in this hyper-regular language will have six syllables! The absolute absence of complexity is counterproductive for communication, and with respect to information, total unmarkedness provides us information-processors with too little order for us to extract and exploit the order needed to maintain our own complexity. (I hope I got that right!)

(‽) The principle of regular patterns comes up in game design as well. Meaningful play says that a game needs to respond to players in a discernible and predictable manner. A game that responds unpredictably to player actions—this button does x some of the time, does y some of the time, but unpredictably in the same context—is a broken game. Without the regularity of the system or clear rules of cause-and-effect, we cannot extract the information from the system that we need in order to have the kind of faith in our actions that makes our actions meaningful.

The paper contrasts this type of information with the mathematical type:

Here, creatures make use of predictable patterns in the environment. Thus, a regularly patterned environment is more useful to an organism than a random one, since there is more order in it. However, by the first definition of information, the random input would contain more information.

And it would really suck if life were like a bad videogame.

Type 3: Data to be Manipulated and Transformed

The idea here is that the brain is a computer with different components tasked with various functions. The brain processes the signal. A vibration works on your eardrum, and the Organ of Cortia translates mechanical force into nerve activation for auditory processing. Categorical perception translates the raw signal into discrete features which are packaged into units of sound. Language processing strings the sounds into packets of potential language, comparing the input to known words and storing new forms. This kind of information is data in the computational sense: input, processing, output, and storage.

There is a clear distinction between this sense of information and the first sense. In the “reduction of local uncertainty” definition, information was something contained in the signal. Here, we are using the term ‘information’ to refer to the signal itself. Another term for this would perhaps be ‘data’, as that term stresses the fact that we are interested in the objective truth about the state of the signal. What is being transmitted from one component in the brain to another is an objective physical truth (as much as any other truth about the real world).

(‽) This is how I always thought about the Information of information-processing, but it seems so quaint compared to those two prior definitions!

Type 4: Aboutness

Aboutness connects the processed signal to things and circumstances in the real world.

This is the idea that the signal we are receiving gives us information about something in the external world. For example, the signal entering a person’s eyes gives that person information about the things around them: their colour, their shape, their motion, and so on.

As soon as the signal takes on a meaning—as soon as a blast of vibration becomes a word and that word tells us something about the world—the information is about something.

(‽) To jump into a comfortable frame of reference, aboutness emerges when the signifier (signal) signifies (means something). Which is really fuzzy territory. If we’re going to think about cognition as information-processing done by an information-processing machine, at what point in a computer program does the input become about something?!

In the diagram above, the dotted lines between the representations and the external environment mean two things:

  1. There is no direct line between the real world and it’s representations. This is because aboutness is the only kind of information that may be true or false. We may incorrectly mentally represent the world! The ability for the information processor to be wrong or mistakened is essential for a theory of human cognition.
  2. Representations allow us to cogitate about things we have never sensed firsthand and things that don’t exist in the physical world.
Tags: thumbnail
Quote
"The mathematics of cities was launched in 1949 when George Zipf, a linguist working at Harvard, reported a striking regularity in the size distribution of cities. He noticed that if you tabulate the biggest cities in a given country and rank them according to their populations, the largest city is always about twice as big as the second largest, and three times as big as the third largest, and so on. In other words, the population of a city is, to a good approximation, inversely proportional to its rank. Why this should be true, no one knows."

Math and the City (via iamdanw)

Fun fact: Zipf’s law, named for the selfsame Zipf, says the same thing about corpus linguistics: ”[it] states that given some corpus of natural language utterances, the frequency of any word is inversely proportional to its rank in the frequency table.” Like a great linguist, Zipf was able to find similar formal relationships in disparate objects of study.

Photo
I went out with a bang in field methods using LaTeX and the MIT Working Papers in Linguistics stylesheet.

I went out with a bang in field methods using LaTeX and the MIT Working Papers in Linguistics stylesheet.