Monday, October 22, 2018

Consciousness and Information (or: Why I am a Cartesian Dualist)

I.
One of the major points of confusion I see in many modern theories of consciousness, including Integrated Information Theory (IIT), Global Neuronal Workspace theory (GNW) and others is an unjustified jump from information and the processing thereof to conscious, subjective experience of that information. The strong versions of these theories tend to make the mistake of saying once you have the right type of information processed in the right way, subjective experience will emerge. On these theories, the brain, which processes a lot of information and combines different kinds of information with each other (IIT) or selectively focuses on a particular subset of information (GNW) thus produces consciousness.

I think that underlying these ideas is a fundamental misconception of what information is and what it can do. Information is a mathematical concept, not a physical one. We can use physical systems to represent information, like we can use a pair of gloves to represent the number "two". But that is just a matter of cognitive convenience; if I wanted, the pair of gloves could represent the number "ten" by counting the number of fingers on the gloves. Information is the same in this respect; Information is a property of random variables, not matter.  I can use a coin to represent a random variable by saying "this coin has two states, heads and tails, and flipping the coin assigns a Bernoulli distribution to the random variable with p = 0.5, and the side on which the coin lands after being flipped determines the outcome of the random variable after the experiment." But this is, again, a matter of convention. I could, for example, consider the number of times the coin flips in the air as the random variable, which would then have a different number of states (in principle, the set of all natural numbers) and a different probability distribution. So the coin itself doesn't contain information intrinsically, the information depends on what we, as observers, choose the coin to represent. (Things become a bit more nuanced with subatomic particles, which have physical states that seem to be at least somewhat well-defined and restricted in terms of the information that they convey, and there's also the issue of Landauer's principle which needs to be addressed, but I'll leave those aside for the moment.)

Image result for gloves
A primitive computer

In addition to physical objects being able to represent different information depending on the how the observer chooses to define the state space, information is substrate independent; in other words, you can store the same information in a variety of physical media and it will be identical from a mathematical standpoint. Two rocks and two socks can both convey the number "two". Let's take another example: the string "Hello" is equivalent to the binary string 01001000 01100101 01101100 01101100 01101111 in ASCII encoding, where every letter in the English letter is assigned an 8-bit binary code. A printer can convert the above binary string stored in transistors on your computer to the English string written in ink on a piece of paper. Different physical media, same information.

Let us posit, for the sake of argument, that ASCII was never invented. In other words, no one ever created a mapping between English and 8 digit-numbers. Does 01001000 01100101 01101100 01101100 01101111 still mean the same thing as "Hello"? Well, this is kind of a trick question. In information theory, entropy, the standard measure of information, doesn't answer questions of the form "does A mean the same thing as B". Instead, entropy measures the amount of information in a probability distribution, but it doesn't tell you anything about "meaning." So, for example, the average letter in English language has about 2-3 bits of entropy (after compressing via word-frequency and so forth) meaning that if you want to have a binary system that can represent any arbitrary string in English, you'd need, say, 15 bits to encode all 5 letters of hello. So entropy tells us that 01001000 01100101 01101100 01101100 01101111 could represent "Hello" with some bits to spare if we wanted it to; we would just need to create the encoding scheme that performs the appropriate mapping. But there's no (lossless) mapping from the English language to binary that could produce the word "Hello" with a single bit.

II.

Information theory can actually do a bit more for us though. There's a measure called mutual information, which does indeed tell us how much information A contains about B (and vice versa, mutual information happens to be symmetric). However, mutual information requires some additional knowledge about A and B, namely the joint probability, or the probability that in a given experiment A will have value x and B will have value y. So, for example, there is a non-zero mutual information between a person's height and weight, because height is at least somewhat predictive of weight. In this sense, mutual information is similar to correlation, but it is a stronger measure because correlation only captures linear relationships between A and B whereas mutual information tells you the maximum information you can extract from A about B using an optimal function.

If we go back to our ASCII example if we already have a computer that translates binary to English,  we can calculate the joint probability between the binary code stored in its memory and the words that it prints on its screen or on a piece of paper, and from there we can determine that the mutual information between the ASCII code and English is maximal. If we don't already have such a computer, though, then the joint probability between ASCII and English is simply not defined and the mutual information can't be calculated.

Our mutual information measure also doesn't really approach anything resembling "meaning". All we've said is that it is possible to convert from one string of symbols into another string of symbols without losing information. Because I know that my computer uses a well-defined mapping from bits to letters, I can reconstruct text from documents stored on a hard drive. That's great. But if English-speaking humans weren't around to understand the semantics of English, this would be a pointless exercise; a meaningless conversion of one string of symbols to another. The same is true, by the way, when it comes to information processing. By performing a mathematical operation on some data, I'm simply converting one string of symbols to another string of symbols by means of a function (i.e. via a Turing Machine algorithm). I could have a string of symbols a trillion trillion bits long, and I could perform a trillion trillion operations on it (if you want, I can even make the operations behave like a network, because that seems to be something that people think is important), and at the end I'd still be left with...a string of symbols. There is no Turing function of which I am aware which can take a string of symbols, perform a mathematical operation on it, and return something other than a string of symbols.

III.



If you want to get something other than a string of symbols out of another string of symbols, you have to leave the realm of mathematics and return to the world of physics, with particles that bump into each other and that sort of thing. The best example here is from molecular biology. In the classic (extremely simplified) central dogma of molecular biology, DNA is transcribed into RNA which is then translated into a protein. DNA is a system equivalent to binary except with a base-4 system (A,T,G,C) instead of binary's base-2 system or our more commonly used base-10 system. All of these systems are of course basically the same, the differences are merely representational. DNA is translated to RNA, which has the same bases as DNA except T is converted to U. The RNA strand is complementary to the DNA strand, which means that wherever G appears in the DNA strand a C appears in the RNA strand, but the two strands are informationally equivalent, because there is a simple algorithm to reconstruct the DNA string from the RNA string and vice versa. The string of As, T, Gs, and Cs, only matters, though, when the RNA is translated into amino acids, which create proteins, because proteins actually do stuff in the cell. Proteins help to catalyze chemical reactions, transfer materials within and between cells, and so on. So the DNA only has "meaning", once it's converted into a protein. And, for what it's worth, information is actually lost when RNA is translated into a protein, because multiple RNA trigrams can code for the same amino acid, meaning that you can't tell from looking at a protein exactly what RNA sequence created it.

Image result for ribosome video
RNA Translation

If I were to create a DNA strand a trillion trillion sequences long by randomly concatenating base pairs and transfecting it to your genome, it probably wouldn't do anything especially useful (actually there's a good chance it would kill you). And again, same goes for if I would process the information in that strand a trillion trillion times. The amount of information or information processing that occurs to a string of information is not highly correlated with the usefulness and/"meaning" of that information. The important step for "meaning" is the translation of the information to a physical substrate, not the symbolic representation itself. The choice of symbolic represenation is arbitrary; it just has to be long enough to compress whatever physical information you need to "read out".

IV.

So far I've been tossing around the word "meaning" without defining it, because this is where consciousness comes in. When we look at the word "Hello" written on a screen and we see it -- I mean experientially -- that cannot be information processing. Information processing can turn "Hello" into "Cello" or it can translate "Hello" into 01001000 01100101 01101100 01101100 01101111 (in the case of the brain, into the code of firing neuronal action potentials). It can associate "Hello" with other strings of information, such as an image of a waving hand - and by "associate" I mean "perform some mathematical operation whereby the symbolic representation of the waving hand and the symbolic representation of the word "hello" are combined to produce a new string of symbols. But information processing - even in network structures - cannot remove "Hello" from the world of strings of symbols to the world of conscious experience. And if it can, you have to explain how, because such a claim very much looks like a category error.

In my view, if consciousness is anything, it is unlikely to be information or to emerge from information, because we have no examples of things that are not strings of symbols emerging from strings of symbols. Consciousness (or perhaps more precisely, qualia) is a substrate which interacts with information, it is not the information itself. And what is interesting about consciousness is not the information that it reads, but rather the fact that it can read anything at all, even if the information it reads is simplistic. The brain does a lot of processing, but the purpose of this is not to create consciousness, it is simply to prepare the information for consciousness's interaction with it.



Consciousness is not necessarily a physical substrate, though I find the view of a "consciousness particle" a lot more plausible than the idea that consciousness is something that spontaneously emerges from information processing. I also believe that attempts to ground consciousness in the combination of information processing and "causal networks" of physical interactions in which that information is stored (as exists in IIT) are misguided. The best approach, in my view, is to view consciousness as a categorically independent object of inquiry and to distinguish consciousness from the information with which it interacts, the latter lending itself to network computation-related explanations.

Once we divorce consciousness from information, we are left with something very small; a consciousness that does not contain memory, personality, or anything that could be considered persistent. Persistent information is stored in the brain and may be projected to consciousness, but consciousness does not store information for more than a negligible amount of time. This version of consciousness is so small that, in the absence of information, it is indistinguishable (as far as we know) from being absent. To sharpen this point, consider the following question - is a sleeping person actually unconscious or is his brain simply not projecting any information to the conscious substrate? If consciousness is synonymous with information and the processing thereof, the question is meaningless. But if consciousness is a substrate of information; consciousness can be present in the absence of information, just like a computer can be on in the absence of data held in memory. You might not see anything on the screen, but it's still there, awaiting informational input.

Wednesday, October 10, 2018

Netivot Olam: Torah as Teleology

In the last post, we discussed the view of the Maharal of the Torah as a teleological text;  the Torah contains instructions for how the universe and man ought to be. Maharal is careful to distinguish between chochma, or knowledge, and Torah,  which literally means "instruction." The Torah says how things should be; it provides order. It is not necessarily a description of how things are.

To the Maharal, this is not simply an abstract philosophical claim. He believes that there are practical, real-world implications for the Torah being an organizing force in the universe. To this end, he cites the Talmud (Eruvin 54a) which states that if a man walks alone on a road (i.e. between cities, where it is uninhabited) and he is not accompanied by anyone, he should study Torah. Moreover, if a person has pain in his head, throat, internal organs, bones, or even his whole body, he should study Torah, as the Torah will heal him. Rabbi Yehuda the son of Rabbi Chiya adds to this that God is not like man, because when man gives a medical treatment, the treatment might be helpful in one way and harmful in another (i.e. medicine has side effects) but God's treatment, the Torah, is equally good for everything; it heals a person completely.

Maharal first focuses on the statement about walking alone on a road. Rabbinic tradition has it that "all roads are assumed to be dangerous" (Kohelet Rabba, 3:3). On Maharal's view, this isn't just because of the presence of highwaymen and dangerous animals and whatnot. Rather, there is a metaphysical sense in which uninhabited parts of the Earth are incomplete, or not fulfilling their teleological duty. [An aside: one of the interpretations of the story of the Tower of Babel is that the sin of the creators of the Tower of Babel was that they wanted to build a city to unify all of mankind in a single place, instead of spreading out over the entire world and inhabiting it.] The source text for this is Isaiah 45:18, "he did not create [the world] to be empty, but formed it to be inhabited."  Parts of creation that are not in their teleologically optimal state, such as uninhabited roads, have a sort of inherent danger associated with them. A person thus either needs to travel with someone else (when multiple people travel together, they form a social entity, and are thus considered as turning an uninhabited place into an "inhabited" one) or to study Torah, which itself has the power to bring the world to its optimal state.

Image result for road less traveled
Fig 1. Here there be dragons.

The same reasoning applies, says the Maharal, to diseases of the body. The teleological state of the body is a healthy body, and bringing the Torah in contact with a sick body brings the body back to its optimal state. This is easier to understand with pain of the head, which is the seat of the intellect and thus would more naturally attuned to the Torah's healing powers, but it is true for the rest of the body also. In this context, Maharal mentions two kinds of intellect, the analytic intellect (שכל עיון) which resides in the head and the linguistic intellect (שכל דיבור) which is associated with the throat.

Regarding the difference between Godly medicine (the Torah) and man-made medicine, Maharal notes that all man-made medicine has some sort of physical properties, and those physical properties will necessarily be harmful for some things even if they are helpful for others. He gives the example of a "hot" medicine being good for limbs which are "hot" but bad which are for limbs which are supposed to be cold. The Maharal is working with outdated medical science and terminology here, but one can easily think of many examples of drugs with side effects due to the drug's adverse reactions with non-targeted organ systems. The Torah, in the sense of it being an organizing force that brings matter to its teleological state, doesn't have any of the downsides of physical drugs because it is not a physical substance, rather it simply reorients matter to its teleologically optimal state. (Evidence that this occurs and an explanation of the mechanism by which this occurs are conveniently not mentioned.)

Interestingly, Maharal emphasizes that the Torah has the same healing powers for psychological ailments as it does for physical ailments. The particular psychological issues that the Maharal mentions are "jealousy" and "desire", which are associated with the heart and liver, respectively. In other words, all of the body parts that the Talmud mentioned also have some association with a psychological ailment. While this may seem far-fetched, there is growing evidence that the enteric nervous system and the microbiome play an important role in psychological health, so at least in the case of the digestive system, there is an element of truth in the mind-body connection. In any event, it is worthwhile to note that the Maharal considered psychological ailments sufficiently important for them to warrant medical attention in an age where clinical psychology and psychiatry did not exist as medical disciplines.


Tuesday, October 2, 2018

Maharal's Cosmology and the Is-Ought Problem

David Hume, the 18th century Scottish philosopher, famously introduced philosophy to the is-ought problem - the fact that the universe is a certain way can never imply that a person ought to do something. Even a seemingly morally compelling "is" claim like "dolphins are sentient beings" doesn't entail that people shouldn't eat dolphins; you need to start with a moral axiom like "it is wrong to eat sentient beings." Or, perhaps more in line with Hume's example "God exists and says you shouldn't eat dolphins" doesn't imply that you shouldn't eat dolphins unless you accept "it is wrong to disobey God" as a moral axiom.

Hume's position puts moral philosophy in a quandary because if you can't derive moral axioms from fact claims about the world, where exactly can you derive moral axioms from? The simple answer is that "ought" only has meaning in terms of some sort of goal. So if your goal is to get to the supermarket around the corner, it's legitimate can say "I ought to put my pants on." But there are no cosmic goals that can be derived from facts about the universe, and thus no cosmic morality.

Enter the Maharal. The Maharal draws on two rabbinic statements. The first statement is "God looked into the Torah and created the world". The second statement is "All of the work of the creation of the world was suspended until the sixth of the month of Sivan (the date of the holiday of Shavuot, which celebrates the receiving of the Torah); if Israel would accept the Torah, good; if not, the universe would return to formlessness and emptiness."

For the Maharal, the Torah is not simply a book of law for human conduct. It has another hidden facet - it is a blueprint for the physical organization of the universe. The Torah is thus both prescriptive - in terms of telling man how to behave - and descriptive - in terms of describing the universe. Both aspects of the Torah describe teleological necessities - how the universe ought to be and how man ought to act. The former was carried out in a deterministic fashion when God created the world, the latter is subject to the free will of man to observe or not observe, but when he chooses not to observe the Torah, he violates the divine teleology of the Torah. (This is related to the idea of teleology in classical Greek philosophy, where an object's teleological "purpose" is necessary in order for it to move. This is in contrast to the modern view, where objects in motion stay in motion and they change their motion when acted upon by a force.)

Related image

Fig. 1. The universe. Some of it, anyway.


The is-ought problem is thus turned on its head: we do not derive "ought" from "is", rather the "is" derives from "ought". The universe exists because God commanded it via the Torah, and morality is compulsory for the same reason. The Torah rejects the idea that morality derives from truth claims about the world; rather the trueness of the world derives from the Torah, which is a fundamentally moral document.

 Maharal continues this idea, citing another rabbinic passage. "Why was the world created with ten utterances? To teach that the righteous are rewarded for upholding the world that was created with 10 utterances, while the wicked are punished for destroying the world that was created with 10 utterances." (Avot ch. 5)

A naive understanding of this passage would be something to the effect of "God wanted to reward the righteous a lot and punish the wicked a lot, so he spent more effort on creating the universe, which makes sins and good deeds more significant." To the Maharal, this reading of the passage in incorrect and misses a fundamental concept.

The number 10, in the Maharal's view, has a symbolic meaning. In our base-10 number system, the digits represented by the numbers 1-9 are unique and individuated - they are each assigned a separate symbol. Once we arrive at the number 10, the unique, individual numbers are "collected", so to speak, and converted into a 1, moved over by a decimal place. (The Maharal is actually thinking here in terms of the Hebrew Gematria number system, but the same reasoning applies there as well.) 10 thus symbolizes the individuated coming together in an organized collective. The "world created with 10 utterances" therefore means "the world that was organized and collected from disparate units into a single unit."

In the Netivot Olam's cosmology, God initially created the world "like a hammer splitting a stone" (Jeremiah 23). In other words, creation involved an explosive event sending matter hurling everywhere (big bang theory much?).  A force was thus needed to ensure that the matter would unite and become larger, organized objects. In the modern understanding of physics, this role is played are the four fundamental forces of gravity, electromagnetism, and the strong and weak fundamental forces. To the Maharal, the Torah is the force binds matter together, because of the Torah's teleological directive for matter to organize. (A modern Maharal might hold to his view and claim that the four fundamental forces are all manifestations of the Torah's influence on matter.)

The consequence of the above interpretation of reality is that the consequences of moral violations - violations of the Torah - are not limited to man and his immediate surroundings. Rather, violations of the Torah disrupt the very fabric of the universe, because the Torah is the only thing preventing the universe from dissipating into entropy. The wicked destroy the "world that was created with 10 utterances" - that is, the universe which is made of disparate particles that are united by the cosmic force of the Torah - while the righteous uphold it.