The Basis of Language: The Human Mind/brain
We have by now, fairly substantial evidence that one of the
components of the mind/brain is a language faculty, dedicated to
language and its use - where by "language," now, we mean human
language, not various metaphoric extensions of the term.
Noam Chomsky, Language and Thought, 34.(1)
To Chomsky, language use, language and
language acquisition are explained by facts about an individual's
psychology and ultimately by facts about the individual's brain. I
shall present this position of Chomsky in the following order. 1.1 is
concerned with his way of explaining language use, in particular,
meaning. He takes linguistic activities as instances of language
performance, which we can trace back to knowledge of language (or
linguistic competence), and further, to the internal mechanism
(I-language) that is cognised by a language user.(2) In 1.2, I shall
introduce Chomsky's conception of I-language and account for its
intensional character. In his view, having an I-language is reaching
a mature state (SL, or sometimes abbreviated as SS) of the language
faculty of the human mind (which means, an array of capacities
specifically dedicated to language acquisition, mastery of linguistic
principles and rules, and language use.) Then our discussion will
move from SL to the initial state (S0), that is, the innately given
condition of natural language. A child is supposed to be biologically
endowed with a universal grammar (UG), which consists of a limited
number of principles and parameters. We shall consider the rationale
of this principles-and-parameters (P-P) approach in 1.3 and some
hypothetical idea of Chomsky for explaining language variation on that
approach in 1.4. Another consequence of the P-P approach is the
assumption that language growth (or acquisition) is a process of
parameter-setting. This subject matter will be the topic of 1.5.
In these sections, I hope to show that, according to Chomsky, linguistic
phenomena have their origin in the human mind/brain. (In his view,
the mind, of which the language faculty is one component, is a natural
object, in the sense of being an abstract description of the brain.
That is why he uses the sign '/' in the term 'mind/brain'.) Chomsky
says, "The basic explanation must lie in the properties of the
language faculty of the brain."(3) The natural constraints of language,
Chomsky maintains, are biological facts, uniquely pertaining to human
beings.
A query arises as to how I-language is on the one hand taken
by Chomsky as an "abstract entity", but on the other as a set of facts
of biological nature. This does not involve a conceptual problem for
Chomsky. In 1.6, before closing this chapter, I shall give a brief
answer on behalf of Chomsky to clarify his position.
1.1 Explaining the use of language
Our words are amalgams of some sound and meaning, and put in some
order to express our feelings, thoughts, information, beliefs and so
on, for different purposes in life. We also understand others by
interpreting their words, which may involve certain background
knowledge, perspectives, and presuppositions. Chomsky is well aware
that in actual circumstances, language use is guided and affected by
many factors, which he thinks are not amenable to scientific studies.
But in his view, some general properties of meaning and sound can be
identified and explained by internal principles and rules. His major
ideas for adopting this internalist approach to explaining language
use will be the concern of this section.
1.1.1 Common language and common
thought
Many people, philosophers included (4),
tend to think that linguistic
activities are instances of the use of a common language or publicly
accessible set of well-formed sentences, like English, Chinese and
German, etc. It is contended that members of different communities
inherit different systems of language, and that such a shared language
is passed on to the next generation with a "common treasure of
thoughts" . The main reason is that without the same language and
thought, communication is impossible.
Chomsky is sceptical about this view. Above all, he queries
whether the pre-theoretical notions of common language and of
common thought are useful or coherent. There are
intermingling factors that determine our identifying a
language or dialect in the real world: "colors on maps,
oceans, political institutions, and so on ..."(5) At most some
rule of using the word 'language' in various contexts might be
generalised, but, it seems to Chomsky, this lacks any
explanatory value. Moreover, it is not merely that the notion
of a publicly accessible language is exceedingly vague. The
problem is that it is hopeless to abstract such a 'thing',
whose identity is relative to different perspectives and
circumstances: shifting interests, expectations and power
structure. It is just our loose practice, Chomsky contends,
to say that there are Chinese, Dutch, German, Danish, Swedish
and other languages in the world. In his view, we do not
literally mean that these languages exist as some abstract
entities independent of our cognition of them. Chomsky offers
some examples as follows. "We speak of Chinese as 'a
language,' although the various 'Chinese dialects' are as
diverse as the several Romance languages. We speak of Dutch
and German as two separate languages, although some dialects
of German are very close to dialects that we call 'Dutch' and
are not mutually intelligible with others that we call
'German.'"(6) And to say that we have the same thought and
language is no more than to say that we think (very much)
alike and talk (very much) alike. The claim that we possess
something called 'a thought' does not seem to resemble the
claim that we possess a diamond, since our thoughts are much
more subject to various interpretations under different
circumstances.(7) " Communication is a more-or-less matter," says
Chomsky. To him, a fair degree of mutual understanding does
not depend on any set of public meanings and of accurately
determined pronunciations.(8)
Of a piece with the idea of common language and that of common
thought, is the idea of a predetermined framework of reference. The
publicly shared language is supposed to represent the world, and a
linguistic sign is supposed to refer to an object. Chomsky puts into
doubt this conception, which takes reference to be a two-place
relation. Take London as an example. Using the name 'London', we can
talk about its population, life-style, location, the people living
there, the buildings, and the air, etc. Chomsky thinks that there is
no object or thing-in-the-world, which embraces such
"properties of the intricate modes of reference".(9) Another example: the referent of the
word book in the sentence The book which he wrote in his
mind weighs two pounds is simultaneously abstract and physical.
Chomsky asks: Is there any object like this? In his view, natural
language consists of, among other things, nouns, whose function is not
to supply a determined set of references in the external world. For
as a matter of fact, the object names we use in ordinary life are not
the same as those employed in sciences.(10) "In general," Chomsky
suggests, "a linguistic expression provides a complex perspective
from which to think about, talk about, and refer to things, or what we
take to be things" rather than some external references.(11) Take the
word see. People who know English can be certain that the speaker of
the sentence I see the house sees at least the exterior surface of the
house; one would not imagine the speaker standing inside the house and
looking at a corner of it. "If I am inside an airplane, I see it only
if I look out the window and see the surface of the wing, or if there
is a mirror outside that reflects its exterior surface."(12)
Take near.If Mary is inside the house, we cannot say Mary is near it. Nor are we
correct to say that something inside a container is close to it.
Reference, according to Chomsky, is a relation involving, apart from
the sign and referent, the person who uses the sign and the
circumstances under which the sign is used.(13)
1.1.3 Meaning and interpretation
The fact
that Chomsky highlights the actual contexts where a language is used
and where a thought is conveyed does not imply that he abandons the
analytic-synthetic distinction. According to him, it is clear that
words have an internal structure, and are analytically related. For
instance, the word give necessarily involves a subject, object and
theme, and the words persuade and chase necessarily have to do with
intentions.
He admits the contingent and complicated character of interpretations
in real life, where there are no rigid rules to rely on. But it does
not follow that he thinks that there is no explanation of language use
at all. Meaning and interpretation, he affirms, can be studied.
Besides (i) analytic relations between words, Chomsky suggests, we can
also investigate the following aspects of meaning:
(ii) referential dependence (the reference of anaphors like themselves and each other,
and pronominals like them, are dependent on some antecedent word(s) in
a given sentence),
(iii) thematic roles or θ-roles (like the roles of
agent, patient and goal) and thematic relations (like comment, topic
and focus), and
(iv) quantification and scope, (for instance, the
interpretation of this scope-ambiguous sentence Everybody loves
somebody depends on whether somebody lies within the scope of
everybody, which means that the sentence is paraphrased as 'Everybody
has somebody whom he/she loves. If the scope relation is reversed,
then the sentence is to be paraphrased as 'There is someone who is
loved by everybody').
1.1.4 Positing an internal mechanism
The aspects of meaning just mentioned are shared by all natural
languages. One may generalise certain principles and rules to account
for these linguistic phenomena. However, Chomsky argues, they are not
(internalised) conventions; they are not explicit linguistic knowledge
taught to the child by his/her parents or caretakers. Consider the
following sentences (1a, 1b, 2a and 2b)(14), which are similar and yet
have very different semantic properties:
- (1a) John is too stubborn to expect anyone to talk to Bill
- (1b) John is too stubborn to visit anyone who talked to Bill
- (2a) John is too stubborn to expect anyone to talk to
- (2b) *John is too stubborn to visit anyone who talked to
When Bill is deleted, John, in (2a), becomes the object of talking.
(2a) may be paraphrased as: John is too stubborn that one cannot
expect anyone to talk to him. Yet one can hardly suppose that in
(2b), the person to whom someone talked is also John. Unlike (1b),
(2b) is meaningless; one can hardly assign any sense to it.
On the basis of a limited linguistic experience, children are able to
understand these sentences, which are probably new to them, without
having undergone any specific training. So Chomsky thinks that the
only promising way to explain meaning (and interpretation) is to posit
internal principles and rules of derivation and representation. These
principles and rules are to a great extent innate and universal.
1.1.5 Competence and performance
The
production and perception of speech, like those of meaning, are best
explained in terms of mental representations and rules, according to
Chomsky.(16) A speaker/hearer possesses a lexicon, and in addition,
he/she masters a computational procedure of language (CHL). The
lexicon consists of a definite number of lexical entries. Each
includes information of three types: (i) phonological, (ii) semantic
and (iii) formal, for the interpretations of sound, the
interpretations of meaning and the operations of CHL,
respectively.
(17)
The lexicon and CHL constitute the language that the speaker/hearer
has. Chomsky calls it 'I-language'. (18) When the computational system
operates, his/her lexicon is accessed. The products are certain
derived, structured, strings of mental representations of sound and
meaning. The operations of CHL are insulated from any influence of
the individual's belief or other cognitive systems.
A speaker/hearer's mastery of CHL and possession of a lexicon, Chomsky
calls 'competence'. It is regarded as a kind of knowledge, not
practical abilities or dispositions. Linguistic activities are just
manifestation of this knowledge, albeit various factors (memory,
attention, ideas, and social expectations, etc.) are involved in the
use of language in concrete circumstances, which Chomsky calls
'performance'. These are contingent factors, which are relevant in
shedding light on particular cases of language use. But in linguistic
explanation, it seems to Chomsky, competence, rather than performance,
should be given the primary status.(19)
1.2 I-language
'I-language' refers to an inner state of (i) an individual's
mind/brain, which has a certain independence from other elements in
the world. The idea that a linguistic community speaks the same
language is accordingly derivative. It only suggests that a group of
people share more or less the same I-languages. An I-language is
(ii) an internal mechanism, conceived as a function specified in
(iii) intension, which generates an infinite number of, not only
well-formed sentences like (3a), but structural descriptions (SDs)
like (3b) (whose constituents are hierarchically organised as
indicated by labelled bracketing):(20)
- (3a) David bought the house
- (3b) [S [NP [N
David]][VP[V bought] [NP
[DET the] [N house]]]]. (21)
The notion of I-language is meant to be opposed to that of E-language. The
latter notion suggests that first, in describing a given set
of sentences, a number of grammars can work equally well. As
alternative axiomatisations in mathematics clearly show,
different axiomatic systems can generate the same set of
theorems. These grammars or axiomatic systems are said to be
extensionally equivalent. An extensionally equivalent grammar
determines "recursively, the same infinite set of well-formed
... sentences."(22)
Second, a particular (extensionally
equivalent) grammar has no difference from another that
generates the same set of well-formed sentences; what is at
stake for the E-language approach is not grammar but a
delimited extension of well-formed sentences. And third, it
does not make sense to say that one grammar should be better
than another extensionally equivalent one, and that one
grammar is correct while another is not.
Above all, Chomsky points out, advocates of the concept of E-language
take language to be independent of the properties of the human mind or
brain.(23)
As a consequence, there is no determinate way for analysing
a language; all characterisations of language are thus contingent and
relative to certain interests and purposes. It seems to Chomsky that
this arbitrary character of E-language prevents it from being the
object of scientific inquiry. For Chomsky, what a linguist is
concerned with are facts of nature, not artefacts.
We shall highlight Chomsky's major reasons for defending his
intensional concept of language in 1.2.2,
after providing, in 1.2.1,
an essential sketch about the internal mechanism of language (CHL),
assumed in Chomsky's current research program.
As Chomsky remarks, "The rules of the language are not rules of some
infinite set of formal objects or potential actions but are rules that
form or constitute the language, like Articles of the Constitution or
rules of chess (not a set of moves, but a game, a particular rule
system)."(24)
So to Chomsky, linguistic rules or principles are not
subsequent to a set of well-formed expressions as characterisations of
them, but descriptions of the computational system of language, that
is, a function mapping a given set of lexical items to a sound-meaning
pairing (π, λ). These items, each of which is supposed to be a
mixture of formal (for instance, number, tense, gender, case and so
on), phonological (for instance, green is pronounced as /grin/, not
/gren/) and semantic features (for instance, airplane [artefact] and
run [action]), are selected one-by-one, and merged (that is, combined)
in a pairwise and successive manner to form new and larger syntactic
objects, such as the steps involved in (4).
- (4) You will find it.
- i. Select it.
- ii. Select find.
- iii. Merge it and find to form a structural unit (a verb phrase (VP)) find it.
- iv. Select will.
- v. Merge find it with will to form a higher-ordered structural unit will find it.
- vi. Select you.
- vii. Merge you and will find it to form you will find it.(25)
The output of this procedure is a couple of structural
representations. One is phonological, to be interpreted at the
interface level called Phonetic Form (PF) (that is, to be determined
by the rules of the phonological component of the mechanism); another
is semantic, to be interpreted at the interface level called Logical
Form (LF). These interpretations are instructions to the
articulatory-perceptual (A-P) and conceptual-intentional (C-I)
systems, respectively. So it is assumed that there should be a point
at which a chain of computational operations is split off. After this
point (called Spell-Out) semantic operations continue 'covertly' (in a
sense that they have no phonetic or morphological realisation) to
produce the LF representation at LF (like quantifier raising: from
Mary visited everybody to [Everybodyi [Mary
visited xi]).
And at the same time new features and elements (like intonation contour, and
perhaps temporal order), are introduced to the phonological features
of the syntactic construct at PF to produce the PF representation on
another side.
The idea is that each element of an LF or PF representation
should be interpretable, so that it can be accessed by the C-I
or A-P systems which are connected to CHL, respectively. This
is a crucial representational constraint, called the Principle
of Full Interpretation (FI), which is 'optimal' or
'economical', in a sense that it does not allow any
uninterpretable formal features to remain in the final product
of a derivation of CHL. They are supposed to be 'erased'
during a series of feature-checking steps. Compare (5a) and
(5b), the former being a grammatical sentence while the latter
an ungrammatical one:
- (5a) He has come.
- (5b) *Him has came.
The impropriety of (5b) is due to its violating FI. All the
uninterpretable formal features have been 'erased' in the case of
(5a), not in the case of (5b).
Let us consider the interpretable and uninterpretable features of the
words in (5a) first. The number feature of the word He indicates that
what it refers to is only one object, this being an aspect of meaning
and thus interpretable at LF. Whereas its nominative-case feature is
uninterpretable, since there is no difference in meaning between he
and him in She believes he has come and She believes him to have come,
respectively. Different from the word He, the function of the number
feature of the auxiliary verb has is just to agree with the number
feature of its specifier (that is, subject). But the present-tense
feature of has is interpretable, insofar as it indicates time.
Regarding the word come, its being an n-participle (that is, perfect
or passive participle) here is a consequence of the auxiliary verb,
and so is uninterpretable (that is, plays no role in semantic
interpretation) at LF. There are requirements on the auxiliary verb
has that its specifier be in nominative case and a third-person
singular noun phrase. These restrictions are regarded as the
specifier-feature of has, which motivates its checking with the word
He. Their matching renders the nominative-case feature of the word He
and the specifier-feature of the word has - both of which are
uninterpretable - to be erased. The word has also has a
complement-feature, that is, a restriction such that its complement be
in the form of n-participle. So the words has and come match; the
former's uninterpretable complement-feature and the latter's
uninterpretable feature that it is an n-participle form are erased.
The checking breaks down in the case of (5b). The objective-case
feature of the pronoun Him is not compatible with the
specifier-feature of the auxiliary verb has, which restricts its
specifier to be in nominative case. And the feature of the word came
that it is in the past-tense morphological form also does not match
the complement-feature of auxiliary verb has. All these unchecked and
uninterpretable features remain, and consequently (5b) violates
FI. (26)
Feature-checking requires some constituent part of a syntactic object
to move, in order to determine whether some relevant features match or
mismatch. Syntactic movements, it is contended, are in accordance
with certain economy principles of derivation. There are operations
observing the Minimal Link Condition (MLC), according to which
'shorter'(27) syntactic movements are preferred, and thus the 'shortest
move' in a particular derivation is selected. (6a) and (6b)
illustrate this condition.
- (6a) Who do you think will buy what?
- (6b) *What do you think who will buy?
At some stage of a derivation, we have this syntactic object do you
think who will buy what. The next step is the raising of a wh-word to
the front position. Who, being closer to that position, is thus
preferred to what.
Another principle is called Procrastinate. It prefers those
derivations, which take place as late as possible - that means, the
covert movements that occur after Spell-Out. Contrast (7a), which is
well-formed, and (7b), which is not:
- (7a) David sometimes paints the house
- (7b) *David paints sometimes the house
(7b) involves an improper move of the verb paints to check off its tense feature without waiting until after Spell-Out.
Derivations, it is assumed, are also constrained by Suicidal
Greed. The intuition is that N(oun)P(hrase)-movement should not
be done for the 'benefit' of the other, such as checking the features
of another category, but only if a lexical item has a 'selfish'
reason to move (for instance, its Case-feature needs to be deleted),
does it move. (8a), (8b) and (8c) are ungrammatical:
- (8a) *John seems [t is leaving]
- (8b) *John seems to t [that Bill is leaving]
- (8c) *John was said to t [that Bill is leaving](28)
The small letter 't' represents the original position of John, or the
'trace' of John, which moves to the front. These examples are
similar. Let us consider (8a) only. At one point, [John is leaving]
obtains, where John is located in a Case position, and so John's
Case-feature has already been checked. There is no need for John to
move to a higher Case position. Its being attracted, wrongly, to the
higher Case position is due to the offending inflectional features of
seems, which need to be checked off. The fact that (8a) does not have
a correct form is explained by the presence of such an 'altruistic'
operation.
In the foregoing, I provided an outlook of CHL, which is a
structure-building and feature-checking mechanism. In
particular, a representational constraint, FI, and some
principles or conditions of derivation were illustrated.
Their postulation is an outcome of the working hypothesis of
the on-going 'minimalist program' in Chomskyan linguistics
that CHL should operate optimally.
1.2.2 An intensional concept of language
The internal mechanism with which Chomsky's linguistics is concerned
is supposed to enumerate (generate), not only (1) a particular range
of well-formed sentences infinite in number, but (2) the structural
descriptions (SDs) of them. He distinguishes between (1) and (2) by
calling them weak generation and strong generation,
respectively. (29) It
is (2), or strong generation, that characterise his intensional
conception of language. 'Intensional' is a technical term that he
uses. By 'natural language is a function specified in intension', he
means that what is generated is the SDs of well-formed sentences,
rather than an infinite set of well-formed sentences. (30)
An objection is that there is no empirical means for linguists to
attain such an intensional grammar. It is not possible to ask a
person or invent any experiment in order to ascertain the grammar that
really guides him/her to construct sentences in certain structures,
for his/her knowledge of such a grammar is allegedly tacit. Even we
ourselves cannot give definite answers to questions concerning the
constituent structures of the sentences that we speak every day.
Using experiments, at most, we can distinguish different linguistic
dispositions, which are merely true of indefinitely many extensionally
equivalent systems. To achieve a true description of natural
language, it is contended, what we need and can achieve is no more
than an extensionally equivalent theory. (31)
But in Chomsky's view, such an idea is incompatible with our
motivations to distinguish the following kinds of facts:
- (i) When Jones is dropped, he falls
- (ii) After Jones has eaten the sleeping pill, he falls into sleep
- (iii) Jones understands the sentence Everybody loves somebody as 'There is someone
who is loved by everybody' (see 1.1.3)
- (iv) Jones acquires a body of knowledge, in virtue of which he can form and
interpret the meaning of English sentences, on the basis of a certain linguistic
data obtained from his environment.
(Actually Chomsky does not mention (ii) in the place where he
discusses the issue.(32)
(ii) is included here to further distinguish
between physical and neurological facts.) To explain (i) and (ii), we
attribute Jones mass and a brain, and appeal to some mechanical and
neurophysiological laws, respectively. But in order to explain (iii)
(language use) and (iv) (knowledge of language), further structures
need to be imputed to the individual's mind (or brain) and further
abstraction is certainly required. In Chomsky's linguistics, the
steady state (SL) and initial state (S0) of the language faculty are
attributed to Jones in order to account for (iii) and (iv)
respectively. Postulating an extensionally equivalent system of laws
may well be sufficient to deal with (i) and (ii), but not (iii) and
(iv). For in the case of (iii) and (iv), Jones's being guided by
internal rules, not merely Jones's behaviour, is involved. Theories
about the mental structures involved in (iii) and (iv), admittedly,
cannot be straightforwardly tested or be allowed to rest on
behavioural evidence alone. Linguistics, as other scientific
practices, should not be biased and restrictedly depend on behavioural
criteria and evidence, according to Chomsky. Data from other
languages, findings in child language research, discoveries in
neurosciences and so on are recognised as 'empirical evidence' by
scientists in general. Chomsky maintains that whatever conforms to
the standards of rational thinking may contribute to the advancement
of theoretical linguistics.(33)
For instance, studies in agrammatics have given substantial
support to a postulate in Chomsky's linguistics, namely,
'trace'. Many referential phenomena and phenomena concerning
the acceptability of grammatical structure are currently
explained in terms of syntactic movements and trace, such as
(8a) above. Now take the case of wanna-contraction. The
(wh-)sentence (9) is ambiguous. It may induce two different
answers as shown in (9a) and (9b).
- (9) Who do you want to visit?
- (9a) I want to visit Jane.
- (9b) I want Jane to visit.
If the person who makes the question intends an answer like (9a),
he/she is allowed to use the colloquial wanna, like (9c):
- (9c) Who do you wanna visit?
But if an answer like (9b) is intended, the words want and to should
not be replaced by wanna. The reason is that such a contraction is
blocked by a trace, which is left behind by the movement of the word
who to the front, as contrasted by (9d) and (9e):
- (9d) you want to visit who → Who1 do you wanna visit t1
- (9e) you want who to visit → Who1 do you want t1 to visit
It has been found that agrammatics have difficulties particularly in
understanding the ambiguity of sentences like (9). (34) Note that the
problem has nothing to do with the length or the number of arguments
of the sentence. The explanation is that traces are deleted in
agrammatism. (35) As confirmed by more evidence from agrammatic studies,
the theory of 'traces' can be said to be psychologically real. It is
not speculative, but empirically supported and predictive. In
contrast, E-language theorists can only rely on a range of data, that
is, an extension of ambiguous sentences like (9). Based on evidence
of this kind, their theories have nothing to do with psychological
reality and they are no more than generalisation, with much weaker
explanatory and predictive power.
Comparisons between different languages, of course, can bring
to light the topological characteristics of each particular
language, of which most native speakers of this language are
not aware. The data of one natural language may also modify
the understanding of another one. Suppose that some newly
found facts of a particular language undermine the theory of
the initial state of CHL, that is, UG. Then a revised UG
needs to be put forward, to explain particular languages,
which are instances of UG, and consequently, the idea on the
purported character of a language will be updated. This
nativist model of natural language and the relevant arguments
involved will be dealt with in the next two sections.
1.3 Principles and parameters
Every human being shares the initial state (S0) of the language
faculty, which consists inter alia in an innate schema of universal
principles and parameters (or options) constraining the operation of
CHL. Let us first consider the motivation behind this
principles-and-parameters (P-P) approach in this section.
1.3.1 The tension between descriptive and explanatory adequacy
There is always the limitation that the more restrictive a general
linguistic theory (UG) is (so that it maps a unique grammar on the
basis of certain data about the use of a particular language), the
more unpromising such a theory will be (to be able to describe other
languages). But if the theory is not constraining enough, there comes
the problem that the range of possible grammars will become too broad.
If this were true of the initial state of the language faculty, in
normal situations where the given data are meagre the child would not
be able to learn a language as rapidly as what we usually see. (36)
Given a corpus of linguistic data, there will be a restricted number
of grammars, which are capable of assigning structural descriptions to
them. These grammars (or theories), which purport to describe the
tacit knowledge (competence) of language of the native speakers, are
said to be descriptively adequate, if indeed externally justified by
their intuitions. (37) An explanatorily adequate theory is not only
capable of describing the structure of the sentences of a particular
language, but also of revealing the reality about the language faculty
and language acquisition.
1.3.2 The P-P approach: resolving the
tension
In the current framework of Chomsky's linguistics, CHL is assumed to
work optimally. There being no redundant moves is not due to a
restriction imposed by a preferred system of rules.(38) The notion of
evaluative procedure has been abandoned. Non-redundancy in the sense
of either derivation or representation is now regarded as a designed
property of CHL. This optimal character is not contingent upon any
implicit application of an innate language-specific idea of simplicity
in the process of language acquisition on a set of possible grammars
that are equally compatible with a given set of empirical data. A
particular grammar is just an instantiation of the innate UG with
parameter options chosen - and parameters are currently believed to be
limited to morphological or lexical features. (39) According to this P-P
conception, what is learned is restricted to the lexicon; all specific
rules are subsumed under some innate principles tacitly known by the
speaker/listener. Therefore, the tension between the twin goals of
linguistics is better resolved by the P-P approach: accommodating at
the same time the widest range of particular grammars and the fact
that language is rapidly acquired. (40)
1.4 Language variation
The various ways of parameter-setting accounts for language variation.
For instance, a native speaker of English and a native speaker of
Chinese are in fact biologically endowed with the same innate schema;
the mature state (SL) of their language faculty is an instantiation of
this schema, with the values of the parameters fixed in a different
way. Following Otto Jespersen, Chomsky thinks that UG is restricted to
syntax, and that morphology accounts for the variety of natural
languages. (41) The part of the innately given schema which provides
parameters or options is, it is thought, restricted to the lexicon.
The differences of particular languages, accordingly, lies in their
idiosyncratic morphological features, in particular, their
inflectional morphemes or functional elements like Tense (T) and
Case. (42)
The morphology of a language, and its consequences for word order, are
explained in terms of overt or covert feature-checking.(43)
For instance, English has a strong EPP-feature of T (that is, a
feature that a clause must have a subject), which induces the
pre-Spell-Out (overt) movement of the subject. This results in the
subject-first property of the English-type (SVO) languages. The
V(erb)-features of Japanese, which is a typical example of verb-final
(SOV) languages, are supposed to be weak and according to
Procrastinate to be checked covertly. Celtic languages such as Welsh,
which is a VSO language, is supposed to have strong V-features, for
instances:
- (10a) Gwelodd y dyn y ci
- Saw the man the dog
- (10b) Rhoddodd y dyn y ffon i'r ci
- Gave the man the stick to the dog(44)
The notion of strong features may also explain the difference of
sentence structures regarding the relative positions of the main verb
(in particular, of finite sentences) and the adverbial(s) adjoined to
the VP. French and English vary in this respect, among others, though
they are both subject-initial languages. Compare the following pair
of sentences (11a) (that is, (7a)) and (11b):
- (11a) David sometimes paints the house
- (*David quelquefois peint la maison)
- (11b) David peint quelquefois la maison
- (*David paints sometimes the house)
It is explained that unlike English, the V-features of T in French is
strong and thus there must be overt V-raising from within the VP after
the overt subject (nominal)-raising attracted by the strong
EPP-features of T before Spell-Out. In addition, similar to English,
the D(eterminer)-feature of T is weak in French; consequently its
determiner phrases (DPs) do not raise out of the VP overtly. The
outcome is that French sentences have the following ordering:
subject-main verb-adverbial-object.
1.5 Language growth
The P-P approach has also implication for language acquisition, as
shown in 1.5.1. Chomsky's central argument for it is to be discussed
in 1.5.2.
1.5.1 From S0 to SL
According to Chomsky, I-language is the mature state (SL) of an
individual's language faculty. Experiencing only a meagre amount of
data, a child can acquire a rich information system, which enables it
to produce and understand even those sentences it never came across
before. This linguistic issue is regarded by Chomsky as a version of
"Plato's Problem" as presented in the Meno via a thought experiment to
show how geometric principles are acquired. While Plato explains the
geometric knowledge of the untutored slave boy in terms of a cognitive
state of the boy's "pre-existence", Chomsky explains the knowledge of
language by appeal to the language learner's innately given physical
condition, that is, his biological endowment.
Chomsky is not satisfied with the term 'language learning' as it is
usually imbued with the behaviourist connotation that learning a
language is basically a matter of training or 'moulding' by external
factors. To him nature is of far more significance than nurture.
Experiences merely trigger the biological development of the innate
language faculty. So the word 'growth' is, Chomsky suggests, more
appropriate than the word 'learning'. (45)
Let us begin with the 'primary linguistic data' (PLD) relevant to
language acquisition, that is, the experience of a particular language
to which a child is exposed, especially in its pre-school days. One
should not confuse PLD with the data available to the linguists. The
PLD include mainly the limited number of sentences or pseudo-sentences
that a child hears from its parents or caretakers, some being
incomplete, ill-formed, slips of the tongue, false starts, etc. This
set of meaningful, structured, sounds (and gestures) in daily life
situations only play some triggering and shaping roles in the natural
process in which the child attains its language competence. Without
an innate mechanism, that is, the 'language faculty', the language
learner can hardly extrapolate or induce from the impoverished and
degenerate stimuli a language which is so rich and complex in terms of
its form and meaning. (46) Moreover, what enables a child to learn how to
speak is, in Chomsky's view, by no means a certain 'general(ized)
learning mechanism', which includes processes such as trial and error,
conditioning, association, induction, making hypotheses, confirmation,
making analogy and abstraction, etc., exercising 'multi-purpose
learning strategies' on the 'general intelligence' - though such kinds
of mechanisms may also be innate. He firmly believes that the mental
capacities responsible for our acquisition (and use) of a language
must be distinguished from those for learning tasks or activities such
as playing chess, physics and history.
Characteristic of the human species, the 'language faculty'
consists of a unique, built-in, 'language acquisition device'
(LAD), which does not vary across cultures. Almost every
individual is genetically endowed with a LAD, a mechanism
"which effects a transition from the initial state [S0] of the
language faculty to later states, mapping experience to state
attained". (47) UG is concerned with the biologically given and
unique condition of language acquisition and thus is the
"essence of human language". (48) Language acquisition is
understood as fixing these biologically-constrained options
(parameter setting) on the basis of invariant, innate,
principles and of an exposure to a linguistic environment.
1.5.2 Lack of negative evidence
Suppose a child grows up in an English-speaking community. Then the
PLD that he/she receives enable him/her to 'turn on' the head-first
parameter and to 'turn off' the pro-drop parameter. English, unlike
Japanese, requires the verbs, prepositions, adjectives and nouns (that
is, heads) to be put in front of their complements, (49) and unlike
Italian, requires a sentence to have a subject obligatorily, except in
casual speeches or diaries. (50) , (51) It is believed that the parameters
are almost binary ('on' or 'off'), that is, with only two options of
variation. This idea, that UG highly restricts the range of
hypothetical grammars for testing in language acquisition (and yet is
compatible with the properties of all human languages), is sustained
by the fact that what is available for a language learner is
overwhemingly positive evidence. Chomsky distinguishes three kinds of
evidence that the learner might get: (i) positive evidence, (ii)
direct negative evidence, and (iii) indirect negative evidence. By
'positive evidence' Chomsky means the correct expressions pertaining
to a particular language, which indicates, in particular, word order
(like the SVO pattern of English) and irregular verbs, etc. 'Direct
negative evidence' refers to explicit remarks on the incorrectness of
the structure or words employed in linguistic performance, on the
occasions where an adult corrects his/her own or the others',
especially children's, linguistic mistakes. As for 'indirect negative
evidence', it is supposed to be there when, say, a linguistic
principle or rule hypothesised by a child, is not instantiated in even
very simple sentences that he/she hears. (52)
If UG is not so restrictive as the P-P approach assumes, then the
child will much more likely overgenerate. And the 'shrinking' back
from the false grammars that he/she hypothesises to the target
language relies tremendously on either (ii) direct negative evidence
(that is, corrections by adults) or (iii) indirect negative evidence
(that is, the non-occurrence of certain ungrammatical structures).
But (ii) is not prominent in ordinary language-learning setting.
Whereas (iii) is incompatible with the relatively fast and easy
character of first language learning, for this kind of evidence
depends on an exposure of an exceedingly great amount of linguistic
experience - otherwise, the child could not be sure whether the
non-occurrence of a structure is ungrammatical.
The nativist conception of language that Chomsky advocates is strong,
for it suggests that UG is highly restrictive, so restrictive that
mere positive evidence, which depends only on observation of a few
instances, can play a decisive role in attaining some aspects of a
grammar. When the language learner is exposed to a limited number of
head-first expressions, he/she can automatically set the relevant
parameter, since it is innately determined that if a language is
head-first, then it is not head-last - and there are no other
alternatives. (53)
1.6 Natural language: abstract yet biologically determined
I have shown in the previous sections how Chomsky explains the various
linguistic phenomena by reference to the language faculty of the human
mind. It is claimed that every person inherits a human genome
providing 'instructions', which specify the growth of the brain in
such a way that capacities of acquiring, mastery and use of a language
are formed. And UG and I-language should reflect a certain physical
reality of a language learner or user. Regarding the idea that there
should be some biological endowments determining the way linguistic
information is learned, stored and accessed for use, there seems to be
no problem. But can the possession of I-language be identified with
some facts of the mind/brain?
Chomsky is accused of committing a conceptual confusion in taking the
I-language to be an "abstract entity" and yet "some element of the
mind of the person who knows the language". (54) The charge is that
Chomsky conflates what is known with knowing. The former is the
object of knowledge; the object itself is not necessarily mentally
represented or something necessarily related to the cognitive system
of the mind/brain, whereas the latter must be concerned with the
mind/brain, being a state or process of it. In addition, the comment
continues, when we say Jones knows a language, we do not mean that
what he knows are some components of his mind or brain, but something
of an abstract nature whose identity does not depend on any cognition
about it. (55)
But what is this abstract entity? Is it a mathematical object of a
Platonic kind, which is non-spatial and non-temporal, independent of
the mind and the physical world? (56) Chomsky has made it clear that if there were such a
language, there must be some fact about it; yet we cannot find any
linguistic fact that is extra to those concerning the human mind. (57)
Furthermore, human language has its uniqueness. It is distinguished
from other symbolic systems, thanks to its nonredundancy feature:
"for example, FI is not observed in standard notations for
quantification theory that permit vacuous quantifiers in well-formed
expressions, as in [i], which is assigned the same interpretation as
[ii]:
- [i] (Ax) (2+2=4)(for all x, 2+2=4)
- [ii] 2+2=4
But FI is a property of natural language." (58)
Another crucial feature of natural language is that it involves
"displacement". Analysing a sentence's structure, we need to
distinguish two representation levels, namely, LF and PF. A word's
position in the thematic structure of a sentence is usually not
matching its position in the phonological structure of the same
sentence. Wh-questions in English are obvious examples. (59)
In saying that I-language is an abstract entity, Chomsky does not
presuppose a Platonic World. It is abstract because it is not
described in physical (or physiological) terms. A discipline may
characterise or study the human brain at different abstract
levels. Chomsky's linguistics of the computational-representational
approach is an example; another example, with different degree of
abstractness, is neural net theory. (60) Both are theories of the human
mind. There should be no conceptual problems, it seems to Chomsky, in
calling I-language an 'abstract entity' "beyond those familiar in
discourse involving theoretical entities". (61) The mind is composed of
'abstract entities' in this sense, I-language being one of them.
According to Chomsky, the mental under naturalistic inquiry, like the
chemical, electrical, optical and so on, is just a part of
nature. Linguistic theories play an important part in guiding the
neuroscientists' research on the material basis of language. Chomsky
writes, "Just as nineteenth-century science provided essential
guidelines for the physics of the subsequent period, so the study of
mind should serve as a guide for the brain sciences of the future,
exhibiting the properties and conditions that must be satisfied by the
mechanisms of the brain, whatever they turn out to be." (62) Though so
far little has been known concerning the link between theoretical
linguistics and neurosciences, that is, concerning the physical
realisation of a linguistic description or explanation, he believes,
in the long run, these scientific researches can be integrated, in the
same way as the unification of chemistry and physics. (63)
1.7 Concluding remarks
Chomsky highlights the biological constraints of natural language.
Language acquisition is determined by a principles-and-parameters
schema (UG), which is genetically endowed to human beings. This
schema is thus also the basis of linguistic competence, which means
tacit knowledge of (i) a limited number of economy principles of
derivation and representation of CHL and (ii) a lexicon. Linguistic
experience involved in attaining SL consists in to a large measure
mastering (ii), and is in effect a process of parameter-setting. What
is known (that is, I-language) in Chomsky's conception, is not an
infinite number of well-formed sentences conceived as objects
independent of the mind/brain, namely, E-language, but the
internalised mechanism which derives optimal representations of LF and
PF. FI, which requires that there not be any uninterpretable LF and
PF representations, is in fact a legibility condition imposed by the
other two systems in the mind/brain, C-I and A-P, with which CHL
interacts to produce meaning and sound.
Back to Chomsky for Philosophers
Last modified: Wed Mar 27 13:30:13 Eastern Standard Time 2002