From Wikipedia, the free encyclopedia
- The word lemma has a different meaning in
logic and in
mathematics.
In
linguistics a lemma is a form of a word, although its
usage depends on the area of linguistics in which it is used.
In
lexicography "lemma" refers to a headword or heading in any
kind of dictionary, encyclopaedia, or commentary. So, for
example, the lemma of this article is "Lemma (linguistics)". The
term is derived from the practice in Greco-Roman antiquity of
using the word to refer to the headwords of marginal glosses in
scholia; for this reason, the
Ancient Greek plural form is sometimes used, namely
lemmata (Greek λῆμμα, pl. λήμματα).
In
morphology, a lemma is the canonical form of a
lexeme. Lexeme, in this context, refers to the set of
all the forms that have the same meaning, and lemma
refers to the particular form that is chosen by convention to
represent the lexeme. Lemmas have special significance in highly
inflected languages such as
Czech. In this sense, a lemma can also be called a
citation form.
In
psycholinguistics, the terms lemma has a more
restricted use: it is an abstract form of a word that is used in
speech production. In the best accepted psycholinguistic models,
speech production has several stages, and the lemma
occurs after the word has been selected mentally, but before any
information has been accessed about the sounds in it (and thus
before the word can be pronounced). It therefore contains
information concerning only
meaning and the relation of this word to others in the
sentence.
|
Contents
-
1
Morphology
-
1.1
Lemmas in different
languages
-
2
Psycholinguistics
-
3
See further
-
4
References
-
5
External links
|
Morphology
In a dictionary, the lemma "go" represents the inflected
forms "go", "goes", "going", "went", and "gone". The
relationship between an inflected form and its lemma is usually
denoted by an angle bracket, e.g. "went" < "go". The
disadvantage of such simplifications is, of course, the
inability to look up a declined or conjugated form of the word,
although some dictionaries, like
Webster's, will list "went". Multilingual dictionaries vary
in how they deal with this issue: the Langenscheidt dictionary
of German does not list ging (< gehen); the
Cassell does.
The form that is chosen to be the lemma is usually the least
marked form. There are significant exceptions; e.g. in
Finnish, the dictionaries lists verbs not under the verb
root, but under the first infinitive marked with -(t)a,
-(t)δ.
Lemmas are used often in
corpus linguistics for determining word frequency. In such
usage the specific definition of "lemma" is flexible depending
on the task it is being used for.
Lemmas in different languages
In English, the citation form of a
noun
is the
singular: e.g. mouse rather than mice. For
multi-word lexemes which contain
possessive adjectives or
reflexive pronouns, the citation form uses a form of the
indefinite pronoun one: e.g. do one's best,
perjure oneself. In languages with
grammatical gender, the citation form of regular adjectives
and nouns is usually the masculine singular. If the language
additionally has
cases, the citation form is often the masculine singular
nominative.
In many languages, the citation form of a
verb
is the
infinitive: French aller, German gehen. In
English we can use either the bare infinitive go or the
full infinitive to go. In Latin and Greek, however, the
first person singular present tense is normally used, though
occasionally the infinitive may also be seen. (For
contracted verbs in Greek, an uncontracted first person
singular present tense is used to reveal the contract vowel,
e.g. φιλέω philιō for φιλῶ philō "I love implying
affection"; αγαπάω agapαō for αγαπῶ agapō "I love
implying regard").
In Arabic, which has no infinitives, the third person
singular of the past tense is the least-marked form, and is used
for entries in modern dictionaries. In older dictionaries, which
are still commonly used today, the
triliteral of the word, either a verb or a noun, is used.
Hebrew often uses the 3rd person masculine qal perfect,
e.g. ברא bara' create, כפר kaphar cover. For
Korean, -da is attached to the stem.
Some phrases are cited in a sort of lemma, e.g. Carthago
delenda est (Carthage must be destroyed) is a common way of
citing
Cato, although he more often said, Ceterum censeo
Carthaginem esse delendam.
Psycholinguistics
When we produce a word, we are essentially turning our
thoughts into sounds (a process known as
lexicalisation). In many psycholinguistic models this is
considered to be at least a two-stage process. The lemma is thus
intermediate between the
semantic level (where
meaning is specified) and the
phonological level (where the sounds of the word are
specified). It is an abstract form containing
syntactic information (about how the word can be used in a
sentence), but no information about the pronunciation of the
word. In this context, the lexeme is the phonologically
specified form that is selected after the lemma.
This two-staged model is the most widely supported theory of
speech production in psycholinguistics[1],
although it has been recently challenged.[2]
For example, there is some evidence to indicate that the
grammatical gender of a noun is retrieved from the word's
phonological form (the lexeme) rather than from the lemma.[3]
This is easily explained by Caramazza's Independent Network
model, which does not assume a distinct level between the
semantic and the phonological stages (so there is no lemma
representation); in this model, syntactic information about the
word in this model is activated in the semantic or phonological
level (so gender would be activated in the latter).[4]
See further
-
Linguistics
-
Corpus linguistics
-
morphology
-
Psycholinguistics
-
Markedness
-
Principal parts
-
Root (linguistics)
References
- ^
Harley, T. (2005) The Psychology of Language.
Hove; New York: Psychology Press: 359
- ^
e.g. Caramazza, A. (1997) How many levels of processing
are there in lexical access? Cognitive
Neuropsychology, 14, 177-208.
- ^
e.g. Starreveld, P. A. and La Heij, W. (2004)
Phonological facilitation of grammatical gender
retrieval. Language and Cognitive Processes, 19
(6), 677-711.
- ^
Caramazza (1997)
External links
Categories:
Linguistic morphology |
Psycholinguistics