2 - Unit - 1 - Find Structures of Words
2 - Unit - 1 - Find Structures of Words
• For Example :
• The rules and limitations governing how words are structured and
formed based on their pronunciation are explained and defined by
the field of linguistics known as phonology.
• Words are perhaps the most intuitive units of language, yet they
are in general tricky to define.
Finding the Structure of Words
• Knowing how to work with them allows, in particular, the
development of syntactic and semantic abstractions.
• The minimal parts of words that deliver meaning to them are called
morphemes.
• For Example - Will you read the newspaper? Will you read it? I won’t
read it.
• The lexical or syntactic units blur into one compact string of letters
and no longer appear as distinct words.
• By the term word, we often denote word with not just the one
linguistic form in the given context but also as the concept behind
the form and the set of alternative forms that can express it.
• Such sets are called lexemes or lexical items and they constitute
speech.
• for instance, the nouns receiver and reception are derived from the
verb to receive.
• For Example - Did you see him? I didn’t see him. I didn’t see anyone.
• The lexeme to see would be inflected into the form saw to reflect its
grammatical function of expressing positive past tense.
2. Morphemes
• Morphology is the study of the structure of words i.e the way words
are built up with smaller and minimal units of meaning which are
termed as morphemes.
• For Example
• Played = play ed
• Cats = cat s
• The word played has two morphemes: play ‘word’ and ed ‘plural
marker’.
Words and Their Components
• The word cat has two morphemes: cat ‘word’ and s ‘plural marker’.
ii. Suffixes follow the stem: -ed, -s, un-, -ly, etc.
Words and Their Components
• Affixes are called bound morphemes as they cannot occur on their own
and must combine with a root/stem.
• Example
Lemma Singular Plural
cat cat cats
knife knife knives
• The process through which the new words are formed by adding an affix
to an existing word is called derivation. Example – inter+national =
international.
Words and Their Components
• Unlike inflection, derivation often leads to change in the category.
• For Example –
B. Synthetic languages
i. Agglutinative languages
• The phenomenon where certain words or word forms does not follow
regular patterns or rules interms of their morphology or syntax.
• Words forms that can be understood in multiple ways out of the context.
• Word forms that look same but have distinct functions or meaning also
called as homonyms.
• For Example
For Example - John went to the store, and he bought some groceries.
3. Productivity
• From googol, an unknown word google was generated and by using some
productivity rules, a new words are generated like googling, googlish,
googleology etc.
Morphological Models
• There are many possible approaches to designing and
implementing morphological models.
1. Dictionary Lookup
2. Finite-State Morphology
3. Unification-Based Morphology
4. Functional Morphology
Morphological Models
1. Dictionary Lookup
a. Accuracy
b. Transparency
a. Limited Coverage
b. Ambiguity Handling
c. Resource-Intensive
Morphological Models
2. Finite-State Morphology
• The two most popular tools supporting this approach, XFST (Xerox
Finite-State Tool) and Lex Tools.
• In such a network or graph, nodes are also called states, while edges
are called arcs.
Morphological Models
• Traversing the network from the set of initial states to the set
of final states along the arcs is equivalent to reading the
sequences of encountered input symbols and writing the
sequences of corresponding output symbols.
• Following example shows the FST state diagram for the input
words and their corresponding morphological parsed output or
morphological parsing.
Morphological Models
Input Input Morphological parsed
output
•Let us have a relation R, and let us denote by [Σ] the set of all
sequences over some set of symbols Σ.
𝑅 ∷[∑]՜ Σ
𝑅 ∷ 𝑆𝑡𝑟𝑖𝑛𝑔 ՜ 𝑆𝑡𝑟𝑖𝑛𝑔
1. Feature Structures:
• For Example :
For Example :
Word: "cats"
Feature Structures:
{ { base: "cat", pos: "noun", number: "singular"},
{ base: “ ”, pos: "plural marker", number: "plural"}}
Morphological Models
4. Morphological Analysis:
• For Example :
• For Example:
• Morphological Rules:
Apply "-s" to indicate plural.
• Ambiguity Representation:
• Past Tense Verb: {base: "see", pos: "verb", tense: "past"}
• Noun (Tool): {base: "saw", pos: "noun"}
Morphological Models
4. Functional Morphology
• 𝑳 ∷ 𝒄𝒐𝒏𝒕𝒆𝒏𝒕 ՜ 𝒍𝒆𝒙𝒆𝒎𝒆
Morphological Models
•Many functional morphology implementations are embedded in
a general-purpose programming language.
•For instance