Natural Language Processing and Machine Learning Basics
Natural Language Processing and Machine Learning Basics
An Introduction to
Natural Language Processing
and
Machine Learning
Karthik Sankar
Department of CSE
NIT Trichy
So computers could be a ton more useful if they could read our email, do our
library research, chat to us, do all of these things involve dealing with natural
language
They're pretty good at dealing with machine languages that are made for them,
but human languages, not so.
“Look. The computer just can't deal with the kind of stuff that humans produce,
and how they naturally interact”
We’re exploiting human cleverness rather than working out how to have
computer cleverness.
Definition
NLP is a field of computer science and linguistics concerned with the interactions
between computers and human (natural) languages
Categories
Phonology
Modeling the pronunciation of a word as a string of symbols – PHONES
Articulatory Phonetics: How phones are produced as the various organs in the
mouth, throat and nose modify the airflow from the lungs.
Can
Chair
Coach
Syllables
Morphology
Identification, analysis and description of the structure of words.
Inflections
Number dog/dogs : goose/geese
Tense hunt – hunted
Case his - hers
Gender
Person
Word Formation
mother in law
hot dog
Syntax
Part of Speech Tagging
Noun
Verb
Adjective …
Semantics
Understanding and representing the meaning
having
Has(Ram, book)
Ambiguity
Adjective: the adjectives are associated with which of the two nouns ?
“pretty little girls' school”
Ambiguity - contd
Fed raises
The raises we received was small
Resolving Ambiguity
Probabilistic Parsing
Perceptions
Perception provides agents with information about the world they inhabit.
A sensor is anything that can record some aspect of the environment and pass it
as input to an agent program.
The sensor could be as simple as a one-bit sensor that detects whether a switch
is on or off or as complex as the retina of the human eye, which contains more
than a hundred million photosensitive elements
Image processing
Computer Vision
Speech recognition
Facial recognition
Object recognition
Applications
Information retrieval & Web Search
Information retrieval (IR) is the science of searching for documents,
for information within documents, and for metadata about documents, as well
as that of searching databases and the World Wide Web.
Information Extraction
Information extraction (IE) is a type of information retrieval whose goal is to
automatically extract structured information, i.e. categorized and contextually
and semantically well-defined data from a certain domain, from
unstructured machine-readable documents
Question Answering
Type in keywords to Asking Questions in Natural Language.
Response from documents to extracted or generated answer
Text Summarization
Process of distilling most important information from a source to produce an
abridged version
Machine Translation
use of computer software to translate text or speech from one natural
language to another.
November 11, 2009 Department of CSE, NIT Trichy 13
Artificial Intelligence
Applications
Speech - recognition & synthesis
Deriving a textual representation of a spoken utterance
Natural Language understanding and generation
NLG system is like a translator that converts a computer based representation
into a natural language representation.
Human - Computer Conversation
Dialogue between humans and computers using natural language.
Text Generation
A method for generating sentences from “keywords” or “headwords”.
Hand writing recognition
Ability of a computer to receive and interpret intelligible handwritten input
from sources such as paper documents, photographs, touch-screens and other
devices
Machine Learning
Machine Learning
The ability to learn
A system can improve its problem solving accuracy (and possibly efficiency) by
learning how to do something better
Symbolic
Explicitly represented Domain knowledge
Supervised
Training data is available
Unsupervised
Training data is not available. Self learning process
Reinforcement
how an agent ought to take actions in an environment so as to maximize some
notion of long-term reward
Knowledge acquisition
Analogy
positive positive
negative
negative
Example
Example
Example
Generalization Operations
Color(ball, red)
generalizes to Color(X, red)
Covering
p covers q
Thank you