Natural Language Processing
Natural Language Processing
Processing
By Abhishek Saini
Lecture Outline
• What is Natural Language Processing?
• The purpose of this phase is to draw exact meaning or you say dictionary
meaning from the text.
• Syntax analysis check the text for meaningfulness comparing to the rules
of the grammar.
Syntactical Analysis
Syntactical Analysis
Syntactical Analysis
• Ambiguity problem: one sentence may have many possible parsing
trees
• Vietnamese language processing (VNLP) still lacks accurate syntax
parsers (in my understanding)
− Accuracy about 78 ~ 84%
Approach to Syntactical Analysis
• Top-down parsing
• Bottom-up parsing
• Dynamic programming methods
− CYK algorithm
− Earley algorithm
− Chart parsing
• Probabilistic Context-Free Grammars (PCFG)
• Assign probabilities for derivations
Semantic Analysis
• Two levels
Lexical semantics
-Representing meaning of words
− Word sense disambiguation (e.g., word bank)
• Compositional semantics
− How words combined to form a larger meaning.
Meaning Representations
• First order predicate calculus
• E.g., Maharani serves vegetarian food. => Serves(Maharani,
vegetarian food)
• E.g., I only have five dollars and I don’t have a lot of time =>
Have(Speaker, FiveDollars) ∧ ¬Have(Speaker, LotOfTime)
Syntax-driven Semantic Analysis
Some Applications
• Information Retrieval
• Information Extraction
• Question Answering
• Machine Translation
Information Retrieval
• Query: “list of good sushi restaurants in kyoto?”
Architecture of an ad hoc IR system
Information Extraction
• To extract from unstructured text, information which pre-specified or
pre-defined in templates − Fill a number of slots/attributes
• Example: use template [PERSON, go, LOCATION, TIME] to extract
information about the destination of an individual goes. − “President
Obama went to Hanoi yesterday. − [PERSON = “President Obama”, go,
LOCATION = “Hanoi”, TIME = “yesterday”]
Question Answering
• A system that automatically return answers for an user’s question by
retrieving information from a collected documents.
• Differences from information retrieval system:
• − QA system’s goal is to respond exact answer instead of documents
related to users’ question.
• Q: who did invent the internet?
• A: Robert E. Kahn and Vint Cerf.
• − QA system requires more complicated semantic analysis
Question Answering
Machine Translation
• The use computer to automatic some or all of the process of
translating one language to the other one.
• Fully automatic machine translation is one of the most challenging
and hot topic in NLP.
• Recent advances of Deep Learning raise the trend of Neural Machine
Translation.
Thanks
End of Session!