NLP is an interdisciplinary field that uses computational methods to:
Investigate the properties of written human language and model the cognitive mechanisms underlying the understanding and production of written language.
Develop novel practical applications involving the intelligent processing of written human language by computer.
What is NLP? (cont.)
NLP plays a big part in Machine learning techniques:
automating the construction and adaptation of machine dictionaries
modeling human agents' desires and beliefs
essential component of NLP
closer to AI
We will focus on two main types of NLP:
Human-Computer Dialogue Systems
Machine Translation
Human-Computer Dialogue Systems
Usually with the computer modelling a human dialogue participant
Doesn't work well with domains where people want translation the most:
spontaneous conversations
in person
on the telephone
and on the Internet.
Problems
Computers can't deal with ambiguity, syntactic irregularity, multiple word meanings and the influence of context.
Time flies like an arrow.
Fruit flies like a banana.
Accurate translation requires an understanding of the text, situation, and a lot of facts about the world in general.
The box is in the pen.
Problems
The sign is describing a restaurant (the Chinese text, 餐厅, means "dining hall").
In the process of making the sign, the producers tried to translate Chinese text into English with a machine translation system, but the software didn't work, producing the error message,
Successes
Product knowledge bases need to be translated into multiple languages
Hiring a large multilingual support staff is expensive
Machine translation is cheaper and accurate with predictable texts.