Natural Language Processing

Natural Language
Processing
Frank Bergman
Mike Gallagher
Jason Firestone
Barcelona, October 19th, 2001
Stacked Layers of Language
Person A
Person B
„Thoughts“
„Thoughts“
Pragmatics
Pragmatics
Semantics
Semantics
Syntax
Syntax
Speech
Speech
Infrastructure
Voice
Recognition
Syntax
Parsing
Semantic
Analysis
Translation/
Processing
Generation
Voice
Generation
Ramona
Philips Genie
IBM Via Voice
BabelFish, Systran, other translators
Verbmobil, Train Information Systems
SAM,
AT&T Synthesizer
Verbmobil: Speech2Speech Translation
Verbmobil Video
http://verbmobil.dfki.de/overview-us.html
Speech: Audio signals
„I am
hungry“
Person A
Person B
„Thoughts“
„Thoughts“
Pragmatics
Pragmatics
Semantics
Semantics
Syntax
Syntax
Speech
Speech
“I am hungry”
Audio Signals
Syntax: Sentence structure
Person A
Person B
„Thoughts“
„Thoughts“
Pragmatics
Pragmatics
Semantics
Semantics
Syntax
Syntax
Speech
Speech
text=„I am hungry“
type=noun_phrase;
text=„am“
type=verb;
time=pres;
pers=1st;
infinitive=„be“;
verb=
subject =
text=„I“
type=pronoun;
case=nom;
pers=1st;
adverb =
text=„hungry“
type=adjective;
case=nom;
pers=1st;
Grammar Formalisms: DCG, HPSG, …
Semantics: „Meaning“ and Logics
Person A
Person B
„Thoughts“
„Thoughts“
Pragmatics
Pragmatics
Semantics
Semantics
Syntax
Syntax
Speech
Speech
x.self(x)  hungry(x)
Logic Formalisms: FOL, Lambda Calculus, …
Pragmatics: The „world“ around us
Person A
Person B
„Thoughts“
„Thoughts“
Pragmatics
Pragmatics
Semantics
Person
Semantics
Syntax
Syntax
Speech
Speech
instance
I
has_property
has_part
has_value
State
desire
Body
instance
Body
has_property
State
eat
has_value
full
Food
empty
Frames, Semantic Networks, Description Logics, …
„Thoughts“: What we „feel“
Person A
Person B
„Thoughts“
„Thoughts“
Pragmatics
Pragmatics
Semantics
Semantics
Syntax
Syntax
Speech
Speech
„I am
hungry“
Markets
Speech
Input
Automatic
Translation
Players
• Philips
• IBM
• Lernout & Hauspie
• AT&T
Dialog
Systems
Market Conditions
• High entrance barriers
• Low margin products
Commercial Impact
The generated value is very low
The technology doesn‘t work yet
The Future
Current:
Low margin add-on to operating systems
Automatic translations have bad quality
Next 5 Years:
Low margin add-on for mobile devices
Translation quality will slightly enhance
Dialog systems becoming more common
Next 20 Years:
Speech processing will become ubiquous in
public life, but no „real intelligence“
Translation quality will slightly enhance
Links
Verbmobil: http://verbmobil.dfki.de/
AT&T Demo: http://www.naturalvoices.att.com/demos/
Philips Speech Processing:
http://www.philipsspeechprocessing.com/
Babelfish http://www.babelfish.com/
Lernout & Hauspie http://www.lhsl.com/
Ray Kurzweil AI Net: http://www.kurzweilai.net/
ALICE: http://www.alicebot.org