Why Can't Computers Use Language As Well As People Do? Victoria Muehleisen Associate Professor Tasks which computers do better than humans Calculating numbers: Computers are faster and more accurate than people. Searching and retrieving information: Computers don‘t “forget” where information is. Recognizing patterns (such as “enhancing” a photograph): Computers can compare many pieces of information very quickly. However, when it comes to language, computers leave much to be desired. Three programs in this Powerbook Spell Checking (in TextEdit) Speech Recognition (with Apple Speakable Items) Translation (with Sherlock) You’ll see that each program does well with things, but poorly with others. Spell Checking It‘s a kind of pattern matching. It checks the words you type against words in its dictionary. Problems with Spell Checking It works fine for words in the dictionary, but a lot of words aren‘t in the dictionary! It usually doesn‘t catch a mistake when the mistake is an actual word (just the wrong word). Demonstration Speech Recognition A computer first must recognize the individual sounds, then the words, then the meanings. Problems with Speech Recognition The program can only recognize a certain set of words, but in natural languages, there are many different ways to express the same message. Demonstration More problems with Speech Recognition Different speakers have different voices and different accents. Even one person sounds different at different times (for example, talking quickly, talking while sick with a cold.) Translation This is one of the most complicated things we ask computers to do with language. To translate, a person or a computer must understand individual words, understand the sentences, and understand the intended meaning in one language. Translation Then the translator must find equivalent words and sentences in the other language. Demonstration Problems with Translation There are many things which make translation difficult, both for people and for computers. We’ll see several more examples in the rest of this lecture. Why can’t computers do better? Of course, computers don‘t do anything with language by themselves. They are programmed by people! So the question really is… Why can‘t people program computers to deal with language as well as people can? In order to program computers to use language … 1. We have to understand the “rules” of language. 2. We have to express those rules in a way that computers can use. People working on this problem Linguists: people who study the rules and patterns of language Computer scientists and applied linguists: people creating programs based on the rules and patterns Understanding the Rules: One Simple Rule of English To form the past tense of most verbs, add -ed to the present tense form. live → lived need → needed We need to list the conditions and exceptions. Some verbs are irregular (wrote, saw), and these must be listed individually in the program. Speech production and speech recognition programs would have to know that -ed has different pronunciations: /d/ in lived, /id/ in needed, and /t/ in slipped. Computers can easily “learn” this kind of simple rule, but... …even with a rule so simple there may be variations that are hard to include in the program. She dived into the lake. (Used in many English speaking countries) She dove into the lake. (Used in some parts of North America) Many rules are more complex. English speakers learn the difference between “the” and “a” at an early age, but we never learn explicit rules. Linguists have tried to describe the rules for using “the” and “a”, but there is still no complete description. A similar example in Japanese You learned how to use “wa” and “ga” at an early age, but there is still no complete set of rules which can be used by adults learning Japanese. Many patterns in language are still not completely understood… …and if we don’t understand the patterns, we can’t write computer programs. An example: Input: お湯がありますか?お茶を飲みたいです。 Sherlock’s translation: Is there a hot water? We would like to drink the tea, is. My translation: Is there any hot water? I want some tea. Another example of rules When translating from Japanese to English or English to Japanese, we need rules to describe the differences in basic word order: Japanese: Subject - Object-Verb 私は、本を買った。 English: Subject - Verb - Object I bought a book. Word order rules in a computer program The difference in word order is something we can easily make a rule for and put into a computer program, as long as the sentence is simple. Sherlock’s translation Input: Cats like mice. Output: 猫がマウスを好む。 But not all sentences are this easy… Consider a sentence with more than one clause: Input: I love skiing, so whenever I get a chance, I do it. Sherlock: 私はするそれをスキー、従って私がチ ャンスを得る時はいつでも、私愛する。 A human translation (my husband’s) 私は、スキが好きです。チャンスがあれば、いつ でもスキをやります。 There are many more difficulties for Sherlock For example, English sentences must always have a subject, but in Japanese, the subject is often not mentioned. What is the subject of this sentence? 雨がすきで、雨が降ると雨をみる。 If I tell you that it is from a autobiographical (自叙)essay, you can easily guess. Human translation(mine): Input:雨がすきで、雨が降ると雨をみる。 Output: I like rain, so when it rains, I watch it. I know from the context that the person who likes rain and the person who watches the rain are both “I”. Sherlock’s translation Input:雨がすきで、雨が降ると雨をみる。 Output: Liking the rain, when the rain falls, you see the rain. My translation and Sherlock’s translation are different because I understand the context of the sentence. Another example of the importance of context (Students of Japanese learn this one in class!) In a restaurant: 「私は、うなぎです。」 「私は、うなぎです。」 My translation: “I’ll have eel.” Sherlock’s: “I am the eel.” Possibility for improvement It is possible to put more information about grammar structure and contexts into computer programs. A better spell checker TextEdit is free software, but a more expensive program might catch mistakes like this one: “Can you sea this mistake?” “See” is a verb, but “sea” is a noun. A better translation program Sherlock is a free program, but a more expensive would probably know that we don’t say “a hot water.” お湯がありますか?⇒ Is there any hot water? A better translation program It might even be possible to make a program that would know that in an autobiographical essay, the subject is often “I”. 雨がすきで、雨が降ると雨をみる。 I like rain, so when it rains, I watch it. But there’s a more serious difficulty… In the widest sense, context includes the whole world, including knowledge of social relationships. Humans can easily use this knowledge, but can computers? An example of social context 優美子:宝くじ一等当たったのよ。 貴子:うそ! Sherlock’s translation Yumiko: Lottery first class it hit, is. Takako: Lie! 「うそ!」≠ “lie!” We know that if Yumiko and Takako are friends, Takako is not calling her friend a liar. We can tell this both by the context and by the pronunciation:「うっそ〜」. A better translation Yumiko: I won first prize in the lottery. Takako: No! I don’t believe it! “No! I don’t believe it!” In this case too, the real meaning is not the same as the literal meaning. Takako does believe what Yumiko said, but she is very surprised. This expression shows her feelings. Real meaning vs. literal meaning It’s easy for humans to learn that the real meaning is not always the literal meaning. Although I never learned this use of うそ in Japanese class in the U.S., I quickly learned it when I got to Japan. However, it’s hard to imagine how a computer program could “learn” this kind of thing. Summary If linguists can write a language rule that does not depend on the context, it’s easy for computer scientists to write a computer program using the rule. But if the rule is not understood well, or if the rule depends on the context, it’s very difficult, or maybe impossible, to write a program using it. In the future, we can expect that computers will get better and better at translating “straight forward” texts (e.g., texts that just give information). However, there are some things that computers might never be able to do. If you want to learn more… Go to the web sites on the handout Japanese web sites about linguistics The Linguistic Society of America (check out the videos!) The Linguist Network Classes at the International College of Waseda CO 201 Introduction to Language Studies CO 202 Introduction to Communication CO 203 Introduction to Application of Language Studies GE 111 Introduction to Computer Literacy(情 報処理入門) You can find the web links and this PowerPoint lecture on my web page: <http://www.f.waseda.jp/vicky/comp-lang.html> Any questions?
© Copyright 2024 ExpyDoc