2013-04-09 slides - Goh Kawai

Spoken language corpora
Course overview
goh kawai
2013-04-09 tue1 week1
spoken language corpora
s316
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
goh do this for tue1
l
l
l
l
bring and connect laptop, projector, network,
bluetooth speaker, clicker
arrange desks, chairs
show these slides, my website, glexa
circulate roster sheet
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
make roster
l write
l full name
l furigana
l email address
pass sheet
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
informed consent
l your speech and actions may be recorded,
archived and, without revealing your identity,
used and made public for research and
education purposes
l if you disagree, I will neither record nor retaliate
l 学生の言動を録音し、保存し、匿名としたうえで研
究と教育のために利用したり公開する可能性があ
る
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
contact info
l
l
l
office: office building room s304
email: [email protected]
web: goh.kawai.com
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
goh's website
l
l
http://goh.kawai.com/
http://goh.cll.hokudai.ac.jp/
l
l
identical content
hokudai site may be faster
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
instructor
l
Goh Kawai (河合 剛 かわい ごう)
l born in Tokyo, raised in Toronto
l came to Sapporo in 2003-04
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
goh’s academic background
l
l
l
l
Univ of Tokyo
l BA linguistics, 1984
ICU
l MA educational technology, 1986
Stanford Univ
l linguistics (dropout)
Univ of Tokyo
l PhD information and communication
engineering, 1999
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
goh’s vocational background
l
l
l
l
l
l
Xerox Palo Alto Research Center
Palo Alto, CA
SRI International
Menlo Park, CA
University of Tokyo
Tokyo, Japan
University of California Santa Cruz
Santa Cruz, CA
Oregon Health & Science University
Beaverton, OR
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
goh’s interests
l
l
research
l spoken and written language
processing technology applied to
language learning
personal interests
l flying, kayaking, cycling, snowshoeing,
amateur radio, sado (way of tea)
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
office hours
l
l
drop-in or email for appointment
l no phone calls
off campus
l see my website
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
class periods
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
grad school catalog blurb
担当分野/マルチメディア言語情報処理論
研究領域、学歴(言語学学士、教育学修士、電子情報工学博士)
、職歴(研究所2社、大学4校)、業績一覧、所属学会、授業資料、
教え子の匿名コメント(全ての学部授業)などをwebに掲載。メー
ルで面会予約。電話不可。私の評価を元指導生に直接たずねる
とよい。
l
言語情報処理、教育工学☆領域 言語学と情報処理技術を利用
した非母語学習。☆手法 学習システムや教材を制作し、学習効
果を定量的に評価する。☆指導方法 協同プロジェクトを共著論
文にまとめる。☆修士条件 査読のある国際会議で論文発表。☆
博士条件 後進の研究指導。☆指導生の発表先 音響学会、音声
学会、教育工学会、ASA, AAAL, Calico, Eurocall, Interspeech
など。03:20 utc [email protected] http://goh.kawai.com/
updated 2013-04-07
l
l
alumni
l
平野宏子
l
歌代崇史
l
l
l
三角美樹
壽崎尚美
片桐徳昭
updated 2013-04-07 03:20 utc
東京大学 博士(科学)
東北師範大学
東京工業大学 博士(工学)
北海学園大学
札幌開成高校
北海道立高校
札幌開成高校、博士(学術)見込
[email protected]
http://goh.kawai.com/
undergraduate education
l
english language for freshmen
l online course
l instructor-led courses
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
english online
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
instructor-led course
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
pronunciation lunch
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
spoken language corpora course
l acquire a specific practical skill
l not theory
l lots of out-of-class work
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
objectives
l
l
re: spoken language corpora, explain:
l basic concepts (definitions, features)
l uses (analysis, engineering, learning)
l design and development strategies
re: speech analysis, perform:
l design and collect corpus
l label and analyze speech
l interpret analyses
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
prerequisites
l phonetics and phonology
l sound system of English and/or Japanese
l IPA desirable
l audio input and output using computers
l bring your laptop (Linux, Windows, Mac)
l statistics
l mean, standard deviation
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
format of each class period
l
l
l
l
explain concepts and theory
collect and analyze speech
l learn software tools
l transcribe and analyze
l design corpus
learn about research and academia
explain next week's assignment
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
grading
l
l
discussion and project
100%
essential
l participate in discussion during class
l propose and report your project
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
schedule
date
activity
date
1
2013-04-09
install software
9
2
2013-04-16
transcribe speech
10
2013-0618
propose project
3
2013-04-23
record read speech
11
report progress
4
2013-05-07
record spontaneous
speech
2013-0625
12
2013-0626
report progress
5
2013-05-14
design L1 script
6
2013-05-21
design L1 script
7
2013-05-28
design L2 script
8
2013-06-04
activity
wk
wk
13
2013-0702
report project
14
2013-0709
report project
15
2013-0716
critique
design L2 script
16
updated 2013-04-07 03:20 utc
2013-06-11 propose project
2013-0723
probably no class
(make up day)
attendance mandatory
[email protected]
http://goh.kawai.com/
courseware
l
l
l
l
everything online
l reading material
l lecture notes (including this presentation)
http://goh.kawai.com/
http://goh.cll.hokudai.ac.jp/
hokudai library catalog of our course's
textbooks
l view online course offering (シラバス)
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
Praat
l
l
l
l
l
l
http://www.praat.org/
built by researchers and engineers in
linguistics and speech processing
updated frequently
good support base
Windows, Mac, Linux
free
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
what can Praat do?
l record and play speech
l display waveforms, spectrograms, pitch and
more
l label speech at various levels
l phone, mora, syllable, word, phrase and
utterance levels
l SIL fonts
l Praat in action
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
demo
l
l
view praat
l time waveform
l spectogram
l spectral slice
sound sources show praat
l vowels
l consonants
l pure tones (sinusoids)
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
readings
l
Jurafsky et al (2000) chapter 4
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
next week
l
l
install Praat
TIMIT sentences
l download from my website
l extract speech files from archive
l read files into Praat
l play speech
l view waveforms and spectograms
l label at the word level
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
slideshow
l
if there's time
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
one-stop website
l
http://goh.kawai.com/
l
l
l
link to glexa
course material (these slides)
contact form
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
see you next week!
updated 2013-04-07 03:20 utc
[email protected]
http://goh.kawai.com/
mailto:[email protected]
http://goh.kawai.com/