2012-04-10 course overview

Spoken Language Corpora
2012-04-10 course overview
2012 spring semester, tue1
elective course for IMCTS graduate
students
ITE-SE → S316
make roster
write
full name
furigana
email address
pass sheet
updated 2011-04-11 11:30 utc goh kawai
informed consent
your speech and actions may be recorded,
archived and, without revealing your identity,
used and made public for research and
education purposes
if you disagree, I will neither record nor
retaliate
学生の言動を録音し、保存し、匿名としたうえで
研究と教育のために利用したり公開する可能性
がある
updated 2011-04-11 11:30 utc goh kawai
welcome to boot camp
師師
範範
代
白紀
黒子剛
updated 2011-04-11 11:30 utc goh kawai
語
道
師
剛
範
道
場
contact info
office: often in jou-kyouiku-kan
3rd floor server room, or staff
and student office building s316
email: [email protected]
web: goh.kawai.com
updated 2011-04-11 11:30 utc goh kawai
goh's website
http://goh.kawai.com/
http://goh.cll.hokudai.ac.jp/
identical content
hokudai site may be faster
updated 2011-04-11 11:30 utc goh kawai
alumni
平野宏子
歌代崇史
三角美樹
壽崎尚美
片桐徳昭
updated 2011-04-11 11:30 utc goh kawai
東京大学 博士(科学)
吉林華僑外国語学院
東京工業大学 博士(工学)
北海学園大学
札幌開成高校
教材制作
札幌開成高校、博士進学
undergraduate education
english language for freshmen
online course
instructor-led courses
updated 2011-04-11 11:30 utc goh kawai
english online
updated 2011-04-11 11:30 utc goh kawai
instructor-led course
updated 2011-04-11 11:30 utc goh kawai
pronunciation lunch
updated 2011-04-11 11:30 utc goh kawai
spoken language corpora course
acquire a specific practical skill
not theory
lots of out-of-class work
updated 2011-04-11 11:30 utc goh kawai
objectives
re: spoken language corpora, explain:
basic concepts (definitions, features)
uses (analysis, engineering, learning)
design and development strategies
re: speech analysis, perform:
design and collect corpus
label and analyze speech
interpret analyses
updated 2011-04-11 11:30 utc goh kawai
prerequisites
phonetics and phonology
sound system of English and/or
Japanese
IPA desirable
audio input and output using computers
bring your laptop (Linux, Windows,
Mac)
statistics
mean, standard deviation
updated 2011-04-11 11:30 utc goh kawai
format of each class period
explain concepts and theory
collect and analyze speech
learn software tools
transcribe and analyze
design corpus
learn about research and
academia
explain next week's assignment
updated 2011-04-11 11:30 utc goh kawai
grading
discussion and project
100%
essential
classroom participation
project
updated 2011-04-11 11:30 utc goh kawai
schedule
wk
date
activity
•purple means assignment
wk
date
activity
1
2012-04-10 install software
8
2012-06-05 no class
2
2012-04-17 transcribe speech
9
2012-06-12 design L1 script
3
2012-04-24 record read speech
10 2012-06-19 design L2 script
2012-05-01 no class
11 2012-06-26 design L2 script
record spontaneous
speech
4
2012-05-08
5
2012-05-15 record read speech
6
2012-05-22 design L1 script
7
2012-05-29 design L1 script
12 2012-07-03 project report
13 2012-07-10 project report
14 2012-07-17 critique
15 2012-07-24
probably no class
(make up day)
attendance mandatory
updated 2011-04-11 11:30 utc goh kawai
courseware
everything online
reading material
lecture notes (including this
presentation)
etc
http://goh.kawai.com/
http://goh.cll.hokudai.ac.jp/
(inprog)
updated 2011-04-11 11:30 utc goh kawai
Praat
http://www.praat.org/
built by researchers and
engineers in linguistics and
speech processing
updated frequently
good support base
Windows, Mac, Linux
free
updated 2011-04-11 11:30 utc goh kawai
PRAAT
what can Praat do?
record and play speech
display waveforms,
spectrograms, pitch and more
label speech at various levels
phone, mora, syllable, word,
phrase and utterance levels
SIL fonts SIL
Praat in action PRAAT
updated 2011-04-11 11:30 utc goh kawai
demo
view praat
time waveform
spectogram
spectral slice
sound sources show praat
vowels
consonants
pure tones (sinusoids)
updated 2011-04-11 11:30 utc goh kawai
readings
Jurafsky et al (2000) chapter 4
updated 2011-04-11 11:30 utc goh kawai
next week
install Praat
TIMIT sentences
download from my website
extract speech files from archive
read files into Praat
play speech
view waveforms and spectograms
label at the word level
updated 2011-04-11 11:30 utc goh kawai
slideshow
if there's time
updated 2011-04-11 11:30 utc goh kawai
see you next week!
updated 2011-04-11 11:30 utc goh kawai
mailto:[email protected]
http://goh.kawai.com/