The 'quote' column tells whether the beginning and ending utterences
listed in the documentation, were
present in the audio file. This allows us to check whether we have
the entire audio file. The possibilities are 'begin', 'end', 'both', and
'neither'.
The corpus will be released in parts as indicated in the part column.
The 'DATA' column shows which data types we have and where they were
generated. Namely: 'w' - 22050K wave files, 'a' - ascii transcripts,
't' - timestamped transcripts.
Lower case letters indicate that Santa Barbara provided the data in
this format; letters in capitals indicate that the LDC created
the data or converted it into this format.
The 'check' column shows if each file has been reaudited to confirm the difficulty and comment fields.
The 'diff' column is the estimated difficulty of transcription, with 1 being easiest, and 4 being most difficult.
The 'comment' column includes observation on the sound quality
and subject matter of the recording.
| file | quote | part | DATA | check | diff | comment |
|---|---|---|---|---|---|---|
| csae0403 | both | 2 | waT | y | 2 | multiple speakers(2+), informal discussion |
| csae0404 | both | 3 | W | y | 2 | multiple speakers(2+), informal discussion |
| csae0408 | both | 1 | wat | y | 2 | multiple speakers(2+),question and answer session, medical topic(s) |
| csae0504 | both | 4 | W | y | 3 | multiple speakers(2+), informal discussion (background noise, music, tv) |
| csae0513 | both | 1 | wat | y | 3 | multiple speakers(3+), informal discussion (background music) |
| csae0514 | both | 1 | wat | y | 2 | multiple speakers(4+), informal talk while preparing meal (background noise, overlapping speech) |
| csae0518 | both | 2 | WaT | y | 2 | multiple speakers(2+), mock(?)retail sale (tape decks) |
| csae0523 | both | 1 | wat | y | 3 | multiple speakers(4+), breakfast table talk (background noise) |
| csae0525 | both | 2 | WaT | y | 1 | multiple speakers(2+), interview (doctor/patient?) |
| csae0527 | both | 1 | wat | y | 1 | multiple speakers(2), talk about a book |
| csae0532 | both | 1 | wat | y | 2 | multiple speakers(2), two sisters, family talk |
| csae0533 | both | 2 | WaT | y | 1 | multiple speakers(2+), vet's office comings and goings (background noise, dogs barking etc.) |
| csae0543 | end | NEVER | W | y | 3 | multiple speakers(2+), musicians talk and music |
| csae0548 | both | 2 | waT | y | 3 | multiple speakers(3+), group talk, family (parents/children, home schooling(?), background noise) |
| csae0549 | both | 3 | W | y | 2 | multiple speakers(3+), family opening presents, (some music in BG) |
| csae0553 | both | 4 | W | y | 3 | multiple speakers(3+), group playing scrabble |
| csae0584 | both | 2 | waT | y | 1 | one primary speaker (male), religous sermon |
| csae0588 | both | 2 | waT | y | 1 | one primary speaker (male), religous sermon (some group response) |
| csae0590 | end | NEVER | Wa | y | 2 | multiple speakers(3+),religous missionary meeting/Q&A (some group response) |
| csae0593 | both | 1 | wat | y | 2 | multiple speakers(2+), counseling session |
| csae0595 | both | 2 | WaT | y | 2 | multiple speakers(2), job related talk (airport) |
| csae0661 | both | 4 | W | y | 3 | multiple speakers(3+), talk outside, family |
| csae0700 | both | 1 | wat | y | 3 | multiple speakers(4+), lawyer prep talk / exposure case |
| csae0713 | both | 1 | wat | y | 2 | multiple speakers(2), couple (southern accent) doing math homework (HS?) |
| csae0714 | both | 4 | W | y | 3 | multiple speakers (3+), dinner table setting/cleanup |
| csae0719 | both | 4 | W | y | 3 | multiple speakers (3+), talk about politics et.al. |
| csae0728 | both | 1 | wat | y | 2 | multiple speakers(2), counseling session(?) |
| csae0729 | both | 1 | wat | y | 1 | multiple speakers(2), senior citizens |
| csae0730 | both | 2 | WaT | y | 1 | one primary speakerlecture (minor class noise) |
| csae0735 | begin | NEVER | W- | y | 3 | multiple speakers talk about boys and punishment (summer camp) |
| csae0751 | - | - | - | - | ### no speech file ### | |
| csae0754 | both | 4 | W- | y | 3 | group dinner, seniors talk about war and work (drinking alcohol) |
| csae0759 | both | 4 | W- | y | 3 | family talk |
| csae0761 | both | 2 | WaT | y | 2 | man and woman (couple) playing computer game |
| csae0762 | both | 4 | W | y | 3 | friends at college talking |
| csae0767 | both | 3 | W | y | 2 | man and woman talk, on the quiet side, segments of no dialogue |
| csae0771 | both | 4 | W | y | 4 | family and friends dinner, w/ talk of SATs and college, much overlap |
| csae0780 | both | 1 | wat | y | 2 | lecture w/ some class participation |
| csae0784 | both | 1 | wt | y | 3 | group (women and men) talking and eating at a birthday party |
| csae0796 | both | 2 | WaT | y | 1 | lecture on Martin Luther |
| csae0901 | both | 3 | W | y | 3 | classroom (HS) reports on Whitman & Dickinson , quiet |
| csae0906 | both | 1 | wat | y | 2 | bank, loan talk |
| csae0912 | both | 4 | W | y | 3 | family dinner w/ visiting friend |
| csae0942 | both | 4 | W | y | 3 | group talk about baby |
| csae0952 | both | 3 | W | y | 2 | man and woman talk on phone, speakers on other end quiet |
| csae0953 | begin | NEVER | W | y | 2 | tour of an historic building (Woodrow Wilson) , some crowd noise |
| csae0959 | end | NEVER | W | y | 3 | wedding ceremonies, some quiet speakers |
| csae0960 | end | NEVER | W | y | 3 | friends talking, watching TV |
| csae0962 | both | 5 | W | y | 3 | family speaking while cooking , w/spanish , loud blender |
| csae0971 | both | 5 | W | y | 3 | friends talking sports , background speakers cause overlap |
| csae0972 | both | 3 | W | y | 2 | tour , some speakers in crowd too quiet |
| csae0987 | both | 2 | WaT | y | 2 | civic meeting multiple speakers |
| csae0988 | begin | NEVER | W | y | 2 | legal proceedings between tenants and landlords |
| csae1000 | both | 3 | W | y | 3 | group talk, baby talk, multiple speakers talk about the baby |
| csae1002 | neither | NEVER | W | y | 2 | info speaker at pawnee indian lodge |
| csae1003 | both | 3 | W | y | 2 | zoo training - multi speaker |
| csae1005 | both | 3 | W | y | 2 | two men talking about a business |
| csae1007 | both | 4 | W | y | 3 | comedy/poetry, crowd noise |
| csae1008 | both | 4 | W | y | 2 | group (male and female) talking about and playing music |
| csae1011 | both | 3 | W | y | 2 | storyteller |
| csae1016 | begin | NEVER | wa | y | 1 | science program |
| csae1019 | both | 2 | WatT | y | 1 | phone conversation |
| csae1024 | both | 3 | W | y | 2 | Beatrice Wood speech, brit moderator |
| csae1034 | both | 3 | W | y | 2 | classroom lecture, paper response |
| csae1036 | both | 3 | W | y | 2 | woman talks to people about horses |
| csae1039 | both | 4 | W | y | 3 | a class, students quiet / overlap |
| csae1043 | both | 5 | W | y | 3 | horse show, crowd too quiet |
| csae1047 | both | 3 | W | y | 2 | discussion with doctor |
| csae1057 | both | 5 | W | y | 2 | talk about social security |
| csae1059 | both | 3 | W | y | 2 | dinner conversation about travelling and doctors |
| csae1079 | both | 5 | W | y | 3 | karate lesson |
| csae1107 | both | 5 | W | y | 3 | work talk |
| csae1188 | both | 5 | W | y | 3 | play practice and direction |
| csae1362 | both | 5 | W | y | 4 | parent child arguing, quiet |
| csae1441 | begin | NEVER | w | y | 2 | women talk, quiet at times |
| csae1444 | both | 5 | W | y | 2 | parent child talk, kitchen noise |
| csae1447 | both | 5 | W | y | 3 | conversation two males, quiet |
| csae1456 | both | 5 | W | y | 3 | parent child talk about drawing |
| csae1457 | both | 5 | w | y | 2 | sexuality talk |
| csae1461 | both | 5 | W | y | 2 | marriage ceremony |
| csae1470 | both | 5 | W | y | 3 | health lecture |
| csae1489 | both | 5 | W | y | 2 | winston foundation experimental session on preventive diplomacy |
| csae1505 | both | 5 | W | y | 2 | discussion between two friends, dirty |
| csae1540 | both | 5 | W | y | 2 | man telling story |
| csae1550 | both | 5 | W | y | 2 | doctor's visit |
| csae1556 | end | NEVER | W | y | 3 | domestic/family conversation/argument |