Speechocean confidencial
Download 205.64 Kb. Pdf ko'rish
|
Vazifani bajarish bo\'yicha yo\'riqnoma ingliz tilida
- Bu sahifa navigatsiya:
- 9. Filler Words
- 10. Multiple Languages (code-switching)
- 11. Non-Speech Acoustic Events
8. Unintelligible Words
Unintelligible speech, words or stretches of speech that are completely unintelligible, were transcribed by “ ** ”. The “ ** ” marker is separated from neighboring intelligible words with spaces. E.g. “Stop the mu… the music.” = “stop the ** the music.” "Play ??? on Spotify." = "play ** on Spotify." 9. Filler Words Filler words are “words” that speakers use to indicate hesitation or to maintain control of a conversation while thinking of what to say next. Each language has a limited set of filler words that speakers can use. The spelling of filler words should not be altered to reflect how the speaker pronounces the word, and each filler word should be preceded with a hashtag (#). E.g. “but #um I like it.” "#hmm perhaps you're right." “#ah I’ve got it.” 10. Multiple Languages (code-switching) If a speaker switches languages, place the tag languages begins and unambiguously recognized by the annotator, replace “foreign” with the ISO639 language code (usually two-letter, lowercase abbreviation) of the new language. There should be a space before and after each foreign language tag. E.g. “I don’t know, amigo.” = “I don’t know, 11. Non-Speech Acoustic Events Five categories of non-speech acoustic events must be transcribed. Events will only be transcribed if they are clearly distinguishable. Very low-level, i.e. non-intrusive events will be ignored. The event will be transcribed at the place of occurrence, using the defined symbols in angle brackets. For noise events that occur over a span of one or more words, the transcription should indicate the beginning of the noise, just before the first word it affects. The first category of acoustic events originate from another source. Sounds originating from the speaker usually do not overlap with the target speech, while sounds originating from other sources could of course occur simultaneously with the speech. SPEECHOCEAN CONFIDENCIAL Speechocean_transcription_guidelines_en_v2.3_20210701 - 4 - Download 205.64 Kb. Do'stlaringiz bilan baham: |
Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling
ma'muriyatiga murojaat qiling