Available at


Download 1.62 Mb.
Pdf ko'rish
bet24/61
Sana18.06.2023
Hajmi1.62 Mb.
#1559231
1   ...   20   21   22   23   24   25   26   27   ...   61
Bog'liq
bbbb

3.3 Corpus description 
This study relies on a multiple parallel corpus, defined as “bidirectional translation 
corpora” (Johansson, 2007: 9-11) or, in other words, a set of texts in language A 
and their translations in language B. The corpus is
named LOCOSSI and is made 
up of 26 speeches, i.e. 4 source speeches (the English subcorpus) and 22 
interpretations (the French subcorpus). It contains 36,154 tokens. The corpus 
used in the present study is parallel, with source texts in English and their 
interpretations in French. It is thus bilingual (two languages represented) and also 
unidirectional (only one interpreting direction, from English into French). It can 
also be termed a multiple parallel corpus in the sense that it contains several 
interpretations of the same source speeches. Table 3 shows the total number of 
words per text. The grey row shows the total number of words of the English 
source texts. 
Table 3 – Total number of running words (source and target texts) 
IN01 
1,378
IN02 
1,343
IN03 
1,487
IN04 
1,414
Total IN 
5,622
STU01 
1,451 
1,460 
1,632 
1,429 
5,972 
STU02 
1,367 
1,322 
1,505 
1,400 
5,594 
STU03 

1,457 
1,458 

2,915 
STU04 
1,170 

1,436 
1,313 
3,919 
STU05 


1,641 
1,463 
3,104 
STU06 

1,086 
1,375 
1,205 
3,666 
STU07 
1,524 
1,334 

1,216 
4,074 
STU08 
1,267 



1,267 
Total STU 
6,779 
6,659 
9,047 
8,026 
30,511 


Data and methodology 
 
page 44 
The four source speeches are part of the pedagogical material used by a lecturer 
in English-to-French interpreting. The students who interpreted these texts into 
French were all interpreter trainees in their second year of master’s degree in 
conference interpreting. 
The first source text (ST) included in LOCOSSI, which was used on November 3
rd

2016, contains 1,378 words (which corresponds to 14:04 minutes). It is the first 
part of a much longer speech, which contains 3,610 words in total. It is taken from 
a BBC Radio 4 programme called the Reith Lectures. The text is about 
synaesthesia, which is a perceptual condition in which the simulation of one sense 
triggers an automatic, involuntary experience in another sense. For example, 
synesthetes, people who have synaesthesia, may see sounds, taste words or feel 
a sensation on their skin when they smell certain scents. The source speech is 
scientific and accordingly, contains specialized terminology. Here are some 
examples: 
(33) angular gyrus 
(34) sickle cell anemia
(35) cross-modal synesthetic abstraction 
As ST01 contains specialized terminology, it is a specialized text. A specialized 
text is a production of non-literary pragmatic content designed to be used in a 
specific field or discipline, such as science, technology, healthcare, business, or 
tourism. The purpose of a specialized text is mostly informative. 
The second ST was used on November 10
th
, 2016 and contains 1,343 words 
(duration: 13:37 minutes). As the first ST, it is only a sample of a longer speech 
made up of 3,464 words. This ST is also from the Reith Lectures BBC programme. 
The text is about the brain and its complex structure and is a scientific text and can 
then also be considered as a specialized text (with specialized terminology). 
(36) Capgras Syndrome 
(37) fusiform gyrus 
(38) Darwinian revolution


Data and methodology 
 
page 45 
The third ST, used on November 17
th
, 2016, is made of 1,487 words (duration: 
13:45 minutes). The full speech contains 5,471 words and is also based on Reith 
Lectures. It deals with the evolution of art and how people over the world perceive 
art. The text contains references to the famous Chola bronze representing 
goddess Parvati in Southern India. This text does not contain any specialized 
terminology. 
Finally, the last ST was used on December 8
th
, 2016 and contains 1,414 words 
(duration: 12:52 minutes). It is the transcript of a TED talk given by Juan Enriquez 
in November 2016: “What will humans look like in 100 years?” (total length: 3,115 
words). The talk is about the evolution of the human body and the ethics behind 
that evolution. The speaker mentions new kinds of prosthetics, such as hearing 
aids. ST03 is a specialized text (scientific) and therefore contains specialized 
terminology. 
(39) 
gene code 
(40) 
hereditary diseases 
(41) 
radiation
The four ST were POS-tagged with the TreeTagger, using Sketch Engine. 
Download 1.62 Mb.

Do'stlaringiz bilan baham:
1   ...   20   21   22   23   24   25   26   27   ...   61




Ma'lumotlar bazasi mualliflik huquqi bilan himoyalangan ©fayllar.org 2024
ma'muriyatiga murojaat qiling