Translation Tribulations: Corpus terminology workshop in the Netherlands

May 13, 2013

Corpus terminology workshop in the Netherlands

The professional translators' group Stridonium is organizing a networking event on June 17, 2013 in Holten (NL) to teach translators in legal, financial and other domains the effective use of text collections (corpora) for identifying important terminology.

The NIFTY corpus methodology uses specialized texts compiled by translators themselves to find appropriate terms in the target language, in particular types of text (such as joint venture agreements, offering circulars, divorce decrees or any other). The methodology applies to all language pairs and has been developed for efficiency (requiring on average 30 minutes) to meet the needs of working translators.

Further details on the workshop and registration are available here.

6 comments:

Michael BeijerMay 16, 2013 6:33 PM
Hi Kevin,

Maybe a little off topic, but I just came across an interesting corpus search website which searches 29 different corpora!

http://www.lextutor.ca/concordancers/concord_e.html

It searches the following corpora:

1k Graded Corpus (530,000)
2000 List Corous (240,000)
2k Graded Corpus (920,000)
AA Academic Abstracts
Academic Abstracts (174,000)
BNC Commerce (3.8 million)
BNC Humanities (3.3 million)
***BNC Law (2.2 million)***
BNC Med (1.4 million)
BNC speech (10 million)
BNC Spoken (1 million)
BNC Written (1 million)
Brown (1 million wds)
Brown + BNC Written (2+ m)
Call of the Wild (24,000)
Focus on Vocab (82,300)
JPU Learner (300,000)
NNS-Ts in Korea (123,000)
NS-Ts in Korea (124,000)
Presidential speeches (1.98 million)
RAC Academic (103,000)
RAC Research Articles Corpus (HK, 132,000 wds)
TC Learner (Student) (150,000)
TC Learner (Teacher) (61,000)
TESL Prog (3,400)
Univ. Word List (550,000)
US TV Talk (2 million)
V - Marlise
Yenny Korean EFL teachers corpus

Michael
ReplyDelete
Replies
Kevin LossnerMay 16, 2013 7:37 PM
Michael, what I particularly like about the method taught in this workshop is its focus on careful text selection with manageable scope in a specific specialist area. These large "bucket" corpora are more general in scope and likely less suited to making the sort of distinctions we would need.
ReplyDelete
Replies
Kevin LossnerMay 16, 2013 8:52 PM
I poked around a bit in the legal and medical corpora - not bad for examples of general vocabulary - and then I discovered the link to bilingual dictionaries at the top of the concordance hitlist. Dangerous stuff in the hands of the ignorant. The English>German dictionary search for a medical term I was looking at pulled up hits that mostly had to do with traffic :-) It seems ill-advised to link dictionaries with no consideration of context.
ReplyDelete
Replies
ChristinaMay 16, 2013 9:07 PM
Hi Kevin,
Yes, having attended Juliette's workshop at the Legal Translators' Conference in Portugal, I can confirm that this method promotes "developing *specific* domain terminologies in an efficient manner".
Christina
ReplyDelete
Replies
Michael BeijerMay 17, 2013 3:22 PM
Hi Kevin,

Yes, that is of course always a danger of letting someone else build your corpus. I am currently trying to build a few of my own with tlCorpus (which btw now accepts PDFs!), but my time is limited and these ready-made online ones sure are a lot easier to set up;)

Incidentally, I couldn't find those links to bilingual dictionaries you mentioned. Where exactly did you see them?

Michael
ReplyDelete
Replies

Add comment

Notice to spammers: your locations are being traced and fed to the recreational target list for my new line of chemical weapon drones :-)