Since my first explorations of editing video caption files in a text editor last week, I've learned quite a few ways to improve the process. I found a free, cross-platform Open Source tool, Aegisub, for editing the captions. It is particularly helpful when the timing needs to be adjusted, and its use is fairly intuitive. It beats working in Notepad or Microsoft Word by a long shot.
For translating caption files I also discovered a useful resource on Kilgray's Language Terminal: a Regex text filter designed to filter out the segment numbers and time codes in the caption files. Useful exclusion rules to configure for this are as follows:
The resource file for the filter settings (MQRES) and some sample cation files in English can be downloaded here.
Here is an example (preview) of how text is filtered:
To ensure that the correct filter settings are used for the captions text file, use Import with options... in memoQ and set the file type to "All files (*.*)" for the likely case that the file extension is not recognized by memoQ:
If you want to change the text breaks in a given time segment, use the Join function to combine segments, and place the tag for the line break wherever it makes sense to do so:
An exploration of language technologies, translation education, practice and politics, ethical market strategies, workflow optimization, resource reviews, controversies, coffee and other topics of possible interest to the language services community and those who associate with it. Service hours: Thursdays, GMT 09:00 to 13:00.
Jul 5, 2013
Subscribe to:
Post Comments (Atom)
OK, but there is no Video captions filter configuration in memoQ. I downloaded VideoCaptions.zip, and it really has a RegexTextConverter#VideoCaptions.mqres file, which I suppose is the memoQ filter, but how to use it, or make it seen by memoQ?
ReplyDeleteImport the filter configuration, then import the file to memoQ, set the Regex text filter and the particular saved configuration you imported for that.
DeleteHow is possible to import filter configuration? I tried several ways, but unsuccessfully.
ReplyDeleteUse the relevant section in the Resource Console.
Deletecan you help me import file srt on notepad in detail ?
DeleteI'm sorry, I do not understand what you are asking.
DeleteIn the MemoQ menu there is a resource console option. There, in filter configuration,you have to import the mqres file. You can also upload it into a MemoQ server if you work in collaborative environment. Then, when you have the filter in place, you have to import files through the 'Import with options' link. There you can configure filters to be used with the particular files to be imported.
ReplyDeleteI have some text like this:
ReplyDelete!_awa="WA"
%s. Limits.="%s. Limits."
_***="United States Territorial Waters"
_a**="Australia"
_act="Australian Capital Territory"
_AFG="Afghanistan"
_AHO="Netherlands Antilles"
_AIA="Anguilla"
How to make a filter to show me what is between apostrophes only?
In this particular case I would use the JSON filter.
DeleteBTW, those are not apostrophes, they are quotation marks.
DeleteI am following the rule that you presented above, but my .srt uses a "comma" instead of a "period" after the timecode and I don't know how to change the rule so that it would work in memoQ. The subtitles below are some examples. Can you help me?
ReplyDelete1
00:00:06,236 --> 00:00:07,379
- Hey, welcome back to
"Right on the Money,"
2
00:00:07,379 --> 00:00:09,101
the show that features financial advisors,
It works! Very useful workaround, thank you Kevin!
ReplyDeleteRegards