Aligning the content of two folders with source and target documents; automatic pairing by name |
The second important difference with alignment in memoQ is that it is really not necessary to feed the aligned content to a translation memory. memoQ LiveDocs alignments essentially function as a translation memory in the LiveDocs corpus, with one important difference: by right-clicking matches in the translation results pane or a concordance hit list, the aligned document can be opened directly and the full context of the content match can be read. A match or concordance hit found in a traditional translation memory is an isolated segment, divorced from its original context, which can be critical to understanding that translated segment. LiveDocs overcomes this problem.
A third advantage of alignment in memoQ is that, unlike environments in which aligned content can only be used after it is fed to a translation memory, a great deal of time can be saved by not “improving” the alignment unless its content has been determined to be relevant to a new source text for translation. If an analysis shows that there are significant matches to be found in a crude/bulk alignment, the specific relevant alignments can be determined and the contents of these finalized while leaving irrelevant aligned documents in an unimproved state. Should these unimproved alignments in fact contain relevant vocabulary for concordance searches, and if a concordance hit from them appears to be misaligned, opening the document via the context menu usually reveals the desired target text in a nearby segment.
I still wonder why PDF files are not imported by default as plain text. It requires 4 additional clicks every time you want to import two PDF files. Import as docx with hard returns is useless for alignment purposes. I made a suggestion to change this but nothing ever happened.
ReplyDeleteKR
Noe
Noe, all of the PDF import filters for memoQ are total crap. If you want to extract and align PDF content I suggest using the XML export from Iceni InFix.
DeleteHi Kevin, thanks for your reply. Don't know if I am willig to pay, even it would be just some cents, for every alignment passing through Iceni.
DeleteWhat are you talking about Noe? pay for what? Using the XML extract function of iceni InFix is free, just like the TransPDF service they have for extracting XLIFFs for translation. In fact, AFAIK, the only function in InFix which is actually crippled if you don't have a license is the useful "story export" feature, which is rather nice for pulling PDF content out as a relatively clean MS Word file. To do alignments or stick content into LiveDocs for reference you are not subject to any fees for removing watermarks from PDF files created with InFix (which is actually not relevant). Another nice thing about the iceni tool is that it ignores password protection on a PDF file, which can also be useful.
DeleteHi Kevin, I just know upgraded to 8.1.4 so I'll give the Iceni XLIFF extraction a try. I thought it's a payable service. ;)
DeleteThe XLIFF is for tranalation, not alignment. And in some cases people may run afoul of their NDAs using that online service with the latest version of memoQ. But for alignment and confidentiality situations, the older form of Iceni translatable export can be used: the XML files. That is done directly in InFix. I think I wrote an article about this some years ago, but the menus have changed since then, and I intend to write a short post soon with the new details. It's a bit hard to find the relevant menu commands, especially if one is averse to RTFM.
Delete