This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
Gary Hess Local time: 00:01 German to English + ...
May 21, 2022
A client that normally works with SDL Trados just sent me a Transit NXT project for the 1st time. I received a source file (Word) as well as a PPF file (Transit NXT package).
I opened the Transit NXT project and started translating. Unfortunately, it turns out that the source window in Transit NXT contains thousands of seemingly superfluous tags. For example, I have already found a single sentence with 30 words that has 100 tags! Each single letter is contained within a start tag an... See more
A client that normally works with SDL Trados just sent me a Transit NXT project for the 1st time. I received a source file (Word) as well as a PPF file (Transit NXT package).
I opened the Transit NXT project and started translating. Unfortunately, it turns out that the source window in Transit NXT contains thousands of seemingly superfluous tags. For example, I have already found a single sentence with 30 words that has 100 tags! Each single letter is contained within a start tag and end tag.
I looked at the Word document, but I don't see anything that corresponds to all of these tags.
What I am wondering is whether anyone else has encountered a situation like this? Any thoughts on how to proceed? I am estimating that the doc contains 16000 extra tags (10 per segment). If it takes 10 seconds to handle each tag, it will take 44 hours just to deal with the tags.
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Hans Lenting Netherlands Member (2006) German to Dutch
Create a new project
May 21, 2022
You or your client has to create a new project, after cleaning the Ms Word document first.
Use CodeZapper or TransTools to clean.
Stepan Konev
Jorge Payan
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Stepan Konev Russian Federation Local time: 02:01 English to Russian
Wrong OCR algorythm
May 21, 2022
The reason why that happens is described here for example. In addition to Code Zapper and TransTools, you can also try to make uniform formatting by adjusting the font settings in your MS Word file. Open the file, select all (this step is important), press Ctrl+D, navigate to the 'Advanced' tab, set 'Scale' to 100%, 'Spacing' to Normal ... See more
The reason why that happens is described here for example. In addition to Code Zapper and TransTools, you can also try to make uniform formatting by adjusting the font settings in your MS Word file. Open the file, select all (this step is important), press Ctrl+D, navigate to the 'Advanced' tab, set 'Scale' to 100%, 'Spacing' to Normal and 'Position' to Normal too. This will remove most tags.
However you can only do it with your source file (MS Word). But you still have to contact your client for them to re-create their Transit package for you. ▲ Collapse
Jorge Payan
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Hans Lenting Netherlands Member (2006) German to Dutch
Re-import the source document
May 22, 2022
Here's an example of a project with tag soup:
I created a new project. And clicked on Options:
Then I chose these settings:
Now the project looks much better:
Sergey Afanasev
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Gary Hess Local time: 00:01 German to English + ...
TOPIC STARTER
OCR
May 23, 2022
Thanks for the suggestions! I hadn't thought of OCR, but that would explain this disaster. I am writing with the client now to figure out a plan.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.
Manage your TMs and Terms ... and boost your translation business
Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.