Measuring PO files - word counts, etc. Thread poster: Carl Carter
| Carl Carter Germany Local time: 01:58 German to English + ...
Dear colleagues,
I've received over 70 PO files to translate and have been asked to make a quote for localising them now. My usual CAT tool DXV is able to import and count them, which I have also done, but I'd like to ask anyone who has already translated files like these if they can recommend any other tools for doing a word count.
I don't want to do any "workarounds" or file conversions in this case, but just use a tool that includes this type of counting functionali... See more Dear colleagues,
I've received over 70 PO files to translate and have been asked to make a quote for localising them now. My usual CAT tool DXV is able to import and count them, which I have also done, but I'd like to ask anyone who has already translated files like these if they can recommend any other tools for doing a word count.
I don't want to do any "workarounds" or file conversions in this case, but just use a tool that includes this type of counting functionality. POedit is a great little application for translating POs with, but it doesn't have a counting facility, for example. My OS is Windows Vista (Business). What I'd like to count is the no. of words that have to be translated and then obviously use this figure to calculate the cost of the project for the customer.
The reason for me asking is that I'd like to verify DXV's word count before I make my quote as the project's quite a big one and I want the quotation to be accurate. I don't trust DVX's usual word counts for Microsoft Office formats as they differ from other standard word count tools' a fair bit, and I've just discovered that DXV's line and word counts for some of the PO files are higher than the no. of lines and words I've counted manually while they were open in POedit.
Any suggestions, please?
Thanks in advance.
Carl
Amper Translation Service
Carl Carter
Bahnhofstr. 2
D-82256 Fürstenfeldbruck
Tel. 08141/36379-65 // Fax 08141/36379-63
E-Mail: [email protected] // Web: http://www.ampertrans.de
Mitglied des deutschen Übersetzerverbandes BDÜ, www.bdue.de
Proz profile: http://www.proz.com/profile/774269
================================ ▲ Collapse | | | Didier Briel France Local time: 01:58 English to French + ...
Carl Carter wrote:
I've received over 70 PO files to translate and have been asked to make a quote for localising them now. My usual CAT tool DXV is able to import and count them, which I have also done, but I'd like to ask anyone who has already translated files like these if they can recommend any other tools for doing a word count.
OmegaT can load PO files, and then provide statistics.
Didier | | | | Samuel Murray Netherlands Local time: 01:58 Member (2006) English to Afrikaans + ... Working with PO files | Mar 18, 2010 |
Carl Carter wrote:
I've received over 70 PO files to translate and have been asked to make a quote for localising them now.
The Translate Toolkit has a tool for counting words and strings in PO files. This is a command line tool but works on directories also. So if your PO files are in the folder "clientfiles", you'd issue this command:
pocount.exe clientfiles > clientfiles_count.txt
...and it will create a report per file and for the total files. The report includes the number of words and number of strings (both source and target counts), of the translated, fuzzy and untranslated (and total) string counts.
I don't want to do any "workarounds" or file conversions in this case, but just use a tool that includes this type of counting functionality.
There is a deliberate bug in the Translate Toolkit in that it refuses to read UTF-8 files that have a byte order mark. So if your PO files have it, you have to remove it first. You can do so in bulk using Rainbow from Okapi tools:
Select the files, go to the Output tab, deselect the option to add a new file extension, and then go Utilities > Byte-order-mark conversion.
You can also use OmegaT to do the counting, as Didier had indicated. OmegaT's support for PO is not 100% up to scratch (no support for plurals, no support for msgctxt, etc), but it is quite good nonetheless. | |
|
|
Didier Briel France Local time: 01:58 English to French + ... Simple plurals are supported | Mar 18, 2010 |
Samuel Murray wrote:
OmegaT's support for PO is not 100% up to scratch (no support for plurals, no support for msgctxt, etc), but it is quite good nonetheless.
Simple (i.e., one plural only) plurals are supported. E.g.
msgid "Delete File"
msgid_plural "Delete Files"
msgstr[0] "Supprimer le fichier"
msgstr[1] "Supprimer les fichiers"
Didier | | | Use a software localization tool for software files | Mar 19, 2010 |
Hello Carl,
I would recommend to use a real software localization tool that supports PO files. The benefit will be that you can automatically align the files if they already contain translations and you can easily update the localization project if your client is sending a new drop.
Update features in software localization tools are able to detect which file has changed. Changes and new strings will be incorporated into the project. So no need to restart translation wit... See more Hello Carl,
I would recommend to use a real software localization tool that supports PO files. The benefit will be that you can automatically align the files if they already contain translations and you can easily update the localization project if your client is sending a new drop.
Update features in software localization tools are able to detect which file has changed. Changes and new strings will be incorporated into the project. So no need to restart translation with your TM, only translate new and changed strings.
Our own tool SDL Passolo has a special PO add-in that also supports plural cases.
Hope this helps.
Achim Herrmann ▲ Collapse | | | Carl Carter Germany Local time: 01:58 German to English + ... TOPIC STARTER Everyone's suggestions | Mar 19, 2010 |
Thanks very much for all your suggestions and advice! That was a very quick response with lots of useful information.)
Regards
Carl
Amper Translation Service
Carl Carter
Bahnhofstr. 2
D-82256 Fürstenfeldbruck
Tel. 08141/36379-65 // Fax 08141/36379-63
E-Mail: [email protected] // ... See more Thanks very much for all your suggestions and advice! That was a very quick response with lots of useful information.)
Regards
Carl
Amper Translation Service
Carl Carter
Bahnhofstr. 2
D-82256 Fürstenfeldbruck
Tel. 08141/36379-65 // Fax 08141/36379-63
E-Mail: [email protected] // Web: http://www.ampertrans.de
Mitglied des deutschen Übersetzerverbandes BDÜ, www.bdue.de
Proz profile: http://www.proz.com/profile/774269
================================ ▲ Collapse | | | PO files with Trados Studio? | Mar 19, 2010 |
Does anyone know whether or not it is possible to translate PO files using Trados Studio 2009? | |
|
|
Carl Carter Germany Local time: 01:58 German to English + ... TOPIC STARTER What's a "real" software localisation tool? | Mar 19, 2010 |
Achim Herrmann wrote:
Hello Carl,
I would recommend to use a real software localization tool that supports PO files.
Hi Achim,
I appreciate your comments, but I think it should be said in all fairness that tools like POedit ARE "real software localisation tools"; they may not be as elaborate or feature-rich as Passolo, but they still work well and include a TM component as well.
Our own tool SDL Passolo has a special PO add-in that also supports plural cases.
Actually, all I need the tool for is for making a reliable word count. The actual translation is going to be done by someone else using either POedit or LocFactory.
You didn't say what sort of counting is possible using Passolo. Does it measure "msgstr" as well as "msgid"? I need to find out the no. of words in each "msgstr" line, which is for the translation. The reason is that the files have already been translated once (into German) and the customer wants us to translate the German version into French. Can Passolo do this? Déjà Vu X can.
Regards
Carl
P.S. Just as an aside, why isn't it possible to download a trial version of Passolo without having to fill in a form first of all? I don't like the thought of a sales rep contacting me at some point as a result of me disclosing my personal data. As I've said, all I want to do is count the words in our PO files. Perhaps your firm should reconsider its sales approach. After all, people who want to buy the product will get in touch with you at some point anyway, won't they?
====================== | | | Carl Carter Germany Local time: 01:58 German to English + ... TOPIC STARTER The Translate Toolkit | Mar 19, 2010 |
Hi Eric,
Thanks again for your helpful suggestion. I took a look at this today, but decided not to proceed with it any further because the PO counting tool is basically just a small utility in a very big software package; it seems you (now) have to download three applications as a bundle to get The Translate Toolkit. Installing the programs and then having to read the documentation to use the count tool via the Windows command line - which I'm not very familiar with either - seems like a lot of work to me. A colleague of mine did a word count using LocFactory (for Apple Macs), which worked, and I managed to do one pretty easily using OmegaT today, which I haven't used before; the results were OK, I guess, but it doesn't show you which strings have been measured - I expect it counted "msgid", but I need to count "msgstr" (as I've explained in my reply to Achim's message).
Anyhow, although I haven't got much further with my problem, I have learnt a bit more about the programs used for this type of work.
Regards
Carl
Amper Translation Service
Carl Carter
Bahnhofstr. 2
D-82256 Fürstenfeldbruck
============ | | | Didier Briel France Local time: 01:58 English to French + ... OmegaT count both msgid and msgstr | Mar 19, 2010 |
Carl Carter wrote:
I managed to do one pretty easily using OmegaT today, which I haven't used before; the results were OK, I guess, but it doesn't show you which strings have been measured - I expect it counted "msgid", but I need to count "msgstr" (as I've explained in my reply to Achim's message).
Since 2.0.4, OmegaT reads existing translations in PO files.
That means that both msgid and msgstr are counted.
Didier | | | Juan Pagola Argentina Local time: 19:58 English to Spanish + ...
Hi Didier,
Here's an easy way to count words on a .PO file.
1. Download POedit at: http://www.poedit.net/download.php (it's free).
2. Open your .PO file and then go to File>Export as HTML.
3. Open the exported HTML file and copy all the text to translate.
4. Paste the text in a new Word document and count the words as you would usually d... See more Hi Didier,
Here's an easy way to count words on a .PO file.
1. Download POedit at: http://www.poedit.net/download.php (it's free).
2. Open your .PO file and then go to File>Export as HTML.
3. Open the exported HTML file and copy all the text to translate.
4. Paste the text in a new Word document and count the words as you would usually do ▲ Collapse | |
|
|
vslavik (X) Czech Republic Re: Use Poedit | Jan 11, 2014 |
Juan Pagola wrote:
1. Download POedit at: http://www.poedit.net/download.php (it's free).
2. Open your .PO file and then go to File>Export as HTML.
3. Open the exported HTML file and copy all the text to translate.
4. Paste the text in a new Word document and count the words as you would usually do
Hi Juan, Poedit developer here. Let me chime in with a brief remark -- you don't have to go through these hops anymore, Poedit now (finally!) has word count statistics in its Pro version. Sorry it took me so long to fix this. | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Measuring PO files - word counts, etc. Pastey | Your smart companion app
Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.
Find out more » |
| Trados Studio 2022 Freelance | The leading translation software used by over 270,000 translators.
Designed with your feedback in mind, Trados Studio 2022 delivers an unrivalled, powerful desktop
and cloud solution, empowering you to work in the most efficient and cost-effective way.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |