Translation - Μετάφραση

General => Announcements => Topic started by: spiros on 19 Jan, 2015, 14:06:40

Title: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: spiros on 19 Jan, 2015, 14:06:40
Convert online Excel files (xls, xlsx) to TMX (https://translatum.gr/cgi-bin/excel-to-tmx.pl)

xlsx/xls/tsv > tmx (https://translatum.gr/cgi-bin/excel-to-tmx.pl) is part of a series CAT conversion tools developed by Translatum:

— tmx > text (https://www.translatum.gr/cgi-bin/tmx-to-text.pl) (read Help (https://www.translatum.gr/forum/index.php?topic=388540.0))
— xlsx/xls/tsv > tmx (https://translatum.gr/cgi-bin/excel-to-tmx.pl) (read Help (https://www.translatum.gr/forum/index.php?topic=388541.0))
— xlsx/xls/tsv > MultiTerm xml (https://translatum.gr/cgi-bin/excel-to-multiterm-xml.pl) (read Help (https://www.translatum.gr/forum/index.php?topic=388542.0)).

This tool converts the following types of files to a tmx memory file. It accepts as input the following types of files:

— xlsx file
— xls file
— tab separated text file (it must be in UTF-8 format)

You can better prepare your files for this conversion by following some specific steps.
Make sure only plain LATIN characters (without accents) are contained in the file name. [Important!]
Make sure there is no header row. If there is one, then delete it. A header row could contain descriptive information about the column content, i.e. language name.
Make sure there are no leading, trailing or multiple spaces (You can use the free ASAP Utilities to fix that in Excel by going to ASAP Utilities (http://www.asap-utilities.com/) > Text > Delete leading, trailing and excessive spaces).
Make sure there are no empty rows (You can use the free ASAP Utilities to fix that in Excel by going to ASAP Utilities > Columns and rows > Delete all empty rows).
Make sure there are no empty cells (i.e. cells where source or target text is missing). To fix that, Sort text first on Column A, scroll down to check on any orphan entries, and then repeat with Column B.

A typical file structure is with source language in Column A and target language in Column B (in Sheet 1). You may have different notes in each cell of Column C if you want to further specify your translation units. If you have a note that applies to the whole file, just enter it in the tool in the Add A Note (optional) box. I.e. it could be “Medical” or you could have the customer name there. If you have different values in Column C for each translation unit check the Last Column check box.

In the Enter Column 1 Language check box enter the column 1 language code. It could be EN, EN-US, FR, FR-FR, etc. Please see a list of these codes (http://msdn.microsoft.com/en-us/goglobal/bb896001.aspx) (you need the code listed in the Culture Name column). Make sure that the codes match the ones used by the translation memory in which you would like to import the tmx file. If they do not match, you may need to open the tmx file with a text editor like Notepad++ (http://notepad-plus-plus.org/) and run a find/replace to fix the language code. In the Enter Column 2 Language check box enter the column 2 language code.

(Note that you can add more languages by clicking on Add new column, thus creating a multilingual tmx file).

In the Creator box enter your first name (in Latin characters) or leave it blank (Translatum TMX maker will be used).

Click Submit to upload the file and wait until the conversion is completed and a download link is displayed. Right click on the download link and save the file in the desired location. Unzip it and then import it to your CAT tool. Please read how to import a tmx file in various CAT tools (https://www.translatum.gr/forum/index.php?topic=11363.0).

Maximum file size limit is 10 MB. In case your file is larger than that you can a) split your file into smaller ones b) if your file is xls save it first as xlsx (the file size is reduced in xlsx format).

Files are deleted hourly and they are not used for any purpose.

The tool was based on a free script by Hungarian translator András Farkas.

The video below illustrates the procedure of conversion of an Excel file and its import to SDL-Studio 2014:

Create tmx from Excel online with Translatum's converter - YouTube (https://youtu.be/bpPqvBNuvHo)
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: UFZ on 20 Jun, 2017, 10:49:59
Hi, I wonder if this program still works?
I was using it 2 weeks ago, went on 2 week vacation, came back and can't convert any file.
Instead of opening a page this link: Convert Excel files (xls, xlsx) and tab delimited txt to TMX (https://translatum.gr/cgi-bin/excel-to-tmx.pl) keeps saving a file on my computer, it happens each and every time.
Could you help me out, am I doing sth wrong or has it stopped working?


Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: spiros on 20 Jun, 2017, 12:23:01
This is being investigated; will let you know when I have news.
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: UFZ on 21 Jun, 2017, 08:57:44
Thank you
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: spiros on 21 Jun, 2017, 16:01:40
Should be fixed now.
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: UFZ on 03 Aug, 2017, 10:30:38
It is, thank you!
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: mk_loca on 28 Nov, 2017, 16:02:42
Hi, this tool looks like it could really save me a lot of time as I have to import a high volume of translations (Korean to English) from xls into my CAT program (Deja Vu x3).

However, despite following the guide above to the letter, I am still not getting the desired results. In sum, my column B that contains our translations does not import alongside the contents of column A (containing the source text in Korean).

imgur.com/a/GM7RL (http://imgur.com/a/GM7RL)

I have come close to success when I tried pasting the contents of my sheet to a text file (not tab delimited, see below). I ran this text file through your tool, imported the resulting TMX into DJV and, instead of translating the source text into English, the quotation marks around each phrase were translated. If I remove the quotations from the text file, I end up with no translation at all, just as with my xls > TMX attempt.

imgur.com/a/iRtJj (http://imgur.com/a/iRtJj)

As I said, this text file was not Tab Delimited, because whenever I try to export my sheet in this format, the resulting text file does not display Korean characters, only a litany of question marks, no matter what encoding option I use (perhaps I am using the incorrect encoding for ko_KR).

If anyone has any ideas where I might be going wrong your help would be much appreciated. You will most likely preserve my sanity and prevent me from getting RSI.

Thanks in advance.

Marc
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: spiros on 28 Nov, 2017, 16:26:50
Did you try using Xbench (https://www.xbench.net/index.php/download)?
Did you try exporting into tmx a Korean TM that works fine on your system and compare the language codes/encoding used with the one from this tool?
Alternatively, e-mail me a sample Excel file for testing.
https://www.translatum.gr/contact.htm
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: mk_loca on 28 Nov, 2017, 19:08:36
Hi Spiros,

Thanks for the info. I will give all these things a try tomorrow, just leaving the office now.

Will be in touch.

Thanks again,

M
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: mk_loca on 29 Nov, 2017, 15:51:40
Hi Spiros,

Message sent to the address on your contact page.

Best,

M
Title: Re: Convert Excel files (xls, xlsx) to TMX (free online tool)
Post by: spiros on 29 Nov, 2017, 17:18:05
Your xls had notes in it. You may have to change language codes in the tmx file (open and change with Notepad++) to match your TM. I used ko-KR and EN-US.

I used copy/paste from Excel to a txt Editor (I use EmEditor, Notepad++ is just fine).