Author Topic: Linearize tmx on a translation unit basis  (Read 483 times)

spiros

  • Administrator
  • Hero Member
  • *****
  • Posts: 693721
  • Gender: Male
  • point d’amour
    • spiros.doikas
    • greektranslator
    • doikas
    • 102094522373850556729
    • lavagraph
    • Greek translator CV
Linearize tmx on a translation unit basis
« on: 20 Feb, 2019, 14:06:23 »
Linearize tmx on a translation unit basis

1.
Find
Code: [Select]
    <tu (it has to include at the beginning all the white space preceding the <tu and one extra space after it)
Replace
Code: [Select]
<tu (one extra space after it)

2.
Find
Code: [Select]
\n \s+ (regular expressions on)
Replace with nothing

The result would be a list of one liners witch each line having a full TU.

Why do I do that? Because most tmx editors cannot handle big TMs and TM/TMX editors suck when it comes to big files. I use EmEditor instead where for example I can search for invalid/corrupt characters, bookmark those TUs and then batch delete them.
« Last Edit: 20 Feb, 2019, 16:45:45 by spiros »