edit distance -> απόσταση επεξεργασίας, εύρος επεξεργασίας, εύρος αναθεώρησης, απόσταση διόρθωσης, απόσταση μετασχηµατισµού


  • Administrator
  • Hero Member
  • *****
    • Posts: 813938
    • Gender:Male
  • point d’amour
edit distance -> απόσταση επεξεργασίας

In computational linguistics and computer science, edit distance is a way of quantifying how dissimilar two strings (e.g., words) are to one another by counting the minimum number of operations required to transform one string into the other. Edit distances find applications in natural language processing, where automatic spelling correction can determine candidate corrections for a misspelled word by selecting words from a dictionary that have a low distance to the word in question. In bioinformatics, it can be used to quantify the similarity of DNA sequences, which can be viewed as strings of the letters A, C, G and T.

This time, however, he gave a different kind of talk in which he suggested a new kind of pricing model. I'm probably not going to do it complete justice in this short article, but let me try to rephrase his main points. He argued that since there is no discernible and reliable differentiator between fuzzy translation memory matches and machine translation suggestions (assuming that you are using a specifically trained machine translation engine for a text type that is suited for that kind of translation), we should stop differentiating them in their pricing. Instead, they should all be paid by edit distance. ("Edit distance" is the now widely used approach to evaluating the number of changes the editor or translator had to make to an MT suggestion before delivering it.) Doing this, according to Jay, will protect the translator from poor-quality machine translation (because the edit distance -- or rewrite from scratch --will in that case be large enough for 100% payment) as well as from bad translation memories (same reason). Also, he suggests payment for MT suggestions with no edit distance, i.e., suggestions where no edits were deemed necessary (20% of the word price) at a rate twice as high as a 100% TM match (10%) to compensate for the effort to evaluate their accuracy. He also suggests a 110% rate for an edit distance of 91-100%, taking into account the larger effort needed to "correct" something that was rather useless in the first place.
Jost Zetzsche, The 318th Tool Box Journal
« Last Edit: 30 Oct, 2020, 03:05:48 by spiros »


Search Tools