Movie subtitling program eliminates letters with diacritical markings

Diacritics are symbols added above, below or alongside letters to indicate correct pronunciation. Viewers of films in languages employing diacritics may notice that some words in the subtitles have missing letters. The problem occurs because software used to generate subtitles simply deletes any letter carrying a diacritical mark. This confusing flaw is most common with personal names, which generally do not "translate". For example: Šaran becomes aran,  Šimáček becomes imaek and Čížek becomes iek. In a multicultural educational environment, especially film-related courses where English-speaking students are expected to identify characters by name, participate in discussion and write essays on the subject matter, losing as many as half the letters in someone's name constitutes a technologically imposed handicap. This problem is unlikely to be resolved anytime soon because oddly enough, as one online commentator put it, “Most subtitle programs do not work correctly with Unicode files unless the Unicode subtitles file is in English.”  

 

Details

Article ID: 10634
Created
Fri 8/11/23 2:55 PM
Modified
Fri 3/8/24 12:06 PM