Collaborative Text Translation with DotSUB

Posted on April 5, 2009 by Claude Almansi

By Claude Almansi
Editor, Accessibility Issues

In a discussion about Uwe Müller’s dissertation regarding open access journals (see abstract with download link) on the A2k (access to knowledge) mailing-list, Arif Jinha wrote that it would be great to translate it collaboratively into English. Great idea, especially for a 269-page long dissertation.

The way Arif Jinha intends to collaboratively translate scholarly texts is based on the hypothesis that if two specialists thoroughly know each other’s subject, specialist B, even if he does not know specialist A’s language, is able to better understand – and render in own his language – specialist A’s work on the basis of even a dubious computer translation than would a generic translator who masters both languages. However, generic bilingual translators could be of use for checking possible mistakes in details.

This is very true. For instance, the best translation of a poem by Seferis into French was done by the French poet Yves Bonnefoy – who didn’t know Greek – on the basis of several English translations, in collaboration with Seferis who told him what he liked and disliked in these translations. And the same possibly extends to other fields of specialisation.

Collaborative Text Translation Tools

However, being just a generic translator, I have to translate the other way round, from the small end as it were. So Arif Jinha’s suggestion got me thinking about collaborative translation tools. There are such tools for software, like Pootle, for instance, which split the interface into short strings presented in a table: a volunteer starts translating some, then another volunteer goes on. You can navigate by untranslated and “fuzzy” strings.

Problem 1: the strings are presented by alphabetical order, with only some coded indications of where the strings come from, and it takes some time to start understanding them. And one-word strings can be tricky: is “post” a noun or a verb, and if a verb, should we use the infinitive or the imperative, and if the imperative, the polite or the familiar form (in languages where both exist)?
Problem 2, you need a server on which to install this kind of tool.

Collaborative Text Translation with DotSUB

And then I remembered DotSUB. It is normally used for collaboratively captioning videos, but its interface is very similar to one of the software translation tools that I covered in Three Video Captioning Tools. And you can have longer strings, in the order you decide – in the order of a text too…

But I needed a video pre-text first. So I made one, inserting a 4k black JPEG file in a video editor:

I timed it for 10 minutes and exported the video in the lowest possible resolution. Then I uploaded it into DotSUB and inserted some text from my blog post Making Web Multimedia Accessible Needn’t Be Boring, sentence by sentence:

Dotsub Transcription Tool:

I left the default 3-second timing for each string in the “Add a transcription line” box and paid no attention to the pre-text black video. Each transcribed string moves to the top-right table when you hit return and is automatically saved. When that was done, I clicked on “Mark this transcription complete” (bottom left) and moved to the DotSUB Translation Tool.

DotSUB Translation Tool

I clicked on the links to translate each string (actually, I only translated the text into French, but I forgot to make a screenshot, first, so I made one of the interface for translating into Italian instead).

When you choose a language for the captions in the video player of the resulting Collaborative translation DotSUB page, you get the translation in the corresponding language as a drop-down list under Video Transcription. To get rid of the list markings, just copy-paste it into the “source” or “html view” of a web editor. Here is the almost unedited result (I just redid a separate paragraph for the subtitle and bolded it, and I put the rest in italics) :

Certains pensent que l’obligation légale de se conformer aux règles d’accessibilité des contenus Web – celles du W3C ou, aux USA, la “section 508” mène forcément à des pages ennuyeuses, rien qu’en texte En fait, ces règles n’excluent pas l’utilisation du multimédia sur le web, mais imposent de le rendre accessible en “offrant des alternatives équivalentes pour des contenus auditifs ou visuels et en particulier: “Pour toute présentation multimédia à base temporelle (p. ex. film ou animation), il faut offrir des alternatives équivalentes (p.ex. sous-titres ou descriptions audios de la piste visuelle) avec la présentation [Priorité 1]” [1] Ce n’est pas une corvée aussi terrible qu’il ne semble, et elle peut être partagée entre plusieurs personnes, même si elles ne sont pas expertes en technologie et n’ont pas d’instruments perfectionnés.

Sous-titrage avec DotSUB.com

Exemple: Phishing Scams in Plain English de Lee LeFever, en http://dotsub.com/view/41ffcc22-6609-4780-bf9d-5bcf88d3197d [2] Ici, la vidéo a été téléchargée dans DotSUB.com, et plusieurs volontaires l’ont sous-titrée en diverses langues. Le résultat peut être insérer dans un blog, un wiki ou une page web. Les sous-titres apparaissent aussi comme texte copiable sous “Video Transcription”: commode si des gens veulent citer des passages dans une discussion de la vidéo. En outre, une transcription d’une vidéo tend aussi à améliorer sa position dans les moteurs de recherche, qui indexent principalement les textes. Le seul problème est que les sous-titres couvrent une partie substantielle de la vidéo

Summing up so far:

Of course, I attempted this alone. But it would also work with several people collaborating in the translation. In theory, even the transcription, sentence by sentence, of the original text could be shared, but I haven’t checked yet if a collaborator could decree that a transcription is finished when it isn’t, thus blocking the transcription.

In case of a longish text that must be translated into several languages (hopefully in collaboration with many people), this way of using DotSUB might prove useful due to the ease of toggling between the different versions from the main page.

Filed under: Uncategorized | Tagged: "collaborative translation", "translation", DotSUB, DotSub.com, Jinah |

« A Digital Educator in Poland Interview with Bert Kimura: TCC 2009 April 14-16 »

Spotlight

Harry Keller: "I read that the virus can remain viable on hard surfaces for as long as 12 hours. " ("My Life in LA County During COVID-19: March 22").

Harry Keller: "People are working feverishly on [COVID-19] cures and vaccines. Until they arrive, we might as well be in the world a century ago" ("My Life in LA County During COVID-19: March 20").

John Mark Walker: "If educational communities can continue to push platform integration and content portability, in the future, students may be able to design their own personalized degrees from smaller, modular chunks that cross institutional barriers" ("MOOCs Are Dead. Long Live MOOCs!").

Richard Koubek, provost of LSU Baton Rouge: “Our vision is LSU, anywhere, anytime, and that physical boundaries would not define the boundaries of this campus.... You’re not going to get there incrementally. You have to change the paradigm” ("Successful Online Programs Require a Paradigm Shift").

Bryan A. Upshaw: "Most teachers already have the resources to videoconference. If they have a smartphone, tablet, or computer, then they probably have everything they need!" ("Bring the World to Your Classroom: Videoconferencing").

Judith McDaniel: "The nature of online education is that it removes me, the instructor, from the center of the learning process and allows the students to learn from me and from one another" ("Creating Community: Part 3 – Hard Conversations in an Online Classroom – Heart of Darkness").

Tim Fraser-Bumatay: "Although the format leaves us far-removed physically, the online forum has its own sense of intimacy" (Judith McDaniel, "Creating Community: Part 3 – Hard Conversations in an Online Classroom – Heart of Darkness").

Ryan Kelly: "For me to be able to work with people clear across the country for an extended period of time opened me up to new things" (Judith McDaniel, "Creating Community in an Online Classroom: Part 1 – Getting to Know You").

Daniel Herrera: "As a Mexican American, I know that words of identity are powerful; so to discuss white privilege with my professor and classmates in a face-to-face class would have been terrifying and impossible" (Judith McDaniel, "Creating Community: Part 2 – Hard Conversations in an Online Classroom – Othello").

Camille Funk: "Instructional design is an emerging profession and in the midst of a renaissance. There is a need to structure and develop this growing field" (Stefanie Panke, "New Instructional Design Association in Higher Ed: An Interview with Camille Funk").

JD Pirtle: "Coding is learning to create and harness the power of machines, both near and far.... But coding isn’t really about machines, programming languages, or networks—it’s about learning new and powerful ways to think" (Stefanie Panke, "Wearable Tech on Your Preschooler? Technology Education and Innovation for Children").

John Wasko: "Here is the great thing. You don’t need any special set up or call center or anything like that. Just a smartphone. I use an iPhone 4. Works great. If we can develop mobile techniques to help these students, every university will knock on their door" (Lynn Zimmerman, "Social Media in TESOL: An Interview with John Wasko").

.
Katie Paciga: "It’s always better to use the technology to accomplish meaningful, child-centered goals related to communication — to consume information, to create new messages, and to communicate those messages to others" (Lynn Zimmerman, "Technology in Early Education: An Interview with Katie Paciga").

Lee Shulman: Whereas the traditional approach aims to achieve generalized findings and principles that are not limited to the particulars of setting, participants, place and time, the SoTL community seeks to describe, explain and evaluate the relationships among intentions, actions and consequences in a carefully recounted local situation (summary by Stefanie Panke in "ISSOTL 2013: ‘Doing SoTL Means You Never Have to Say You’re Sorry!’").

Jesse Stommel: "The course (and its participants) inspired our thinking about MOOCification, which basically means leveraging the best pedagogies of MOOCs in our on-ground and small-format online courses and laying the rest to waste."

Sean Michael Morris: "The MOOC has become something manageable, something we we can mine for data, and something that simply isn’t — and never was — all that innovative" (MOOC MOOC! The interview by Jessica Knott).

Curtis P. Ho: "The challenge will be to create and implement authentic learning in an online course. How authentic can learning be if we are confining it to a 15-week semester at a distance?" (A Conversation with Curtis Ho: AACE E-Learn SIG on Designing, Developing and Assessing E-Learning by Stefanie Panke.

Tom Evans: "We are ... using this MOOCulus platform as a learning tool for students taking Calculus at Ohio State.... However, any student, anywhere, can access MOOCulus, anytime, by logging into the site using their Google ID" (MOOCulus for Calculus Fun: An Interview with Tom Evans by Jess Knott).

Curt Bonk: “Today, anyone can learn anything from anyone at any time." "Students want feedback on everything they do. You know what happens when you give feedback on everything they do? You die” (Stone Soup with Curt Bonk: Armchair Indiana Jones in Action by Stefanie Panke).

Daniel McGee: "Successful [Calculus I] students appeared to need a unified approach, which emphasized verbal situations, geometric figures, algebraic expressions and the relations between them" (Study Suggests the Need for an Intergrated Learning Styles Approach to Calculus by Jessica Knott).

Kathlyen Harrison and Michael Gilmartin: "We highly recommend [Triptico] for teachers that want to improve interactivity, foster competition, and engage students in the learning process" (Triptico: A Powerful and Free Instructional App).

Bert Kimura: "If paper and pencil testing is absolutely required in a class, it probably shouldn’t be offered as a DE class. Not today anyway" (Remote Proctoring: More Questions Than Answers).

Cathy Gunn: "Traditional methods for effecting change at my institution aren’t getting us even to a trickle yet, let alone to thinking about or planning for a wave!" (How Will Traditional Leaders Fare in the Wave of Open Courses?)

Janet Buckenmeyer: "It takes more time to design and develop the [online] course. It takes more time to monitor students in an online course.... How are faculty compensated in terms of workload and pay for the additional work an online course requires? How many students should be placed in an online course?" (A Talk with Janet Buckenmeyer on Issues in Online Course Development, by Lynn Zimmerman).

Billy Sichone: "My phone has been a valuable asset as I can check the internet for information at any and every time. For instance, I once took an international trip to two countries in a row and the phone was my only source of assignment submissions etc. I did not miss out at all" (A Student’s View of an Open University: An Interview with Billy Sichone, by Stefanie Panke).

Julia Kaltenbeck: "Seek ways to build and maintain your community! The community is the single most important success factor in crowdfunding and social payments. To put it simply: No community, no funding" (Julia Kaltenbeck: How Crowdfunding and Social Payments Can Finance OER, by Stefanie Panke).

Jessica Ledbetter: "What keeps me going is that I’m actually creating things I might not find the time to do otherwise. It’s nice to be able to learn with others and see what they’re doing. I always learn by looking at others’ code" (Open Learning at P2PU: An Interview with Jessica Ledbetter, by Stefanie Panke).

Susan Murphy: "We are all so afraid that we're going to miss out on something, so we just skim and scan and re-post without really taking time to consider the source. We sometimes forget that there are real people behind the avatars. And that it's worth getting to know more about them" (The Human Face of Twitter: An Interview with Susan Murphy, by Jessica Knott).

Jessica Knott: "While a lot of these younger students are pretty gung ho to go forth and innovate technologically, they will be stymied in many cases by an aging infrastructure and restrictive technology rules. Perhaps even by the culture of co-workers who discourage them from using tech in their teaching" (An Interview with Jessica Knott: Teaching an Online Class on Course Development).

Emily Hixon: "If a teacher thinks that she/he is going to be able to talk 'at' students and they will learn, she is mistaken. Teachers must be prepared to engage students and use technology to support an interactive, meaningful approach to learning" (Integration of Pedagogy and Technology in Teacher Education: An Interview with Emily Hixon, by Lynn Zimmerman).

Parry Aftab: "Unless we can make the technology safer and provide the right skills to use it responsibly and teach cyber-self-defense, we can’t expect students to use it, enjoy it or benefit from it. We owe it to the kids" (Bonnie Bracey Sutton, "Cyberbullying: An Interview with Parry Aftab").

Nancy Willard: "It sure does not help us in transitioning to Web 2.0 if the news is that cyberbullying is at an epidemic level. But it isn’t. And my approach will demonstrate the positive norms of students, which should also translate to greater willingness to also use these technologies for instruction" (Bonnie Bracey Sutton, "Cyberbullying: An Interview with Nancy Willard").

Marc Prensky: "Instead of just spending, and often wasting, billions of dollars to create things that are new, let’s try harder to fix what we have that’s already in place" (Simple Changes in Current Practices May Save Our Schools).

Spotlight Archives

Ben Brumfield, on April 12, 2009 at 11:42 am said:

That’s pretty neat. How do you expect that this would handle revisioning? I’ve often thought you’d want a language expert collaborating with a subject matter expert, revising each other’s translation in turn.

Claude Almansi, on April 13, 2009 at 7:26 am said:

Good observation: DotSUB does not really handle revisioning, though it is possible to export the “subtitles” (here the translation of a text) at any stage – which of course is a safeguard in case someone badly messes up, voluntarily or by mistake. But that’s not the same as the revision feature in a text treatment or the history of versions in a wiki.

I should have added that I thought of hijacking DotSUB in this way for the first complete draft of translations, and then move the translations elsewhere for fine-tuning. One interesting platform for that would be David Lebow’s Hylighter: see hylighter-edu (1), where the comments appear as marginalia, and then get integrated by the document manager. This would work great for the collaboration between subject matter expert and language expert, I think.

Other more obvious solution: a wiki. But the possibility to first use marginal comments for suggestions is appealing.

(1) the “normal” hylighter.com would work just as fine, but the verbal explanations on hylighter-edu are more complete.

Ben, on April 15, 2009 at 4:28 am said:

Thanks for the link to Hylighter — I hadn’t seen it before. Reminds me a bit of CommentPress and eComma.

One of the things I’ve been mulling over is the need to “fork” a translation for different target languages. Do you think that the best approach would be through multiple, independent translation projects, or to link/combine them in some way?

	Elin Kling Blog on ‘Teaching History in the…
	Anonymous on My Life in LA County During CO…
	Anonymous on Critical Importance of Social…
	Anonymous on Critical Importance of Social…
	HarryKeller on Don’t Disregard the Facts: Ste…
	HarryKeller on Textbooks Are Zombies
	HarryKeller on Textbooks Are Zombies
	JimS on My Observatory Odyssey –…
	Live On Car on SPOCs Are MOOC Game Chang…
	TCC 2021 (April 13-1… on TCC 2021 (April 13-15) Call fo…
	Charles E Sayle on Computers in Low-income Househ…
	Emily Mora on Blame Poorly Designed Technolo…
	natural childcare Ka… on What’s Going On As School-age…
	Hare on Textbooks Are Zombies
	backlighting technol… on Repurposing Gaming Keyboards a…

Educational Technology and Change Journal

Recent Posts

Recent Comments

To Comment

Archives

Categories