Meedan

The Meedan Blog Archive

Meedan Releases the World’s First Open Access Arabic/English Translation Memory

Today Meedan is releasing the first publicly available open licensed Arabic/English Translation Memory (TM). This release comes as a part of Meedan’s ongoing commitment to improving the state of Arabic language machine processing research and supporting translators and machine translation researchers.

The new translation memory, called MeedanMemory, contains around one million words in aligned Arabic and English sentence pairs. It constitutes an invaluable resource for translation professionals, Arabic linguistic researchers, academics and volunteer communities, and is the first translation memory of such size and sophistication to be released under an open access license.

MeedanMemory is packaged as a standards-compliant Translation Memory Exchange (TMX) file, which facilitates its integration with most computer assisted translation tools and translation memory managers.

Professional translators can utilize MeedanMemory to support their translation work and improving its efficiency, and to increase consistency and quality in the translation outcome. Researchers can use this resource in designing and implementing their research activities into Arabic computer linguistics and machine language processing, which will lead to the development of better text search engines and document analysis tools and applications. And community organizations, non-profits and volunteer communities can leverage MeedanMemory to facilitate translation between Arabic and English of the content they produce for lower costs, lower turnaround times and higher quality.

The MeedanMemory is released under the Open Database License, which permits the use, copy and distribution of the translation memory, produce work using it, and to modify, transform and build upon the translation memory.

Meedan is pleased to announce this first release of MeedanMemory, and will continue to release data generated from the Meedan translation community into this open repository. We also hope that other organizations will contribute to the growth of this unique and valuable resource for the Arabic language and Arabic digital content that holds great potential for supporting Arabic content projects and initiatives.

To download your copy of MeedanMemory, please visit: http://github.com/anastaw/Meedan-Memory

UPDATE (2013): After a user reported the above link broken, please use the following URL to access MeedanMemory: https://github.com/meedan/Meedan-Memory

For more information about MeedanMemory, please contact: info@meedan.net Meedan Cross-language Arabic-English Social Networking: beta.meedan.net

Comments on this post

2009-11-09 20:58:08 -0800
Hi Mohsen, Thank you very much for your feedback. To download the file, you need to go to: http://github.com/anastaw/Meedan-Memory Click on the "Download" button. A pop-up window will appear asking you which compression format you prefer. You can choose either ZIP or RAR based on the compression software you use. Once the file is downloaded, you can extract it to your computer hard drive. To be able to use the TMX file, you need a computer aided translation software (CAT). For more information about how to use MeedanMemory, please check the following post: http://blog.meedan.net/2009/11/10/how-to-use-me... I hope that helps.
2009-11-11 21:20:35 -0800
Wow almost 500 000 words of data! Congratulations
2010-03-05 23:37:23 -0800
This is fantastic. Greate initiative, I will forward this link to everyone I know. Which brings me to the question of how come there is no 'Sharing' option? Or is there and I just haven't found it? :)
2010-03-06 07:37:23 -0800
This is fantastic. Greate initiative, I will forward this link to everyone I know. Which brings me to the question of how come there is no 'Sharing' option? Or is there and I just haven't found it? :)
2010-04-16 19:47:16 -0700
I only wish to thank you for such a helpful sevrice.
2010-04-30 21:43:40 -0700
The link to MeedanMemory doesn't work. Has it changed?
2010-04-30 23:53:39 -0700
Looks to be working fine Gingi - try here http://github.com/anastaw/Meedan-Memory
2010-06-20 11:20:33 -0700
عظيم أن تكبر الفكرة أما صاحبها
2010-08-24 15:09:28 -0700
Very helpful tool, thanks for sharing!
2011-05-23 07:12:25 -0700
شكرا لكم على هذا المجهود الرائع ونتمنى منكم المزيد
2011-07-01 07:17:46 -0700
محدش فاهم حاجة خالص
2011-07-02 08:04:38 -0700
أشير بالتقدير إلى قرار معالي مدير الجامعة رقم (18840) وتاريخ17/7/1432 القاضي بالموافقة على تعيين الدكتورة/حصه بنت فهد نصير الشريف البركاتي .. على وظيفة (أستاذ مساعد) بقسم الكيمايء . عليه نرفق لسعادتكم كامل المعاملة للتكرم بموافاتنا برقم الوظيفة المراد تعيين سعادتها عليها وتسليمها الكتب الخاصة بها ومن ثم إعادة الماملة ليتسنى لنا إكمال اللازم .وتقبلوا خالص تحياتي وتقديري
2012-05-15 09:46:24 -0700
Nicely explained. Truly impressive and informative tips. Thanks for sharing.
2013-08-27 06:26:10 -0700
Thank you very much for all the effort and time you put into this, but the link which is supposed to show how to get the TM to work is broken. would you please look into that. Regards
2013-08-27 22:48:14 -0700
Thanks for pointing that out, I've updated the post and the working link is: https://github.com/meedan/Meedan-Memory
2013-08-29 07:24:54 -0700
Hi there, First of all I would like to thank you for the TM file you have put online, that is a great service for Arab readers. I have downloaded the file and unzipped it. I tired to open it using SDL Trados studio, but I could not and in order to do that I needed to change the format or do what they call upgrade MT. I did that but it came back with an error message. The error message is [Culture name '*all* is not supported. Is there an easy fix for that? Thank you very much,