English - Modern Greek XDXF/FreeDict dictionary Converted by: Piotr Bański Edition: 0.1 Size: 20984 headwords Published by: FreeDict, 09 November 2009 at: http://freedict.org/ Availability: Available under the terms of the GNU General Public Licence, version 2.0 or later, OR under the terms of Creative Commons (CC) BY-SA license, version 3.0. You have to make the choice before you distribute a modified version of this dictionary. Notes: * Database Status: big enough to be useful Source(s): Full title of the source version: "English-Greek dictionary" Original description: "Copyright: http://www.freelang.net/dictionary/; Version: 1.0 [FreeDict clean-up, phase II]" Converted from version 1.0 distributed by {the XDXF project}, part of Sourceforge.net. The Project: This dictionary comes to you through nice people making it available for free and for good. It is part of the FreeDict project, http://www.freedict.org. This project aims to make many translating dictionaries available for free. Your contributions are welcome! Changelog: * 2009-11-09 #bansp: Version 0.1, 09 November 2009. Compiler/converter: Piotr Bański * Created version 0.1 on the basis of the English-Greek dictionary distributed by the XDXF project * This involved a clean-up of the original and conversion to TEI P5 simplified markup, according to the ideas sketched in a paper entitled "A repository of free lexical resources for African languages: the project and the method" by Piotr Bański and Beata Wójtowicz, accepted for the AfLaT-2009 (Language Technologies for African Languages) workshop, part of the 12th meeting of the European Association for Computational Linguistics (EACL). The paper is available from {the ACL web archive}. * What follows is a copy of my notes written down during the clean-up stage and placed in the transitory file included in the first distribution of this dictionary (this is not entered in the CVS; sorry about the CDATA and rendering): This glossary has been apparently put together from two others, using different headword conventions, different diacritics and at least in part being prepared by non-expert lexicographers. In part, it's a phrase book. It also appears to contain some specialized zoological terminology. It comes from the XDXF.sourceforge.net project. Changes in the XDXF file (included in the distribution, not in the CVS): * some homophones are not disambiguated by means of POS, and some are collapsed into a single entry - I didn't fix such cases. * braces inside became dtrn/"dom"; inside : dtrn/"hint" * threw away over 200 duplicated entries, merged some of them, split a few others, especially around slashes; removed stray diacritics, fixed capitalization (to some extent only) * introduced instead of brackets or braces (needed a -like element) * tried to avoid touching equivalents, added Αθήνα to Athens (hmm), threw away stuff like burst out into tears ξεσπώ σε δάκρυα, ξεσπώ σε γέλια (there weren't many such cases, mostly of them involved duplicated headwords or equivalents; I only fixed those equivalents that I noticed in the process of pruning [some!] duplicate headwords) * the ordering of entries here is changed from the original (I wanted to have identical headwords next to each other, then I resorted); threw away lots of whitespace * changed "sea sick" into "I am sea-sick", according to the translation - it could be "be sea-sick" as well; in general, there are lots of entries in "I", "be", "a" or "the", etc...; sometimes the only difference is in the handling of compounds ("seasick" vs. "sea-sick" vs. "sea sick") * this dictionary (well, glossary) needs LOTS of lexicographic work still is just an abused element here; according to the XDXF spec, dtrn had the @type attribute, but the DTD (also included) wouldn't allow it, so I added: Values: "hint" for further explanations/examples in Greek, "infl" for inflections of the headword (used inconsistently in the source), "pos" (inconsistent) for POS values of headwords in cases that I noticed during the clean-up phase "dom" for some domain restrictions, etc., provided in English "colloc" for broadly understood collocates of the headword or sense restrictions "variant" inconsistent: for spelling variants (instal(l)ment) but also dialectal variants (autumn/fall) and abbreviations "editor" for internal notes Before the transformation into TEI, it had 20973 entries, some of which were still duplicated, and some of which should be split (e.g. "Norwegian", into adj and noun, etc.). The XDXF original had 21237 elements. Piotr Bański, November 2009, http://freedict.org/