Sun 04 Jan 2015 00:10:00 UTC
* Release 0.4 (with change in the language codes apertium-es-an --> apertium-spa-arg)
* Main changes since version 0.3:
   * Adoption of three-letter codes: spa (old es) and arg (old an).
   * Use monolingual packages apertium-spa and apertium-arg (separated monolingual and bilingual data).
   * Trimming of the monolingual dictionary (in both directions, although only the spa monodix is really trimmed     since arg monodix has grown in paralell to the bidix).
   * Added lexical selection support (translate-to-default-equivalent.xsl not used anymore).
   * Improvement in coverage and morphology of Aragonese monolingual dictionary.
     > 552 paradigms,
     > 21271 lemmae (26103 entries, of which, 8150 proper nouns), including 2518 multi-words.
     Nave coverage: from 89.8% in an.wikipedia to 92.4% in a corpus of narrative texts.
   * Some improvements an fixes in verbal morphology.
   * Several agrammatical or rare combinations of enclitic pronouns are now ignored.
   * Some changes in the preferred (generated form): 'pa' (instead of 'ta') to translate 'para', 'cantatz' (instead of 'cantat') as p2.pl imperative, 'coixeyo' rather than 'coixeo'...
   * Extended support for dialectal verbal morphology analysis ('anatz', 'veden', 'anirem', 'quisto', benasques strong infinitives)
   * new paradigms ("coch/n__n", "troz__n", "restaur/ant__n", "contpa__pr")
   * duplicated paradigms eliminated (e.g. "tapi/z__n")
* Main pending Issues:
   * Still room to improve on coverage.
   * Still much to work on transfer.
   * Develop lexical selection rules

Tue 29 Dec 2012 9:00:00 UTC
* Release 0.3
* Improvements since version 0.2:
   * Improved tagger (PoS desambiguator) for Aragonese. Changes in the tagger
   rules (...an.tsx) and unsupervised training in the Wikipedia corpus. 
   * Improvement in coverage and morphology of Aragonese monolingual dictionary.
     > 546 paradigms, 
     > 21073 lemmae (of which, 8047 proper nouns), including 1921 multi-words. 
     ~568 000 different analyzable surface forms, 
     ~117 500 different generated surface forms. 
     Nave coverage: from 88.4% in an.wikipedia to 94.0% in a corpus of narrative texts.
   * Some paradigms fixed (e.g. seguir)
   * Some rules fixed (e.g. prep+el+que --> prep+o+que)
   * Some rules added (vase --> se veiga, laselo --> se lo leiga, behaviour of
   buen, mal before a noun)
   * Improved distinction beteen a<pr> and a<det> (a2) and two meanings of "ta" so
   that the postgenerator can apply different rules to them.
   * Use of apocopated posesives mi, tu, su is dealt with in analysis.
   * Periphrastic past dealt with in analysis (synthetic forms are used for
   generation, although rules for using periphrastic instead are written and
   commented).
* Main pending Issues:
   * Still room to improve on coverage.
   * Still much to work on transfer.

Tue 6 Sept 2011 9:00:00 UTC
* Release 0.2.1
* Solved postgeneration of a (preposition) + a (article f sg) using double
label <a/><a/> for a (article f sg).
* See ver 0.2 for main pending issues.

Sun 10 Jul 2011 19:20:14 IST
* Release 0.2
* First operative bidirectional release.
* Improvements:
   * Aragonese monodix and bidix in the initial release (0.1.0) were checked 
     and corrected.
   * Huge improvement in the completeness of verbal and nominal morphology. 
     Wide paradigms analysing most dialectal forms and orthographic variations 
     (see Note on orthography below).
   * Closed classes completed.
   * Treatment of enclitics (up to combinations of two enclitics), including 
     dialectal non-standard combinations.
   * Treatment of apostrophation (analysis and (post)-generation).
   * Orthographic variation concentrated on monodix.
   * Dialectal variation concentrated (when possible) on monodix.
   * Possesives fixed.
   * Impersonal haber-ie dealt with.
   * Important increase in the dictionary size and coverage. 
     > 510 paradigms, 
     > 17500 lemmae (of which, 8025 proper nouns), including 1786 multi-words. 
     ~500 000 analyzable surface forms, 
     ~120 000 generated surface forms. 
     86.3% nave coverage in wikipedia.
   * es monodix trimmed to bidix using ignore label (i="yes").
* Main pending Issues:
   * Still room to improve on coverage.
   * Still much to work on transfer (current version based on a reduced and 
     tuned subset of rules from es-ca).
   * Current POS disambiguator is directly taken from es-ca.
     Note: Orthography is taken from "Academia de l'Aragons - Estudio de 
     Filolocha Aragonesa", a board created in the II Congreso de l'Aragons.  
     http://www.academiadelaragones.org/biblio/EDACAR7_2.pdf . 
     It is the orthography used in Wikipedia in Aragonese 
     (http://an.wikipedia.org). 
     There is  partial compatibility in analysis with other previously used 
     orthographies.

Sun Sep 26 15:42:44 IST 2010
 * Initial release (0.1.0)

