Hoppa fram till innehållet

Ändringar

View changes from to


9 augusti 2022 15.58.31 UTC, Gravatar Alp Öktem:
  • Ändrade titel till Synthetic parallel corpora LAD-EN, TR (tidigare Synthetic parallel corpora LAD-EN, TR, ES)


  • Uppdaterade beskrivningen av Synthetic parallel corpora LAD-EN, TR från

    Synthetically produced parallel data using rule-based Spanish-Ladino translation. Sizes: Ladino-Spanish: 10,322,033 sentences Ladino-Turkish: 4,574,021 sentences Ladino-English: 5,748,012 sentences Paper: https://arxiv.org/abs/2205.15599 This dataset is created as part of project "Judeo-Spanish: Connecting the two ends of the Mediterranean" carried out by Col·lectivaT and Sephardic Center of Istanbul within the framework of the “Grant Scheme for Common Cultural Heritage: Preservation and Dialogue between Turkey and the EU–II (CCH-II)” implemented by the Ministry of Culture and Tourism of the Republic of Turkey with the financial support of the European Union. The content of this website is the sole responsibility of Col·lectivaT and does not necessarily reflect the views of the European Union.
    till
    Synthetically produced parallel data using rule-based Spanish-Ladino translation. Sizes: Ladino-Turkish: 4,574,021 sentences Ladino-English: 5,748,012 sentences Total Ladino-Spanish: 10,322,033 sentences (This is basically combination of the two corpora) Paper: https://arxiv.org/abs/2205.15599 This dataset is created as part of project "Judeo-Spanish: Connecting the two ends of the Mediterranean" carried out by Col·lectivaT and Sephardic Center of Istanbul within the framework of the “Grant Scheme for Common Cultural Heritage: Preservation and Dialogue between Turkey and the EU–II (CCH-II)” implemented by the Ministry of Culture and Tourism of the Republic of Turkey with the financial support of the European Union. The content of this website is the sole responsibility of Col·lectivaT and does not necessarily reflect the views of the European Union.