Title European Union Language Resources in Sketch Engine
Authors Vít Baisa, Jan Michelfeit, Marek Medveď and Milos Jakubicek
Abstract Several parallel corpora built from European Union language resources are presented here. They were processed by state-of-the-art tools and made available for researchers in the corpus manager Sketch Engine. A completely new resource is introduced: EUR-Lex Corpus, being one of the largest parallel corpus available at the moment, containing 840 million English tokens and the largest language pair English-French has more than 25 million aligned segments (paragraphs).
Topics Acquisition, Multilinguality, Corpus (Creation, Annotation, etc.)
Full paper European Union Language Resources in Sketch Engine
Bibtex @InProceedings{BAISA16.572,
  author = {Vít Baisa and Jan Michelfeit and Marek Medveď and Milos Jakubicek},
  title = {European Union Language Resources in Sketch Engine},
  booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)},
  year = {2016},
  month = {may},
  date = {23-28},
  location = {Portoro┼ż, Slovenia},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {978-2-9517408-9-1},
  language = {english}
