Data distribution

  • Parsing: test and training data can be downloaded from the dedicated web page. Contact: Cristina Bosco, bosco[at]di.unito.it
  • Domain Adaptation: – Test data are available [4/10/2011] – Training data can be downloaded from the dedicated web page. Contact: Simonetta Montemagni, simonetta.montemagni[AT]ilc.cnr.it
  • Named Entity Recognition on Transcribed Broadcast News: test data sent to participants [04/10/2011] Please note that training and test data are available for research purposes upon acceptance of a license agreement: – If you work for a non-profit research organization, you can obtain an unlimited Research License I-CAB is also available as part of the training data: – If you work for a non-profit research organization, you can obtain an unlimited Research License Contact: Manuela Speranza, manspera[at]fbk.eu
  • Cross-document Coreference Resolution: The CRIPCO corpus is freely available for research purposes upon acceptance of a license agreement.
  • Anaphora Resolution: Test data are available [04/10/2011] Training data are downloadable [12/09/2011]. This version incorporates several rounds of corrections. It also contains annotations of minimal spans that will be used for mention alignment (cf. guidelines). Minimal spans have been added to all the labels in columns 17 and 18, using the BIO notation (“B-MIN”, “I-MIN” and “O”). Minimal spans are encoded as the last parts of composite labels. Please note that no adjusted minimal spans are provided for documents 53, 69 and 68 (for these documents, minimal spans are equal to mention boundaries). OLD version of the training data is still available Contact: Olga Uryupina, uryupina[at]gmail.com
  • Super Sense Tagging: test data are available from the dedicated web page [04/10/2011] Training data can be downloaded from the dedicated web page Contact: Maria Simi, simi[at]di.unipi.it
  • Frame Labeling over Italian Texts: test data will be available from 12 October: for more information, please visit the dedicated web page [05/10/2011] – A subset of development can be downloaded from the dedicated web page. – The full training set is available from the dedicated web page. Contact: Roberto Basili, basili[at]info.uniroma2.it
  • Lemmatisation: for test and development data please contact the task organizer. Contact: Fabio Tamburini, fabio.tamburini[at]unibo.it
  • Automatic Speech Recognition – Large Vocabulary Transcription: test data are available [04/10/2011] To obtain training data please contact: Marco Matassoni, matasso[at]fbk.eu
  • Forced Alignment on Spontaneous Speech: test data are available together with a description [04/10/2011] Training data are downloadable [13/09/2011] Contact: Antonio Origlia, antonio.origlia[at]unina.it