Here you find the resources which we offer for download.
For further questions contact julielab@listserv.uni-jena.de
GeneReg - abstracts dealing with the regulation of gene expression in the model organism E. coli
FSU-PRGE - The FSU PRotein GEne corpus
JSynCC - Jena Synthetic Clinical Corpus
JCoRe NLP tool suite - for the application purposes of semantic search, information extraction and text mining
UIMA type system - a annotation type system which covers formal document structure and document meta information as well as the linguistic levels of morphology, syntax and semantics
GeNo - system for gene name normalization
Most models that we provide are integrated in our JCoRe tool set. However, some of them might be usable outside of JCoRe.
Please find the respective JCoRe repository on our GitHub page.
The appropriate models are then stored under src/main/resources/de/julielab/jcore/ae/…/model/ if you want to download them separately.
German clinical models trained on FraMed for the LREC2014 paper titled Disclose Models, Hide the Data - How to Make Use of Confidential Corpora without Seeing Sensitive Raw Data:
In February 2009, Fredrik Olsson (SICS, Schweden) and Katrin Tomanek arranged a web survey to analyze and understand the extend to which Active Learning has been used to support the annotation of textual data in context of natural language processing. Moreover, we aimed at finding out about the reasons to why or why not AL has been found applicable to a specific task.
The results of the survey were presented at the Workshop on Active Learning for NLP at NAACL, June 2009 and are published in the workshop proceedings:
Katrin Tomanek and Fredrik Olsson. A Web Survey on the Use of Active Learning to support Annotation of Text Data. Proc. Workshop on Active Learning for NLP at NAACL 2009, 2009. [bib] [pdf]
The survey data can be downloaded here:
If you are interested in the raw survey data (which is in some kind of XLS format), please send a mail to the JULIE Lab staff.
Visit us on GitHub
© 2022 JULIE Lab