Publications
2009
[23] Erik Fäßler, Rico Landefeld, Katrin Tomanek, and Udo Hahn. LuCas - A Lucene CAS Indexer. In Proceedings of the 2nd UIMA@GSCL Workshop, 2009 [to appear]
[22] Katrin Tomanek and Udo Hahn. Reducing Class Imbalance During Active Learning for Named Entity Annotation. K-CAP'09 --- Proceedings of the 5th International Conference on Knowledge Capture, 2009 [paper won KCAP's Best Paper Award] [pdf]
[21] Katrin Tomanek and Udo Hahn. Semi-Supervised Active Learning for Sequence Labeling. In ACL'09 -- Proceedings of the 47th Annual Meeting of the Association of Computational Linguistics, 2009 [ bib ] [ pdf ]
[20] Katrin Tomanek and Udo Hahn. Timed Annotations --- Enhancing MUC7 Metadata by the Time It Takes to Annotate Named Entities. In The LAW at ACL/IJCNLP 2009 - Proceedings of the Third Linguistic Annotation Workshop, 2009 [ bib ] [ pdf ]
[19] Fredrik Olsson and Katrin Tomanek. An Intrinsic Stopping Criterion for Committee-Based Active Learning. In CoNLL '09 -- Proceedings of the Conference on Natural Language Learning, 2009 [ bib ] [ pdf ]
[18] Udo Hahn, Katrin Tomanek, Ekaterina Buyko, Jung Jae Kim, and Dietrich Rebholz-Schuhmann. How Feasible and Robust is the Automatic Extraction of Gene Regulation Events ? A Cross-Method Evaluation under Lab and Real-Life Conditions. In Proceedings of the NAACL workshop on BioNLP 2009, 2009. [ bib ] [ pdf ]
[17] Katrin Tomanek, Florian Laws, Udo Hahn, and Hinrich Schütze. On Proper Unit Selection in Active Learning: Co-Selection Effects for Named Entity Recognition. In Proc. Workshop on Active Learning for NLP at NAACL 2009, 2009. [ bib ] [ pdf ]
[16] Katrin Tomanek and Fredrik Olsson. A Web Survey on the Use of Active Learning to support Annotation of Text Data. Proc. Workshop on Active Learning for NLP at NAACL 2009, 2009. [ bib ] [ pdf ]
[15] Joachim Wermter, Katrin Tomanek , and Udo Hahn. High-Performance Gene Name Normalization with GENO. Bioinformatics, 2009. [ article at bioinformatics ]
2008
[14] Roi Reichart, Katrin Tomanek , Udo Hahn, and Ari Rappoport. Multi-task active learning for linguistic annotations. In ACL'08 - Proceedings of the 46th Annual Meeting of the Association of Computational Linguistics . Association for Computational Linguistics, 2008. [ bib ] [ pdf ]
[13] Katrin Tomanek and Udo Hahn. Approximating learning curves for active-learning-driven annotation. In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08) , Marrakech, Morocco, May 2008. [ bib ] [ pdf ]
- [12] Udo Hahn, Ekaterina Buyko, Rico Landefeld, Matthias Mühlhausen, Michael Poprat, Katrin Tomanek , and Joachim Wermter. An overview of JCoRe, the JULIE lab UIMA component repository . In Proceedings of the LREC'08 Workshop `Towards Enhanced Interoperability for Large HLT Systems: UIMA for NLP` , pages 1-7, Marrakech, Morocco, May 2008. [ bib ] [ pdf ]
-
- [11] Udo Hahn, Elena Beisswanger, Ekaterina Buyko, Michael Poprat, Katrin Tomanek , and Joachim Wermter. Semantic annotations for biology - a corpus development initiative at the jena university language & information engineering (JULIE) Lab. In Proceedings of the Sixth International Language Resources and Evaluation (LREC'08) , May 2008. [ bib ] [ pdf ]
2007
- [10] Roman Klinger and Katrin Tomanek . Classical Probabilistic Models and Conditional Random Fields. Technical Report TR07-2-013, Department of Computer Science, Dortmund University of Technology, December 2007. ISSN 1864-4503. [ bib ] [ pdf ]
- [9] Katrin Tomanek , Joachim Wermter, and Udo Hahn. Sentence and token splitting based on conditional random fields. In PACLING 2007 - Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics , pages 49-57. Melbourne, Australia, September 19-21, 2007. Melbourne: Pacific Association for Computational Linguistics, 2007. [ bib ] [ pdf ]
-
- [8] Ekaterina Buyko, Katrin Tomanek , and Udo Hahn. Resolution of coordination ellipses in biological named entities using conditional random fields. In PACLING 2007 - Proceedings of the 10th Conference of the Pacific Association for Computational Linguistics , pages 163-171. Melbourne, Australia, September 19-21, 2007. Melbourne: Pacific Association for Computational Linguistics, 2007. [ bib ] [ pdf ]
-
- [7] Ekaterina Buyko, Scott Piao, Yoshimasa Tsuruoka, Katrin Tomanek , Jin-Dong Kim, John McNaught, Udo Hahn, Jian Su, and Sophia Ananiadou. Bootstrep annotation scheme: Encoding information for text mining. In Corpus Linguistics 2007 - Proceedings of the 4th Corpus Linguistics Conference . Birmingham, England, U.K., July 27-30, 2007, 2007. [ bib ] [ pdf ]
-
- [6] Udo Hahn, Ekaterina Buyko, Katrin Tomanek , Scott Piao, John McNaught, Yoshimasa Tsuruoka, and Sophia Ananiadou. An annotation type system for a data-driven NLP pipeline. In The LAW at ACL 2007 - Proceedings of the Linguistic Annotation Workshop , pages 33-40. Prague, Czech Republic, June 28-29, 2007. Stroudsburg, PA: Association for Computational Linguistics, 2007. [ bib ]
-
- [5] Katrin Tomanek , Joachim Wermter, and Udo Hahn. An approach to text corpus construction which cuts annotation costs and maintains corpus reusability of annotated data. In EMNLP-CoNLL 2007 - Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 486-495. Prague, Czech Republic, June 28-30, 2007. Stroudsburg, PA: Association for Computational Linguistics, 2007. [ bib ] [ pdf ]
-
- [4] Katrin Tomanek , Joachim Wermter, and Udo Hahn. Efficient annotation with the Jena ANnotation Environment (JANE). In The LAW at ACL 2007 - Proceedings of the Linguistic Annotation Workshop , pages 9-16. Prague, Czech Republic, June 28-29, 2007. Stroudsburg, PA: Association for Computational Linguistics, 2007. [ bib ] [ pdf ]
2006
- [3] Joachim Wermter, Katrin Tomanek , and Felix Balzer. Automatische Erkennung und effiziente Annotation von anonymisierungsrelevanten Begriffen in klinischen Freitexten. In GMDS 2006 - Tagungsband der 51. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie , pages 151-152. Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e.V. (gmds), 2006. [ bib ] [ pdf ]
2005
- [2] Katrin Tomanek . Ontology-driven classification of named entities based on a machine learning approach. Diplomarbeit, Institute of Applied Informatics and Formal Description Methods (AIFB), University of Karlsruhe (TH), Germany, 2005. [ bib ]
2004
- [1] Katrin Tomanek . Implementierung und Evaluierung eines hybriden Overlays auf Basis von CAN und Chord. Studienarbeit, Institute of Telematics, University of Karlsruhe (TH), Germany, November 2004. [ bib ]