Dr. Thomas Schmidt

Foto von Thomas Schmidt
© Thomas Schmidt

Leibniz-Institut für Deutsche Sprache
R 5, 6-13
D-68161 Mannheim

E-Mail: thomas(at)linguisticbits.de

Weitere Websites:

Dr. Thomas Schmidt

Functions:

CV:

  • 1992 - 1998: Studies in General Linguistics / English Linguistics / Romance philology and Mathematics / Computer Science at the Universities of Kaiserslautern and Mainz
  • 1996 - 1997: Studies in Linguistics and Artificial Intelligence at the University of Edinburgh, Scotland
  • 1998 - 1999: Language Resource Engineer at Philips Speech Processing, Aachen
  • 1999 - 2000: Studies "European Master of Linguistics" at the Free University Berlin and the Université Paris VIII
  • 2000 - 2011: Project Assistant / Principal investigator in the project "Computer-assisted creation and analysis of multilingual data", SFB 538 Multilingualism, University of Hamburg
  • 2004: PhD in German linguistics (text technology) from the University of Dortmund on Thesis title: Computergestützte Transkription – Modellierung und Visualisierung gesprochener Sprache mit texttechnologischen Mitteln
  • 2005 - 2006: DAAD Post-Doc-Researcher at the International Computer Science Institute, Berkeley
  • 2007 - 2008: Project assistant in the Digital Dictionary of the German Language, Berlin-Brandenburg Academy of Science
  • 2012 - 2021: Researcher at the Institute for the German Language
  • September 2021 - August 2022: Director of "Research and Infrastructure Support (RISE)" at the University of Basel

Developer of the EXMARaLDA system and FOLKER, author of Kicktionary, founding Managing Director of the HSZK (Hamburg Centre for Speech Corpora), Member of the Academic Network of Internet Lexicography

Areas of Research:

Oral corpora, corpus linguistics, text technology, computational lexicography

Recent English Publications:

  • Schnabel, Eva-Luisa/Wahl, Hans-Werner/Streib, Christina/Schmidt, Thomas (2020): Elderspeak in Acute Hospitals? The Role of Context, Cognitive and Functional Impairment. In: Research on Aging.
  • Batinić, Dolores / Schmidt, Thomas (2018): Reconstruction of separable particle verbs in a corpus of spoken German. In: Rehm, Georg / Declerck, Thierry (eds.): Language technologies for the challenges of the digital age. 27th International Conference, GSCL 2017 Berlin, Germany, September 13–14, 2017. Proceedings. Cham, Switzerland: Springer, 2018. pp. 3-10. PDF
  • Cassidy, Steve / Schmidt, Thomas (2017): Tools for Multimodal Annotation. In: Ide, Nancy / Pustejovsky, James (eds.): Handbook of Linguistic Annotation. Springer, Dordrecht, pp. 209-227.
  • Schmidt, Thomas / Hedeland, Hanna / Jettka, Daniel (2017): Conversion and Annotation Web Services for Spoken Language Data in CLARIN. In: Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26–28 October 2016, CLARIN Common Language Resources and Technology Infrastructure, by Borin, Lars (ed.), Linköping Electronic Conference Proceedings, pp. 113-130. PDF
  • Schmidt, Thomas (2016): Construction and Dissemination of a Corpus of Spoken Interaction - Tools and Workflows in the FOLK project. In: Corpus Linguistic Software Tools, Journal for Language Technology and Computational Linguistics (JLCL 31/1), by Kupietz, Marc & Geyken, Alexander (eds.), pp. 127-154. PDF
  • Schmidt, Thomas (2016): Good practices in the compilation of FOLK, the Research and Teaching Corpus of Spoken German. In: Compilation, transcription, markup and annotation of spoken corpora, by Kirk, John M. and Gisle Andersen (eds.), Special Issue of the International Journal of Corpus Linguistics [IJCL 21:3], pp. 396-418.
  • Westpfahl, Swantje / Schmidt, Thomas (2016): FOLK-Gold – A GOLD standard for Part-of-Speech-Tagging of Spoken German. In: Proceedings of the Tenth Conference on International Language Resources and Evaluation (LREC’16), Portorož, Slovenia. Paris: European Language Resources Association (ELRA), pp. 1493-1499. PDF
  • Fandrych, Christian / Frick, Elena / Hedeland, Hanna / Iliash, Anna / Jettka, Daniel / Meißner, Cordula / Schmidt, Thomas / Wallner, Franziska / Weigert, Kathrin / Westpfahl, Swantje (2016): User, who art thou? User Profiling for Oral Corpus Platforms. In: Proceedings of the Tenth Conference on International Language Resources and Evaluation (LREC’16), Portorož, Slovenia. Paris: European Language Resources Association (ELRA), pp. 280-287. PDF
  • Reimer, Eva / Trevisan, Bianka / Eraßme, Denise / Schmidt, Thomas / Jakobs, Eva-Maria (2015): Annotating Modality Interdependencies. In: Proceedings of the Int. Conference of the German Society for Computational Linguistics and Language Technology, University of Duisburg-Essen, Germany, Sep 30–Oct 2 2015, pp. 110-11. PDF
  • Herzog, Gottfried / Heid, Ulrich / Trippel, Thorsten / Bański, Piotr / Romary, Laurent / Schmidt, Thomas / Witt, Andreas / Eckart, Kerstin (2015): Recent Initiatives towards New Standards for Language Resources. In: Proceedings of the Int. Conference of the German Society for Computational Linguistics and Language Technology, University of Duisburg-Essen, Germany, Sep 30–Oct 2 2015, pp. 154–156. PDF
  • Kupietz, Marc / Schmidt, Thomas (2015): Schriftliche und mündliche Korpora am IDS als Grundlage für die empirische Forschung. In: Eichinger, Ludwig M. (ed.): Sprachwissenschaft im Fokus. Positionsbestimmungen und Perspektiven. pp. 297-322 - Berlin/Boston: de Gruyter, 2015. (Jahrbuch des Instituts für Deutsche Sprache 2014)
  • Ruhi, Şükriye / Haugh, Michael / Schmidt, Thomas / Wörner, Kai (eds.) (2014): Best Practices for Spoken Corpora in Linguistic Research. Newcastle: Cambridge Scholars Publishing.
  • Thomas Schmidt (2014): Gesprächskorpora und Gesprächsdatenbanken am Beispiel von FOLK und DGD. In: Gesprächsforschung - Online-Zeitschrift zur verbalen Interaktion 15, pp. 196-233. PDF
  • Schmidt, Thomas / Wörner, Kai (2014): EXMARaLDA. In: Jacques Durand, Ulrike Gut, and Gjert Kristoffersen (eds.): The Oxford Handbook of Corpus Phonology. Oxford: OUP 2014, pp. 402-419.
  • Schmidt, Thomas (2014): The Database for Spoken German - DGD2. In: Proceedings of the Ninth International conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland: European Language Resources Association (ELRA). PDF
  • Schmidt, Thomas (2014): The Research and Teaching Corpus of Spoken German - FOLK. In: Proceedings of the Ninth International conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland: European Language Resources Association (ELRA). PDF
  • Deppermann, Arnulf / Schmidt, Thomas (2014): Gesprächsdatenbanken als methodisches Instrument der Interaktionalen Linguistik - Eine exemplarische Untersuchung auf Basis des Korpus FOLK in der Datenbank für Gesprochenes Deutsch (DGD2). In: Domke, Christine & Gansel, Christa (eds.): Korpora in der Linguistik - Perspektiven und Positionen zu Daten und Datenerhebung [= Mitteilungen des Deutschen Germanistenverbandes 1/2014], pp. 4-17. PDF
  • Stift, Ulf-Michael / Schmidt, Thomas (2014): Mündliche Korpora am IDS: Vom Deutschen Spracharchiv zur Datenbank für Gesprochenes Deutsch. In: Institut für Deutsche Sprache (ed.): Ansichten und Einsichten. 50 Jahre Institut für Deutsche Sprache. Redaktion: Melanie Steine, Franz Josef Berens. pp. 360-375 - Mannheim: Institut für Deutsche Sprache, 2014.
  • Westpfahl, Swantje / Schmidt, Thomas (2013): POS für(s) FOLK – Part of Speech Tagging des Forschungs- und Lehrkorpus Gesprochenes Deutsch. In: Journal for Language Technology and Computational Linguistics, iss. 1, pp. 139-156. PDF
  • Schmidt, Thomas / Dickgießer, Sylvia / Gasch, Joachim (2013): Die Datenbank für Gesprochenes Deutsch - DGD2. Mannheim: Institut für Deutsche Sprache. PDF

Publications: