Publications – Avatarmin

Below, you will find a complete list of my scientific publications. The majority of them are in English and peer-reviewed if not otherwise stated. If you find articles non disponible through the links here provided, feel free to contact me @ info @ this_site.

Monographs

A. Hoenen. 2018. “Tools, evaluation and preprocessing for stemmatology”. PhD thesis. University Library Johann Christian Senckenberg.

A. Hoenen. 2011. „Der Fremdschrifterwerb. Eine Analyse verschiedener Sprachlehrbücher und Sprachkombinationen.“ Grin.

Chapters

A. Hoenen. 2020. “The stemma as a computational model“ In: Handbook of Stemmatology – History, Methodology, Digital Approaches. Mouton, De Gruyter.

A. Hoenen. 2020. “History of computer-assisted stemmatology“. In: Handbook of Stemmatology – History, Methodology, Digital Approaches. Mouton, De Gruyter.

A. Hoenen. 2020. “Software Tools“ In: Handbook of Stemmatology – History, Methodology, Digital Approaches. Mouton, De Gruyter.

A. Hoenen. 2020. “Evolutionary models in other disciplines“ Chapter Introduction. In: Handbook of Stemmatology – History, Methodology, Digital Approaches. Mouton, De Gruyter.

A. Hoenen. 2018. “Recurrence Analysis Function, a Dynamic Heatmap for the Visualization of Verse Text and Beyond” in: Bubenhofer, Noah, & Marc Kupietz, eds. Visualisierung sprachlicher Daten: Visual Linguistics – Praxis – Tools. Heidelberg University Publishing. 149-166.

A. Hoenen and F. Mader, 2015. “A New LMF Schema Application by Example of an Austrian Lexicon Applied to the Historical Corpus of the Writer Hugo von Hofmannsthal,” in Historical Corpora, Narr.

A. Hoenen. 2014. “Simulation of Scribal Letter Substitution,” in: Andrews, T. & Macé, C., eds. Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches, Lectio 1, Brepols. 119-139.

Conference Papers

Armin Hoenen, Cemre Koc, Julian Hasche. 2021. Creating an artificial language for mitigating the Internet Information Retrieval recall problem. EADH 2021, Krasnoyarsk.

Armin Hoenen, Marc D. Rahn. 2021. Migration of Small and Endangered Languages into the Wikipedia. in Proceedings of the Workshop on Computational Methods for Endangered Languages (ComputEL4). University of Hawaii.

[Video]

Philipp Büch, Mortimer Drach, Jolanta Gelumbeckaitė, Armin Hoenen, Adriano Cerri, Vanessa Wadlinger. 2021. Both Digital Edition and Corpus Archive: The works of Kristijonas Donelaitis. in Book of Abstracts of the annual conference of the AIUCD 2021, Pisa, AIUCD.

[Poster]

A.Hoenen, C. Koc, M. Rahn. 2020. „Two LRL & Distractor Corpora from Web Information Retrieval and a Small Case Study in Language Identification without Training Corpora.“ In Proceedings of the 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), pages 28–35 Language Resources and Evaluation Conference (LREC 2020). European Language Resources Association (ELRA).

A. Hoenen 2019. „NJ-networks, turning greedy into all possibilities.“ German Conference on Bio-Informatics GCB 2019.

A. Hoenen, G. Brüning. 2019. „Zur Stemmatologie neuerer Überlieferungen.“ DARIAH-DE Working Papers Nr. 29. Göttingen: DARIAH-DE, 2019. (Proceedings of the workshop Graphentechnologien at the Academy of Sciences Mainz).

A. Hoenen. 2019. „Rooting through Direction – New and Old Approaches.“ In: Book of Abstracts of the DHd 2019 Mainz-Frankfurt. 342-345.

A. Hoenen. 2019. „Interpreting and Post-Correcting the Minimum Spanning Tree.“ In: Computer-linguistics poster session at DGfS 2019. Bremen.

A. Hoenen. 2019. „eLearning the URLCoFi – Digital Didactics for Humanists.“ In: Book of abstracts of the AIUCD 2019 Udine/Gorizia. 187-189.

A. Hoenen. 2018. “Annotated Timelines and Stacked Area Plots for Visualization in Lexicography” in Proceedings of the Elexis workshop, collocated with EADH Gallway.

A. Hoenen. 2018. “Wikipedia Mention Graphs by Example” in: Proceedings of the 1^st EADH conference. Gallway.

A. Hoenen & Lela Samushia 2018. “Principles Aiding in Reading Abbreviations in Old Georgian and Latin ,” in Book of Abstracts DHd Köln, 2018. 290-294.

A. Hoenen. 2018. “Multi Modal Distance – An Approach to Stemma Generation With Weighting,” in Proceedings of the 11th International Conference on Language Resources and Evaluation, 2018. 2105-2112.

A. Hoenen, N. Schenk. 2018. “Knowing the Author by the Company His Words Keep,” in Proceedings of the 11th International Conference on Language Resources and Evaluation, 2018. 521-528.

A. Hoenen. 2018. “From Manuscripts to Archetypes through Iterative Clustering,” in Proceedings of the 11th International Conference on Language Resources and Evaluation, 2018. 712-718.

A. Hoenen. 2018. “Attempts at Visualization of Etymological Information” in Proceedings of the Globalex workshop of the 11th International Conference on Language Resources and Evaluation, 2018.

A. Hoenen, S. Eger, and R. Gehrke. 2017. “How Many Stemmata with Root Degree k?,” in Proceedings of the 15th Meeting on the Mathematics of Language, 2017, pp. 11-21.

A. Hoenen. 2017. “Using Word Embeddings for Computing Distances Between Texts and for Authorship Attribution,” in International Conference on Applications of Natural Language to Information Systems, 2017, pp. 274-277.

A. Hoenen. 2017. “Beyond the tree – a theoretical model of contamination and a software to generate multilingual stemmata,” in Book of Abstracts of the annual conference of the AIUCD 2017, Sapienza, Rome, AIUCD, 2017. 155-159.

S. Eger, A. Hoenen, and A. Mehler. 2016. “Language classification from bilingual word embedding graphs,” in Proceedings of COLING 2016, 2016. 3507-3518.

A. Hoenen. 2016. “Silva Portentosissima – Computer-Assisted Reflections on Bifurcativity in Stemmas,” in Digital Humanities 2016: Conference Abstracts. Jagiellonian University & Pedagogical University, Kraków, pp. 557-560.

A. Lücking, A. Hoenen, and A. Mehler. 2016. “TGermaCorp — A (Digital) Humanities Resource for (Computational) Linguistics,” in Proceedings of the 10th International Conference on Language Resources and Evaluation, 2016. 4271-4277.

A. Hoenen. 2016. “Wikipedia Titles As Noun Tag Predictors,” in Proceedings of the 10th International Conference on Language Resources and Evaluation, 2016. 2114-2118.

A. Hoenen. 2016. “Das erste dynamische Stemma, Pionier des digitalen Zeitalters?,” in Proceedings of the Jahrestagung der Digital Humanities im deutschsprachigen Raum, 2016. 328-330.

A. Hoenen. 2015. “Das artifizielle Manuskriptkorpus TASCFE,” in Proceedings of the Jahrestagung der Digital Humanities im deutschsprachigen Raum, 2015.

A. Hoenen. 2015. “Lachmannian Archetype Reconstruction for Ancient Manuscript Corpora,” in Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT), 2015. 1209-1214.

A. Hoenen. 2015. “Simulating Misreading,” in Proceedings of the 20^th international conference on applications of natural language to information systems (NLDB), 2015.

A. Hoenen. 2014. “Stemmatology, an interdisciplinary endeavour,” in Book of Abstracts zum DHd Workshop Informatik und die Digital Humanities – DHd.

M. Z. Islam and A. Hoenen. 2013. “Source and Translation Classifiction using Most Frequent Words,” in Proceedings of the 6^th International Joint Conference on Natural Language Processing (IJCNLP), 2013. 1299-1305.

M. Sukhareva, M. Z. Islam, A. Hoenen, and A. Mehler. 2012. “A Three-step Model of Language Detection in Multilingual Ancient Texts,” in Proceedings of Workshop on Annotation of Corpora for Research in the Humanities, Heidelberg, Germany, 2012.

R. Gleim, A. Hoenen, N. Diewald, A. Mehler, and A. Ernst, “Modeling, Building and Maintaining Lexica for Corpus Linguistic Studies by Example of Late Latin,” in Corpus Linguistics 2011, 20-22 July, Birmingham, 2011.

Journal Papers

A. Hoenen, C. Koc, M. Rahn. 2020. “A Manual for Web Corpus Crawling of Low Resource Languages“ umanisticadigitale 8.

R. Gleim, S. Eger, A. Mehler, T. Uslu, W. Hemati, A. Lücking, A. Henlein, S. Kahlsdorf, and A. Hoenen, 2019. Practitioner’s view: A comparison and a survey of lemmatization and morphological tagging in German and Latin,” Journal of Language Modeling 7(1). 1-52.

A. Hoenen. 2019. „An open problem in computational stemmatology – a model for contamination.“ umanisticadigitale 5.

A. Hoenen, A. Mehler, and J. Gippert. 2016. “Corpora and Resources for (Historical) Low Resource Languages. Editorial“ in Corpora and Resources for (Historical) Low Resource Languages,” JLCL 31(2), p. iii–iv.

A. Hoenen and L. Samushia. 2016. “Gepi: An Epigraphic Corpus for Old Georgian and a Tool Sketch for Aiding Reconstruction,” JLCL, vol. 31, iss. 2, pp. 25-38, 2016.

N. Dundua, A. Hoenen, and L. Samushia. 2015. “A Parallel Corpus of the Old Georgian Gospel Manuscripts and their Stemmatology,” The Georgian Journal for Language Logic Computation, vol. IV, pp. 176-185.

A. Hoenen and T. Jügel. 2012. “Altüberlieferte Sprachen als Gegenstand der Texttechnologie — Ancient Languages as the Object of Text Technology. Editorial ” in A. Hoenen and T. Jügel, eds. Altüberlieferte Sprachen als Gegenstand der Texttechnologie — Ancient Languages as the Object of Text Technology, JLCL 27.

A. Hoenen, “Measuring Repetitiveness in Texts, a Preliminary Investigation,” Sprache und Datenverarbeitung. International Journal for Language Data Processing, vol. 36, iss. 2, pp. 93-104, 2012.

A. Hoenen, 2025. “LLM-Mining Pre-Stemmatological Philological
Literature”. magazén, 6(2), 215-232.

Recensions

Armin Hoenen. Hanne Martine Eckhoff, Silvia Luraghi u. Marco Passarotti (Hgg.): Diachronic Treebanks for Historical Linguistics, Benjamins Current Topics (BCT), Amsterdam u. Philadelphia: John Benjamins Publishing Company 2020, 113, 154 S. Neuauflage von Diachronica 35:3 (2018). In: Beiträge zur Geschichte der deutschen Sprache und Literatur Volume 143 Issue 2. De Gruyter.

Miscellaneous (Whitepapers, Blogarticles etc.)

A. Hoenen. 2020. Einige neue explorative Visualisierungsformen in der digitalen, historischen Lexikologie und Lexikographie. ZHistLex-Papiere.

Jolanta Gelumbeckaitė, Armin Hoenen. 2020. Richtlinien für die Alignierung der litauischen Texte von Kristijonas Donelaitis mit ihren deutschen Übersetzungen (Projekt CorDon). CorDon Projektpublikationen.

Armin Hoenen. 2019. Einreichungen zur DHd 2019 – II. DHd-Blog.

Armin Hoenen. 2019. Interview mit C. Schöch anlässlich der DHd 2019. DHd-Blog.

Armin Hoenen. 2010. Under Resourced Language Content Finder. Interaktiver Online Kurs. Lernbar Uni Frankfurt. 1 2 3 4 5 6 7 X Y

Preprints (arxive.org etc.)

Armin Hoenen. 2023. Encryption by using base-n systems with many characters.

Armin Hoenen. 2022. Can the Language of the Collation be Translated into the Language of the Stemma? Using Machine Translation for Witness Localization.