Conference Agenda

Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).

Please note that all times are shown in the time zone of the conference. The current conference time is: 14th June 2025, 07:46:18pm WEST

 
Filter by Session Topic 
Only Sessions at Location/Venue 
Only Sessions at Date / Time 
 
 
Session Overview
Date: Thursday, 17/July/2025
9:00am - 10:30amLP-17
Location: Aud B2 (TB)
Session Chair: Susan Schreibman, Maastricht University, DARIAH
 

Breaking the Unicode Barrier with Niv Louie: Advancing Digital Accessibility through Innovative Screen Reading and Braille Translation Technologies

Matthew Yeater1, Luis D. Sáenz Santos2, Shai Gordin1

1Department of the Land of Israel Studies and Archaeology, Ariel University, Israel; 2Saxon Academy of Sciences and Humanities in Leipzig, Germany

This research examines a critical gap in current digital text infrastructure for individuals with print disabilities: the systematic inaccessibility of specialized Unicode-encoded characters essential for academic discourse. Our investigation showcases the development and implementation of Niv Louie, an innovative software solution designed to facilitate access to previously inaccessible digital content.



Bridging Accessibility and Innovation: An NLP-Powered Writing Assistant for Easy and Plain Texts in Italian

Floriana Carlotta Sciumbata1, Luca Tringali2

1Università di Trieste, Italy; 2Independent researcher

The presentation introduces a Writing Assistant System designed to simplify and enhance the creation of easy and plain language texts in Italian. Addressing communication barriers in public administration and supporting inclusivity, WAS provides real-time feedback and educational tools, combining AI-driven suggestions with human oversight to improve accessibility and writing skills.



Mastering Ideas, Not Keystrokes: Digital (3D) Literacy through Digital Humanities Praxis-based Pedagogy

Susan Schreibman2, Costas Papadopoulos1, Kelly Gilikin Schoueri1

1Maastricht University, Netherlands, The; 2Maastricht University, Netherlands, The, DARIAH-EU

This presentation examines critical digital literacy as a multifaceted competency. Drawing on a Master’s-level course where students create 3D scholarly editions of toys within an authentic learning environment, we demonstrate how they develop skills, including critical and creative making and collaborative problem-solving –transcending discipline-specific knowledge to prepare for the digital and creative economy.

 
9:00am - 10:30amPanel 03
Location: Aud B3 (TB)
Session Chair: Till Grallert, Humboldt-Universität zu Berlin
 

The global state of digital history: Establishing data culture(s) in uncertain times

Till Grallert1, Torsten Hiltmann1, Andrew Flinn6, Min-Woo Lee4, Ian Marino5, Ian Milligan2, Julianne Nyhan3,6

1Humboldt-Universität zu Berlin; 2University of Waterloo; 3Technische Universität Darmstadt; 4Andong National University; 5Leibniz-Institut für Europäische Geschichte; 6University College London

The panel discussion addresses the need for a new data culture in history and beyond with the aims to understand the fundamental epistemological affordances of the post-digital moment; to develop the necessary quotidian practices and disciplinary protocols; and to negotiate new understandings of history as a discipline of societal relevance.

 
9:00am - 10:30amSP-20
Location: Aud C1 (EC)
Session Chair: Sarah Lang, Universität Graz
 

Contexts, Diversity, Poetry: Topic Modelling the Poetess

Kiera Obbard

University of Guelph, Canada

This short paper asks: what can be discovered about the poetess tradition of nineteenth-century Britain–a mode of writing initially lost to literary history and recovered by feminist literary scholars–when using topic modelling to conduct a distant reading of primary, critical, and bio-critical materials?



How Is Gender Portrayed on Preschool Children’s Book Covers? An Analysis of the Chinese National Library Catalogue between 2012-2022

Yi Li1, Yongning Li2

1University of Edinburgh, United Kingdom; 2Te Shi Liangcai School of Journalism and Communication, Zhejiang Sci-Tech University, Hangzhou, China

Picture book covers carry abundant visual information about the story. They also indicate gender information through titles and illustrations, which might further impact preschool children’s gender perception. This paper will investigate how ChatGPT infer gender on covers of 6,629 preschool children’s books from the 2012-2022 National Library of China catalogue.



Reading Spanish NovEllas through an Antiracist, Inclusive, and Feminist Text Encoding Framework

Sarah Revilla-Sanchez, Elizabeth Lagresa-González

University of British Columbia, Canada

This presentation will introduce the work conducted by “The Adaptive Text Encoding Initiative Network,” an interdisciplinary research group of PhD students, faculty, and DH librarians in Canada. We will report on our progress as a team and provide insights into our proposed antiracist, decolonial, and inclusive TEI guidelines.



Exploring Gender Differences in Gaming Culture: A Comparative Analysis of Male and Female Streamers’ Live Chat Interactions on Twitch.tv

Greta Pfältzer, Michael Achmann-Denkler, Christian Wolff

University of Regensburg, Germany

This study examines gendered communication in German-speaking Twitch gaming chats using BERTopic and qualitative analysis. Male streamers’ chats focus on gaming, while female streamers evoke social and emotional messages. No evidence of objectification was found, highlighting shifts in behavior or effective moderation. Findings underscore gender’s role in shaping digital interactions.



Documenting datasets as a tool for change

Sarah Lang1,2

1Max Planck Institute for the History of Science, Germany; 2University of Graz, Austria

This talk explores how documentation can serve as a powerful tool for positive change by making our research and datasets more transparent. Detailed documentation not only facilitates the effective and responsible reuse of datasets and algorithms but also promotes more inclusive scholarship and ethical research outcomes.

 
9:00am - 10:30amLP-13
Location: B207 (TB)
Session Chair: Isabel Galina Russell, Universidad Nacional Autónoma de México (UNAM)
 

Modernização da infraestrutura do portal da “edição digital de Fernando Pessoa projetos e publicações” em parceria com o consórcio Text+

Fernanda Alvares Freire2,3, Ulrike Henny-Krahmer1, Erik Renz1

1Universität Rostock; 2Technische Universität Darmstadt; 3Berlin-Brandenburgische Akademie der Wissenschaft

Esta apresentação se concentrará no trabalho realizado durante o projeto de colaboração Text+ "Pessoa Digital", discutindo a importância das mudanças implementadas para garantir a sustentabilidade e modernizar a infraestrutura digital da edição acadêmica digital “Fernando Pessoa. Projects and Publications” e como elas se encaixam na infraestrutura do consórcio Text+.



Las Bibliotecas Públicas de Bogotá como escenarios de co-creación de narrativas digitales de historia pública (2016–2024)

Juan Pablo Angarita Bernal

University of Edinburgh, United Kingdom

Se analiza a las Bibliotecas Públicas de Bogotá como plataformas de co-creación de narrativas digitales de historia pública entre 2018 y 2021. A través del análisis de Los bogotanos del Bogotazo y Marca de agua se destaca cómo la mediación digital desafía narrativas oficiales y democratiza el conocimiento histórico.



Inteligência Artificial nas Humanidades Digitais: questões críticas e desafios éticos

Sara Carvalho1, Maria Manuel Borges2, Renato Rocha Souza3,4

1Universidade de Coimbra, FLUC; 2Universidade de Coimbra, CEIS20, FLUC; 3Fundação Getulio Vargas - CPDOC; 4Universidade de Viena - Department of European and Comparative Literature and Language Studies

Este artigo apresenta uma revisão crítica da literatura dos princípios éticos no universo da Inteligência Artificial (IA), investigando a sua conexão com as Humanidades Digitais (HD). Tem como objetivo refletir sobre o campo epistémico e identificar os desafios, práticas e recomendações, através de uma abordagem qualitativa.



Empowering Peripheral Voices: Data Sovereignty and Low-Tech Solutions for Art Galleries Data Preservation

Nuria Rodríguez Ortega1, Bárbara Romero Ferrón2, Martín Salvachúa3

1University of Malaga; 2Leuphana University; 3University of Malaga

Small-to-medium galleries face significant challenges in preserving and sharing their contributions, relying on spreadsheets or paper records.This issue is exacerbated by proprietary data platforms, which reduce data sovereignty. This paper presents a low-tech solution developed to normalize, preserve, and share gallery data, emphasizing sustainability, inclusivity, and resistance to digital monopolies.

 
9:00am - 10:30amLP-14
Location: B210 (TB)
Session Chair: Florian Kessler, Friedrich-Alexander University Erlangen Nuremberg
 

Collation of Multilingual Versions of a Text: Necessity, Approach, Challenges

Sandra Balck1, Janis Dähne2, Fabian Etling1, Frank Fischer1, Steffen Frenzel3, Brigitte Grote1, Sascha Hesse2, Paul Molitor2, Marcus Pöckelmann2, Jörg Ritter2, Yashee Singh1, Manfred Stede3

1Freie Universität Berlin, Germany; 2Martin Luther University Halle-Wittenberg, Germany; 3Universität Potsdam, Germany

With the project ‘Collation of Multilingual Text’ (COMUTE), we tackle the problem of aligning multilingual versions of text that are not literal translations, but which have undergone a major revision process including extensive additions, deletions and rephrasings. Our approach based on a hierarchical alignment solves issues of current state-of-the-art methods.



NLP-basierte Analysen von marginalisierter serieller Frauenliteratur im 19. Jahrhundert. Ein Vergleich von Frauenzeitschriften im deutschsprachigen Raum (NLP-based Analyses of Marginalized Women's Literature in the 19th Century)

Alexa Silke Lucke1, Hermann Johannes2

1Universität Siegen, Germany; 2Fachhochschule Südwestfalen in Hagen, Germany

Der Beitrag widmet sich der digitalen Modellierung von Dekanonisierungsprozessen serieller Frauenliteratur im 19. Jahrhundert und ihrer theoretischen Reflexion. Hierzu untersuchen wir Relationen zwischen Texten in Frauenzeitungen und historischen Kontexten. In multivariaten NLP-basierten Analysen werden Grade der vielfältigen Einflussfaktoren gemessen und Bedingungen der Evolution serieller Literatur in deutschsprachigen Frauenzeitschriften miteinander verglichen.



What is a Term in Chinese Mathematics? A Digital Exploration of Glossaries in Relation to the Language of the Original Texts

Florian Kessler

Friedrich-Alexander-Universität Erlangen-Nürnberg

Glossaries appended to monographic treatments of historical mathematical Chinese texts are one of the most important references available for technical terms in that type of language. In this study, digital methods are used to explore such glossaries, with an focus on understanding what phenomena from the original texts are included.

 
9:00am - 10:30amLP-15
Location: B302 (TB)
Session Chair: Martina Scholger, University of Graz
 

404 not found - Strategies for Ensuring the Sustainable Management of Living Resources in the Digital Humanities

Patrick Helling

Data Center for the Humanities (DCH), University of Cologne, Germany

I will discuss the challenges of handling living resources and relate them to already existing strategies and their vulnerabilities. I will present an approach for managing living resources by considering the responsibilities of different stakeholders – researchers, funding institutions and data centers/libraries – and argue for their orchestration.



Excavating memory: Computer vision and LLM-assisted Classification workflow for a Digitized Archive

Sinai Rusinek1, Yael Netzer1, Keren Shuster3, Sharon Kurant2, Adam Alyagon Dar1

1Haifa University; 2Technion; 3Independent Scholar

We describes workflows, complexities and challenges in a digital pilot project on the Archive of German Speaking Jews in Israel, concentrating specifically on two complementing classification workflows: image classification of the digitized scans as a preliminary step for document analysis, and LLM assisted parsing and classification of folder level catalog entries.

 
9:00am - 10:30amLP-12
Location: B304 (TB)
Session Chair: Kim Martin, University of Guelph
 

The ReFa Reader- A visual makeover for your semantic data

Linda Freyberg1, Giacomo Nanni2

1DIPF | Leibniz Institute for Research and Inf. in Education, Germany; 2metaLAB (at) FU Berlin, Germany

In this long paper a visual interface, the "ReFa Reader", which combines narrations and the exploration of semantic graph structures is presented. After a brief introduction of the functionalities of the Reader, three workflows on how to reuse this Open Source Prototype by any project with semantic data are elaborated.



Reading between the letters. Exploring biases, gaps, and context in historical correspondence data

Aline JE Deicke1,2, Elena Suárez Cronauer1

1Academy of Sciences and Literature | Mainz; 2Philipps-University Marburg, Germany

On a dataset containing correspondences from the literary period of Early Romanticism, this submission aims to uncover gaps and biases in the data, investigate their impact, and mitigate their effect. For this purpose, it develops a mixed methods-approach including the use of knowledge graphs, data visualisation, and historical network research.



Keeping it in Context: Serendipity, Linked Data, and User Experience at LINCS

Kim Martin

University of Guelph, Canada

The Linked Infrastructure for Networked Cultural Scholarship (LINCS) mobilizes Canadian scholarship through making, disseminating, and promoting the use of linked open data (LOD). This paper will reflect on the user-experience (UX) work done at LINCS, focussing on the roles of context and agency in supporting serendipity in a linked-data environment.

 
9:00am - 10:30amLP-16
Location: B309 (TB)
Session Chair: Caio Mello, University of Luxembourg
 

Castle at the Crossroads: A Machine Learning Approach to Generic Mixture in the Nineteenth-Century Gothic Novel

Mark Andrew Algee-Hewitt, Jessica Monaco

Stanford University, United States of America

Our project introduces a BERT-based machine learning model of generic mixture for classifying individual passages from nineteenth-century Gothic novels by their relationship to other, non-Gothic, contemporaneous genres. Our method offers not only important insights into individual novels, but also a new way to track generic transformation over time.



Cultures of textual reuse: comparing American and Hebrew journalistic networks in the nineteenth century

Zef Segal

College of Management Academic Studies, Israel

This paper uses plagiarism detection software and social network analysis methodologies to compare nineteenth-century American and Hebrew journalistic networks. Results reveal stark differences in frequency, context, and style of textual reuse. The differences reflect distinct cultural roles of the media in shaping public knowledge and identity.



Capturando o silêncio: estratégias para identificação do não-dito, ao combinar-se métodos computacionais e análise do discurso

Caio Mello

Luxembourg Centre for Contemporary and Digital History (C2DH)

Este trabalho traz uma reflexão crítica, embasada em um estudo de caso, sobre a tendência dos métodos computacionais de análise textual priorizarem o que estatisticamente se repete, dificultando o acesso do analista do discurso a certas nuances. Argumento pelo uso de métodos mistos para acessar o 'não-dito' pelos jornais.

 
10:30am - 11:00amCoffee-break (17th morning)
Location: B007 (TB)
11:00am - 12:30pmSP-21
Location: Aud B2 (TB)
Session Chair: Mikhail Biriuchinskii, Sorbonne Université
 

Visualizing the 'New Woman': Analyzing Visual Content in The Delineator Using CLIP.

Luana Moraes Costa

University of Göttingen, Germany

This study explores how the American magazine The Delineator reflects the evolving representation of the 'New Woman' from 1894 to 1914 through its visual content. By employing artificial intelligence techniques, particularly CLIP, the research shifts focus from text to visual analysis, revealing insights into societal perceptions of femininity.



Using ChatGPT for generating SKOS thesauri from handwritten sketches

Felix Kraus, Nicolas Blumenröhr

Karlsruhe Institute of Technology (KIT), Germany

This paper demonstrates how ChatGPT simplifies SKOS thesauri creation from hand-drawn sketches or digital drafts, improving efficiency over traditional editors. Testing with DH and fictional taxonomies reveals high accuracy but minor errors. While less suited for large thesauri, this method promotes FAIR data practices and facilitates SKOS thesauri development.



Towards an automatic transcription of Catalan notarial manuscripts from the Late Middle Ages

Mariona Coll Ardanuy, Ramon Sarobe, Joan Giner-Miguelez, Felipe Gómez, Paolo Marangio, Mercè Crosas, Coral Cuadrada

Barcelona Supercomputing Center (BSC)

This paper introduces an interdisciplinary pilot project centered on the automatic transcription of Catalan manuscripts from the Late Middle Ages, focusing on notarial documentation. We describe the creation of a new dataset for our initial experiments. The resulting datasets, models, and code will be made publicly available.



Progress of The New Spain Fleets Project: accurate Handwritten Text Recognition models for 16th-17th century Spanish calligraphies.

Rodrigo Vega-Sánchez1, Edna Brito-Ramos2, Francisco Cruz-Ríos3, Fryda Montiel-Alejos4, Andrea González-Aceves2, Abril Hernández-Ronquillo2, Martín Díaz-Vázquez2, Ricardo Valadez-Vázquez5, Lidia Camacho-Gamez6, Guillaume Candela7, Mariana Favila-Vázquez8, Flor Trejo-Rivera9, Alexander Sánchez-Díaz10, Patricia Murrieta-Flores1

1Lancaster University, United Kingdom; 2Escuela Nacional de Antropología e Historia, México; 3Independent researcher; 4Archivo General de la Nación, México; 5Universidad Nacional Autónoma de México; 6Universidad de Guadalajara, México; 7University of Leeds, United Kingdom; 8Centro de Investigaciones y Estudios Superiores en Antropología Social, México; 9Subdirección de Arqueología Subacuática-INAH, México; 10Universidad de Alicante, España

We describe advances and results in developing four accurate Handwritten Text Recognition models for the automatic transcription of Itálica cursiva, Procesal simple, Redonda, and Procesal encadenada calligraphies, the most prevalent in 16th-17th-century Spanish American historical documents.



Using LLMs for post-OCR correction on historical French texts: A case study using synthetic data

Mikhail Biriuchinskii, Motasem Alrahabi, Glenn Roe

ObTIC, Sorbonne University

This study explores the use of large language models (LLMs) for correcting OCR errors in 19th-century French texts. Despite its advanced capabilities, fine-tuned models faced challenges with generalization, increasing error rates. The findings highlight limitations of LLMs in character-level OCR corrections and point to future research directions.

 
11:00am - 12:30pmSP-27
Location: Aud B3 (TB)
Session Chair: Cindarella Petz, Leibniz Institute of European History Mainz (IEG)
 

Bridging Critical AI Frameworks with Data Storage Practices: the AIAI Data Collective

Nia Judelson1, Em Nordling2

1Emory University, United States of America; 2Emory University, United States of America

The AIAI Data Collective applies Critical AI frameworks to latent questions of AI data storage. We are developing a digital tool that guides users through a decision-making process for ethical AI data management, resulting in recommended practices that address critical and ethical concerns such as labor, privacy, and bias.



Critical Digital Humanities in Generative AI: Enhancing Critical Thinking in Education

Paolo Casani

Formerly at University College London, United Kingdom

This research proposal explores the intersection of Critical Digital Humanities and Generative AI, aiming to enhance critical thinking in education. Through a mixed-methods approach, it will develop practical guidelines for educators, addressing challenges such as bias and transparency while fostering thoughtful engagement with AI technologies.



Conceptualising Inclusive Access: Lessons and Critical Reflections on the Challenges of Access to Digital Archives and Collections

Sharika Parmar

FLAME University, India

This paper, though examining discourses on access to digital archives (particularly community digital archives) and discussion on building care in access from the Stories on Contested Histories International Programme 2024, argues that studying challenges of access to digital archives and collections can help in conceptualising frameworks for inclusive access.



Digital Access: AltNarrative, a multilingual digital repository, and a Comics Studies Lab for born-digital comics

Natasa Thoudam

Indian Institute of Technology Jodhpur, India

The paper invokes three narratives of disruptive digital projects that make comics accessible through AltNarrative, a multilingual digital repository, or the inception of a lab to create a born-digital and an inclusive comic of the future. The issue of web or digital accessibility is evaluating with respect to WCAG2.2 compliance.



LLMs as Analysis Tool: A Framework for Implementation, Evaluation and Critical Assessment

Sarah Oberbichler, Cindarella Petz

Leibniz Institute of European History, Germany

This paper presents a framework for integrating LLMs into critical research work flows, addressing legal, ethical, and methodological challenges. Drawing on projects analyzing historical newspapers and court records, it emphasizes aligning LLM use with established hermeneutical practices to navigate automation responsibly and set standards for DH research and beyond.

 
11:00am - 12:30pmSP-24
Location: Aud C1 (EC)
Session Chair: Joana Casenave, Université de Lille
 

"Towards the Tolstoy Digital Metaverse: Integrating Testimonies into a Digital Chronicle of Tolstoy's Life and Works"

Anastasia Bonch-Osmolovskaya1,2,4, Fekla Tolstaya2, Youlya Vronskaya2,3, Timofei Lukashevski2

1DH CLOUD; 2Tolstoy Digital; 3Peredelkino Creative Residence; 4CultTech Association

The project marks the second phase of digitizing Tolstoy’s legacy, expanding beyond the 90-volume collection. It focuses on seven sources, each reflecting the writer’s life and works, connected by common temporal points. We discuss the preparation, markup, and presentation of over 32,000 documents through an interactive interface.



Auden in Austria Digital: Formalizing <interp>retation in TEI/XML through RDFa

Massimiliano Carloni, Timo Frühwirth, Sandra Mayer

Austrian Centre for Digital Humanities and Cultural Heritage, Austrian Academy of Sciences

The project Auden in Austria Digital explores novel methods of formalizing interpretation in scholarly digital editions by embedding RDFa within TEI/XML. By transforming editorial argumentation into machine-readable data, the project seeks to make interpretation, interpretational responsibility, and scholarly uncertainty accessible to computational processing.



Uncovering Editors' Intentions and Implicit Historical Perspectives through TEI Markup: Case Study on Dai Nihon Shiryo

Ayano Kokaze, Satoru Nakamura, Taizo Yamada

Historiographical Institute The University of Tokyo, Japan

This project reevaluates the historiographical practices of Dai Nihon Shiryo, focusing on the functional representation of Bouchū (marginal notes) and Warichū (interlinear annotations) using TEI. By addressing their embedded temporal semantics and editorial distinctions, this study uncovers implicit historical perspectives and contributes to advancing TEI applications in digital scholarly editing.



Moving towards a semantic archival edition: the PAVES-e project

Laura Mazzagufo1, Salvatore Cristofaro1, Christian D'Agata2, Angelo Mario Del Grosso3, Pietro Sichera4, Antonio Sichera2, Daria Spampinato1

1CNR-ISTC, Italy; 2University of Catania, Italy; 3CNR-ILC, Italy; 4CNR-ILIESI, Italy

This paper presents the open-access semantic edition-archive of Cesare Pavese’s literary and documentary heritage, developed within the PAVES-e project. It employs the CHROMA model, incorporating XML-TEI encoding, ontology-driven semantic organization, and interactive visualization tools to facilitate enhanced accessibility and semantic and lexicographic analysis of Pavese’s works.



Digital Critical Edition of the Isopet 1-Avionnet Aesopic Fable Collection : Issues and Perspectives

Joana Casenave1,2

1University of Lille, Geriico Lab., Department of Information Science, France; 2Biblissima+ Funding

This paper focuses on the modeling of the digital critical edition of a collection of Aesopic fables entitled Isopet 1-Avionnet, dating from the late 13th and early 14th centuries. It will notably involve translating, into the XML/TEI encoding of this collection, the textual, rhetorical, narrative and intertextual structures of this text.

 
11:00am - 12:30pmSP-25
Location: B207 (TB)
Session Chair: Paul Girard, OuestWare
 

Inferring Semantic Social Networks from Scientific Texts: The Case of Astrobiology

Christophe Malaterre1,2, Francis Lareau1,2,3

1University of Québec in Montréal (UQAM), Canada; 2CIRST, Canada; 3Sherbrooke University, Canada

We present a method for inferring semantic social networks from textual data by analyzing terminological similarities using topic modeling. Applied to a corpus of 3,698 scientific articles in astrobiology, this approach identifies "hidden communities of interest" (HCoIs)—groups with shared semantic content—and enables diachronic analysis of community evolution.



Plato’s Presence and Beyond: Co-Occurrence Networks in Ancient Greek and Latin Literature

Evelien de Graaf

KU Leuven, Belgium

This study employs Network Theory to investigates the similarities and differences in co-occurrences within Ancient Greek and Latin texts from pre-Christianity and during the early Christian period. The case study focusses on mentions surrounding mentions of Plato in these texts.



A mixed-methods approach to study discourses on Twitter about the German anti-hate speech law NetzDG

Jens Pohlmann1, Caio Mello2, Karin León Henneberg3

1UC Davis, United States of America; 2Luxembourg Centre for Contemporary and Digital History (C2DH), Luxembourg; 3University of Bremen, Germany

This paper examines debates on a German anti-hate speech law called Netzwerkdurchsetzungsgesetz (NetzDG) on Twitter/X. By applying network analysis, NLP methods, and close reading, it investigates content published by the most retweeted accounts mentioning the law, considering their potentially high influence in shaping the discussion on the platform.



NHS, CDC, and WHO Twitter Health Communication: A Preliminary Shiny App

Katherine Ireland

University of Georgia, United States of America

This work discusses ongoing tests and development of an R Shiny Web Application to visualize a dataset of National Health Service, Centers for Disease Control and Prevention, and World Health Organization Tweets during 2020. Interactive visualizations display different text analytics using tidytext and quanteda.



Gephi Lite: a lighter web based version of Gephi

Paul Girard1, Alexis Jacomy1, Benoît Simard1, Mathieu Jacomy2

1OuestWare, France; 2Aalborg University, Denmark

Gephi Lite, a web based and lighter version of Gephi, aims at pursuing Gephi original ambition: democratizing network visualisation edition through visual means. In this paper we present what the web does to network visualisation edition while presenting Gephi Lite main features.

 
11:00am - 12:30pmSP-22
Location: B210 (TB)
Session Chair: Takehiro Hashimoto, Chuo University
 

Tracing Transformation: editorial shifts in the Grimm brothers’ tales

Anastasia Glawion1, Dhara Lechner2

1FAU Erlangen Nürnberg, Germany; 2FAU Erlangen Nürnberg, Germany

The presentation showcases first results of a project aimed at representing the editorial transformations of the Children's and Household Tales of the Grimm brothers and demonstrates the variety of changes on several selected fairy tales. It further proposes a first insight into a scalable framework to visualize the editorial changes.



Writing the Routledge Guide to Canadian Literature and Digital Humanities

Paul Barrett

University of Guelph, Canada

In 2024 I coauthored a forthcoming book, The Routledge Guide to Digital Humanities and Canadian Literature. This book presented a number of insights and challegns relevant to anyone attempting to write about the history and practice digital huamnities for a non-specialist audience. In this paper I will present lessons learned.



A Quantitative Approach to Bodily Sensations: Modernist and Realist Authors in Colonial Korea

Jae-Yon Lee1, Hae-in Ji2

1Ulsan National Institute of Science and Technology, Korea, Republic of (South Korea); 2Academy of Korean Studies, Korea, Republic of (South Korea)

This presentation employs computational stylistics to analyze Yi Sang and Yŏm Sang-sŏp's works from Korea's colonial period. Using a custom sensory classification model, it quantifies stylistic and aesthetic differences with a focus on the body, revealing broader affective politics through characters' physical and psychological responses, extending beyond conventional emotion analysis.



How does war affect Romantic literature? Topic modeling Romantic documents

Takehiro Hashimoto

Chuo University, Japan

This paper examined the relationship between the social situation of war and war writings in the British Romantic period by reviewing the topic modeling results and examining the visualized changes in war topics in books published between 1740 and 1840.

 
11:00am - 12:30pmSP-26
Location: B302 (TB)
Session Chair: Aleksandra Rykowska, Jagiellonian University in Kraków
 

Diversidad en los programas de fomento a la traducción editorial en Iberoamérica: construcción de un dataset sobre traducciones subvencionadas (2001-2022)

Laura Fólica1, Diana Roig-Sanz2, Lucia Campanella2, Elizabete Manterola3, Ventsislav Ikoff2

1Instituto de Lengua, Literatura y Antropología, CSIC; 2IN3, Universitat Oberta de Catalunya; 3Universidad del País Vasco/ Eusak Herrika Universitatea

Presentación de la etapa inicial del proyecto de investigación Trad-Divers, consistente en la construcción de un dataset, en acceso abierto, modelando datos de 8 programas de ayuda a la traducción editorial en Iberoamérica (2001-2022), con el objetivo de incidir en la discusión y diseño de este tipo de políticas culturales.



A Digital Humanities Approach to Parallel Corpus Construction and Translation Network Analysis of Japanese and Ryukyuan Bible Translations from the 19th to Early 20th Century

So Miyagawa1,2, Takanori Ito3, Kaho Ohsaki1

1University of Tsukuba, Japan; 2National Institute for Japanese Language and Linguistics (NINJAL), Japan; 3Institute of Science Tokyo, Japan

This research uses Omeka S to construct and analyze a parallel corpus of 19th-early 20th century Japanese and Ryukyuan Bible translations. Using TEI encoding, stylometric analysis, and LOD principles, it reveals translation networks and linguistic patterns among Protestant, Orthodox, Catholic, and regional language translations, preserving linguistic heritage through digital methods.



Extracting Information from Differences in Comics of Multi-Language Editions: Focusing on Dialogues, Onomatopoeia, and Annotations

Teru Agata1, Mari Agata2, Akiko Hashizume3, Masaki Eto4, Yasuharu Otani5

1Asia University, Japan; 2Keio University, Japan; 3Jissen Women's University, Japan; 4Gakushuin Women’s College, Japan; 5Aoyama Gakuin University, Japan

We proposes a new method for extracting text in speech bubbles, onomatopoeia, and annotations from manga by identifying differential regions in multiple language versions. A preliminary experiment demonstrates the effectiveness of our method, achieving high accuracy in extracting speech bubbles and other areas.



A Context-Sensitive Parser for Semitic Languages

Zhan Chen

Institute for Advanced Studies of Beijing Normal University - Hongkong Baptist University United College

This project develops a context-sensitive parser for Syriac using an encoder-only transformer model within ETCBC’s Qoroyo platform, addressing Semitic text challenges. It benchmarks LLMs for ancient languages, aiming for high accuracy and a quantifiable morphological analysis evaluation like GLUE or BLEU.



Is stylometry still able to distinguish between literary human and machine translation?

Aleksandra Rykowska

Jagiellonian University in Kraków, Poland

The study aims to test whether the well known stylometric methods are able to distinguish machine and human literary translation. The analysed corpus consists of novels translated into English, French and Polish to test whether the inflectionality of language plays a role in the quality of translation.

 
11:00am - 12:30pmSP-23
Location: B304 (TB)
Session Chair: Peter Boot, Huygens Institute for the History and Culture of the Netherlands
 

In-depth analysis of social networks of translations of literary narratives

Menno van Zaanen

South African Centre for Digital Language Resources, South Africa

Previous work showed major differences in social networks of main characters and their relations of translations of a narrative. Here, we investigate reasons why this is the case. Quality of named entity recognition has the largest impact, while differences in language preferences do not have a major impact.



Locative narratives: an open access to the renewal of place and self

Varvara Chatzi

NATIONAL AND KAPODISTRIAN UNIVERSITY OF ATHENS, Greece

This submission will focus on a recent locative narrative in the context of contemporary Greek literary production (2021-2022)- Ismini Gatos' america2 -and will reveal that space, time and body, in physical or digital terms can recontextualize the relationship of any user with the local environment and also with themselves.



Research on the Construction of a Digital Narrative Model for Chinese Historical Classics

Xinyi Yuan, Chengxi Yan, Min Yu

Renmin University of China, China, People's Republic of

This study constructs a digital narrative model for Chinese historical classics, supporting nonlinear, interactive storytelling and deep knowledge exploration through multidimensional narrative pathways.



11:00am - 11:10am

One tree to Yule them all? Reflexions on intertextuality and text transmission

Jean-Baptiste Camps, Kelly Christensen, Ulysse Godreau, Théo Moins

École nationale des chartes, Université PSL, France

This study explores the role of intertextuality in manuscript transmission using a Yule process model, extending previous birth-death approaches. Analysis of three major sets of medieval texts suggests that including speciation events better represents the heavy-tailed distribution of surviving witnesses per text.

 
11:00am - 12:30pmSP-28
Location: B309 (TB)
Session Chair: Marie Theresa O'Connor, Johns Hopkins University
 

Librarians Critical Digital Literacy Guide to Smart Software Selections

Joshua Chalifour1, Mona Elayyan2

1Concordia University, Canada; 2Wilfrid Laurier University, Canada

This presentation introduces a scaffolded pedagogical guide to teach the practice of discovering, understanding, and evaluating DH tools and techniques that align with research objectives. It is founded on critical digital literacy skills for search processes and the context in which researchers develop new information.



Open Science and Digital Humanities: Ethical Challenges of Informed Consent in the Era of Transparency and Privacy

Jonas Ferrigolo Melo1, Moises Rockembach2

1University of Porto, Portugal; 2University of Coimbra, Portugal

This study examines ethical challenges in open science within Digital Humanities, focusing on balancing transparency, privacy, and compliance with data protection laws. By analyzing two COVID-19-related projects in Brazil and Portugal, it highlights informed consent practices, proposing adaptive governance models and strategies to harmonize openness with individual rights.



Is Open Data Really Open? The Hansard Parliamentary Data Case Study

Lucia Michielin, Jessica Witte, Kenneth Fordyce

University of Edinburgh, United Kingdom

This paper presents a case study of web scraping for interdisciplinary research, highlighting both the practical implementation of these methods and strategies for overcoming similar challenges. We position web scraping as a crucial tool for ensuring data is accessible, not as a commodity, but in line with FAIR principles.



Preserving AI Voices

Marie Theresa O'Connor

Johns Hopkins University, United States of America

Some big emerging questions are about humans conversing ever more with AIs. For instance, how will we be affected? Yet, despite the volume of human/AI conversations, few public archives exist to preserve them. My paper introduces Preserving AI Voices, a public digital humanities project that creates such an archive.

 
12:30pm - 2:00pmLunch - 17th (see restaurants on website)
12:30pm - 2:00pmKADH meeting
Location: Aud B2 (TB)
12:30pm - 2:00pmPoster (17th)
Location: B007 (TB)
 

keylog.js: An Open Source Pedagogical Tool for DH and Data Studies

Taylor ARNOLD

University of Richmond, United States of America

Presents the pedagogical tool keylog.js, a minimal javascript-based tool that provides privacy-focused, client-side keylogging software served through a static website to address questions about the ethics, privacy, and accessibility of technologies and algorithms.



HTR of a historical manuscript with multiple languages, scripts, and hands

Martina Scholger1, Elisabeth Steiner1, Melanie Frauendorfer1, Sabrina Strutz1, Hans-Jörg Döhla2, Henning Klöter3

1University of Graz, Austria; 2University of Tübingen, Germany; 3Humboldt-Universität zu Berlin, Germany

This contribution investigates the application of Handwritten Text Recognition (HTR) to the automated transcription of a multilingual historical manuscript.



DoTS: FAIRly publishing your textual data with the DTS API

Philippe Pons, Vincent Jolivet, Jean-Victor Boby, Lucas Terriel

École des chartes - PSL, France

This poster aims to present DoTS, a comprehensive and functional suite of tools for publishing corpora in compliance with the DTS specification, integrating backend, API responses, and frontend for the creation of adaptable websites.



User Experience and Accessiblity in Digital Humanities Projects: A Survey

Kathie Gossett1, Liza Potts2

1Brigham Young University, United States of America; 2Michigan State University, United States of America

This poster investigates how digital humanities (DH) scholars world-wide implement user experience (UX) practices in their projects. Through a survey, the authors will explore the integration of UX methodologies, identify barriers to adoption, and aim to promote more accessible, user-centered digital tools, ultimately broadening DH’s reach and engagement.



Trauma Writing and Climate Migration Narratives

Parham Aledavood

Université de Montréal, Canada

This research examines a corpus of contemporary migration novels to explore trauma. Using a mixed methodology, it investigates how these narratives depict human and non-human migrations, challenging anthropocentric environmental discourse while revealing the cultural imagination of climate change through recurring motifs, emotional arcs, and anticipatory memory.



Beyond the Rugged Consumer: Enabling Communal Experiences in Digital Cultural Heritage

Jonatan Jalle Steller

Academy of Sciences and Literature Mainz, Germany

The poster presents five strategies to produce more communal experiences in cultural-heritage software, exemplified using the Cultural Heritage Framework. The strategies are developed against the background of experts or 'rugged consumers' being the de-facto target audience of many editions, dictionaries, repositories and similar offerings.



Development of a Commentary Generation System for Western Classical Texts

Ikko Tanaka1, Jun Ogawa2, Naoya Iwata3

1J.F. Oberlin University; 2National Institute of Informatics; 3Nagoya University

We present Humanitext Antiqua, a system employing Retrieval-Augmented Generation (RAG) and advanced large language models (LLMs) to generate scholarly interpretations of classical texts. Using Plato’s Republic as a case study, the system integrates primary texts, commentaries, and secondary literature, addressing challenges in traditional referencing and text segmentation for enhanced academic research.



Oracle Bone Reassembly Based on Diffusion Model

Guang Yang

BNU-HKBU United International College, China, People's Republic of

This paper introduces a machine learning approach to reassemble fragmented oracle bones, which are important materials for understanding early Chinese history. Specifically, we propose a model based on the Diffusion Model, a generative deep learning framework that has demonstrated remarkable performance in computer vision tasks in recent years.



Which chatbot generated the most racial and ethnic stereotypes?

Aleksandra Rykowska

Jagiellonian University in Kraków, Poland

This study proposes a comparison between three most popular chatbots: ChatGPT, Claude AI and Google Bard. Racial, ethnic and gender stereotypes were researched in the generated short stories. The stylo package for R and its function oppose() as well as the method of topic modelling were used in the study.



Webs of Cruelty: Network Analysis of Carceral Institutions for Girls and Women in 19th Century Indiana

Brianna Jean McLaughlin

Indiana University, United States of America

I have created a network that I have used to track influence, cashflow, and cruelty across 8 carceral institutions over approximately 50 years. In doing so, I can prove that there was an intricately weaved web of custodial cruelty among "fallen" girls and women in 19th Century Indiana.



Nature versus Artefacts: Places and Objects in Nineteenth-Century Novels from Spain and Latin America

Ulrike Henny-Krahmer, Caroline Müller

Universität Rostock, Germany

Space, places, and spatial objects have been of interest for literary historical research for a long time. We take up this state of research by analyzing natural and artificial spaces, places, and spatial objects in nineteenth-century novels from Spain and Latin America.



Towards the “Model Building in the Humanities through Data-Driven Problem Solving” based around the Japanese Literary Studies

Nobuhiko Kikuchi

National Institute of Japanese Literature, Japan

This paper introduces a large-scale DH research project in Japan that the National Institute of Japanese Literature is undertaking over the next ten years. The aim of this project is to construct big data on Japanese pre-modern texts and to promote data-driven humanities.



Programming Pedagogies: Exploring GitHub as a Platform for Coding Training in DH

Owen Monroe, Zoe LeBlanc

University of Illinois Urbana-Champaign, United States of America

This poster examines GitHub as a pedagogical platform in DH, analyzing its role in fostering coding literacy. By identifying pedagogical activities and practices, we explore how GitHub data can address gaps in training and inform the development of inclusive and effective programming pedagogies for the field.



The Social Sciences and Humanities Open Marketplace: contextualising digital resources in a registry

Clara Boavida2, Elena Battaner Moro3, Laure Barbot1, Michael Kurzmeier1

1DARIAH, Germany; 2Iscte-Instituto Universitário de Lisboa; 3Universidad Rey Juan Carlos

The SSH Open Marketplace is a discovery portal which pools and contextualises resources for Social Sciences and Humanities research communities: tools, services, training materials, datasets, publications and workflows. This poster presents how this service can provide insights into the use of tools, methods and standards in the DH research communities.



Controlled Vocabularies for a Knowledge Graph on Open Educational Resources

Petra C. Steiner1, Jonathan Geiger2, Frank Lange3, Abdelmoneim Amer Desouki1

1Technical University of Darmstadt; 2Academy of Sciences and Literature Mainz; 3RWTH Aachen University

The DALIA project aims to make open educational resources (OER) on data literacy accessible and interoperable. A knowledge graph is developed to link the materials, using the DALIA Interchange Format (DIF) to ensure transparency and interoperability. This poster focuses on picklists for DIF and invites feedback from the professional community.



Scholarly Navigation on an Open Science Platform: A Computational Study of OpenEdition’s Server Logs

Mohsine Aabid1,2, Simon Dumas Primbault1, Patrice Bellot2

1OpenEdition (CNRS / AMU), France; 2Laboratoire d'informatique et des Systèmes (LIS), France

This study analyzes OpenEdition’s server logs to uncover user navigation patterns across its platforms. Using methods like transition analysis, clustering, and topological modeling, it reveals platform fidelity, distinct user profiles, and shared interests. Future work aims to expand the scope with action-based analysis for deeper insights.



Mapping Collaborations in Performing Arts: Building the Festival d’Avignon Digital Corpus

Nicolas Foucault, Jeanne Fras, Clarisse Bardiot

Université Rennes 2, France

This poster presents the p2AFA corpus, a digital resource of Festival d’Avignon programs and playbills (1947–2024) for studying performing arts collaborations. Combining OCR, machine learning, and diplomatic transcription, it enables network visualization and historiographical analysis.



Intangible and Tangible heritage data integration. Models for management, visualization and knowledge. [INTHEDATA]

Patricia Ferreira-Lopes, Francisco Pinto-Puerto, Elena González-Gracia

Departamento de Expresión Gráfica Arquitectónica, Universidad de Sevilla, Spain

In this poster we will present the INTHEDATA project, in particular, the current state of knowledge and best practice in the area of cultural heritage and semantic knowledge model, the objectives of the project, its methodology by implementing the CIDOC-CRM standard and the first results.



Mirror, Mirror on the Wall: Enabling Computational Research on Beauty Ideals

Tim Gollub1, Pierre Achkar2,3, Martin Potthast4, Benno Stein1

1Bauhaus-Universität Weimar, Germany; 2Leipzig University; 3Fraunhofer Institute Leipzig; 4University of Kassel, hessian.AI, and ScaDS.AI

We present our work on the development of machine learning classifiers trained to assess whether a given input image aligns with a specific beauty ideal. The work is part of our effort toward enabling large-scale computational research on beauty ideals, a subject that is both culturally significant and socially impactful.



Ghost City:Augmented Reality Restoration of Two Hundred Lost Mosques in Belgrade

Uliana Pyadushkina

Faculty of Liberal Arts and Sciences Montenegro, Russian Federation

This project aims to recover the cityscape of lost Muslim heritage in Belgrade by superimposing 200 destroyed mosques onto the modern cityscape at their original locations, using 3D-models in augmented reality, with textures based on the restored appearances of the mosques derived from old photographs, documents and sketches from travelogues.



Development and Evaluation of the Information Retrieval System for Humanities Archives using LLM

Kenshin Kobayashi1, Koki Itagaki2, Tomoaki Tsutsumi2, Atsushi Matsumura2, Norihiko Uda2

1GLOBAL SECURITY EXPERTS Inc., Japan; 2University of Tsukuba, Japan

This study aims to establish an effective information-provision method for humanities research, and as part of this effort, we developed an information retrieval system utilizing recently prominent technologies, LLM (Large Language Models) and RAG (Retrieval-Augmented Generation). This paper describes the developed system and its performance evaluation.



Minimal Computing Meets Public History: The Stadt.Geschichte.Basel Approach to Open Research Data with CollectionBuilder

Moritz Twente1, Moritz Mähr1,2

1Universität Basel, Switzerland; 2Universität Bern, Switzerland

This poster highlights how Stadt.Geschichte.Basel created an Open Research Data Platform using CollectionBuilder. By applying minimal computing principles, the platform addresses challenges of accessibility, sustainability, and inclusivity in digital history. It provides adaptable, FAIR solutions that enhance interdisciplinary research, support marginalized perspectives, and foster long-term usability of historical data.



CLARIAH-ES: A Distributed Research Infrastructure for the Digital Humanities

Elena Battaner Moro1, Ainara Estarrona Ibarloza2, Aritz Farwell2

1Universidad Rey Juan Carlos, Spain (URJC); 2Euskal Herriko Unibertsitatea (UPV/EHU)

CLARIAH-ES is a Spanish national research infrastructure that strengthens research and facilitates innovative approaches within the digital humanities. By integrating language technologies, text analysis, cultural heritage, and multilingual resources, CLARIAH-ES offers a unique ecosystem for scholars interested in exploring the Spanish, Catalan, Galician, and Basque languages and cultures.



Romani Language in Google Translate: Ethical Considerations

Olga Shablykina, Leonardo Melis, Murad Mustafayev, Shayan Ahmed Shariff

IDMC, Université de Lorraine, France

Google including Romani in their MT engine raises ethical concerns regarding linguistic preservation and cultural respect. Lack of transparency, poor translation quality, possible negative implications for language speakers are among the issues. It appears that the BigTech companies prioritize quality over quantity when it comes to support of lower-resource languages.



READ-COOP and Transkribus: cooperative approaches to sustainable and responsible digital infrastructure

Melissa Terras1, Bettina Anzinger2, Guenter Muehlberger3, C. Annemieke Romein4, Andy Stauder2, Florian Stauder2

1University of Edinburgh, United Kingdom; 2READ-COOP, Innsbruck, Austria; 3Leopold Franzens Universität für Innsbruck, Austria; 4University of Twente, the Netherlands

How can we sustainably build digital scholarship infrastructures that best serve their communities, encouraging co-ownership and input into their development? This poster examines the cooperative business model underpinning READ-COOP (https://readcoop.eu) and Transkribus (https://transkribus.org), an Automated Text Recognition platform, providing a blue-print for the establishment of responsible, democratic, cooperative digital infrastructures.



Engaging Researchers for Improving Services and Training: Insights from the ATRIUM Survey and Researcher Forum

Tomasz Umerle1, Tiziana Lombardo2, Iulianna van der Lek3, Maria Ilvanidou4, Carol Delmazo5

1Digital Humanities Centre IBL PAN; 2Net7; 3CLARIN ERIC; 4Athens University of Economics and Business; 5OPERAS

The ATRIUM project enhances access to digital research infrastructures in Arts, Humanities, and Social Sciences by improving services and creating a tailored curriculum for the research community. The poster showcases how, through a survey and workshops, ATRIUM integrates community feedback to bridge skills gaps and deliver impactful open training resources.



Longevity, Accessibility, and Multilingual Micro-editions at Scholarly Editing: A Multimedia, Open-access Journal for Recovery Practitioners

Raffaele Viglianti1, Noelle A. Baker2

1University of Maryland, United States of America; 2Independent Scholar

Scholarly Editing is an open-access, peer-reviewed journal that welcomes contributions that feature rare or marginal texts and small-scale editions for the discoverability of underrepresented stories and artifacts. This poster will introduce the journal’s purpose and present the journal’s strategies to ensure the longevity of its digital content.



O multilinguismo da produção científica em Humanidades Digitais nos últimos 5 anos: uma análise a partir da Web of Science Core Collection

Maria Filipa Torres1, Maria Manuel Borges2

1Univ. Coimbra, FLUC; 2Univ Coimbra, CEIS20, FLUC

O multilinguismo deveria afirmar-se nas Humanidades Digitais (HD). O objetivo deste trabalho é analisar se a produção científica em HD na Web of Science Core Collection o reflete. É um estudo bibliométrico com um corte transversal retrospetivo (2020-2024). Conclui-se que o inglês predomina, mas existe espaço para outros idiomas.



Memory of 518: A Web-Based Platform Connecting Literature, Archival Records, and User-Generated Data

Chaeyeon Jeong, Moonui Kim, Jihyo Jeon

Korea University, Korea, Republic of (South Korea)

This project builds a web-based literary tour platform called ‘Memory of 518’, integrating literary works, factual records, and user-generated data related to the Gwangju May 18 Democratic Uprising. Using maps, 360-degree images, and user contributions, it documents and visualizes the fictional, historical, and everyday aspects of 518 Gwangju.



Geo-Databases on Paper - Structured Data from Historical Maps

Anastasia Bauch, Klaus Stein, Carmen Enss

UrbanMetaMappingTransfer, University of Bamberg

The proposed poster introduces a workflow for data extraction from historical maps into a structured format by manually digitising scanned maps with the OpenSource GIS software QGIS. We present our work in progress on a set of maps from our research in the UrbanMetaMapping project.



Bootstrapping Corpora Building of Low-Resourced Language Texts Using the Universal Declaration of Human Rights

David Bainbridge1, Sulhan Algee1, J. Stephen Downie2, Hemi Whanga3

1University of Waikato, New Zealand; 2University of Illinois, United States of America; 3University of Massey, New Zealand

Digtal Humanities scholars need NLP tools to create new corpora of low-resourced languages, but such tools need to be trained on “non-existent” corpora creating a classic boot-strapping problem. We use the text from the Universal Declaration of Human Rights, along with a lexicon-based interative search strategy, to overcome this problem.



Visualising Africa in Chinese Media: A Preliminary Computer-Assisted Study of 1950s-1980s Representation in Journal Illustrations and Book Covers

Jodie Yuzhou Sun (co-first author)1, Fudie Zhao (co-first author)2, Qilin Hu1

1Fudan University, China; 2University of Oxford, United Kingdom

This study explores the visual representation of Africa in Chinese media (1950s-1980s), creating a digital archive and applying AI tools, including large language models and Contrastive Language-Image Pre-training (CLIP), for bidirectional text-image retrieval, offering fresh insights into Sino-African relations and cross-cultural visual studies.



Customizing Omeka S for Linguistic Linked Open Data: A Case Study of the NINDA Language Resource Archive

So Miyagawa1,2, Yifan Wang1,3, Takanori Ito4, Tomokazu Takada1

1National Institute for Japanese Language and Linguistics (NINJAL), Japan; 2University of Tsukuba, Japan; 3University of Tokyo, Japan; 4Institute of Science Tokyo, Japan

NINDA (NINJAL Digital Archive) adapts Omeka S to manage linguistic resources, particularly for Japonic languages. It implements IIIF for multimedia content and OntoLex Lemon for lexical data structuring, supporting FAIR principles. The system handles annotated recordings, interlinear texts, and lexical databases, making linguistic resources more accessible to researchers and communities.



Integrity in Digital Scholarly Editing: The GreekSchools Case

Simone Zenzaro1, Angelo Mario Del Grosso1, Federico Boschetti1, Graziano Ranocchia2

1Istituto di Linguistica Computazionale "A. Zampolli" - CNR, Italy; 2Università di Pisa

Textual scholarship aims to reconstruct and publish texts through critical apparatuses. The DSL-based Digital Scholarly Editions (DSE) method merges traditional editing with computational techniques, enhancing workflows and adhering to open science principles. The GreekSchools project exemplifies this approach, and the CoPhiEditor implements it as a software solution.



Quil2Vec: A Tool for Vector Manipulation of Medieval Latin Script

Herman Gerrit Makkink

University of Vienna, Austria

This poster will present a tool currently being developed for doing image vectorization of medieval script, called “Quil2Vec”. This tool to intended to expedite the production of image vectors as ground truth for multiple different text-based machine learning research applications.



Enhancing Open Science through the SCIROS Project

Gabriela Manista, Maciej Maryl, Tomasz Umerle, Cezary Rosiński, Marta Świetlik, Magdalena Wnuk, Mateusz Franczak, Piotr Wciślik

Institute of Literary Research Polish Academy of Science, Poland

The SCIROS project aims to enhance Open Science in the humanities and social sciences by tackling theoretical, practical, and infrastructural challenges with 6 international partners. By fostering interdisciplinary collaboration and sharing insights via the blog, the project supports the widespread adoption of OS practices.



Building a Peer Review Framework for Non-Traditional Research Outputs

Françoise Gouzi1, Anne Baillot1, Sarah Bénière2, Carol Delmazo3, Toma Tasovac1

1DARIAH-EU; 2INRIA; 3OPERAS

This poster aims to present our ongoing work on developing the evaluation framework for open peer review assessment of non-traditional research outputs as a contribution toward maximising the quality and impact of Arts and Humanities research in Europe in the context of the Coalition for Advancing Research Assessment (CoARA).



Disputes over Cultural Power in Digital Repatriation: Insufficient Interpretations of Cultural Objects in Cross-cultural Contexts

yujue wang, jingya fan, hanying wen

Wuhan University, the People's Republic of China

After digital repatriation, cultural institutions often still remain digital replicas. This study compares metadata records of ancient Chinese paintings across various museums, revealing that interpretations in cross-cultural contexts are influenced by cultural backgrounds, and finally suggests improving original communities' control over digital replicas in legal, ethical, and technical aspects.



Privatbriefe als marginalisiertes Kulturgut

Debby Trzeciak1,2

1TU Darmstadt, Germany; 2Hochschule Darmstadt, Germany

Die Sammlung und Bewahrung von marginalisierten Kulturgütern wie Privatbriefen unterliegen bisher keiner einheitlichen Archivierung und Standards. Das Dissertationsprojekt adressiert die Frage, wie die nachhaltige und dauerhafte Erschließung nach internationalen Standards im Spannungsfeld der FAIR-, CARE- und Open-Prinzipien gelingen und die maschinenlesbare und interoperable Digitalisierung des Kulturguts ermöglicht werden kann.



“HumAInities: Exploring the Impact of AI on Humanities disciplines”

Michael Sinatra1, Dominic Forest1, Jean-Philippe Magué2

1Université de Montréal, Canada; 2ENS Lyon

Our poster will present the partnership development grant “HumAInities: Exploring the Impact of AI on Humanities disciplines”, its goals and expected results. Our project seeks to understand the changes brought about by the impact of AI on the production and dissemination of knowledge within the humanities.



Vedic Sanskrit OCR as a Bridge between Text and Image Platforms

Yuzuki Tsukagoshi, Ikki Ohmukai

The University of Tokyo, Japan

This study develops a Vedic Sanskrit OCR model to bridge the gap between text and image platforms.We fine-tunes TrOCR on Vedic, aligning images with texts using eScriptorium as a tool for creating groundtruth, suggesting a cyclic process to create text and image correspondences and to impove the performance of OCR.



A Multimodal Approach to Historical Sources in the 18th–19th Century Balkans

Kristiyan Sergeev Simeonov1, Maria Baramova2

1Sofia University "St. Kliment Ohridski", Bulgaria; 2Sofia University "St. Kliment Ohridski", Bulgaria

This poster proposes a multimodal approach to historical research, utilizing HTR, NLP pipelines, and GIS in a user-friendly manner. By integrating advanced computational methods with traditional humanities research, we aim to create a model that can be replicated for other underrepresented regions and languages.



From Late-Antique Text to 21st Century Literature Database: Babylonian Talmud Stories as a Case Study

Itay Marienberg-Milikowsky

Ben-Gurion University of the Negev, Israel

This poster explores the challenges and opportunities of digitizing late-antique literature, focusing on the Babylonian Talmud. By creating a comprehensive database of Talmudic stories, it aims to expand computational literary studies. The poster will discuss the methodological challenges involved in building this database, including text extraction, annotation, and modeling.



Detecting divergent language use in Russian Media during the Russo-Ukrainian War: Steps towards interpretable propaganda detection and analysis

Anastasiia Vestel, Stefania Degaetano-Ortlieb

Saarland University, Germany

This study examines divergent language use in Russian state-controlled media and social media during the Russo-Ukrainian war using the WarMM-2022 corpus and Kullback-Leibler Divergence (KLD). KLD offers interpretability advantages over more opaque machine learning techniques allowing a deeper understanding of how propaganda techniques are linguistically construed and evolve over time.



O compromisso com a Ciência Aberta: a Gestão de Acervos da Fiocruz

Mônica Garcia1, Maria Manuel Borges2, Maria Cristina Soares Guimarães3

1Univ. Coimbra, FLUC, Portugal; 2Univ. Coimbra, CEIS20, FLUC, Portugal; 3Fundação Oswaldo Cruz, Brasil

A proposta visa desenvolver um modelo de gestão de acervos científicos alinhado com as diretrizes internacionais de Acesso Aberto (AA), especialmente considerando o Plano S e as transformações no sistema de comunicação científica.



Creating Open Source, Multilingual DH Tools with Rust

Ian Patrick Goodale

University of Texas at Austin, United States of America

This poster highlights three open source software packages I created in the programming language Rust. The packages include lemmatizing, readability, and stylometry algorithms, and were intentionally designed to create new resources to facilitate analysis of and engagement with multilingual and non-English languages in the Rust ecosystem.



Doing Literature: A Multimedial Index of Research Outputs

Stefanie Messner1, Viktor J. Illmer2, Mark Schwindt2

1fortext lab, Technische Universität Darmstadt, Germany; 2EXC 2020 ‘Temporal Communities’, Freie Universität Berlin, Germany

Doing Literature is a web portal designed to collect and curate research contributions of the humanities in multimedia formats. It aims to develop an innovative framework that engages diverse audiences, thereby enhancing Digital Public Humanities and emphasising their collaborative character as well as their potential in knowledge creation.



Making cultural heritage open: a semantic portal for Germanic Cultural Heritage in Veneto

Chiara De Bastiani

Università Ca' Foscari Venezia, Italy

This poster presents a user interface developed within the OntoVE project. The poster focuses on the search interface, built with the Sampo Model (Ikkala et al. 2022), and its search perspectives, which allow users to explore data through the faceted search paradigm (Tunkelang 2009).



Computer-Assisted Hermeneutics of Philip K. Dick's Corpus: Constructing a Personal Knowledge Base with SpaCy and Obsidian for Literary Analysis

Yann Audin

Université de Montréal, Canada

This proposition showcases a Python library designed to interface with the text editor Obsidian to create a literary database of a corpus. We use Philip K. Dick's science-fiction as the exemplatory corpus, and showcase how classical Natural Language Processing can be used in computer-assisted literary hermeneutics.

 
2:00pm - 3:30pmSP-30
Location: Aud B2 (TB)
Session Chair: Anabela dos Santos Fernandes, University of Coimbra
 

An OIE Pipeline for the Identification and Production of Missing Biographical Knowledge

Jonah Lubin1, Marco Antonio Stranisci2

1Harvard University, United States of America; 2University of Turin, Italy

We present an Open Information Extraction pipeline to identify and address knowledge gaps in Wikidata for underrepresented writers, using the Leksikon Fun Der Nayer Yidisher Literatur as a case study. Our approach benchmarks representation, assesses property alignment, and introduces resources to enhance digital humanities research on marginalized literatures.



Making GLAM resources more accessible and reusable: a FAIR case study on European Literary Bibliography

Gustavo Candela1, Cezary Rosiński2, Arkadiusz Margraf3

1University of Alicante, Spain; 2Institute of Literary Research of the Polish Academy of Sciences; 3Institute of Bioorganic Chemistry of the Polish Academy of Sciences

This study presents a reproducible framework for publishing and reusing bibliographic metadata from GLAM, focusing on the European Literary Bibliography. It emphasizes Linked Open Data transformation, metadata enrichment, and computational reuse via Jupyter Notebooks. Key contributions include a framework, DH research scenarios, and tools enabling scholarly exploration of bibliographic collections.



Improving access to interchanges between material and immaterial cultural heritage through semantic modeling

Sofia Baroncini1, Melissa Macaluso2,3, Charles van den Heuvel4,5

1Leibniz Institute of European History (IEG), Mainz, Germany; 2La Sapienza University of Rome, Italy; 3University of Turin, Italy; 4Huygens Institute, Amsterdam, Netherlands; 5University of Amsterdam, Netherlands

Semantic modeling can play an important role in enhancing accessibility to the immaterial culture related to artifacts. To this end, we examine whether the domain standards CIDOC-CRM and LRMoo can express the interactions of an artwork with the contexts it traverses through a case study of XVII Century integrative restoration.



Preserving Musical Ephemera : A Digital Archive Framework for Classical Vocal Music

Minji Kim, Eunsoo Lee

Seoul National Univeristy, Korea, Republic of (South Korea)

This study introduces a domain-specific ontology and digital archive for classical vocal music ephemera in South Korea. Addressing data fragmentation and inconsistent formats, it integrates Linked Open Data principles and visualization tools to ensure accessibility, cultural preservation, and analytical exploration across a decade of performance ephemera.



Historical Wine Labels of the German Mosel Region: Enabling Insights into Visual Cultural Heritage using Linked Open Data

Christof Schöch1,2, Maria Hinzmann1, Veronica Wassermayr1, Joëlle Weis1, Achim Rettinger2

1Trier Center for Digital Humanities, Trier University, Germany; 2Computational Linguistics and Digital Humanities, Trier University, Germany

This paper presents a project undertaking the digitisation, enrichment, modeling and publication of modern and historical wine labels from the German Mosel region as witnesses of local cultural history using manual annotations and multimodal Large Language Models for enrichment and Linked Open Data for data modeling.

 
2:00pm - 3:30pmPanel 04
Location: Aud B3 (TB)
Session Chair: Sylvia Arlene Fernandez, University of Texas San Antonio
 

Data Advocacy for All: Working and Teaching with Data for Social Change

Laurie Gries1, Cameron Blevins2, Sylvia Fernandez Quintanilla3

1University of Colorado-Boulder, United States of America; 2University of Colorado-Denver, United States of America; 3University of Texas at San Antonio, United States of America

This panel of rhetoric, history, and Hispanic studies scholars aims to invigorate data advocacy research and education in the digital humanities by presenting and discussing the challenges and rewards of their work with three data-driven public humanities projects--a digital hate-tracking project, an online educational toolkit, and an online data repository.

 
2:00pm - 3:30pmLP-22
Location: Aud C1 (EC)
Session Chair: Ulrike Henny-Krahmer, Universität Rostock
 

Urban spatial narratives of Guangzhou in Zhu Zhi Ci (Bamboo Branch Poetry):a Phonotextual Perspective and Literature Cartographical Approach

Yinglin Wang, Xiaochuan Pan, Jingqing Lv, Jie He

Harbin Institude Of Tecnology (shenzhen), China, People's Republic of

This study utilizes phonotextual and cartographical perspectives to analyze Guangzhou Bamboo Branch Poetry, exploring emotional expressions and cultural landscapes. By examining textual features and Cantonese phonetics, we reveal the interplay of history, landscape, and local customs, highlighting the genre's significance in documenting urban life and cultural evolution.



Scene Change Detection in 20th-Century US-American Romance Fiction

Svenja Simone Guhr1,2, Huijun Mao2, Fengyi Lin2, Alexander J. Sherman2, Mark Algee-Hewitt2

1Technical University of Darmstadt, Germany; 2Literary Lab, Stanford University, USA

This study explores scene change detection in 20th-century US-American romance fiction using manual annotations and automated methods. Manually annotated novels build the training data for fine-tuning an English BERT USE model, yielding promising preliminary results for automated text segmentation in computational literary studies.



New approaches to understanding perceptions of distance and landscape in historical travel writing: The changing geographies of picturesque and wild in the English Lake District

Ian Gregory1, Ignatius Ezeani1, Erum Haris2, Joanna Taylor3

1Lancaster University, United Kingdom; 2University of Leeds, United Kingdom; 3University of Manchester, United Kingdom

This paper explores ways of representing the complex ways that landscapes can be described and how this changes over time drawing on the concepts of ‘picturesque’ and ‘wild’ in the English Lake District. It evaluates a range of approaches to landscape description and perceived nearness and how these changed over time.

 
2:00pm - 3:30pmLP-21
Location: B207 (TB)
Session Chair: Luisa Ripoll-Alberola, Leipzig University
 

Text Mining Gender Depictions in Epitaphs Verses from Northern Wei (386–539 C.E.) China

Wenyi Shang1, Seiko Ochi2

1University of Missouri, United States of America; 2Meijo University, Japan

Using Transformer-based models, this study contributes to an inclusive approach to understanding history, examining how males and females were depicted differently in medieval Chinese epitaph verses and which classical texts these verses resemble. The findings highlight a discourse shaped by patriarchal privileges, echoing the assertion that "the subaltern cannot speak."



Rewriting Tradition: Quantifying Change in Lady Gregory’s Irish Legends

Rachel McCarthy, Rasika Edirisinghe, James O'Sullivan, Clíona Ó Gallchoir, Rosane Minghim, Órla Murphy

University College Cork, Ireland

This paper uses computational methods to analyse how Lady Augusta Gregory's translations of traditional Irish legends reimagined the original works for the purposes of aligning them with cultural nationalism and the Irish Literary Revivalist perspectives.



Tracing Antiquity: References to Greco-Roman Authors in Modern Academic Discourse

Luisa Ripoll-Alberola1, Leonardo D'Addario2, Manuel Burghardt1, Monica Berti2,1, Mark Depauw3

1Computational Humanities, Leipzig University, Germany; 2Ancient History, Leipzig University, Germany; 3Ancient History, KU Leuven, Belgium

This paper examines references to Greco-Roman authors across disciplines in a corpus of 56,116 academic articles. Using digital methods, it identifies citation patterns, compares rankings with L’Année Philologique (bibliographic database of classical studies), and explores disciplinary differences.

 
2:00pm - 3:30pmLP-18
Location: B210 (TB)
Session Chair: Miguel Escobar Varela, National University of Singapore
 

Connecting Threads: Creating a Participatory and Globally Accessible Platform for the Study of Checked Indian Cotton Textiles

Deepthi Murali, Jason Heppler

George Mason University, United States of America

Connecting Threads explores connections between South Indian weavers and Caribbean consumers by linking small and large textile collections and archives to enhance access to global fashion histories. Featuring a PostgreSQL database and interactive visualizations, the paper details its technical development, collaborative methodology, and impact on equity and accessibility in DH.



Centering Civic Engagement with Open Scholarship: The Revolutionary City as a Model for Fostering Public Use of Digital Cultural Heritage

David Ragnar Nelson, Bayard L. Miller

American Philosophical Society, United States of America

The paper presents a two-pronged approach for fostering access to and use of digital archival holdings. This approach combines public use of HTR technologies and public involvement in producing interpretative digital scholarship. The framework presented seeks to encourage civic engagement and dialogue around the holdings.



Advancing OCR and Word Sense Disambiguation for the Jawi Script using LLMs and VLMs

Miguel Escobar Varela, Stephane Bressan, Faizah Zakaria, Ganesh Neelalkanta Iyer, Guo Quan Seng, Pratik Karmakar

National University of Singapore, Singapore

We introduce novel datasets and fine-tuned VLM and LLM models for OCR and word-sense disambiguation for Jawi (a writing system used historically for Malay). Our OCR system that outperforms previous solutions with a Character Error Rate (CER) of 8.66%, and a context-aware word sense disambiguation model that achieves 99.2% accuracy.

 
2:00pm - 3:30pmLP-19
Location: B302 (TB)
Session Chair: Elisa Cugliana, Universität zu Köln
 

Augmenting a Maquette of Bergen-Belsen Concentration Camp with Prisoner Artwork

Aliisa Råmark1, Stephanie Billib2, Héctor López-Carral3, Luca Verschure4,5, Pedro Fernandez Gomez3, Stefan Jänicke6, Tobias Ebbrecht-Hartmann7, Chris Hall8, Paul Verschure3,9,10

1Radboud University, Netherlands, The; 2Bergen-Belsen Memorial, Germany; 3Eodyne Systems, Spain; 4Sapiens5 Culture, The Netherlands; 5University of Twente, The Netherlands; 6University of Southern Denmark, Denmark; 7The Hebrew University of Jerusalem, Israel; 8Chris Hall Design, Denmark; 9Universidad Miguel Hernández de Elche, Spain; 10Future Memory Foundation, The Netherlands

Most Nazi persecution memorials use physical maquettes for informing the historical site’s spatial organisation to visitors. In this paper we present the Future Memory Maquette Explorer from MEMORISE exhibition at Bergen-Belsen Memorial. It uses Augmented Reality technology to allow users to explore prisoner artworks, conveying the human dimension of history.



Exploring the “Great Unseen” in Medieval Manuscripts: Instance-Level Labeling of Legacy Image Collections with Zero-Shot Models

Christofer Meinecke1,2, Estelle Guéville3, David Joseph Wrisley4

1Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Leipzig University, Germany; 2Image and Signal Processing Group, Leipzig University, Germany; 3Medieval Studies, Yale University, New Haven, USA; 4Arts & Humanities, New York University Abu Dhabi, Abu Dhabi, United Arab Emirate

We aim to theorize the medieval manuscript page and its contents more holistically, using state of the art techniques to segment and describe the entire manuscript folio, for the purpose of creating richer training data for computer vision techniques, namely instance segmentation, and multimodal models for medieval-specific visual content.



Playing the Past, Predicting the Future: Sortes Texts in Virtual Reality

Elisa Cugliana, Øyvind Eide, Lukas Wilkens, Nadjim Noori, Pascale Boisvert, Julia Haschke

Universität zu Köln, Germany

Our project reimagines medieval sortes texts through virtual reality, combining textual scholarship with performative modeling. By situating these divinatory texts in immersive settings—monastic libraries, astrologers' laboratories, and taverns—we simulate their ritualistic and interactive nature. This approach bridges philology, media studies, and media archaeology, offering new insights into multimodal historical textuality.

 
2:00pm - 3:30pmSP-29
Location: B304 (TB)
Session Chair: Joana Vieira Paulino, Institute of Contemporary History, NOVA FCSH / IN2PAST
 

A "Cathedral of Digital Data". An Application for the Medieval Registers of Notre-Dame

Vincent Jolivet, Lucas Terriel

École des chartes, France

Thanks to its microservices architecture, the eNDP application is designed as an access point to all the medieval resources available on Notre-Dame de Paris. We aim to present a method for valorizing and editorializing resources scattered across different research data repositories repositories, which takes advantage of their open APIs.



Digital Mapping Tools for Australian History and Cultural Heritage

Catharine Coleborne1, Penny Edmonds2, Andrew May3, Hugh Craig1, Bill Pascoe3, Paul Longley Arthur4

1University of Newcastle, Australia; 2Flinders University, Australia; 3University of Melbourne, Australia; 4Edith Cowan University, Australia

This paper reports on the outcomes of experimental work by the team involved with the Time Layered Cultural map (TLCMap) project funded by two Australian Research Council Linkage Infrastructure, Equipment and Facilities grants. It underlines the potential for digital mapping and digital humanities methodologies in historical and cultural heritage research.



Ethnobotany of the Tambov Region According to Historical Sources: Aims, First Results, and Perspectives

Kira Kovalenko1, Tatiana Makhracheva2

1European University at St. Petersburg, Russian Federation; 2Tambov State University named after G.R. Derzhavin

The aim of the paper is to present the project “Ethnobotany of the Tambov region according to historical sources”. This is an ethnolinguistic and ethnobotanical study of the Tambov region traditional culture on the material of the lexical-semantic field “Plants”, with the use of the PhytoLex database.



Framework for AI-Driven Heritage Research at Silahtarağa Archive

Doruk Şen1, Amed Gökçen2,3, Başak Koşanay2,4, Ece Balkan2,5

1Department of Industrial Engineering, Istanbul Bilgi University, Turkey; 2Silahtarağa Archive, Istanbul Bilgi University, Turkey; 3Institute for Area Studies, Leiden University, The Netherlands; 4Department of Political Science, Istanbul Bilgi University, Turkey; 5Department of Information and Document Management, Marmara University, Turkey

This paper presents a framework for AI-driven heritage research at Silahtarağa Archive, a repository of historical documents and maps related to Istanbul's urban development. The authors outline their approach to digitization, data preparation, and AI methodology, enabling researchers to analyze and explore the evolution of Istanbul's urban landscape.

 
2:00pm - 3:30pmLP-20
Location: B309 (TB)
Session Chair: Silvio Peroni, University of Bologna
 

Kalpana—Reimagining Museums in the Age of Digitality

Sayan Sanyal

Public Arts Trust of India

Kalpana explores the transformative role of digital technologies in reimagining museum spaces, focusing on immersive storytelling and experimental museology. Highlighting Global South perspectives, the project integrates AI-generated visuals and multimedia narratives to address inclusivity, accessibility, and decolonial aesthetics, offering innovative frameworks for cultural preservation and engagement in the digital age.



Enhancing Accessibility and Inclusivity at the University: Case Studies from the Virtual Campus and ARTEST Projects

Maria Sotomayor Chicote1, Elisabeth Reuhl1, Øyvind Eide1,2

1Department of Digital Humanities, University of Cologne, Germany; 2Center for Data and Simulation Science, University of Cologne, Germany

Accessibility and inclusivity are central to two University of Cologne projects compared here. Virtual Campus employs VR and AR for accessible campus navigation and cultural heritage engagement. ARTEST advances DH education and collaboration internationally. Both local and global initiatives for accessible, inclusive education are needed and benefit from mutual exchange.



Towards a Critical Ontology-based Knowledge Representation of Archipelagic Performance Histories

Hedren Sum, Alvin Eng Hui Lim, Kyueun Kim

National University of Singapore, Singapore

This paper develops a critical ontological framework to map 19th-20th century performance histories in Asia's archipelagic regions. Introducing Archipelagic Performance Histories Ontology (APHon), the domain ontology captures fluid geo-social relations and touring practices, challenging nation-centric narratives through a structured yet flexible framework that represents complex cultural and historical data.



Leveraging virtual technologies to enhance museums and art collections: insights from project CHANGES

Gianluca Genovese1, Ivan Heibi2, Silvio Peroni2, Sofia Pescarin3

1University of Suor Orsola Benincasa, Naples, Italy; 2University of Bologna, Bologna, Italy; 3Italian National Research Council, Florence, Italy

We investigated the use of virtual technologies to digitise and enhance cultural heritage (CH), aligning with Open Science and FAIR principles. Through case studies in museums, we developed reproducible workflows, 3D models, and tools fostering accessibility, inclusivity, and sustainability of CH. Applications include interdisciplinary research, educational innovation, and CH preservation.

 
3:30pm - 4:00pmCoffee-break (17th afternoon)
Location: B007 (TB)
4:00pm - 5:30pmSP-32
Location: Aud B2 (TB)
Session Chair: Nicholas Y. H. Wong, The University of Hong Kong
 

Historicizing Controlled Vocabularies in Digital Humanities: A Lightweight Context-Indexed Extension for Vocabulary Systems

Tsz-Kin Chau, Sarah Kenderdine

Laboratory for Experimental Museology, EPFL, Switzerland

This paper shows the necessity and motivation behind historicizing the power/knowledge embedded in LOD vocabulary systems. By utilizing CRMaaa, this paper presents a lightweight data model as a “quick fix” to augment existing vocabulary systems. This paper uses a particular case from a 19th c. painted panorama in Switzerland.



Radically inclusive software development for digital cultural heritage

Mia Ridge, Lanie Okorodudu, Saira Akhter, James Misson, Erin Burnand

British Library, United Kingdom

Sustaining open source software can be challenging. We discuss collaboration on the Universal Viewer (UV), software designed to display cultural heritage collections. We highlight methods including innovative, inclusive and multi-institution sprints. We showcase UV’s evolution, including accessibility and user experience enhancements, future plans and ways for others to contribute.



Local Contexts, Global Conversations: Digital History in Central Asia

Dinara Gagarina

University of Erlangen-Nuremberg, Germany

This study explores the emergence of digital history in Kazakhstan, Kyrgyzstan, and Uzbekistan, highlighting innovative projects, thematic focuses, and methodological shifts. Integrating literature reviews, interviews, and community events, it reveals infrastructural challenges, underscores postcolonial dimensions, and suggests that diverse, region-specific approaches can enrich global digital humanities discourse in meaningful ways.



A Conceptual History of Humanism in a Post-WWII Chinese-language Literary Journal via Word Vector Spaces

Nicholas Y. H. Wong

The University of Hong Kong, Hong Kong S.A.R. (China)

This paper uses Chinese word vectors to develop a conceptual history of humanism and related keywords in a post-1945 modernist literary journal from Malaysia, and contributes to scholarship on digital multilingual practices, by asking how to accurately represent semantic and syntactic information from languages of non-Latin script in geometric spaces.



From Draft to Model: Semi-Automated Parametric Extraction of Historical Ship Designs

Giovanni Maria Pala1, Marco Mercuri2, Gian Maria Santi3, Lisandra Costiner4

1University of Oxford, United Kingdom; 2Bologna, Italy; 3University of Bologna, Italy; 4Utrecht University, Netherland

Using a historically informed approach, this contribution proposes a way to reconstruct historical ship 3D models, starting from their 2D drawings. It offers a study of the way ships were drawn, and uses this to charactyerise them as a parametrised problem.

 
4:00pm - 5:30pmSP-33
Location: Aud B3 (TB)
Session Chair: Yutong Yang, Shanghai Jiao Tong University
 

Towards an Evaluation Framework for Assessing Large Language Models in Text Encoding

Sabrina Strutz, Georg Vogeler

University of Graz, Austria

This contribution proposes a multifaceted evaluation framework for assessing the performance of Large Language Models (LLMs) in encoding historical letters according to the TEI Guidelines, using the Joseph von Hammer-Purgstall correspondence edition as a case study.



Investigating Conceptual Plasticity: On Detecting a Re-Conceptualization of Focalization with Large Language Models

Axel Pichler1, Janis Pagel2

1University of Vienna, Austria; 2University of Cologne, Germany

We investigate the extent to which LLMs are able to learn a redefinition of a concept from literary studies, focalization, and apply it adequately to text examples. It shows that, with one exception, there are no statistically significant differences between the LLM output for prompts with and without the redefinition.



Automated Extraction of Character Features in Fiction: Comparing Bert-based Models and Large Language Models on Fanfiction in English and Chinese

Xiaoyan Yang, Federico Pianzola

University of Groningen, Netherlands, The

Aiming to study cross-cultural narrative patterns, this research develops a computational framework for extracting character features from English and Chinese fanfiction. By evaluating traditional Bert-based models and LLMs on tasks including character recognition, coreference resolution, dialogue and trait extraction, it provides insights into NLP tools' performance in characterization analysis.



Automatic Tagging of Word Senses for a Large-Scale Historical Japanese Corpus

Soma Asada1, Kanako Komiya1, Masayuki Asahara2

1Tokyo University of Agriculature and Technology, Japan; 2NINJAL, Japan

We developed a system to automatically assign word sense tags to all content words in a substantial historical Japanese corpus, comprising over 20 million words. Our approach leverages a system based on Bidirectional Encoder Representations from Transformers (BERT), achieving an accuracy of 88.57%.



Leveraging Human Expertise for LLM-Assisted Dialogue Character Extraction and Attribution in Classic Chinese Novels

Yutong Yang1, Yuhan Guo2, Xiaoju Dong1, Xiaoru Yuan2

1Shanghai Jiao Tong University, People's Republic of China; 2Peking University, People's Republic of China

In this work, we propose a framework for extracting, annotating, attributing and visualizing dialogue characters in classic Chinese novels. We leverage interactive workflows to incorporate expert’s knowledge in the dialogue character extraction and attribution process.

 
4:00pm - 5:30pmSP-38
Location: Aud C1 (EC)
Session Chair: Suzanne Mpouli, Université Paris Cité
 

Triplet Extraction from Art-historical Texts for Knowledge Graph Creation

Julian Stalter1, Matthias Springstein2, Max Kristen1, Eric Müller-Budack2, Stefanie Schneider1, Elias Entrup2, Hubertus Kohle1, Ralph Ewerth2

1Ludwig-Maximilians-Universität München, Germany; 2Leibniz-Informationszentrum Technik und Naturwissenschaften, Hannover, Germany

This paper focuses on improving art-historical image search engines by combining Vision-language Models (VLMs) with knowledge graphs. The approach intends to enhance the interpretability and accuracy of search results by using triplets extracted from domain-specific knowledge with generative models. Providing this information to the user thus also increases the transparency of AI methods.



L’art public sous la loupe des citoyen·ne·s : modeler une interface pour la recherche avec les données MONA

Camila De Oliveira Savoi1,2, Lena Krause1,2, Corélie Godefroid1,2, Simon Janssen1,2, Barbara Marche2

1Université de Montréal, Canada; 2Maison MONA, Canada

Cette communication présente une interface de recherche pour étudier la réception de l'art public au Québec avec les données générées par les utilisateur·rice·s de l'application MONA. L’outil développé optimise l'exploration et l’analyse des expériences artistiques recueillies pour contribuer de nouvelles perspectives sur l'interaction citoyenne avec l'art public.



An analysis of symbolic associations in the Arts based on open data

Sofia Baroncini1, Bruno Sartini2, Marilena Daquino3

1Leibniz Institute of European History (IEG), Mainz, Germany; 2Ludwig-Maximilians-Universität (LMU), Munich, Germany; 3University of Bologna, Italy

In this study, we leverage two open datasets, respectively representing a dictionary of art symbols and scholars’ interpretations of ca. 400 artworks, to analyse how symbols and meanings vary in the art history hermeneutic discourse. Results show that the majority of scholar’s interpretations that could be aligned use conventional symbolism.



Semi/automated methods for digitising bomb damage from historical maps of the 2nd world war

S. Alvanides1, A. Bauch1, C.M. Enss1, K. Stein1, C. Ludwig2

1Otto-Friedrich-Universität Bamberg, Germany; 2Universität des Saarlandes, Germany

Our contribution examines methods for capturing spatial information from historical thematic maps depicting level of destruction during the second world war, focusing on the German city of Nuremberg (Nürnberg). We demonstrate three ways of capturing information from historical thematic maps, ranging from manual to semi-automated methods.



The Romance Genre from 1910 to 1949 and the Place of Women Screenwriters: A Quantitative Analysis

Suzanne Mpouli

Université Paris Cité, France

Using freely available data, this presentation tries to characterise the romance genre in the first half of the 20th century and to map the part women screenwriters played in its evolution.

 
4:00pm - 5:30pmSP-31
Location: B207 (TB)
Session Chair: Walter Scholger, University of Graz
 

Exploring Pan-ecologicalness: A Distant Reading of Ecological Discourse in 20th Century US Novel

Jiying Kang2, Wei Zhao1, Yufeng Han2

1Institute of Literature, Chinese Academy of Social Sciences, China; 2Department of Chinese Language and Literature, Tsinghua University, China

This study analyzed reception of ecological discourse in 20th century US novel through computational criticism. We discovered a lexical family resemblance defined as “Pan-ecologicalness”, and implemented ecological discourse as a held-out example of genre changes throughout 20th century.



Ecological Codes: Constructing Nature in Literature

Mareike Katharina Schumacher1, Marie Flüh2, Felix Lempp3

1University of Regensburg, Germany; 2University of Hamburg, Germany; 3Universität Bern, Switzerland

This study presents an approach focused on natural habitats, plants and animals in German-language literature. To find out more about the aesthetic design, representation and distribution of ‘ecological codes’ we develop a classifier for animals, plants, and habitats in literary texts and apply it to 682 texts.



Greening your database of literary works: How to avoid reinventing vocabularies, in favor of sustainable, reusable models

Kelly Christensen, Jean-Baptiste Camps

École nationale des chartes | Université PSL, France

In a multilingual database of literary works, users will want to find a story's various versions. Therefore, we must conceptualize the threshold between narrative content (story) and its expression in language. While specially designed for evolving narrative traditions, our solution is grounded in the Functional Requirements for Bibliographic Records model.



A Version Assist for Digital Scholarly Editions

Martina Bürgermeister

Österreichische Nationalbibliothek, Austria

To facilitate the description of versions and the creation of a version history, this contribution proposes a version assist system for digital editions. This system returns automatically generated change descriptions of changed resources that are comprehensible because each change is described as a purposeful, rule-based and contextualised action.



Rethinking the Publishing System: A Proposal for the Evaluation and Editing of Digital Academic Objects

Jonathan Girón Palau

Universidad Nacional Autónoma de México

This proposal discusses the evaluation and editing of digital academic objects in digital humanities, emphasizing their epistemological value. It proposes a model based on Bhaskar’s publishing theory, focusing on academic rigor and technical precision. The goal is to enhance DH’s recognition and create a more accessible academic publications.

 
4:00pm - 5:30pmSP-35
Location: B210 (TB)
Session Chair: Jonah Lubin, Harvard University
 

Towards a Verb Class-based Semantic Analysis of German Literary Texts

Hans Ole Hatzel2, Haimo Stiemer1, Chris Biemann2, Evelyn Gius1

1Technical University of Darmstadt, Germany; 2Universität Hamburg

The contribution proposes a verb-class-based approach for the coarse-grained semantic classification of literary texts. Our annotations classify verbal phrases based on the semantic class of their main verb. Despite potential quality issues at the micro level, we demonstrate that this approach can yield valuable insights at the story level.



Word Frequency in Poetry: Computational Insights into Groot Verseboek and the Formation of the Afrikaans Literary Canon

Mathilda Smit, Trudie Strauss

University of the Free State, South Africa

This study uses statistical word frequency analysis to explore the Groot Verseboek anthology of Afrikaans poetry, examining dominant themes, stylistic trends, and shifts in socio-historical context. By combining Digital Humanities and literary analysis, it reveals how canon formation reflects cultural and ideological values, offering new perspectives on Afrikaans literature.



Computational Intellectual History? Tracing the Influence of the Ancient Wisdom Tradition on Kepler, Galileo, and Descartes using the Text Matching and Semantic Matching Tools of the VERITRACE project

Jeffrey Wolf

Vrije Universiteit Brussel

This presentation showcases a case study from VERITRACE, an ERC project using digital tools to identify the influence of ancient wisdom traditions on early modern science, highlighting connections between ancient texts and the works of Kepler, Galileo, and Descartes through text matching and semantic analysis, revealing multilingual traces of influence.



The Contribution of the Project "From Parchment to Computer: Editing Manuscripts in the Digital Age" to Training in Digital Humanities

Elena Lombardo1, Maria Inês Monteiro Bico1, Catarina Coelho2

1Centro de Linguística da Universidade de Lisboa, Portugal; 2Centro de História da Universidade de Lisboa, Portugal

The project From Parchment to Computer offers courses on Textual Criticism and digital scholarly editing, blending theory and practice. It aims to train participants in creating digital editions, promote Digital Philology, enhance critical understanding of digital technologies, and contribute to democratizing access to these tools while advancing DH in Portugal.

 
4:00pm - 5:30pmSP-36
Location: B302 (TB)
Session Chair: Henny Sluyter-Gäthje, University of Potsdam
 

Digital Humanities Meets Language Technology: Empirical Insights from a Broadly Stratified Media Resource

Roman Friedrich Schneider

Leibniz Institute for the German Language, Germany

This contribution discusses an innovatively stratified collection of German language data, ranging from informal spoken interactions to formal written texts. It highlights methods for analyzing linguistic patterns using natural language processing, with a particular focus on discourse markers and a machine learning model for identifying them across diverse communicative contexts.



4:00pm - 4:10pm

Infrastructure as a Trope of Reality

Maciej Maryl

Institute of Literary Research of the Polish Academy of Sciences, Poland

Research Infrastructures (RIs) in the humanities actively shape our understanding of the world. Using examples of bibliographies and corpora, this paper examines how methodological choices in building RIs in digital literary studies influence representation, advocating for open infrastructures to ensure inclusivity and a more nuanced understanding of the literary landscape.



Accessible Models for High-Performance Computing in the Humanities

Brad Rittenhouse

Stanford University, United States of America

With the rise of LLM and the increasing computational expense of AI, humanists will increasingly turn to high-performance computing (HPC). This can be an alienating pivot for many researchers. As a Research Data Facilitator with a decade of HPC experience, I will present models for effectively integrating humanists into HPC.



Knowledge as a collective enterprise: Technology for orchestration of complex cultural models in DH

Pietro Sichera, Cristina Marras, Enrico Pasini

Consiglio Nazionale Delle Ricerche, CNR - Istituto per il Lessico Intellettuale Europeo e Storia delle Idee, ILIESI - Italy

The paper examines key features of research infrastructures in humanities and cultural heritage that support open science, focusing on federated RIs as marketplaces connecting diverse networks. It discusses the technological foundation and API orchestration for DH workflows within the H2IOSC MarketPlace, highlighting contributions of the OPERAS node to this project.



4:10pm - 4:20pm

Towards Modularised Open Infrastructures: Enhancing Research Publications in Digital Humanities – “Detecting Small Worlds” as an Example.

Henny Sluyter-Gäthje1, Ingo Börner1, Peer Trilcke1, Evgeniya Ustinova2, Frank Fischer3, Carsten Milling1

1University of Potsdam, Germany; 2Saarland University, Germany; 3Freie Universität Berlin, Germany

As the triad of publishing a paper, data and code poses challenges for the comprehensibility, reproducibility, and accessibility of the research, we present our approach towards a "modularised open infrastructure for research publications” in which a publication is accompanied by modules facilitating e.g.reproduction or result investigation.

 
4:00pm - 5:30pmSP-34
Location: B304 (TB)
Session Chair: Jacek Bąkowski, Institute of Polish Language, Polish Academy of Sciences
 

A Study of Imagery in Franz Kafka’s Novel The Trial Through Illustrated Editions

Carsten Strathausen, Wenyi Shang

University of Missouri, United States of America

We investigated the semantic and rhetoric imagery of Kafka’s novel The Trial through three illustrated editions of the text. Using image analysis techniques and examining the relationship between images and corresponding texts, we found these illustrations more closely associated with sentences than chapters and uncovered their artistic and hermeneutic nuances.



What is Democracy? Scalable Reading Newspapers of the Weimar Republic

Christian Wachter

Bielefeld University, Germany

This ongoing project provides a novel workflow for studying Weimar Germany’s political culture. By integrating text-hermeneutic investigation with quantitative digital analysis techniques, it enables new insights into historical newspaper discourses on democracy. The project, therefore, enhances historical newspaper research and contributes to the understanding of interwar Germany.



Narrative volatility in Dutch novels

Peter Boot1, Angel Daza2

1Huygens Institute for the History and Culture of the Netherlands, The Netherlands; 2Netherlands eScience Center, The Netherlands

We hypothesize narrative volatility (shifts in sentiment between chunks of text) has an effect on appreciation and thus on ratings of fiction. We describe how we compute volatility and show its distribution over genre. We explain how we will use the result to test the hypothesis.



Attitudes towards information technology in Indian English and German novels since 2000

Shanmugapriya T1, Fotis Jannidis2

1Indian Institute of Technology (ISM) Dhanbad; 2Julius-Maximilians-Universität Würzburg, Germany

We analyze how often Indian English and German novels (2000–2024) refer to information technologies (IT), reflecting demographic, cultural, and societal differences. We use a word-list approach and and a large language model. The llm-based approach works well, but the result doesn't confirm our hypothesis that there is a significant difference.



100 DOLLAR REWARD: Exploration of a Historical Crime Journal

Liam Isaac Downs-Tepper

University of Vienna, Austria

This paper showcases layout analysis and OCR to make an under-researched, 120 year old crime journal accessible. It then uses a variety of text analysis tools for distant reading, exploring how crime was addressed at the time.

 
4:00pm - 5:30pmSP-37
Location: B309 (TB)
Session Chair: Lucia Michielin, University of Edinburgh
 

What is Stated but not Evaluated: a Review of Common Objectives and their Evaluation for CH Data Interfaces

Xinyi Ding, Giacomo Alliata, Yuchen Yang, Sarah Kenderdine

EPFL, Switzerland

Our submission reviews 20 digital interfaces for CH data from 2015 to 2024. It finds 6 common objectives stated by the authors of the reviewed use cases but highlights that not all stated objectives are equally well evaluated.



Examining Digital Humanities Projects through the Lens of Technical and Professional Communication

Kerry Ulm

The Ohio State University, United States of America

This short presentation examines the overlaps between technical and professional communication (TPC) and digital humanities (DH) by using TPC content analysis methods to examine the interfaces of 100 DH project websites. It describes common DH web design features and offers insight regarding the development of accessible and sustainable DH projects.



CLARIAH-EUS-gArA: Constructing a Trustworthy Conversational Assistant for Basque News and Research in the Digital Humanities

Xabier Arregi, Telmo Briones, Ainara Estarrona, Aritz Farwell, Joseba Fernandez de Landa, Iker García, Naiara Perez, German Rigau, Oscar Sainz

University of the Basque Country (UPV/EHU)

The CLARIAH-EUS-gArA project aims to enhance DH research by developing a trustworthy conversational assistant for Basque news using RAG and Latxa, a Basque LLM. It integrates AI and LT to address misinformation, verification, and reliability, thereby providing accurate, up-to-date responses in Basque to aid researchers in fact-checking and accessing sources.



Experiments and Preliminary Thoughts on the Use ofGraph RAG in the Humanities

Jun Ogawa1, Naoya Iwata2, Ikko Tanaka3, Ikki Ohmukai1

1The University of Tokyo; 2Nagoya University; 3J. F. Oberlin University

This study evaluates Graph RAG’s applicability to the humanities, focusing on Caesar’s Gallic Wars, volume 1. A knowledge graph was constructed using LLMs, enabling the retrieval of semantically structured data. The results highlight the potential of enhanced knowledge graphs for broader applications, emphasizing evaluation methods and expert-driven graph development.



Mind the Gap! Supporting code-free Computational research through Small Scale Apps

Lucia Michielin

University of Edinburgh, United Kingdom

Non-coding tools have expanded accessibility in digital humanities, empowering researchers without programming skills to perform data-driven analyses. However, there are currently few tools to assist non-coders with converting and cleaning data. This paper presents a Shiny application for data preprocessing positing that similar small-scale solutions could help bridge this gap.

 
7:00pm - 10:00pmBanquet "Cervejaria Trindade"

 
Contact and Legal Notice · Contact Address:
Privacy Statement · Conference: DH2025 Lisbon
Conference Software: ConfTool Pro 2.8.106+TC
© 2001–2025 by Dr. H. Weinreich, Hamburg, Germany