Conference Agenda

Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view.

 
 
Session Overview
Date: Wednesday, 09/Apr/2025
9:00am
-
9:40am
REGISTRATION: Web Archiving Conference (WAC)
9:40am
-
9:50am
Opening Remarks
Location: Målstova (upstairs)
Streamed to Store Auditorium.
9:50am
-
10:45am
Libraries, Copyright, and Language Models
Location: Målstova (upstairs)
Streamed to Store Auditorium.
10:45am
-
10:55am
SHORT BREAK
Streaming video from Målstova to Store Auditorium ends. Lightning Talk Session 2 will begin in the Store Auditorium after the break.
10:55am
-
11:00am
LIGHTNING TALK SESSION 1: INTRODUCTION
Location: Målstova (upstairs)
LIGHTNING TALK SESSION 2: INTRODUCTION
Location: Store Auditorium (ground floor)
 
11:00am
-
11:25am
LIGHTNING TALK SESSION 1
Location: Målstova (upstairs)
 
11:00am - 11:05am

Strategies and Challenges in the Preservation of Mexico’s Web Heritage: First Steps

Carolina Silva Bretón

National Library of Mexico, Mexico



11:05am - 11:10am

Challenges and Strategies in Implementing Web Archiving Legislation in Brazil

Jonas Ferrigolo Melo1, Moisés Rockembach2

1: University of Porto, Portugal; 2: University of Coimbra, Portugal



11:10am - 11:15am

Arquivo.pt Toolkit for Web Archiving

Daniel Gomes

Arquivo.pt, Portugal



11:15am - 11:20am

Tracking the Political Representations of Life: Methodological Challenges of Exploring the BnF Web Archives

Guillaume Levrier1,2, Dorothée Benhamou-Suesser2

1: Centre de recherches politiques de Sciences Po (CEVIPOF, CNRS), France; 2: Bibliothèque nationale de France, France



11:20am - 11:25am

Collaborative Curatorial Approaches of the Czech Web Archive Using the Example of Thematic Literary Collections

Marie Haškovcová

National Library of the Czech Republic, Czech Republic

LIGHTNING TALK SESSION 2
Location: Store Auditorium (ground floor)
Chair: Sawood Alam, Internet Archive
 
11:00am - 11:05am

Modelling Archived Web Objects as Semantic Entities to Manage Contextual and Versioning Issues

Tom Storrar1, Manuela Pallotto Strickland2

1: The National Archives (UK), United Kingdom; 2: King's College London, United Kingdom



11:05am - 11:10am

Modernizing Web Archives: The Bumpy Road Towards a General ARC2WARC Conversion Tool

Pedro Ortiz Suarez, Sebastian Nagel, Thom Vaughan

Common Crawl Foundation, United States of America



11:10am - 11:15am

Poking Around in Podcast Preservation

Jasper Snoeren

Netherlands Institute for Sound and Vision, Netherlands



11:15am - 11:20am

Automatic Clustering of Domains by Industry for Effective Curation

Thomas Smedebøl

Royal Danish Library, Denmark



11:20am - 11:25am

Best Practice of Preserving Posts from Social Media Feeds

Magdalena Sjödahl

Arkiwera wcrify AB, Sweden

 
11:25am
-
11:55am
BREAK
11:55am
-
1:00pm
PANEL #01: Engaging Audiences
Location: Målstova (upstairs)
Chair: Eveline Vlassenroot, University of Ghent
 

Beyond Preservation: Engaging Audiences and Researchers with Web Archives

Eveline Vlassenroot1, Peter Mechant1, Friedel Geeraert2, Christina Vandendyck2

1: University of Ghent, Belgium; 2: KBR - Royal Library of Belgium, Belgium

SESSION #01: Tools: Under Construction: Lessons Learned (National Library Perspective)
Location: Store Auditorium (ground floor)
Chair: Katherine Boss, National Library of Norway
 
11:55am - 12:15pm

Embedding the Web Archive in an Overall Preservation System

Hansueli Locher

Swiss National Library, Switzerland



12:15pm - 12:35pm

UKWA Rebuild

Gil Hoggarth

British Library, United Kingdom



12:35pm - 12:55pm

Under Construction: Web Archive of the German National Library

Natanael Arndt

German National Library, Germany

WORKSHOP #01: Exploring Dilemmas in the Archiving of Legacy Webportals: An Exercise in Reflective Questioning
Location: Slottsbiblioteket (ground floor)
 

Exploring Dilemmas in the Archiving of Legacy Webportals: An Exercise in Reflective Questioning

Daniel Steinmeier, Sophie Ham

National Library of the Netherlands, Netherlands

1:00pm
-
2:00pm
LUNCH
2:05pm
-
3:40pm
SESSION #02: Crawling Tools
Location: Målstova (upstairs)
Chair: László Tóth, National Library of Luxembourg
 
2:05pm - 2:25pm

Lessons Learned Building a Crawler From Scratch: The Development and Implementation of Veidemann

Marius André Elsfjordstrand Beck

National Library of Norway, Norway



2:25pm - 2:45pm

Experiences of Using in-House Developed Collecting Tool ELK

Lauri Ojanen

National Library of Finland, Finland



2:45pm - 3:05pm

Better Together: Building a Scalable Multi-Crawler Web Harvesting Toolkit

Alex Dempsey, Adam Miller, Kyrie Whitsett

Internet Archive, United States of America



3:05pm - 3:25pm

Lowering Barriers to Use, Crawling, and Curation: Recent Browsertrix Developments

Tessa Walsh, Ilya Kreymer

Webrecorder, United States of America

SESSION #03: Advocacy & User Engagement
Location: Store Auditorium (ground floor)
Chair: Helena Byrne, British Library
 
2:05pm - 2:25pm

Insufficiency of Human-Centric Ethical Guidelines in the Age of AI: Considering Implications of Making Legacy Web Content Openly Accessible

Gaja Zornada, Boštjan Špetič

Computer History Museum Slovenia (Računališki muzej), Slovenia



2:25pm - 2:45pm

Web Archives for Music Research

Andreas Lenander Ægidius

Royal Danish Library, Denmark



2:45pm - 3:05pm

IXP History Collection: Recording the Early Development of the Core of the Public Internet

Sharon Healy1, Gerard Best1, Lara Díaz Martínez2

1: Independent Researcher, Ireland; 2: University of Barcelona, Spain



3:05pm - 3:25pm

Lost, but Preserved - A Web Archiving Perspective on the Ephemeral Web

Sawood Alam, Rachel Auslander, Mark Graham

Internet Archive, United States of America

WORKSHOP #02: Web Archive Collections As Data
Location: Slottsbiblioteket (ground floor)
 

Web Archive Collections as Data

Gustavo Candela1, Chase Dooley2, Abbie Grotke2, Olga Holownia3, Jon Carlstedt Tønnessen4

1: University of Alicante, Spain; 2: Library of Congress, United States of America; 3: IIPC, United States of America; 4: National Library of Norway, Norway



Datasheets for Web Archives Toolkit

Emily Maemura1, Helena Byrne2

1: University of Illinois Urbana-Champaign, United States of America; 2: British Library, United Kingdom

3:40pm
-
4:10pm
BREAK
4:10pm
-
4:40pm
POSTER SLAM
Location: Målstova (upstairs)
Chair: Olga Holownia, IIPC
Streamed to Store Auditorium.
 

‘We Are Now Entering the Pre-election Period’: Experimental Twitter Capture at The National Archives

Jake Bickford

The National Archives (UK), United Kingdom



The BnF DataLab Services and Tools for Researchers Working on Web Archives

Sara Aubry, Dorothée Benhamou-Suesser

Bibliothèque nationale de France, France



Experiences Switching an Archiving Web Crawler to Support HTTP/2

Sebastian Nagel

Common Crawl Foundation, United States of America



Web Scraping in the Hungarian Web Archive

Gyula Kalcsó

National Széchényi Library, Hungary



Arquivo.pt Api/Bulk Access and Its Usage

Vasco Rato, Daniel Gomes

Arquivo.pt, Portugal



Politely Downloading Millions of WARC Files Without Burning the Servers Down

Pedro Ortiz Suarez, Thom Vaughan, Greg Lindahl

Common Crawl Foundation, United States of America



Next Steps Towards A Formal Registry Of Web Archives For Persistent And Sustainable Identification

Eld Zierau

Royal Danish Library, Denmark



Using Web Archives to Construct the History of an Academic Field

Tegan Pyke

University of Bergen, Norway



Arquivo.pt Annual Awards: A Glimpse

Daniel Gomes

Arquivo.pt, Portugal



Asynchronous and Modular Pipelines for Fast WARC Annotation

Pedro Ortiz Suarez, Thom Vaughan

Common Crawl Foundation, United States of America



Consortium on Electronic Literature (CELL)

Hannah Ackermans

University of Bergen, Norway



Designing Art Student Web Archives

Katherine Martinez

The New School, United States of America



Failed Capture or Playback Woes? A Case Study in Highly Interactive Web Based Experiences

Mari Allison

Smithsonian Libraries and Archives United States of America



From New Media Archives on Social Media Platforms to Web Archives - Challenges in Preserving Scraped Cultural Materials

Camilla Holm Soelseth

OsloMet, Norway



HAWathon: Participants Experience

Ingeborg Rudomino, Anamarija Ljubek

National and University Library in Zagreb, Croatia



Implementing the e-ARK Standard for Ingest of Social Media Archives: Goals, Opportunities and Challenges

Nastasia Vanderperren, Ellen Van Keer

meemoo, Flemish Institute for Archives, Belgium



Planning Web Archiving Within a Four-Year Scope: Making the New Collection Plan for the Years 2025-2028 in the National Library of Finland

Sanna Haukkala

National Library of Finland, Finland



Redirects Unraveled: From Lost Links to Rickrolls

Kritika Garg1, Sawood Alam2, Michele Weigle1, Michael Nelson1, Mark Graham2, Dietrich Ayala3

1: Old Dominion University, United States of America; 2: Internet Archive, United States of America; 3: Filecoin Foundation, Netherlands



Robots.txt and Crawler Politeness in the Age of Generative AI

Sebastian Nagel, Thom Vaughan

Common Crawl Foundation, United States of America



Solving the Problem of Reference Rot Via Web Archiving: An OA Publisher’s Solution & Future Solutions in Thoth

Miranda Barnes, Gareth Cole

Loughborough University, United Kingdom



Use of Screenshots as a Harvesting Tool for Dynamic Content and Use of AI for Later Data Analysis

Gaja Zornada, Boštjan Špetič

Computer History Museum Slovenia (Računališki muzej), Slovenia

4:40pm
-
6:00pm
POSTER SESSION
Location: Folkestova (upstairs)
7:30pm
-
9:30pm
DINNER
Location: Kantine (downstairs)