Conference Agenda

Session Overview
Location: Theatre 2
 
Date: Thursday, 11/May/2023
11:00am
-
12:30pm
SES-02: FINDING MEANING IN WEB ARCHIVES
Location: Theatre 2
Chair: Vladimir Tybin, Bibliothèque nationale de France
These presentations will be followed by a 10 min Q&A.
 
11:00am - 11:20am

Leveraging Existing Bibliographic Metadata to Improve Automatic Document Identification in Web Archives.

Mark Phillips1, Cornelia Caragea2, Praneeth Rikka1

1: University of North Texas, United States of America; 2: University of Illinois Chicago, United States of America



11:20am - 11:40am

Conceptual Modeling of the Web Archiving Domain

Illyria Brejchová

Masaryk University, Czech Republic



11:40am - 12:00pm

Web Archives & Machine Learning: Practices, Procedures, Ethics

Jefferson Bailey

Internet Archive, United States of America



12:00pm - 12:20pm

From Small to Scale: Lessons Learned on the Requirements of Coordinated Selective Web Archiving and Its Applications

Balázs Indig1,2, Zsófia Sárközi-Lindner1,2, Mihály Nagy1,2

1: Eötvös Loránd University, Department of Digital Humanities, Budapest, Hungary; 2: National laboratory for Digital Humanities, Budapest, Hungary

1:30pm
-
2:30pm
SES-04 (PANEL): SOLRWAYBACK: BEST PRACTICE, COMMUNITY USAGE & ENGAGEMENT
Location: Theatre 2
Chair: Thomas Langvann, National Library of Norway
 

SolrWayback: Best practice, community usage and engagement

Thomas Egense1, László Tóth2, Youssef Eldakar3, Sara Aubry4, Anders Klindt Myrvoll1

1: Royal Danish Library (KB); 2: National Library of Luxembourg (BnL); 3: Bibliotheca Alexandrina (BA); 4: National Library of France (BnF)

2:40pm
-
3:50pm
SES-06: SOCIAL MEDIA & PLAYBACK: COLLABORATIVE APPROACHES
Location: Theatre 2
Chair: Susanne van den Eijkel, KB, National Library of the Netherlands
These presentations will be followed by a 10 min Q&A.
 
2:40pm - 3:00pm

Archiving social media in Flemish cultural or private archives, (how) is it possible

Katrien Weyns1, Ellen Van Keer2

1: KADOC-KU Leuven, Belgium; 2: meemoo, Belgium



3:00pm - 3:20pm

Searching for a Little Help From My Friends: Reporting on the Efforts to Create an (Inter)national Distributed Collaborative Social Media Archiving Structure

Zefi Kavvadia1, Katrien Weyns2, Mirjam Schaap3, Sophie Ham4

1: International Institute of Social History; 2: KADOC Documentation and Research Centre on Religion, Culture, and Society; 3: Amsterdam City Archives; 4: KB, National Library of the Netherlands



3:20pm - 3:40pm

Collaborating On The Cutting Edge: Client Side Playback

Clare Stanton, Matteo Cargnelutti

Library Innovation Lab, United States of America

4:20pm
-
5:30pm
SES-08: QUALITY ASSURANCE
Location: Theatre 2
Chair: Arnoud Goos, Netherlands Institute for Sound & Vision
These presentations will be followed by a 10 min Q&A.
 
4:20pm - 4:40pm

The Auto QA process at UK Government Web Archive

Kourosh Feissali, Jake Bickford

The National Archives, United Kingdom



4:40pm - 5:00pm

The Human in the Machine: Sustaining a Quality Assurance Lifecycle at the Library of Congress

Grace Bicho, Meghan Lyon, Amanda Lehman

Library of Congress, United States of America

5:30pm
-
6:10pm
POS-2: LIGHTNING & DROP-IN TALKS
Location: Theatre 2
Chair: Martin Klein, Los Alamos National Laboratory
1 minute drop-in talks will immediately follow lightning talks. After the session ends, lightning talk presenters will be available for questions in the atrium, where their posters will be on display.

Drop-in talk schedule:

Persistent Web IDentifier (PWID) also as URN​
Eld Zierau, Royal Danish Library

Crowdsourcing German Twitter ​
Britta Woldering, German National Library

At the end of the rainbow. Examining the Dutch LGBT+ web archive using NER and hyperlink analyses
Jesper Verhoef, Erasmus University Rotterdam
 

Sunsetting a digital institution: Web archiving and the International Museum of Women

Marie Chant

The Feminist Institute, United States of America



Visualizing web harvests with the WAVA tool

Ben O'Brien1, Frank Lee1, Hanna Koppelaar2, Sophie Ham2

1: National Library of New Zealand, New Zealand; 2: National Library of the Netherlands, Netherlands



WARC validation, why not?

Antal Posthumus, Jacob Takema

Nationaal Archief, The Netherlands


 
Date: Friday, 12/May/2023
8:30am
-
10:00am
WKSHP-04: BROWSER-BASED CRAWLING FOR ALL: GETTING STARTED WITH BROWSERTRIX CLOUD
Location: Theatre 2
Pre-registration required for this event.
 

Browser-Based Crawling For All: Getting Started with Browsertrix Cloud

Andrew N. Jackson1, Anders Klindt Myrvoll2, Ilya Kreymer3

1: The British Library, United Kingdom; 2: Royal Danish Library; 3: Webrecorder

10:30am
-
12:00pm
SES-13: CRAWLING, PLAYBACK, SUSTAINABILITY
Location: Theatre 2
Chair: Laura Wrubel, Stanford University
These presentations will be followed by a 10 min Q&A.
 
10:30am - 10:50am

Developer Update for Browsertrix Crawler and Browsertrix Cloud

Ilya Kreymer, Tessa Walsh

Webrecorder, United States of America



10:50am - 11:10am

Opportunities and Challenges of Client-Side Playback

Clare Stanton, Matteo Cargnelutti

Library Innovation Lab, United States of America



11:10am - 11:30am

Sustaining pywb through community engagement and renewal: recent roadmapping and development as a case study in open source web archiving tool sustainability

Tessa Walsh, Ilya Kreymer

Webrecorder



11:30am - 11:50am

Addressing the Adverse Impacts of JavaScript on Web Archives

Ayush Goel1, Jingyuan Zhu1, Ravi Netravali2, Harsha V. Madhyastha1

1: University of Michigan, United States of America; 2: Princeton University, United States of America

1:00pm
-
2:10pm
SES-15: DATA CONSIDERATIONS
Location: Theatre 2
Chair: Sophie Ham, Koninklijke Bibliotheek
These presentations will be followed by a 10 min Q&A.
 
1:00pm - 1:20pm

What if GitHub disappeared tomorrow?

Emily Escamilla, Michele Weigle, Michael Nelson

Old Dominion University, United States of America



1:20pm - 1:40pm

Web archives and FAIR data: exploring the challenges for Research Data Management (RDM)

Sharon Healy1, Ulrich Karstoft Have2, Sally Chambers3, Ditte Laursen4, Eld Zierau4, Susan Aasman5, Olga Holownia6, Beatrice Cannelli7

1: Maynooth University; 2: NetLab; 3: KBR & Ghent Centre for Digital Humanities; 4: Royal Danish Library; 5: University of Groningen; 6: IIPC; 7: School of Advanced Study, University of London



1:40pm - 2:00pm

Lessons Learned in Hosting the End of Term Web Archive in the Cloud

Mark Phillips1, Sawood Alam2

1: University of North Texas, United States of America; 2: Internet Archive, United States of America

2:20pm
-
3:50pm
SES-17: PROGRAM INFRASTRUCTURE
Location: Theatre 2
Chair: René Voorburg, KB, National Library of the Netherlands
These presentations will be followed by a 10 min Q&A.
 
2:20pm - 2:40pm

Maintenance Practices for Web Archives

Ed Summers, Laura Wrubel

Stanford University, United States of America



2:40pm - 3:00pm

Radical incrementalism and the resilience and renewal of the National Library of Australia's web archiving infrastructure

Alex Osborne1, Paul Koerbin2

1: National Library of Australia, Australia; 2: National Library of Australia, Australia



3:00pm - 3:20pm

Arquivo.pt behind the curtains

Daniel Gomes

FCT: Arquivo.pt, Portugal



3:20pm - 3:40pm

Implementing access to and management of archived websites at the National Archives of the Netherlands

Antal Posthumus

Nationaal Archief, The Netherlands