Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available). To only see the sessions for 3 May's Online Day, select "Online" for location.
Please note that all times are shown in the time zone of the conference. The current conference time is: 11th May 2024, 12:34:12pm CEST
Thomas Egense, Toke Eskildsen, Jørn Thøgersen, Anders Klindt Myrvoll
Royal Danish Library, Denmark
An in-person, updated, version of the ‘21 WAC workshop Run your own full stack SolrWayback: https://netpreserve.org/event/wac2021-solrwayback-1/
This workshop will
Explain the ecosystem for SolrWayback 4 (https://github.com/netarchivesuite/solrwayback)
Perform a walkthrough of installing and running the SolrWayback bundle. Participants are expected to mirror the process on their own computer and there will be time for solving installation problems
Leave participants with a fully working stack for index, discovery and playback of WARC files
End with open discussion of SolrWayback configuration and features.
Prerequisites:
Participants should have a Linux, Mac or Windows computer with Java 8 or Java 11 installed. To see java is installed type this in a terminal: java -version
Downloading the latest release of SolrWayback Bundle from:https://github.com/netarchivesuite/solrwayback/releases beforehand is recommended.
Having institutional WARC files available is a plus, but sample files can be downloaded from https://archive.org/download/testWARCfiles
A mix of WARC-files from different harvests/years will showcase SolrWaybacks capabilities the best way possible.
Target audience:
Web archivists and researchers with medium knowledge of web archiving and tools for exploring web archives. Basic technical knowledge of starting a program from the command line is required; the SolrWayback bundle is designed for easy deployment. Maximum number of participants 30
Background
SolrWayback 4 (https://github.com/netarchivesuite/solrwayback) is a major rewrite with a strong focus on improving usability. It provides real time full text search, discovery, statistics extraction & visualisation, data export and playback of webarchive material. SolrWayback uses Solr (https://solr.apache.org/) as the underlying search engine. The index is populated using Web Archive Discovery (https://github.com/ukwa/webarchive-discovery). The full stack is open source and freely available. A live demo is available at https://webadmin.oszk.hu/solrwayback/
During the conference there will be focused support for SolrWayback in a dedicated Slack channel by Thomas Egense and Toke Eskildsen.