{"id":2786,"date":"2021-05-11T15:48:56","date_gmt":"2021-05-11T15:48:56","guid":{"rendered":"http:\/\/recordsandarchives.westminster.ac.uk\/?page_id=2786"},"modified":"2022-01-06T11:54:28","modified_gmt":"2022-01-06T11:54:28","slug":"web-archiving-at-westminster","status":"publish","type":"page","link":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/archive-blog\/web-archiving-at-westminster\/","title":{"rendered":"Web archiving at Westminster"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"2786\" class=\"elementor elementor-2786\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-c9ca073 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"c9ca073\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-dc20dcb\" data-id=\"dc20dcb\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6e0b4cf elementor-widget elementor-widget-text-editor\" data-id=\"6e0b4cf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Since its development in the 1990s the web has become a central part of our lives. As historian Ian Milligan argues in his book, <a href=\"https:\/\/library-collections-search.westminster.ac.uk\/permalink\/44WST_INST\/15emkrp\/alma996927420203711\" target=\"_blank\" rel=\"noopener\"><span style=\"text-decoration: underline;\">History in the Age of Abundance?<\/span><\/a>, it is impossible to imagine writing a history of the present day without reference to the web. At the University, our websites form a vital record of our teaching and research activities and how we have shared them with the community. This blog will look at how we have been working to preserve these records so that we can continue to tell the story of the University into the future.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-24f87b9 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"24f87b9\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-dc2e81e\" data-id=\"dc2e81e\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-125b75d elementor-widget elementor-widget-heading\" data-id=\"125b75d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">Why we need to archive the web<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-4ef6777 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"4ef6777\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-3fd8e2a\" data-id=\"3fd8e2a\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-b5386b8 elementor-widget elementor-widget-text-editor\" data-id=\"b5386b8\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>Websites may feel permanent but studies have suggested average website lifespans of <span style=\"text-decoration: underline;\"><a href=\"https:\/\/blogs.loc.gov\/thesignal\/2011\/11\/the-average-lifespan-of-a-webpage\/\" target=\"_blank\" rel=\"noopener\">100 days or less<\/a><\/span>. Even where sites stay up, content on the web is frequently moved or overwritten, and <span style=\"text-decoration: underline;\"><a href=\"https:\/\/harvardlawreview.org\/2014\/03\/perma-scoping-and-addressing-the-problem-of-link-and-reference-rot-in-legal-citations\/#_ftn3\" target=\"_blank\" rel=\"noopener\">research by Harvard Law Review<\/a><\/span> found that more than 70% of cited URLs within a selection of legal journals no longer linked to the originally cited information. At the University while key sites like our institutional website are likely to remain up for some time, their content will often be overwritten. Meanwhile smaller sites that reflect a particular research project or activity will often have a much shorter lifespan. Ultimately, the ongoing cost of hosting and maintaining a site means that when a particular project comes to an end or relevant staff move on, an associated site is likely to be shut down. In response to these issues, web archiving aims to capture and preserve, not just the information stored on a site, but also the way in which it was presented and accessed, allowing future researchers to see how it was experienced by users at the time.\u00a0<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-7c9621f elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"7c9621f\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-510b5ae\" data-id=\"510b5ae\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-3137c43 elementor-widget elementor-widget-image\" data-id=\"3137c43\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t<figure class=\"wp-caption\">\n\t\t\t\t\t\t\t\t\t\t<img fetchpriority=\"high\" decoding=\"async\" width=\"496\" height=\"493\" src=\"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_1997-2.png\" class=\"attachment-medium_large size-medium_large wp-image-2788\" alt=\"A screenshot showing the university website in 1997.\" srcset=\"http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_1997-2.png 496w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_1997-2-300x298.png 300w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_1997-2-150x150.png 150w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_1997-2-65x65.png 65w\" sizes=\"(max-width: 496px) 100vw, 496px\" \/>\t\t\t\t\t\t\t\t\t\t\t<figcaption class=\"widget-image-caption wp-caption-text\">The University website as it appeared in 1997, preserved by the Internet Archive<\/figcaption>\n\t\t\t\t\t\t\t\t\t\t<\/figure>\n\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-29bb9dd elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"29bb9dd\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-a170e4c\" data-id=\"a170e4c\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-fea0e4e elementor-widget elementor-widget-heading\" data-id=\"fea0e4e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">How web archiving works: the University website<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-59bd031 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"59bd031\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-8079464\" data-id=\"8079464\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-9f07068 elementor-widget elementor-widget-text-editor\" data-id=\"9f07068\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>There are two main approaches to web archiving. Large-scale automatic capture of websites, of the sort done by the <a href=\"https:\/\/archive.org\/web\/\" target=\"_blank\" rel=\"noopener\"><span style=\"text-decoration: underline;\">Internet Archive<\/span><\/a> or the <span style=\"text-decoration: underline;\"><a href=\"https:\/\/www.webarchive.org.uk\/\" target=\"_blank\" rel=\"noopener\">UK Web Archive<\/a><\/span>, uses software agents called crawlers. A crawler visits an initial page \u2013 called a seed \u2013 makes an archival copy of the page and searches for any links. The crawler then follows each of the links, captures the pages it finds, searches for links and repeats the process. As such crawls are potentially infinite, they are usually limited in scope by \u2018domain\u2019 (for example instructing the crawler to only archive pages from westminster.ac.uk and not follow external links), \u2018depth\u2019 (the number of times the crawler continues following links from the original seed), or simply by crawl time or file size.\u00a0<\/p><p>This is the approach we take for our regular captures of the University\u2019s main website and other key sites. Since 2018, our partners <span style=\"text-decoration-line: underline;\"><a href=\"https:\/\/www.mirrorweb.com\/\" target=\"_blank\" rel=\"noopener\">Mirrorweb<\/a><\/span>\u00a0have crawled the site twice a year and transferred the resulting archive files\u00a0for us to look after in the university archive&#8217;s digital repository.\u00a0 Websites are preserved in a standard archival file format called\u00a0<a style=\"text-decoration-line: underline;\" href=\"https:\/\/archive-it.org\/blog\/post\/the-stack-warc-file\/\" target=\"_blank\" rel=\"noopener\">WARC<\/a>.\u00a0With the appropriate software WARC files can then be viewed and interacted with like a normal website.\u00a0<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-295b929 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"295b929\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-8b63d26\" data-id=\"8b63d26\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-02f4ef4 elementor-widget elementor-widget-image\" data-id=\"02f4ef4\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t<figure class=\"wp-caption\">\n\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"476\" height=\"357\" src=\"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_2018-2-476x357.png\" class=\"attachment-single-project size-single-project wp-image-2801\" alt=\"The university website homepage in 2018\" srcset=\"http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_2018-2-476x357.png 476w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_2018-2-300x224.png 300w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_2018-2-768x574.png 768w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_2018-2-276x207.png 276w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_2018-2-320x240.png 320w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/homepage_2018-2.png 1168w\" sizes=\"(max-width: 476px) 100vw, 476px\" \/>\t\t\t\t\t\t\t\t\t\t\t<figcaption class=\"widget-image-caption wp-caption-text\">The University website in 2018, captured by Mirrorweb and preserved in the University Archive<\/figcaption>\n\t\t\t\t\t\t\t\t\t\t<\/figure>\n\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-8d5f5dc elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"8d5f5dc\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-c2f0487\" data-id=\"c2f0487\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-ed66014 elementor-widget elementor-widget-heading\" data-id=\"ed66014\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">How web archiving works: rapid response collecting<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-9fd97b7 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"9fd97b7\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-8264f27\" data-id=\"8264f27\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6f15c0e elementor-widget elementor-widget-text-editor\" data-id=\"6f15c0e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>While crawl-based systems are useful for capturing larger sites, they are technically complex to manage and can be costly.\u00a0Automatic crawls can also be less effective at capturing complex or dynamic content, especially interactive material. For this reason, the digital arts organisation <a href=\"https:\/\/rhizome.org\/\" target=\"_blank\" rel=\"noopener\"><span style=\"text-decoration: underline;\">Rhizome<\/span><\/a>, developed <a href=\"https:\/\/conifer.rhizome.org\/\" target=\"_blank\" rel=\"noopener\"><span style=\"text-decoration: underline;\">Conifer<\/span><\/a>\/<span style=\"text-decoration: underline;\"><a href=\"https:\/\/webrecorder.net\/\" target=\"_blank\" rel=\"noopener\">Webrecorder<\/a><\/span>, as a solution that would both allow for higher fidelity capture and let individuals and smaller organisations create their own web archives.\u00a0 These systems work by allowing the user to click through the website as if they were using it, recording the interactions between the browser and the site in a WARC file.\u00a0<\/p><p>At the University we make use of Conifer\/Webrecorder either where we have experienced technical issues with our automated crawls or, more often, when we want to quickly capture content that is in danger of being overwritten or deleted. For example, this is the approach we have taken towards collecting web pages relating to the University&#8217;s coronavirus response, which, particularly during the early days of the pandemic, were frequently being overwritten.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-0dc3f36 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"0dc3f36\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-0af501c\" data-id=\"0af501c\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-b6c4156 elementor-widget elementor-widget-image\" data-id=\"b6c4156\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t<figure class=\"wp-caption\">\n\t\t\t\t\t\t\t\t\t\t<img decoding=\"async\" width=\"476\" height=\"357\" src=\"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/160320_COVID-2-476x357.png\" class=\"attachment-single-project size-single-project wp-image-2816\" alt=\"Web archive of the University&#039;s coronavirus response page 16 Match 2020\" srcset=\"http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/160320_COVID-2-476x357.png 476w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/160320_COVID-2-276x207.png 276w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/160320_COVID-2-320x240.png 320w\" sizes=\"(max-width: 476px) 100vw, 476px\" \/>\t\t\t\t\t\t\t\t\t\t\t<figcaption class=\"widget-image-caption wp-caption-text\">The University's Coronavirus Reponse page from the 16th of March 2020, captured with Webrecorder<\/figcaption>\n\t\t\t\t\t\t\t\t\t\t<\/figure>\n\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-1abe93f elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"1abe93f\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-dde72a9\" data-id=\"dde72a9\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-75102f6 elementor-widget elementor-widget-text-editor\" data-id=\"75102f6\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>We also typically\u00a0 use Conifer\/Webrecorder when we need to capture smaller sites that are due to be decommissioned. Like all potential accessions, sites are first appraised to make sure they fit with our <span style=\"text-decoration: underline;\"><a href=\"http:\/\/recordsandarchives.westminster.ac.uk\/home\/collection-policy\/\" target=\"_blank\" rel=\"noopener\">Collection and Acquisition policy<\/a><\/span>. For websites this means that they must be well-used sites that document a key university activity such as teaching or research, or are in themselves of historical interest. Web archives are time consuming to create and use a lot of digital storage so, as with all forms of archiving, it&#8217;s not possible to capture everything. A good example of the kind of sites we collect with Conifer are the pages of the Hypermedia Research Centre which conducted innovative research on internet politics and culture at Westminster in the 1990s and early 2000s. These pages are a particularly interesting example of early internet culture, but they also posed a technical challenge for conventional web archiving as they included some examples of student work using Flash. Flash was a software platform that enabled rich interactive web content but <span style=\"text-decoration: underline;\"><a href=\"https:\/\/www.bbc.co.uk\/news\/technology-55497353\" target=\"_blank\" rel=\"noopener\">which has now been deprecated<\/a><\/span>. Fortunately Conifer provides emulated browsers that support Flash, so we were able to use it to capture the student work along with the rest of the site.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-ba3d1ef elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"ba3d1ef\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-7d76372\" data-id=\"7d76372\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-9c10c2b elementor-widget elementor-widget-image\" data-id=\"9c10c2b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"image.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t\t\t\t<figure class=\"wp-caption\">\n\t\t\t\t\t\t\t\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"476\" height=\"357\" src=\"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/warchalk_2-2-476x357.png\" class=\"attachment-single-project size-single-project wp-image-2835\" alt=\"An example of flash-based student work from 2003, running in an emulated browser via Conifer\" srcset=\"http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/warchalk_2-2-476x357.png 476w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/warchalk_2-2-276x207.png 276w, http:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-content\/uploads\/sites\/42\/2021\/05\/warchalk_2-2-320x240.png 320w\" sizes=\"(max-width: 476px) 100vw, 476px\" \/>\t\t\t\t\t\t\t\t\t\t\t<figcaption class=\"widget-image-caption wp-caption-text\">An example of flash-based student work from 2003, captured from the pages of the Hypermedia Research Centre using Conifer<\/figcaption>\n\t\t\t\t\t\t\t\t\t\t<\/figure>\n\t\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-1f6f3d6 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"1f6f3d6\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-4b20f31\" data-id=\"4b20f31\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-39a27cd elementor-widget elementor-widget-heading\" data-id=\"39a27cd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h3 class=\"elementor-heading-title elementor-size-default\">Issues with web archives<\/h3>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-98499c8 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"98499c8\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-e59ed58\" data-id=\"e59ed58\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-7be6e80 elementor-widget elementor-widget-text-editor\" data-id=\"7be6e80\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p>As suggested above, one of the first things to recognise is that as with conventional archives, not everything will make it into a web archive. However, even if a site has been archived, researchers also need to keep in mind that no archival copy will be a 100% faithful recreation of the original experience. Websites use a wide variety of underlying technologies, many of which change frequently, making perfect capture and playback impossible. At the University, although we are careful to perform quality assurance on our captures, not all problems can be resolved and you might come across missing images or media and the occasional broken link. Archives will also always, to some extent, be meditated by the software that was used to create them and play them back on the user&#8217;s computer. These issues mean that web archives, like all archival sources, should be approached critically. For more advice on using web archives as a researcher, see our <span style=\"text-decoration-line: underline;\"><a href=\"https:\/\/libguides.westminster.ac.uk\/c.php?g=681142&amp;p=4859367\" target=\"_blank\" rel=\"noopener\">libguide on working with digital archives<\/a><\/span>.\u00a0<\/p><p>If you have your own website or manage one for work, there are steps you can take to make it more &#8216;archivable&#8217;. Fortunately, as web crawlers often work in a similar way to screen readers and other accessibility tools, many of the steps you take to enhance accessibility will also help make your pages easier to archive. The US Library of Congress has a <span style=\"text-decoration-line: underline;\"><a href=\"https:\/\/www.loc.gov\/programs\/web-archiving\/for-site-owners\/creating-preservable-websites\/\" target=\"_blank\" rel=\"noopener\">helpful guide<\/a><\/span> on this subject and you can also use the <span style=\"text-decoration-line: underline;\"><a href=\"http:\/\/archiveready.com\/\" target=\"_blank\" rel=\"noopener\">Archive Ready site checker<\/a><\/span>, to get a quick idea of how compatible your site would be with common web archiving systems.<\/p><p>Providing access to web archives can also be challenging and currently, access to web archives held by the University is provided on request. If you are a researcher interested in accessing our archives,\u00a0<a href=\"http:\/\/recordsandarchives.westminster.ac.uk\/contact\/\" target=\"_blank\" rel=\"noopener\">please\u00a0<span style=\"text-decoration-line: underline;\">contact us<\/span><\/a>.\u00a0\u00a0<\/p><p>Jacob Bickford May 2021<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-35969fe elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"35969fe\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-05f4ff5\" data-id=\"05f4ff5\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-68f19ab elementor-widget elementor-widget-button\" data-id=\"68f19ab\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"button.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<div class=\"elementor-button-wrapper\">\n\t\t\t\t\t<a class=\"elementor-button elementor-button-link elementor-size-sm\" href=\"http:\/\/recordsandarchives.westminster.ac.uk\/archive-blog\">\n\t\t\t\t\t\t<span class=\"elementor-button-content-wrapper\">\n\t\t\t\t\t\t\t\t\t<span class=\"elementor-button-text\">Back to the Blog<\/span>\n\t\t\t\t\t<\/span>\n\t\t\t\t\t<\/a>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Since its development in the 1990s the web has become a central part of our lives. As historian Ian Milligan argues in his book, History in the Age of Abundance?, it is impossible to imagine writing a history of the&#8230;<\/p>\n","protected":false},"author":136,"featured_media":0,"parent":3461,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-2786","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/pages\/2786","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/users\/136"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/comments?post=2786"}],"version-history":[{"count":1,"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/pages\/2786\/revisions"}],"predecessor-version":[{"id":3489,"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/pages\/2786\/revisions\/3489"}],"up":[{"embeddable":true,"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/pages\/3461"}],"wp:attachment":[{"href":"https:\/\/blog.westminster.ac.uk\/recordsandarchives\/wp-json\/wp\/v2\/media?parent=2786"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}