Motherboard: The Geocities Archive Is Bringing the Early Internet to Life. “Geocities was one of the first places your average person could make a website for free. The Geocities Gallery aims to archive these sites and return them to functionality, MIDIs and all.”
Library of Congress: In a Web Archives Frame of Mind: Improving Access and Describing the Collections
Library of Congress: In a Web Archives Frame of Mind: Improving Access and Describing the Collections. “In 2018, the Library of Congress Web Archiving Team embarked on a journey to streamline description of the Library’s voluminous web archives. As part of that continuing effort, the Library of Congress Digital Content Management Section is excited to announce the release of 4,258 new web archives across 97 event and thematic collections!”
Make Tech Easier: How to Find Out When a Webpage Was Published. “When you’re doing research on a topic, it’s vital to ensure your sources are up to date. If you’re writing an academic paper, dates of publication are often required in the citations. The majority of the time, getting the date is easy: simply look on the site and find the ‘published on’ date to find how recent it was. Things get a little more complicated when there is no date listed on the webpage. When this happens, how do you know when the page was published?”
Introspective Digital Archaeology: The Digital Derangement of Archives. “So do digital archives represent a paradigm shift from traditional archives and archival practice, or are they simply a technological development of them?” Not lengthly, but dense and thought-provoking and well-written. Can you compare something to a slice of really good fruitcake without being pejorative?
Ars Technica: Verizon reportedly blocks archivists from Yahoo Groups days before deletion. “An ad-hoc group scrambling to archive as much content as possible from Yahoo Groups ahead of the site’s final demise next week is running into trouble as more than a hundred volunteer archivists say Yahoo’s parent company, Verizon, has banned their accounts.” This is a big steaming pile of you know.
Boing Boing: GIF site Gfycat announces mass deletions, threatens Archive Team with lawsuit. “Gfycat is a site that people upload GIFs to so they can share them with other people reliably. Used most conspicuously to host memes, clips from other media, and animated porn, it announced Wednesday that it was planning to permanently delete old, anonymously-posted images within days. Archive Team, a web preservation initiative coordinated by Jason Scott, set about archiving the site’s soon-to-vanish content. So Gfycat’s CEO, Dan McEleney, threatened it with a lawsuit, describing archival of the memes it hosts as a ‘denial of service attack’ and demanding compensation.”
Reddit: Help us save as many Yahoo! Groups as possible!. “We need volunteers to manually join as many groups as they can, by filling out reCaptchas. We’ve written a Chrome extension that links to a tracker to coordinate efforts. If you’re up for completing a bunch of reCaptcha’s for us, you could really make a difference! If everyone reading this just completed a few, it would be an enormous help!”
Internet Archive: The Wayback Machine’s Save Page Now is New and Improved. “Every day hundreds of millions of web pages are archived to the Internet Archive’s Wayback Machine. Tens of millions of them submitted by users like you using our Save Page Now service. You can now do that in a way that is easier, faster and better than ever before.”
Internet Archive: The Wayback Machine: Fighting Digital Extinction in New Ways. “The average web page might last three months before it’s altered or deleted forever. You never know when access to the information on these web pages is going to be needed. It might be three months from now; it might be three decades. That’s how the Wayback Machine serves—making history by saving history. Now, the Wayback Machine is fighting digital extinction in brand new ways.”
Library of Congress, The Signal: Launching the Digital Collections Management Compendium. “Over the past two years, my colleagues and I in the Digital Content Management section have been working with experts from across many divisions of the Library of Congress to collate and assemble guidance and policy that guide or reflect the practices that the Library uses to manage digital collections. I am excited to share that today the results of that work have launched as the Digital Collections Management Compendium (DCMC).”
Computerworld: National Library launches ‘enormous’ archive of Australia’s Internet. “‘The Australian Web Archive [AWA] is one of the biggest in the world. And when we say big, we mean enormous,’ says director general of the National Library of Australia, Dr Marie-Louise Ayres. The new archive, which launched last week, contains around 600 terabytes of data across 9 billion records. In bookshelf terms; if the records were printed and stacked they would stretch from Canberra to Cairns.”