The way back machine

12/18/2022

So, the Archive won’t always follow takedown requests. Reid’s being a journalist (a very high-profile one, at that) and the journalistic nature of the blog archives. The interesting part is that the Archive refused an emailed request from her lawyers to delete the offending posts, due to: She has said that someone hacked the Wayback Machine, which is an unsubstantiated claim that the Archive denies. MSNBC host Joy Ann Reid has recently been the subject of controversy after Wayback Machine searches unearthed homophobic comments on her blog. Nor does it seem to have completely removed robots.txt-based removals. Nevertheless, the Archive doesn’t appear ready to roll over at every request. Internet Archive has no interest in including materials in the Wayback Machine of persons who do not wish to have their Web content archived. In 2007, it settled with activist Susanne Shell, who had demanded that it take down records of her family rights site after alleging copyright infringement. There are many others, and some have resulted in legal cases. However, the ability to request a deletion via email remains, as it always has done.įlexiSpy’s request isn’t the first that the Archive has received. In December 2016 it began ignoring robots.txt files on government sites, and then in April 2017 announced that it was “looking to do this more broadly”. Since then, the Archive’s policy on crawling has relaxed. Under the Archive’s policy at the time, this should have triggered the site’s complete deletion from the Wayback Machine, but it didn’t. Healthcare Advocates had added a robots.txt file to its site to stop crawlers spidering it. In 2006, it settled with a firm called Healthcare Advocates, which was in the middle of a trademark dispute with a similarly-named company. That policy had significant implications for the Archive. Under the policy, a site owner could simply add one of these files at the top level of their site with a specific instruction for the Internet Archive, and then submit their site using a form. Robots.txt files are instructions left on sites for crawlers, telling them what they should not look at. Under this policy, archivists should provide a ‘self-service’ approach that site owners can use to remove their materials using robots.txt files. Traditionally, the Archive has based its approach to exclusion requests on a policy created by UC Berkeley (archived version here). However, its terms and conditions say that if asked by an author or publisher, it “may remove that portion of the Collections without notice.” Its FAQ says that site owners can “send an email request for us to review”. The Internet Archive did not respond to requests about its policy. Does that mean it complied with the request? Search the Wayback Machine’s archive for FlexiSpy, however, and it reports that the URL has been excluded. As Motherboard points out, another archive still maintains images of the company’s site from several years ago. FlexiSpy, which sells software for monitoring phones and desktop computers, used to market its software as a tool to spy on cheating spouses.

This issue came up recently when Thailand-based FlexiSpy reportedly asked the Internet Archive to delete its webpages from the Wayback Machine. So what happens when someone doesn’t want information about them to stick around? After that plane was revealed as Malaysia Airlines Flight 17, the post was deleted, but the Wayback machine still had the original message.Ĭlearly, archiving information has its benefits.

In 2014, Ukrainian separatist leader Igor Girkin bragged about downing a Soviet military cargo plane on social media. The archive’s preservation of online data has proven valuable on several occasions. The Wayback Machine, which is run by the non-profit Internet Archive, has been quietly archiving as much of the web as it can to create a permanent record of our fast-moving, volatile digital landscape. The internet found out recently, when a company with a questionable marketing history reportedly asked the world’s best-known web archive to eradicate its information. Who is archiving the web, and what happens when people ask for information to be ‘un-archived’?

0 Comments

The way back machine

Leave a Reply.

Author

Archives

Categories