Archive-It Tuned Into CustomersKristine Hanna, the director of web archiving services at the Internet Archive, has no illusions about the nature of web content. "One of our favorite expressions is, 'the web is a mess,'" she laughed. "It truly is, and it's not getting any better. We have these librarians and archivists who are interested in highly curated collections of born-digital content. So, the question is, how do we get from 'the web is a mess' to highly curated collections?"http://digitalpreservation.gov/news/2010/20100726news_article_archive-it.html-- Library of Congress Digital Preservation Program News, July 26, 2010system2010-09-03 19:01:46+00:00system2012-10-17 22:34:13+00:0052335294Web archiving from a kid's point of view"Could 22nd-century researchers think of the Captain Underpants Web site as source material? They might if kids taking part in the K-12 Web Archiving Program decide they want to preserve it for future generations."School_Library_Journal.jpghttp://www.schoollibraryjournal.com/article/CA6685481.html-- By Lauren Barack, School Library Journal, August 31, 2009system2012-10-18 18:17:35+00:003625129SAA 2006: Research Library Group Roundtable – Internet Archiving"Archive-It puts the tools for managing larges scale web crawling in the hands of archivists." spellbound_blog.gifhttp://www.spellboundblog.com/2006/08/29/saa-2006-research-library-group-roundtable-internet-archiving/-- By Jeanne Kramer-Smyth, Spellbound Blog, August 27, 2006system2012-10-18 21:26:01+00:00412108Internet Archive Launches Archive-It Service"Internet Archive today announced the latest release of Archive-It 1.5, a new subscription-based archiving service geared towards a broad range of institutions at a cost considerably lower than other archive platforms. Archive-It enables subscribers to capture, categorize, and preserve online material from their own institutions websites as well as from the world wide web. Users are able to access and explore and these text-searchable collections, without needing additional technical expertise."
press9.jpghttp://wayback.archive-it.org/2114/20110923040311/http://my.opera.com/LibraryImportant/blog/show.dml/243025-- Brooklyn Library, May 3, 2006system2012-10-18 22:42:21+00:0051109GPO: Web Harvesting Pilot Project "In late 2011, Library Services and Content Management (LSCM) and OAM staff developed a pilot project to test an implementation of the Internet Archive's Heritrix-based Archive-It, which is a subscription-based Web harvesting and archiving service. In developing the pilot project, the project team networked with Web harvesting teams from the Library of Congress, the National Archives and Records Administration, and the University of North Texas Library (a GPO library partner already well-known for establishing the CyberCemetery and its leadership in digital preservation initiatives)."http://beta.fdlp.gov/all-newsletters/featured-articles/1493-web-harvesting-pilot-project-- By the Federal Depository Library Program (FDLP), Featured Articles, March 1, 2013system2013-03-12 20:16:53+00:00system2013-03-12 20:19:08+00:006950544178Archiving and Preserving the Web: Future Directions and Applications"Those not able or ready to step up to web archiving on their own (or who might want to get their feet wet slowly) might be interested in taking a look at Archive-It, a new product of the Internet Archive. Archive-It makes it possible, at a fairly modest price, to get started with web archiving without a lot of technical expertise or investment." press4.jpghttp://hangingtogether.org/?p=114-- By Merrilee Proffitt, Hanging Together, April 29, 2006system2012-10-19 00:07:17+00:00106114The Humanities, DigitizedWhen the largest tsunami in Japan’s recorded history struck in March 2011, wreaking horrific damage up and down the northeastern coast, Folger Fund professor of history Andrew Gordon recalls that he and other Harvard scholars of Japanese culture “had a sense that this was an event that would probably change people’s sense of time” in Japan. A core group of faculty members also felt obligated to try to capture the ephemeral documentation of the crisis that was appearing on the Internet.http://harvardmagazine.com/2012/05/the-humanities-digitized-- By Jonathan Shaw, Harvard Magazine, May-June 2012system2012-04-26 23:27:17+00:00system2012-10-09 23:16:49+00:005843127304Entire Cornell Website to be ArchivedInternet Archive will create an archive of Cornell’s entire web space — approximately eight million documents — by capturing HTML coding, images, PDFs and links to external pages, according to Dean Krafft, Cornell library chief technology strategist, who is overseeing the project.
http://www.cornellsun.com/section/news/content/2011/04/13/entire-cornell-website-be-archived-- By Dennis Liu, The Cornell Daily Sun, April 13, 2011system2011-04-20 00:16:53+00:00system2012-10-17 22:12:05+00:00563723080Archiving the Volunteer State Web: Archive-It and The Tennessee State Library and Archives"The Internet Archive’s Archive-It service continues to debut new permanent archival collections of web pages and sites. Here are a few new ones since we posted our last update. This time the collections all began with the help of the Tennessee State Library and Archives. Here are a few of the many Tennessee collections."resource_shelf_2.gifhttp://www.resourceshelf.com/2007/08/16/archiving-the-volunteer-state-web-archive-it-and-the-tennessee-state-library-and-archives/-- Resource Shelf, August 17, 2007system2012-10-18 18:48:51+00:001818122Smithsonian Now Using Archive-It to Crawl Websites"In September 2012, the Smithsonian Institution Archives began using Archive-It, a service of the Internet Archive, to crawl its almost 250 websites...While Archive-It uses the same software for crawling and viewing websites as we had been using for the past three years, we have been plagued with hardware issues and have not been able to keep our software up-to-date. We now have access to software updates as soon as they are available. The processes of setting up a crawl and reviewing it afterwards are also more user-friendly with Archive-It. In addition, we now have the benefit of support from both the Archive-It staff and the larger Archive-It user community for those times when we just cannot figure out why a crawl is not working."http://siarchives.si.edu/blog/smithsonian-now-using-archive-it-crawl-websites-- By Jennifer Wright, The Smithsonian Bigger Picture Blog, February 26, 2013system2013-02-27 19:14:33+00:00system2013-02-27 19:15:46+00:006849539491A Permanent Collection for a Digital World"Students are encouraged to use the online archives, which are free to anyone with an Internet connection. Once on the site, users can choose from the available collections, including one on Hurricane Katrina. After a collection is selected, the site retrieves links, similar to search engine results, to Web sites that have been cached in the archives."press10.jpghttp://www.schoollibraryjournal.com/article/CA6333210.html-- By Lauren Barack, School Library Journal, May 12, 2006system2012-10-18 22:19:21+00:00128116Archive-It 2: Internet Archive Strives to Ensure Preservation and Accessibility"The Internet Archive's universal approach to the dissemination and access of information is embodied in its Archive-It service that anybody or any organization can use."econtent.gifhttp://www.econtentmag.com/Articles/ArticleReader.aspx?ArticleID=18132-- By Marji McClure, EContent, October 5, 2006system2012-10-18 19:00:48+00:00115105Moran students chronicle their 'worlds' online"Most of the decisions being made about what gets archived have been made by adults," said Cheryl Lederle, educational resources specialist at the Library of Congress. "Student users are arguably one of the largest users of the Internet proportionately, and their voices weren't being heard."students.gifhttp://www.myrecordjournal.com/latestnews/article_c7e0ec84-5802-55f1-aaed-a7570ec9aed3.html-- By Samaia Hernandez, The Record Journal, April 2, 2009system2012-10-18 18:39:29+00:002521125NASA Archiving Social Networking ActivityNASA has partnered with online subscription service Archive-It to store all of the space agency's social media activity and make it accessible from one central location.http://www.informationweek.com/news/government/cloud-saas/showArticle.jhtml?articleID=224200762&subSection=All%2520Stories-- By Elizabeth Montalbano, InformationWeek, March 30, 2010 system2010-04-22 01:29:06+00:00system2012-10-18 18:09:18+00:005032136Readying for Reframing: Reports on Web Archiving "Over the past year, NYARC has surveyed the publishing and web archiving landscape to develop a program for collecting born-digital art research materials." An overview of this project called “Reframing Collection for a Digital Age: A Preparatory Study for Collecting and Preserving Web-Based Art Research Materials,” funded by the Andrew W. Mellon Foundation, can be read online.http://nyarc.org/content/readying-reframing-reports-web-archiving-- By Lily Pregill, New York Art Resources Consortium, April 18, 2013system2013-04-18 20:12:29+00:00system2013-04-18 20:13:18+00:007152622872D-Lib MagazineA focus on web archiving in the March issue of D-Lib Magazine.http://www.dlib.org/dlib/march12/03contents.html-- March, 2012system2012-05-23 20:50:13+00:00system2012-10-09 23:22:37+00:006340133924Archiving an unprecedented disasterThere is an urgent need to collect materials for a digital archive of the catastrophe triggered by the Great East Japan Earthquake, especially because digital resources are uniquely impermanent. http://ajw.asahi.com/article/0311disaster/opinion/AJ201203110017-- By Andrew Gordon, Asahi Shimbun, March 11, 2012system2012-05-23 20:32:45+00:00system2012-10-17 21:57:47+00:006041133921Web Archiving at the National Museum of Women in the ArtsThe National Museum of Women in the Arts has been web archiving art-related online ephemera using the Internet Archive’s Archive-It since November 2011. This case study presents the considerations and challenges of archiving such types of material and provides a foundation for arts institutions to begin more collaborative web archiving.http://archive-it.org/static/files/art_ephemera_nmwa.pdf-- By Heather Slania, Art Documentation: Journal of the Art Libraries Society of North America, Vol. 32, No. 1 (Spring 2013), pp. 112-126system2013-05-07 17:39:58+00:00system2013-05-07 17:39:58+00:007253684030MS Thesis - Visualizing Digital Collections at Archive-It"Visualizing Digital Collections at Archive-It", was the subject of a recent MS thesis by Kalpesh Padia (who is continuing his Ph.D. studies at NC State University) and a JCDL 2012 short paper by Kalpesh Padia, Yasmin AlNoamany, and Michele C. Weigle.
In order to provide a better visual experience to users of Archive-It collections, we implemented six different visualizations (treemap, time cloud, bubble chart, image plot, timeline, and wordle). The work also provides a rule-based categorization of sites in collections that lack a curator-defined grouping.http://ws-dl.blogspot.com/2012/08/2012-08-10-ms-thesis-visualizing.html-- August 15, 2012system2012-08-15 17:31:38+00:00system2012-10-17 21:37:41+00:006647165593Web Archiving: Selection, Capture, Preservation, Marketing #saa12While archiving web pages may not spring to mind as one of the duties of an archivist, organizations are increasingly recognizing the value of keeping records of websites and social media sites to provide snapshots of their public face and context for other assets.http://www.cmswire.com/cms/information-management/web-archiving-selection-capture-preservation-marketing-saa12-016864.php-- By Mimi Dionne, August 8, 2012system2012-08-15 17:04:39+00:00system2012-10-17 21:44:48+00:006546165592AUC Rare Books and Special Collections Library NewsInterested in learning more about the events of January 25th? Explore the resources described below to discover more about demonstrations in Egypt.http://aucrbscl.blogspot.com/2011_02_01_archive.html-- American University in Cairo Rare Books and Special Collections Library, February 27, 2011system2011-03-01 16:27:19+00:00system2012-10-17 22:32:46+00:00543517529Long–term Preservation for fragile Web content: LANIC’s Web archiving ProgramIn this article, Archive-It partner, Kent Norsworthy, talks about how the Latin American Network Information Center at UT Austin uses Archive-It to archive government documents from Latin America, as well as why web archiving is so important for this type of content. The article was published in the 2008-2009 "Portal"(http://lanic.utexas.edu/project/etext/llilas/portal/portal099/), a magazine published annually
since 2006 by the Teresa Lozano Long Institute of Latin American
Studies at The University of Texas at Austin.lanic-article-screenshot.jpghttp://lanic.utexas.edu/project/etext/llilas/portal/portal099/lanic.pdf-- By Kent Norsworthy, Portal, October 29, 2009system2012-10-18 18:13:22+00:004126130Archiving Made Easier"The Internet Archive has released new software, called Archive-It 1.5, that lets colleges and museums create their own searchable catalogs of data and multimedia. The software -- available for an annual fee of $10,000 -- allows institutions to manage digital collections comprising as many as 10 million items as part of the archive's main project, a comprehensive online library showing the evolution of every site on the Web. A number of colleges, including the University of Toronto and Indiana University at Bloomington, have signed up for the service, the archive said. "press8.jpghttp://chronicle.com/wiredcampus/article/1228/archiving-made-easier-- By Brock Read, Chronicle of Higher Education: The Wired Campus, May 3, 2006system2012-10-18 22:15:24+00:00139117Internet Archive's Brewster Kahle Profiled in a New Article"While digitizing content gets tons of press these days, archiving the web is a very important issue for info pros that deserves lots of attention and work. Although many web archiving projects exist around the globe, The Internet Archive is perhaps the most well known."resource_shelf.gifhttp://www.resourceshelf.com/2006/06/23/internet-archives-brewster-kahle-profiled-in-a-new-article/-- Resource Shelf, June 22, 2006system2012-10-18 19:07:40+00:00214106Archiving NASA’s Social MediaOne thing NASA is careful about is archiving material. They are well aware of the importance of the work they’re doing, and public outreach is a critical aspect of it. That’s why I’m happy to see a new effort on the part of the space agency to archive all their social media outlets.Snapshot_2010-03-26_10-55-53.tiffhttp://blogs.discovermagazine.com/badastronomy/2010/03/21/archiving-nasas-social-media/-- By Phil Plait, Discover Magazine, March 21, 2010system2012-10-18 18:10:17+00:004831137Internet Archive Releases Archive-It 1.5"Collaborating with state archivists as well as public and private libraries, Archive-It is working to preserve information found on the Internet. New features and applications of the Archive It 1.5 release include enhancements to the user interface, improved access to collections, and advanced search and reporting capabilities."press2.jpghttp://www.econtentmag.com/Articles/ArticleReader.aspx?ArticleID=15639-- By Michele Manafy, EContent, April 28, 2006system2012-10-18 22:12:54+00:001410118Announcing Partnership with Internet Archive & Archive-It Service"The partnership is not just about committing to preservation, it’s also about shared mission."http://anvilacademic.org/announcing-partnership-with-internet-archive-archive-it-service/-- By Anvil Academic, April 3, 2013system2013-04-03 19:15:40+00:00system2013-04-03 19:15:40+00:007051550912CAPE Students Creating Virtual Time CapsulesEighth-graders at Camarillo Academy of Progressive Education are literally making history. The students and their teacher Camille Kavon are taking part in a K-12 Web Archiving Program sponsored by the Library of Congress, the Internet Archive and California Digital Library. The program is designed to encourage students to think about history by selecting sources for ongoing research use, effectively creating "time capsules" of what represents their current lives.
http://www.vcstar.com/news/2011/mar/10/cape-students-creating-virtual-time-capsules/-- By Rachel McGrath, Ventura County Star, March 10, 2011system2011-03-15 22:06:13+00:00system2012-10-17 22:13:49+00:00553617786Web Archiving and Mainstreaming Special Collections: The Case of the Latin American Government Documents ArchiveWhen historians of the future want to understand Latin American governments, they are going to be thrilled that curators like Kent Norsworthy from University of Texas Libraries have been preserving Latin American government websites.http://blogs.loc.gov/digitalpreservation/2012/06/web-archiving-and-mainstreaming-special-collections-the-case-of-the-latin-american-government-documents-archive/-- By Trevor Owens, The Signal, June 6, 2012system2012-06-06 22:01:39+00:00system2012-10-17 21:45:23+00:006445145597Internet Archive sells extended archiving for organizations"Archive-It 1.5 provides subscription-based service that allows institutions to store, categorize history (at a lower price) from their website and WWW. Users are able to explore and access these text-searchable collections, without needing additional technical expertise."press6.jpghttp://battellemedia.com/archives/002530.php-- By John Batelle, Searchblog, May 2, 2006system2012-10-19 00:03:52+00:00117115Open Folklore + Community Arts NetworkThe Community Arts Network (CAN), Indiana University Bloomington Libraries, and the American Folklore Society are pleased to announce that the CAN Web site has been archived as part of the Open Folklore project. After CAN announced it would be forced to immediately shut down its Web site due to lack of funds, the IU Bloomington Libraries offered to capture the CAN Web site using Archive-It.http://www.openfolklore.org/news/open-folklore-community-arts-network-september-4-2010-- Open Folklore, September 16, 2010system2010-10-14 21:49:00+00:00system2012-10-17 22:31:47+00:00533411149Digital Preservation Is Cultural Literacy While digital preservation might not be the first thing that comes to mind when thinking about K-12 curricula, in fact teaching kids about Web archiving, digitization, media storage, and collection building can foster long-term thinking and serve as a gateway to hands-on learning in science and technology.http://www.huffingtonpost.com/kari-kraus/digital-preservation-is-cultural-literacy_b_1455752.html-- By Kari Kraus, Huffington Post, April 26, 2012system2012-04-26 22:33:55+00:00system2012-10-17 21:56:57+00:005742127303Archiving Human RightsIn 2009, Columbia University Libraries received a grant from the Mellon Foundation to explore web archiving program development. The collection at the center of our web archiving program is the Human Rights Web Archive. Why are we doing this? In brief, to preserve online resources for future researchers and activists. Archiving the sites of human rights organizations ensures, to a certain degree, that the website content will be preserved in the context of the original site, and will be accessible even if the original site becomes unavailable. http://blog.witness.org/2012/01/archiving-human-rights-on-the-web/-- By Tessa Fallon, Witness Blog, January 27, 2012system2012-05-23 20:43:00+00:00system2012-10-17 22:11:00+00:006238133923Archive-It Adds Advanced Search Interface"Unlike another IA project, The Wayback Machine, Archive-It pages can be keyword searched. The service uses Nutch open source search software."rs_adv_search.gifhttp://www.resourceshelf.com/2007/12/27/web-archives-archive-it-adds-advanced-search-interface/-- Resource Shelf, December 27, 2007system2012-10-18 18:47:19+00:002019123La frágil memoria de la informáticaAn article on preserving digital information from the Argentinian newspaper Clarinhttp://www.revistaenie.clarin.com/ideas/tecnologia-comunicacion/La-fragil-memoria-de-la-informatica_0_644335568.html-- By Andres Hax, Clarin, February 10, 2012system2012-05-23 20:38:24+00:00system2012-10-17 21:59:15+00:006139133922Sherwood middle school students participate in nationwide program to archive Web sitesSherwood middle school students are creating a 21st century version of a time capsule, though there won't be any shovels or digging involved. sherwood_middle_school.pnghttp://www.oregonlive.com/washingtoncounty/index.ssf/2010/02/sherwood_middle_school_student.html-- By Melissa Navas, The Oregonian, February 24, 2010system2012-10-18 18:11:01+00:004327131Developing a Health and Medicine Blogs Collection at the U.S. National Library of Medicine"The National Library of Medicine has a mandate to collect, preserve and make accessible the scholarly biomedical literature as well as resources that illustrate a diversity of philosophical and cultural perspectives not found in the technical literature. New forms of publication on the web, such as blogs authored by doctors and patients, illuminate health care thought and practice in the 21st century. In June 2011 the NLM Web Collecting and Archiving Working Group engaged in a pilot project to understand better the processes and challenges of collecting born-digital web content and to expand the Library’s collecting strategy for digital formats."http://blogs.loc.gov/digitalpreservation/2012/10/developing-a-health-and-medicine-blogs-collection-at-the-u-s-national-library-of-medicine/?doing_wp_cron=1349192650-- By Christie Moffatt and Jennifer Marill, The Signal, October 2, 2012system2012-10-02 17:37:11+00:00system2012-10-17 21:36:20+00:006748306473New Web Archives From Archive-It: Chomsky.Info & Many Labor and Political Organizations"The Archive-It Team at The Internet Archive continues to do important work archiving the web. Over the past few weeks we’ve been posting about many new collections that Archive-It has been rolling out and this week will be no different."resource_shelf_1.gifhttp://www.resourceshelf.com/2007/05/12/new-web-archives-chomskyinfo-many-labor-and-political-organizations/-- Resource Shelf, May 12, 2007system2012-10-18 18:58:41+00:001716121Occupy Wall Street: From the Streets to the ArchivesEarlier this week a Times article looked at social scientists who are trying to study Occupy Wall Street in real time. But a group of archivists are also hitting the streets, and the Internet, in an effort to preserve the movement’s traces for scholars of the future.http://artsbeat.blogs.nytimes.com/2012/05/02/occupy-wall-street-from-the-streets-to-the-archives/-- By Jennifer Schuessler, New York Times Arts Beat Blog, May 2, 2012system2012-05-04 15:01:51+00:00system2012-10-17 21:48:58+00:005944130379K12 Web Archiving: "I had never thought of archiving websites...""Student comments [about the web archiving program] included 'choosing the websites was really fun because it let everyone be creative and really think about what teenagers enjoy today,' and 'I had never thought of archiving websites, even though in this day and age we use them as much as and more than books.'"k-12_program.jpghttp://www.digitalpreservation.gov/news/2009/20090805news_article_k-12_archiving_program.html-- Library of Congress, August 5, 2009system2012-10-18 18:38:25+00:002722126Pact to Ensure Uninterrupted Research Access"The DOE E-print Network is the largest collection that a federal institution has undertaken in an on-going effort to preserve their own documents and history through Archive-It."osti_press.gifhttp://www.osti.gov/news/2007/jun/internetarchive-- Office of Scientific and Technical Information (OSTI), June 29, 2007system2012-10-18 18:52:50+00:001617120Biblog (Denmark)"Take the tour to see how it works. It is all up-to-date technology, simple and straightforward. And very effective."
<br/>(translation by InterTran, www.trannexp.com). press5.jpghttp://bib-log.blogspot.com/2006/05/lav-bibliotekets-eget-internetarkiv.html-- By Erik Hy, Biblog, April 29, 2006system2012-10-19 00:04:55+00:0095113 Preserving the Web one group at a time"Subscribers can develop digital collections of their own based on up to 300 "seed" Web sites designated by the institution. The annual service lets subscribers create and manage up to three collections, with as many as 10,000,000 URLs."press_cnet_sm.jpghttp://news.com.com/2061-10802_3-6067173.html-- By Stefanie Olsen, CNet News, May 1, 2006system2012-10-18 19:09:01+00:00313107