Tips for Locating Web Sites That Have Moved
So how do you find these missing genealogy sites? Here are a few tricks that I use:
- Shorten the URL back to the main site or first subdirectory. Many times when the subpages of a site don't work it is because the site has changed its directory structure. If you can get back to the home page of the Web site you may be able to search or browse your way back to the page you were looking for. This trick only works if the main site hasn't moved to a new URL without leaving a forwarding address.
- Search for the file name (this works especially well if the file name is somewhat unique) and a keyword or too. The file name is the last part of the URL with a .htm, .html, .asp or other extension.
- If the page has been online recently, then Google usually offers the most recent cached version. Search for the site in Google and then click on the "cached" link under the page description.
- For sites that have been missing for months or perhaps years, then the Wayback Internet Archive is probably your best bet. This site has cached versions of pages going back for many years. If one of the cached versions doesn't work, then try another one.
As an example (sorry to single you out!), the page for Berrien County, Michigan cemeteries in the MIGenWeb Archives still has an old page that comes up broken in Google search results. In other words, the old page is not redirecting to the new one. When you follow a link to the old page on RootsWeb, you receive a "We're sorry. The page you tried is not available" message.
To locate the new page, you can try several things:
- If you're familiar with the USGenWeb Archives system, then you'll know that this sub-site is part of the larger USGenWeb Archives system. A search for USGenWeb Archives in Google still brings up the old site high on the list, but by visiting this page you are given the link to the new Web address at usgwarchives.org. From there you can browse down to Michigan Table of Contents page -> Michigan Counties Table of Contents -> Berrien -> Cemeteries to find the new page.
- Even if you weren't previously familiar with the USGenWeb Archives project, you can follow the same path by using the "shorten the URL" trick. Shorten it one directory level at a time until you get back to the main USGenWeb Archives page - which doesn't redirect, but does give you a link to the new location of the Web site.
- Another way you can locate the new site is to use the file name search trick. Search Google for "1101cem.htm" (the file name located at the end of the URL) and "berrien" to locate the new page. This trick only works if the new Web site has been around long enough to have been indexed by Google. In this case it has. In my test the new page for USGenWeb Archives - Berrien County Cemeteries came up second in the search results.
- Because this site only moved a few months ago, the cached link on Google still works for the moment, but not all of the links from the cached page work because they, too, have moved. The next step is to visit the Internet Archive, which has cached versions of the site from 2007 back to 2001.
The main theme here is persistance. Most broken links and moved or removed Web sites can be found with a little time and patience!


Comments
Thanks for the suggestions. I can get to a rootsweb website I had used frequently, but can no longer access the index or source data. I hope one of these tricks work. I also hope that when the owners of these non-working sites get set up elsewhere they let us all know through Cindy’s list of new sites.
The best search site for the USGenweb Project is USGenwebSearchUS found here:
http://www.usgenweb-search.us/
It indexes all of the USGenweb sites and you can search by state or by a range of counties within some larger states. I am not sure if it is caught up to date with all of the changing urls yet or not.
Mike