Genealogy Searching on the Internet


Introduction

The Internet is is a huge collection of information and growing daily. For a genealogist, it can be a formidable task to locate valuable information. There are hundreds of Web search engines out there. Most of them do not provide much information of use to genealogists and some have a wealth of information if you know how to find it. You have probably used at least one of these search engines already and found thousands of hits or none. Success depends upon which search engine you use and how you define your search arguments.

The discussion here is limited to those facilities which are most apt to provide information of use to genealogists. A list of other search facilities are given under Other search engines.

The information available to each search engine is determined by several factors such as controls placed in the HTML code for the Robots which collect the data, which search engines the website manager has registered with and what each search engine manager decides is significant data for the users. While the general search engines can provide genealogical information, most successful searches are obtained from the genealogy search engines.

Search engines offer various options for you to control your searches. Understanding and using these options properly is very worthwhile.

General Search Engines

General Search Engines fall into several categories. They are Directories, True Search Engines, Meta Search Engines, FTP searches, people searches and usenet searches. Most of these have two levels of search: simple searching and advanced searching.

Directories

Directories are similar to a card catalog in a library. They are manually updated by the administrator who determines whether a website should be included in the directory. It is quite often necessary to select several levels of the directory to reach the desired category. Most directories provide a search engine but data in their index only includes those web pages included in the directory.

True Search Engines

True search engines are built and maintained by Robots (crawlers, worms, spiders etc.) which scan web pages selecting the information to be included in the index. Some indexes contain only the title of the web page, some include the first several lines of the web page and some contain key words from the entire page. These databases are huge but are the most likely to produce results.

Meta Search Engines

Meta Search Engines have no indexes of their own. They provide the facility (predetermined or selectable) to search several True Search Engines with just one search. The search results indicate on which site the matches were found.

FTP Searches

FTP (File Transer Protocol) sites are the original way to download text and program files. Some of the sites are still accessed by the prefix FTP://. However, most downloading from the Internet is done directly from http:// sites.

People Searches

People Searches include searches for postal and E-mail addresses for living persons. Some of the sites provide other services such as directories of nearby commercial services, area maps and sometimes route maps to the location.

Usenet Searches

Usenet Searches are for locating usenet discussion groups (channels).

Simple searching

Simple searches are controlled mainly by punctuation: + and - (force and ignore), " " (force sequence), and
* and ? (wildcards).

Using + and -

To begin with, you can instruct the search engine to find pages that must contain a certain word, words or phrase. If you enter two words into the search box (for example, bible and faith), the result of this search will include some pages which contain both words, some which will only have bible, and some with only faith. The search engines will find any page that has at least one of your keywords. But if you put a plus sign (+) before any word, then the search engine understands that all pages must include that word (or words).

Or use a minus sign (-) to tell the search engine not to find any pages with that word (or words). Here are some examples:
 

Search for
Result
bible +faith All pages will have "faith"; some will have "bible".
+bible +faith All pages will have "faith" and "bible".
+bible -faith All pages will have "bible"; no pages will have "faith".
bible faith Some pages will have "bible", some "faith", some both.

There are two exceptions to this rule. HotBot has "Match All" as default in its pull-down menu (so you don't need to use a plus sign), and Infoseek doesn't really exclude words with a minus sign from its searches; it only gives these words a lower ranking value, so that pages with these words appear toward the end of the list.

Double Quotes

All major search engines also allow you to search for phrases by using double quotes. For example, if you searched for faithful men, you would get pages which have faithful and/or men; but if you search for  "faithful men", you would only find pages with the exact phrase faithful men. Learn to use this feature and your searches will be much more profitable.

Capital Letters

Most of these search engines treat lower case search phrases as universal, but will perform a case sensitive search if you capitalize any letter. That is, if you search for bible, you'll find pages containing bible or  Bible, but  if you search for Bible, you'll only find pages containing Bible. So pay attention to your capital letters, or you might not find what you're looking for.

Using Wildcard Characters

Altavista, allows you to use wildcard characters. With Altavista you can also put an asterisk (*) at the end of a word. With HotBot, you can use the asterisk at the end or beginning of words, or use a question mark (?) to substitute any letter.

For example, typing faith* will find pages containing  faithful, faithfulness, faithfully. Typing mo?e in HotBot will find pages with more, move, mole, etc.

Advanced Searches

advanced searches are controlled by boolean operands and certain keywords. They are not of much use for genealogy searching but the principles are presented here for those who may want to use them. They are of two main types: Boolean and Field.

Boolean Searches

Except for Infoseek, the major search engines support Boolean searching. Use AND, OR and NOT according to Boolean logic. Search for faith AND bible to find pages which have both words; search for faith OR bible to find pages which have one word or the other; search for faith NOT bible to find pages which have faith but not bible.

You might think you could achieve all this with the plus and minus signs, but Boolean phrases allow you to use multiple parameters. For example, to find any page which speaks about the Bible and about faith, but which does not mention men or women, use the following search phrase: faith AND bible NOT (man OR men OR woman OR women).

Notes: In HotBot you first have to choose "Boolean Search" from the "all the words" menu. In Lycos, you need to search with Lycos Pro. In Yahoo!, select AND or OR from the search options page. And Altavista and Excite require that the NOT command be written as AND NOT.

Field Searches

If your search needs are really specific, you might need to use field searches, which Altavista, HotBot, Yahoo!, Infoseek and Lycos Pro support. You enter a "field name" followed by a colon and then a search term. Valid field names include link (which will only find the search term in links), title (only searches in titles of pages), url (only look in URLs), alt (looks in labels of images), etc.

For example, want to know which sites link to yours? Simply search for link:your_url.

The table at the end of this document lists the valid field names for each search engine.

Keywords

Choosing significant keywords for genealogical searches is important. There should be at least one specific keyword such as a surname, name of a state, county or cemetery. When searching for a specific person, use quotes such as "john smith". Some general keywords to use include:
+genealogy
+family
+history
+index
+born
+died
+married
+record (birth death marriage)
+cemetery

Hints for searching the Internet

1. Define the keywords for the search (importance and number)

2. Add necessary search language codes. (do not mix symbols with Boolean)

3. Use all lower case unless you specifically want upper case.

4. When searching for persons, use the " ". ("john smith")

5. Do not use punctuation in names, ( - ' . )

6. Modify the keywords and try again depending upon what is found on the first try.

7. Use the Edit and Find in Page functions of your browser to find words within the retrieved page.

8. Do the same search in a few months to check for additions to the net.

Start Searching!

So there you are. Start using these simple tips in your favorite search engine, and your searches will be much more meaningful.

Genealogy Search Engines

Genealogy Search Engines differ from the general search engines in that they are designed specifically for names, dates, places etc. They fall into several categories. They are Lists, Databases, Queries, Newsgroups and Files.

Lists

Lists are just that; lists of web sites with a link to them. Like Directories, they are maintained manually and a web site manager or user can ask to have a site listed.

Databases

Genealogy Databases vary in what data they contain. Most database sites have multiple databases online Family Search from LDS has the IGI and Ancestral File online. Family Tree Maker has indexes to Library.com, Genealogy Web sites, Census, the World Family CD's , the Ancestry CD's and the SSDI files. Ancestry has the Ancestry World Tree (GEDCOM), SSDI, Census records. World Connect is a facility of RootsWeb.com. Data is displayed in group sheet format. Geneanet provides a search for others who are researching the same surnames as you. It has a separate form for foreign countries. Kindred Connections provides the same service but in a different format. Genealogy.com provides links to Internet web pages, Personal Home Pages and the SSDI.

Queries

Queries sites are the same as those you find listed in genealogical publications. You can find other researchers who are researching the same families that you are and you can enter your own queries.

Newsgroups

Newsgroups provide information on discussion groups on various topics. They are similar to the message areas on Bulletin Board Systems. You must subsribe to the discussion groups to receive the messages. There are programs such as Free Agent that can be used to search these messages for surnames or subjects of interest to you.

Files

Files sites are places to obtain genealogical text files and software, mostly freeware and shareware.

Other search engines

There are many other search engines available such as Ask Jeeves, HotBot, Infoseek, MSN, Net Center, Snap, Magellan, Google, All In One ($$$$) and Fast. There are also commercial programs available to run on your system to search the Internet. Some of them are: Copernic 99, Super Sleuth, Websleuth, Organizer, Tracker Pro, Net Detective and Alexa. There are also several free search links that are available for use on your system. They are Alta Vista Discovery and Express Search from Go-Network. The search button on the two main browsers will also get you to search engines. Internet Explorer takes you to Go-Network and Netscape Navigator takes you to a list of search engines to choose from.

Navigation Aids:

Back to Home Page
Back to top of this page


Bill Korroch
3806 Churchill Ave.
Lansing, MI 48911-2209
E-Mail: korroch.william@acd.net

Revised - 12 June 1999
url: http://userdata.acd.net/korrochwilliam/tutorial.html