Principles of information search

Building a query

The most essential factor in formulating a query is the skilful combining of search words and using different techniques in order to formulate the subject as clearly and precisely as possible. The search words can be combined using the following techniques.


Boolean operators

The search can be extended or limited by Boolean logical operators AND, OR or NOT.

 
 
rockwell_schrocks_boolean_machine.png
 
 
 

Boolean machine, http://rockwellschrock.com/rbs3k/boolean/index.htm

AND limits the search, finding documents that contain the words connected by AND in the query

Example: television AND violence – presents documents where both television and violence can be found.

OR extends the search, finding documents containing television, as well as those containing violence

Example: violence OR aggression – presents documents containing only violence, only aggression or both of these words.

It is practical to use operator OR to connect

  • synonyms
    student OR pupil
  • words with different spelling
    organization OR organisation
  • related words and phrases, or those of similar meaning
    children OR adolescents
     antisocial behaviour OR deviant behaviour
    antisocial OR criminal

NOT limits the search – the document has to contain the first word, but not the other one, making it possible to exclude the presence of some words in the results.

Example: antisocial NOT criminal – finds documents containing antisocial, but excludes those containing criminal.


Proximity operators

Many databases make use of proximity operators (NEAR, ADJ, etc.) that help to make the search more precise, as they let to specify how near to each other the words have to stand in the text. The likelihood of finding the words you are searching for in one and the same context is, this way, bigger. Often, it is more practical to use proximity operators instead of AND.

Example: media N3 violence – retrieves documents where media and violence stand three words apart from each other independent of their succession, finding both media violence and violence in media.

Proximity operators and their functions may differ in different databases, therefore, while searching a database for the first time, be sure to consult HELP to get familiar with the database.


Phrase search 

Phrase search is used to find documents containing a specific phrase or sentence. The phrase or sentence is usually put into quotation marks.

Example: 
“media violence” – retrieves documents containing precisely this phrase.

If the same meaning may probably be conveyed in different wording or if you cannot form the exact phrase, better use the proximity search with the same words.

Often, several words together are automatically searched as a phrase, even without quotation marks, but in some databases they are connected with the operator AND, meaning that all of them have to be found in the text independent of their location in it. You have to make sure before the search or during the search, which of these versions are working in the database.


Truncation 

Truncation allows searching for words with the same stem, but different endings.

Any number of letters at the end or the beginning of the word are replaced by a symbol, most often by * (asterisk) or ? (question mark).

Example: 
question* – retrieves questionsquestioning, etc.
child* – retrieves childrenchildhood, etc.

This search can replace the connecting of different word forms with OR.

First, decide where it is most practical to interrupt the word to replace its ending by a symbol/meta-sign. If you interrupt the word too early, you may get too many irrelevant results, but if you do it too late, you may miss some important words (adolescent* – retrieves adolescents, but not adolescenceteacher* – misses teaching).

You can also use a symbol instead of a letter inside a word in case different spellings are possible or if you are not sure of the spelling, or if you want to find all possible spellings of the word.

Example: 
organi?ation – both organization and organisation are found;   wom?n – both woman and women are found

The use of these symbols may differ in different databases, be sure to find out about their use before making a search.


Use on parentheses

If you use different operators in one and the same query, you have to keep in view that the expressions connected with AND and NOT are searched first, and those connected with OR, only in the second order. If you wish to change this order, you have to use parentheses. The expressions in parentheses are searched first.

Example: using the table of search words above, we connect synonyms and words of similar meaning with the operator OR, separate these groups with parentheses and then connect them with the operator AND in order to find their intersection. We use truncation to avoid spelling out all words with similar stems.

(aggressi* OR violen*) AND (tv OR television) AND (child* OR adolescen* OR teen*)