Boolean search
The Boolean search is a model for information retrieval in which Document is considered as a bag of words. Any query is posed as a boolean expression of terms where term means word.
The bag of words
Let us consider closely the model by an example.

Term-Document incidence matrix

We have a four documents which contains programming skills of employees.
  1. Yernat
    PHP, Java, Basic
  2. Marat
    Perl, Java, C++
  3. Rinat
    PHP, C++, XSLT
  4. Bekzat
    Java, HTML, PHP
Let us tokenize string of skills and build the equivalent matrix in memory(RAM)
Term-Document incidence matrix

Boolean query processing

Let us try to find employees who skilled with PHP AND C++.
The result is Rinat.
May 3rd, 2011
Back to main
Moi krug - Yernat Assanov
Advertisement
Documentolog
(C) 2010, kseeker
Email: kseeker@yandex.kz
Используются технологии uCoz