ARE
ARE (Anchors and RElations) system extracts information from free text. First, ARE extracts cue phrases (anchors) and, second, ARE evaluates relations between the anchors.
EventSupervisor
The system EventSupervisor is experimental system of structurization of a news web-stream. The basic idea of system consists in statistical classification of documents with use of features inherent in web-stream of news and actually news.
Exactus
Galaktika-Zoom
"Galaktika-Zoom" is a text mining solution working with unstructured data. The system includes proprietary tools for textual data repository creation and management, full-text search, automatic structuring, and data analysis tools based on linguistic, mathematical and statistical methods.
hbc-S
HeadHunter
Explorative search engine based on mix of classic and our own original algorithms. Testing of new ranging formulae is planned during the seminar.
HSVISE
IFM2
IFM2 - experimental system for near-duplicate image detection. The system is combines bio-inspired computational attention principles with interest point detection methods. The main idea is computation of local image regions, salient in terms of attention model. To represent these regions standard scale invariant descriptors, such as SIFT, PCA-SIFT and SURF are used. The image is described by a set of salient regions' feature vectors. Thus the image comparison becomes a comparison of local interest point sets.
JKX
KGCDA
KGCDA is the system of context-dependent annotation based on use of text fragments estimation multifactorial model and parametrical optimisation by means of documents teaching sample.
LISA
In the context of CBIR track solution of a modified task is proposed: build and save textual annotations for all of the images in the task and then search among obtained annotations. For annotation we use the probabilistic methods. In the task of near duplicate detection an improving of the method based on multiscale representation of the image is suggested. The idea is to analyze the signs of the gradient of images for a few scales.
mnoGoSearch
MnoGoSearch is free open source software for Unix-style operating systems to organize search for a Web site or a group of sites. mnoGoSearch is build on the inverted index technology and uses the TF*IDF weight when ranking documents, taking into account various additional parameters such as word distance, section break-down, stemming word forms and synonyms, and others.
RCO
RCO team is focused on research in area of computer linguistics and development of text analysis solutions for full-text databases, data-warehouses and BI systems. In the workshop we are planning to drive several experiments on text categorization and document retrieval tasks.
Search@Mail.ru
Search KM.ru
Information retrieval system, version mod.2.5. The system is based on traditional algorithms combined with our own developments.
SEUS
SKAT
Subject Search Sleuth (SSS)
Subject Search Sleuth (SSS) is a text search and annotation engine based on the fast non-reconsidering full-text fuzzy pattern search algorithm developed by Sergey Kryloff. The SSS algorithm supports cases when search terms are absent, swapped or alternated with other terms in the answer. Being based on notion of Q-Term (instead of word, its canonical form or, stem) SSS is very flexible with regard to supporting multiple languages. Current version supports 40 languages, including Asian ones, Arabic, Indonesian and Hebrew.
UIS RUSSIA
Yandex.Server