ROMIP: Russian Information Retrieval Evaluation Seminar

 News 
 About 
 Manifesto 
 General principles 
 Participation 
 Test collections 
 Relevance tables 
 History 
 2004 
 2005 
 Publications 
 Forum 

По-русскиПо-русски
 

Legal Classification Track

Overview

The purpose of this track is to evaluate methods of document classification on a collection of legal documents.

For this track the standard procedure is used.

Test Collection

The source dataset is the collection of legal documents 2007.

Task Description for Participating Systems

Each participant is granted access to the training set, and a set of documents from the legal documents collection 2007. The task is to assign topic(s) from the training set to each document from the collection.

The training set is a subset of the categories based on the topic catalog provided by Kodeks company.

All the documents from the collection must be classified by participants. Expected result is an ordered list of documents for each category (sorted in descending order of confidence).

Evaluation Methodology

  • A random subset of the categories is selected. The full Kodeks catalog (which was verified by experts manually) is used for the evaluation of the selected subset.
  • official metrics:
    • precision
    • recall

Data Formats