ROMIP: Russian Information Retrieval Evaluation Seminar

 General principles 
 Test collections 
 Relevance tables 


Format of training set for classification tracks

Training set is an XML file with the following structure:
<?xml version="1.0" encoding="windows-1251" ?>
<topic-set xmlns:romip="" collectionId="ROMIP-2004-DMOZ" id="dmoz-training">
                 <!-- for legal documents classification collectionId="ROMIP-Legal2007" id="legal-training" -->
<romip:header xmlns:romip="">
  <romip:license type="public" uri="" /> 
  <romip:description> This file contains definition of taxonomy and training set for...</romip:description> 

<topic id="202" name="Business->Armament_and_Defense">
<topic id="240" name="Sport->Frisbee">