ROMIP: Russian Information Retrieval Evaluation Seminar

 News 
 About 
 Manifesto 
 General principles 
 Participation 
 Test collections 
 Relevance tables 
 History 
 2004 
 2005 
 Publications 
 Forum 

По-русскиПо-русски
 

Format of results for classification tracks

Results should be returned as one or several XML files. Each XML file contains answers for one or several tasks.

XML must have the following structure:

<?xml version="1.0"?>
<romip:taskresult xmlns:romip="http://www.romip.ru/data/classification" trackId="ROMIP-2007-Web-classification">
<!-- for legal documents classification trackId="ROMIP-2007-Legal-classification" -->
<!-- for web documents classification trackId="ROMIP-2007-Webpage-classification" -->

<romip:run> <romip:systemID>Yellow</romip:systemID> <romip:runID>MyRunId</romip:runID> </romip:run>
<result> <document name="www.frisbee.ru"> <topic>240</topic> </document> <document name="www.ularams.ru"> <topic>184</topic> <topic>202</topic> <topic>131</topic> <topic>163</topic> </document> </result> </romip:taskresult>
Tag run is descriprion of the system produced the result.
  • systemID - system identifier provided by ROMIP organizing commitee.
  • runID - run identifier to distinguish between different runs of same participant. Value is up to participant.

Tag result describes into which topics item was classified.

For web site classifictaion track items (document tag) are sites (in this case identifier is site name). In case of page classification items are documents (document identifier is value of document tag in its description in the documents collection). Identifier should be specified as value of name attribute of document tag.

Identifiers of predicted topics should be stored as set of topic tags. Each topic tag contains identifier of one topic. Topic identifiers should match topics identifiers from the training set.

IMPORTANT: Up to 5 topics per item are allowed.