Format of results for site classification tracks
Results should be returned as one or several XML files. Each XML file
contains answers for one or several tasks.
XML must have the following structure:
<?xml version="1.0"?>
<romip:taskresult xmlns:romip="http://www.romip.ru/data/classification" trackId="ROMIP-2008-Web-classification">
<romip:run>
<romip:systemID>Yellow</romip:systemID>
<romip:runID>MyRunId</romip:runID>
</romip:run>
<result>
<document name="www.frisbee.ru">
<topic>240</topic>
</document>
<document name="www.ularams.ru">
<topic>184</topic>
<topic>202</topic>
<topic>131</topic>
<topic>163</topic>
</document>
</result>
</romip:taskresult>
Tag run is descriprion of the system produced the result.
-
systemID - system identifier provided by ROMIP organizing
commitee.
-
runID - run identifier to distinguish between different runs of
same participant. Value is up to participant.
Tag result describes into which topics item was classified.
Items (document tag) are sites with site name as identifier.
Identifier should be specified as value of name attribute of document tag.
Identifiers of predicted topics should be stored as set of topic tags. Each
topic tag contains identifier of one topic. Topic identifiers should match
topics identifiers from the training set.
IMPORTANT: Up to 5 topics per item are allowed.
|