ROMIP: Russian Information Retrieval Evaluation Seminar

 General principles 
 Test collections 
 Relevance tables 


The collection of quotes from the news flow with sentiment markup


This collection contains parts of news documents. Each part is either direct or indirect speech with its polarity score.
The polarity score could be positive, so-so, negative or neutral(no sentiment).

Dataset Parameters
  • Collection size: 2,5 Mb
  • Quotes number: 4 260
  • Encoding: windows-1251
Rights to Use

To get access to the collection you must sign the usage agreement.

Data Format

The collection is distributed in xml file of a certain format.