Filters








2 Hits in 1.0 sec

StoryDB: Broad Multi-language Narrative Dataset [article]

Alexey Tikhonov and Igor Samenko and Ivan P. Yamshchikov
2021 arXiv   pre-print
This paper presents StoryDB - a broad multi-language dataset of narratives. StoryDB is a corpus of texts that includes stories in 42 different languages. Every language includes 500+ stories.  ...  The corpus shows rich topical and language variation and can serve as a resource for the study of the role of narrative in natural language processing across various languages including low resource ones  ...  StoryDB is the first dataset of narratives that we know of that contains narrative descriptions in various natural languages. This paper presents StoryDB -a broad multilanguage dataset of narratives.  ... 
arXiv:2109.14396v1 fatcat:xvwsqc2jebfrjkwssftm63zrkm

StoryDB: Broad Multi-language Narrative Dataset

Alexey Tikhonov, Igor Samenko, Ivan Yamshchikov
2021 Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems   unpublished
This paper presents StoryDB -a broad multilanguage dataset of narratives. StoryDB is a corpus of texts that includes stories in 42 different languages. Every language includes 500+ stories.  ...  The corpus shows rich topical and language variation and can serve as a resource for the study of the role of narrative in natural language processing across various languages including low resource ones  ...  Conclusion This paper presents StoryDB -a broad multilanguage dataset of narratives.  ... 
doi:10.18653/v1/2021.eval4nlp-1.4 fatcat:gsd6zouyerachfc32wfzox5vwa