Filters








4 Hits in 2.5 sec

D2.1 Libraries and tools for multimodal content analysis

Doukhan; David, Danny Francis, Benoit Huet, Sami Keronen, Mikko Kurimo, Jorma Laaksonen, Tiina Lindh-Knuutila, Bernard Merialdo, Mats Sjöberg, Umut Sulubacak, Jörg Tiedemann, Kim Viljanen
2018 Zenodo  
Finally, five scientific publications are appended to the report, which describe the technological advances related to these components that has been made so far in the project.  ...  The description of the components is divided into the visual and auditory domain, and these are further subdivided into differ- ent themes.  ...  In addition, this work has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No 780069.  ... 
doi:10.5281/zenodo.3697989 fatcat:bde5x3yggzb2jk2fh2mu6t5wxy

D2.3 Software and demonstration of human-like content description generation

Doukhan, Guo, Harrando, Kurimo, Laaksonen, Lindgren, Lindh-Knuutila, Lisena, Pehlivan Tort, Reboud, Rouhe, Troncy (+1 others)
2020 Zenodo  
We describe a small-scale dataset used in the experiments and demonstrate and evaluate various multimodal combinations of the unimodal inputs with that data.  ...  This deliverable describes the last development iteration of the joint collection of libraries and tools for multimodal content analysis and description from AALTO, EURECOM, INA, Lingsoft, LLS and Limecraft  ...  [ 3 ] 3 Jorma Laaksonen and Zixin Guo. PicSOM experiments in TRECVID 2020. In Proceedings of the TRECVID 2020 Workshop, Gaithersburg, MD, USA, December 2020.[4] T. J. Park, K. J. Han, M.  ... 
doi:10.5281/zenodo.4964391 fatcat:ertkzz2wbjajjlavw4iljlbmaq

TRECVID 2019: An Evaluation Campaign to Benchmark Video Activity Detection, Video Captioning and Matching, and Video Search Retrieval [article]

George Awad, Asad A. Butt, Keith Curtis, Yooyoung Lee, Jonathan Fiscus, Afzal Godil, Andrew Delgado, Jesse Zhang, Eliot Godard, Lukas Diduch, Alan F. Smeaton, Yvette Graham (+2 others)
2020 arXiv   pre-print
In addition, many organizations and individuals worldwide contribute significant time and effort. TRECVID 2019 represented a continuation of four tasks from TRECVID 2018.  ...  The TREC Video Retrieval Evaluation (TRECVID) 2019 was a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in research and development of content-based exploitation  ...  We would like to thank Tim Finin and Lushan Han of University of Maryland, Baltimore County for providing access to the semantic similarity metric.  ... 
arXiv:2009.09984v1 fatcat:pmjrd4eyqbgx3gtrvp6l7pa6cq

D2.2 Implementations of methods adapted to enhanced human inputs

Doukhan, Francis, Harrando, Huet, Kaseva, Kurimo, Laaksonen, Lindh-Knuutila, Lisena, Pehlivan Tort, Reboud, Rouhe (+2 others)
2020 Zenodo  
This deliverable describes the second development iteration of the joint collection of libraries and tools for multimodal content analysis from AALTO, EURECOM, INA, Lingsoft, LLS and Limecraft.  ...  in this report.  ...  OCIS [14] is the recently introduced large-scale object categories in indoor scenes dataset. It comprises of 15,324 images spanning more than 1300 commonly encountered indoor object categories.  ... 
doi:10.5281/zenodo.4964298 fatcat:6bbqa7q3xrctnm6nrf5fxh7f3q