Filters








45 Hits in 6.2 sec

Music Demixing Challenge 2021 [article]

Yuki Mitsufuji, Giorgio Fabbro, Stefan Uhlich, Fabian-Robert Stöter
<span title="2021-09-17">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
In this paper, we provide the details of the datasets, baselines, evaluation metrics, evaluation results, and technical challenges for future competitions.  ...  Evaluation campaigns such as MIREX or SiSEC connected state-of-the-art models and corresponding papers, which can help researchers integrate the best practices into their models.  ...  To keep scientific music separation research relevant and sustainable, we want to address some of the limitations of current evaluation frameworks by using: • a fully automatic evaluation system that makes  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2108.13559v2">arXiv:2108.13559v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vdnraihchjds3n4kuni7gjsd6y">fatcat:vdnraihchjds3n4kuni7gjsd6y</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210922120802/https://arxiv.org/pdf/2108.13559v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ea/7a/ea7adc5870d1635c6a15aca2d483473008ddff7a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2108.13559v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

TRECVID 2020: A comprehensive campaign for evaluating video retrieval tasks across multiple application domains [article]

George Awad, Asad A. Butt, Keith Curtis, Jonathan Fiscus, Afzal Godil, Yooyoung Lee, Andrew Delgado, Jesse Zhang, Eliot Godard, Baptiste Chocot, Lukas Diduch, Jeffrey Liu (+5 others)
<span title="2021-04-27">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
The TREC Video Retrieval Evaluation (TRECVID) is a TREC-style video analysis and retrieval evaluation with the goal of promoting progress in research and development of content-based exploitation and retrieval  ...  TRECVID 2020 represented a continuation of four tasks and the addition of two new tasks.  ...  The Video-to-Text work has been partially supported by Science Foundation Ireland (SFI) as a part of the Insight Centre at Dublin City University (12/RC/2289) and grant number 13/RC/2106 (ADAPT Centre  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2104.13473v1">arXiv:2104.13473v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qvxdecwdobfvhgxsgomou6yruu">fatcat:qvxdecwdobfvhgxsgomou6yruu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210430024245/https://arxiv.org/pdf/2104.13473v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/89/72/8972a7208e9ed976c5d99da85c639790c158935b.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2104.13473v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Table of Contents [EDICS]

<span title="">2020</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rut5unc4enborm7fhpkwgeza7m" style="color: black;">IEEE/ACM Transactions on Audio Speech and Language Processing</a> </i> &nbsp;
A. P. Habets 1915 Music Information Retrieval and Music Language Processing Automatic Leaderboard: Evaluation of Singing Quality Without a Standard Reference . . . . C. Gupta, H. Li, and Y.  ...  Lu 225 Quality and Intelligibility Measures Automatic Evaluation of Song Intelligibility Using Singing Adapted STOI and Vocal-Specific Features . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/taslp.2020.3046150">doi:10.1109/taslp.2020.3046150</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/easrxuwl6zdppejsrf4bskxfw4">fatcat:easrxuwl6zdppejsrf4bskxfw4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210429142703/https://ieeexplore.ieee.org/ielx7/6570655/8938144/09311740.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/78/29/782912d11abd247e918f03fcf4fc9fb2e1516942.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/taslp.2020.3046150"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Third DIHARD Challenge Evaluation Plan

Neville Ryant, Kenneth Church, Christopher Cieri, Jun Du, Sriram Ganapathy, Mark Liberman
<span title="2020-06-04">2020</span> <i title="Zenodo"> Zenodo </i> &nbsp;
Evaluation plan for the the third DIHARD challenge.  ...  system outputs (i.e., those displayed on the leaderboards at the end of the evaluation) on Zenodo.  ...  segmentation, or transcription) prior to the end of the evaluation is disallowed. • Participants are allowed to use any automatically derived information (e.g., automatic identification of the domain)  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.3877532">doi:10.5281/zenodo.3877532</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6r6wbl73c5eohdqssjuu7poeqq">fatcat:6r6wbl73c5eohdqssjuu7poeqq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200605104955/https://zenodo.org/record/3877533/files/third_dihard_eval_plan_v1.0.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f2/ad/f2ad3aaa3d1e304be20d76830474351afbc9ff32.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.3877532"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

Third DIHARD Challenge Evaluation Plan [article]

Neville Ryant, Kenneth Church, Christopher Cieri, Jun Du, Sriram Ganapathy, Mark Liberman
<span title="2020-12-02">2020</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
The challenge comprises two tracks evaluating diarization performance when starting from a reference speech segmentation (track 1) and diarization from raw audio scratch (track 2).  ...  This paper introduces the third DIHARD challenge, the third in a series of speaker diarization challenges intended to improve the robustness of diarization systems to variation in recording equipment,  ...  system outputs (i.e., those displayed on the leaderboards at the end of the evaluation) on Zenodo.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2006.05815v3">arXiv:2006.05815v3</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/35hjhvtflza43ceybyufsoubim">fatcat:35hjhvtflza43ceybyufsoubim</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201206035910/https://arxiv.org/pdf/2006.05815v3.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/57/72/5772685fce6706d5d96a6afda0ca567761c49394.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2006.05815v3" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Second DIHARD Challenge Evaluation Plan

Neville Ryant, Kenneth Church, Christopher Cieri, Alejandrina Cristia, Jun Du, Sriram Ganapathy, Mark Liberman
<span title="2019-06-18">2019</span> <i title="Zenodo"> Zenodo </i> &nbsp;
Evaluation plan for the the second DIHARD challenge.  ...  disallowed. • Participants are allowed to use any automatically derived information (e.g., automatic identification of the domain) for the development and evaluation files. • During the evaluation period  ...  SAD conditions Because system performance is strongly influenced by the quality of the speech segmentation used, two different SAD conditions are covered: • Reference SAD -In the reference SAD condition  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.3872390">doi:10.5281/zenodo.3872390</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/mnw6ziv7lnejbk6efix6hsxxxa">fatcat:mnw6ziv7lnejbk6efix6hsxxxa</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200603105945/https://zenodo.org/record/3872390/files/second_dihard_eval_plan_v1.2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/12/f4/12f4eb06810de9e7c9a6808694b61270bb21c299.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.3872390"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

Investigating Deep Neural Networks for Speaker Diarization in the DIHARD Challenge

Ivan Himawan, Md Hafizur Rahman, Sridha Sridharan, Clinton Fookes, Ahilan Kanagasundaram
<span title="">2018</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/gfqnwwky7rg6bgki4hedb3iimu" style="color: black;">2018 IEEE Spoken Language Technology Workshop (SLT)</a> </i> &nbsp;
by a number of tied triphone states referred as senones [27] .  ...  The DER is computed as, DER = E F A + E miss + E spk (3) where E F A refers to false alarm speech which is the amount of time incorrectly detected as speech divided by the total reference speaker time,  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/slt.2018.8639630">doi:10.1109/slt.2018.8639630</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/slt/HimawanRSFK18.html">dblp:conf/slt/HimawanRSFK18</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/5hecj7uukrdirfctzygfmbthbu">fatcat:5hecj7uukrdirfctzygfmbthbu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200505091120/https://eprints.qut.edu.au/123248/1/himawan_1145.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4e/51/4e51ef71f90b50b94322c167884c066a9f83d4d8.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/slt.2018.8639630"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Is Virtual Citizen Science A Game?

Elena Simperl, Neal Reeves, Chris Phethean, Todd Lynes, Ramine Tinati
<span title="2018-06-27">2018</span> <i title="Association for Computing Machinery (ACM)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/5bmfyjg2jjdqppijzcnsge5kqa" style="color: black;">ACM Transactions on Social Computing</a> </i> &nbsp;
Our findings suggest that projects use a range of game elements with points and leaderboards the most popular, particularly in projects that describe themselves as 'games'.  ...  Investigating this phenomenon further, we then present the results of a series of interviews carried out with the EyeWire citizen science project team to understand more about how gamification elements  ...  in the quality of submissions without reducing engagement [Mekler et al. 2013 ].  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3209960">doi:10.1145/3209960</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/squjzu4qsjgkzkuv22otzk4qeq">fatcat:squjzu4qsjgkzkuv22otzk4qeq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190427200357/https://eprints.soton.ac.uk/419313/1/Is_VCS_a_game.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d5/4d/d54dfff6912a1fb08d1b9d3da2d14db5c7be239e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/3209960"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Beyond Language: Learning Commonsense from Images for Reasoning [article]

Wanqing Cui, Yanyan Lan, Liang Pang, Jiafeng Guo, Xueqi Cheng
<span title="2020-10-10">2020</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
This paper proposes a novel approach to learn commonsense from images, instead of limited raw texts or costly constructed knowledge bases, for the commonsense reasoning problem in NLP.  ...  Our approach, namely Loire, consists of two stages.  ...  The task is about resolving a pronoun (represented as a blank line) to one of its two probable co-referents in the sentence.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2010.05001v1">arXiv:2010.05001v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/7mmd4azoxnebrnm7umhiwibsji">fatcat:7mmd4azoxnebrnm7umhiwibsji</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20201014215236/https://arxiv.org/pdf/2010.05001v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2010.05001v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Do Answers to Boolean Questions Need Explanations? Yes [article]

Sara Rosenthal, Mihaela Bornea, Avirup Sil, Radu Florian, Scott McCarley
<span title="2021-12-14">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
However, a one word response is not sufficient for an explainable system. We promote explainability by releasing a new set of annotations marking the evidence in existing TyDi QA and BoolQ datasets.  ...  We also provide further insight into the challenges of answering boolean questions, such as passages containing conflicting YES and NO answers, and varying degrees of relevance of the predicted evidence  ...  Baseline (BASE): A standard MRC system using out-of-the-box training data consisting of both boolean and short answer questions.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2112.07772v1">arXiv:2112.07772v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qqe4sn7pfvhbbi3fkebgyv63um">fatcat:qqe4sn7pfvhbbi3fkebgyv63um</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211217031112/https://arxiv.org/pdf/2112.07772v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f0/9a/f09a3d429c31ab48e7b69f1b4154f26e029fef55.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2112.07772v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Document Expansion by Query Prediction [article]

Rodrigo Nogueira, Wei Yang, Jimmy Lin, Kyunghyun Cho
<span title="2019-09-25">2019</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
In a latency-critical regime, retrieval results alone (without re-ranking) approach the effectiveness of more computationally expensive neural re-rankers but are much faster.  ...  One technique to improve the retrieval effectiveness of a search engine is to expand documents with terms that are related or representative of the documents' content.From the perspective of a question  ...  Exclud- without a re-ranker (BM25 + Doc2query) adds a ing stop words, which corresponds to 51% of the  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1904.08375v2">arXiv:1904.08375v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/b7o7tq6t5rhq5gxnuyrv5wwdki">fatcat:b7o7tq6t5rhq5gxnuyrv5wwdki</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200925050603/https://arxiv.org/pdf/1904.08375v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/79/3f/793fa0022bcf741f5e39673dae78de6534884739.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1904.08375v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Generation-Augmented Retrieval for Open-domain Question Answering [article]

Yuning Mao, Pengcheng He, Xiaodong Liu, Yelong Shen, Jianfeng Gao, Jiawei Han, Weizhu Chen
<span title="2021-08-06">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
We propose Generation-Augmented Retrieval (GAR) for answering open-domain questions, which augments a query through text generation of heuristically discovered relevant contexts without external resources  ...  We show that generating diverse contexts for a query is beneficial as fusing their results consistently yields better retrieval accuracy.  ...  One can take the relevant sentences in the ground-truth passages (if any) or those in the positive passages of a retriever as the reference, depending on the trade-off between reference quality and diversity  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2009.08553v4">arXiv:2009.08553v4</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/3bcgxkx66fbotloyqp6rugbnze">fatcat:3bcgxkx66fbotloyqp6rugbnze</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210812054432/https://arxiv.org/pdf/2009.08553v4.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/a6/4f/a64fbcf815a4f7e4d8b2d97be86451dca0a10d2f.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2009.08553v4" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

ASOD60K: An Audio-Induced Salient Object Detection Dataset for Panoramic Videos [article]

Yi Zhang
<span title="2021-11-12">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
diversity and quality.  ...  With this goal in mind, we propose PV-SOD, a new task that aims to segment salient objects from panoramic videos.  ...  comprehensive study on 11 representative models, which serves as the first standard leaderboard.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2107.11629v4">arXiv:2107.11629v4</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/5gndyferyffc3plnssjatc42pq">fatcat:5gndyferyffc3plnssjatc42pq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211122184314/https://arxiv.org/pdf/2107.11629v4.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/90/d4/90d4c4ff91a5e5f5483b1ab87514f790ce9a8aeb.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2107.11629v4" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

D2.1 Libraries and tools for multimodal content analysis

Doukhan; David, Danny Francis, Benoit Huet, Sami Keronen, Mikko Kurimo, Jorma Laaksonen, Tiina Lindh-Knuutila, Bernard Merialdo, Mats Sjöberg, Umut Sulubacak, Jörg Tiedemann, Kim Viljanen
<span title="2018-12-31">2018</span> <i title="Zenodo"> Zenodo </i> &nbsp;
As part of this deliverable, the open source components have been gathered into a joint software collection of tools and libraries publicly available on GitHub.  ...  This deliverable describes a joint collection of libraries and tools for multimodal content analysis created by the MeMAD project partners.  ...  Speech recognition Lingsoft will provide the consortium with transcripts of test material from Yle data set for gold standard evaluation of automatic speech recognition (ASR) and diarisation both in Finnish  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.3697989">doi:10.5281/zenodo.3697989</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/bde5x3yggzb2jk2fh2mu6t5wxy">fatcat:bde5x3yggzb2jk2fh2mu6t5wxy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200306112439/https://zenodo.org/record/3697989/files/D2.1-Libraries_and_tools_for_multimodal_content_analysis.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4f/b5/4fb565b251e4bfb5e588691b03ab33db07f72d8e.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.3697989"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

MultiBench: Multiscale Benchmarks for Multimodal Representation Learning [article]

Paul Pu Liang, Yiwei Lyu, Xiang Fan, Zetian Wu, Yun Cheng, Jason Wu, Leslie Chen, Peter Wu, Michelle A. Lee, Yuke Zhu, Ruslan Salakhutdinov, Louis-Philippe Morency
<span title="2021-11-10">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
To accompany this benchmark, we also provide a standardized implementation of 20 core approaches in multimodal learning.  ...  MultiBench, our standardized code, and leaderboards are publicly available, will be regularly updated, and welcomes inputs from the community.  ...  PPL is supported by a Facebook PhD Fellowship and a Center for Machine Learning and Health Fellowship. RS is supported in part by NSF IIS1763562 and ONR Grant N000141812861.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2107.07502v2">arXiv:2107.07502v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ls47dr7lpfhkbfry4r6dtqjtua">fatcat:ls47dr7lpfhkbfry4r6dtqjtua</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211123030629/https://arxiv.org/pdf/2107.07502v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/af/86/af86df6a0af3226a1b4b5eb27c17c9e45367f896.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2107.07502v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 45 results