Filters








4,898 Hits in 6.4 sec

Automatic music video summarization based on audio-visual-text analysis and alignment

Changsheng Xu, Xi Shao, Namunu C. Maddage, Mohan S. Kankanhalli
<span title="">2005</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ibcfmixrofb3piydwg5wvir3t4" style="color: black;">Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR &#39;05</a> </i> &nbsp;
In this paper, we propose a novel approach for automatic music video summarization based on audio-visual-text analysis and alignment. The music video is separated into the music and video tracks.  ...  The music video summary is generated based on the alignment of boundaries of the detected chorus, shot class and the most repeated lyrics from the music video.  ...  Music-Visual-Text Alignment The purpose of music-visual-text alignment is to synchronize the most salient parts detected from the music track and visual track so as to make the final music video summary  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1076034.1076097">doi:10.1145/1076034.1076097</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigir/XuSMK05.html">dblp:conf/sigir/XuSMK05</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/cztem66jfvci7mog35q3drxiwu">fatcat:cztem66jfvci7mog35q3drxiwu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809143438/http://www.comp.nus.edu.sg/~mohan/papers/sigir2005.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/2a/d5/2ad5204acf22fd99e0bf35a03dc9c1ce9fc25652.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1076034.1076097"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Music videos miner

Lalitha Agnihotri, Nevenka Dimitrova, John Kender, John Zimmerman
<span title="">2003</span> <i title="ACM Press"> Proceedings of the eleventh ACM international conference on Multimedia - MULTIMEDIA &#39;03 </i> &nbsp;
Overview of the music summarization system.Music video summarization is based on identification and summarization of individual songs.  ...  The boundary is aligned with the visual color boundaries and the start (or end) of music classification in the audio domain.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/957102.957103">doi:10.1145/957102.957103</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qrccyat6yzef3es5eq2sk2lnfi">fatcat:qrccyat6yzef3es5eq2sk2lnfi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20070418154116/http://www.cs.cmu.edu/~johnz/pubs/2003_MM_MV.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/03/c2/03c2a0f0913e7daac6f453ecf1968b22df200c05.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/957102.957103"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Music videos miner

Lalitha Agnihotri, Nevenka Dimitrova, John Kender, John Zimmerman
<span title="">2003</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/lahlxihmo5fhzpexw7rundu24u" style="color: black;">Proceedings of the eleventh ACM international conference on Multimedia - MULTIMEDIA &#39;03</a> </i> &nbsp;
Overview of the music summarization system.Music video summarization is based on identification and summarization of individual songs.  ...  The boundary is aligned with the visual color boundaries and the start (or end) of music classification in the audio domain.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/957013.957103">doi:10.1145/957013.957103</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/mm/AgnihotriDKZ03.html">dblp:conf/mm/AgnihotriDKZ03</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/u5iuffz7hrcr7fsem3hcljxlqy">fatcat:u5iuffz7hrcr7fsem3hcljxlqy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20070418154116/http://www.cs.cmu.edu/~johnz/pubs/2003_MM_MV.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/03/c2/03c2a0f0913e7daac6f453ecf1968b22df200c05.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/957013.957103"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Sports Video Analysis: Semantics Extraction, Editorial Content Creation and Adaptation

Changsheng Xu, Jian Cheng, Yi Zhang, Yifan Zhang, Hanqing Lu
<span title="2009-04-01">2009</span> <i title="Academy Publisher"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p7gn6vnxj5hqpb5gvaqprypf4u" style="color: black;">Journal of Multimedia</a> </i> &nbsp;
In this paper, we summarize our research achievement on semantics extraction and automatic editorial content creation and adaptation in sports video analysis.  ...  We also discuss emerging applications on editorial content creation and content enhancement/adaptation in sports video analysis, including event detection, sports MTV generation, automatic broadcast video  ...  The system contained four live modules: live text/video capturing, live text analysis, live video analysis, and live text/video alignment.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.4304/jmm.4.2.69-79">doi:10.4304/jmm.4.2.69-79</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/xytusontr5cyxlxpyqgljnhkqu">fatcat:xytusontr5cyxlxpyqgljnhkqu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170808234928/http://nlpr-web.ia.ac.cn/2009papers/gjkw/gk34.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3e/f7/3ef72a946423b2ad91a2b1baac90d1fdaec0e532.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.4304/jmm.4.2.69-79"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Automatic generation of personalized music sports video

Jinjun Wang, Changsheng Xu, Engsiong Chng, Lingyu Duan, Kongwah Wan, Qi Tian
<span title="">2005</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/lahlxihmo5fhzpexw7rundu24u" style="color: black;">Proceedings of the 13th annual ACM international conference on Multimedia - MULTIMEDIA &#39;05</a> </i> &nbsp;
For the first challenge, we propose to use multi-modal (audio, video and text) feature analysis and alignment to detect the semantic of events in sports video.  ...  For the second challenge, we propose video-centric and music-centric music video composition schemes to automatically generate personalized music sports video based on user's preference.  ...  We propose to utilize multimodal (audio, visual and text) feature analysis and alignment to select semantic video content and use video and music semantic analysis for MSV generation.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1101149.1101309">doi:10.1145/1101149.1101309</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/mm/WangXSDWT05.html">dblp:conf/mm/WangXSDWT05</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/gzsimq5rozhmxojmxabknxrj2u">fatcat:gzsimq5rozhmxojmxabknxrj2u</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190217034329/https://static.aminer.org/pdf/PDF/000/329/584/generation_of_personalized_abstract_of_sports_video.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3b/14/3b1417c8b54147a705caf2265ff5e84c5340b55f.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/1101149.1101309"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

A Picture is Worth a Thousand Songs

Alexander Schindler
<span title="">2014</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/kw2apmx5ynfyjf6jhs5gzrrx6e" style="color: black;">Proceedings of the 1st International Workshop on Digital Libraries for Musicology - DLfM &#39;14</a> </i> &nbsp;
Traditionally queries are either based on text input or seed songs. Both are in many cases inadequate or require extensive interaction or knowledge from the user.  ...  Modeling music similarities based on such criteria is in many cases problematic or visionary.  ...  As future work we plan large scale analysis of audio-visual correlations, by applying content based affect recognition methods from the image retrieval domain to album art images and music videos and comparing  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2660168.2660185">doi:10.1145/2660168.2660185</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/jcdl/Schindler14.html">dblp:conf/jcdl/Schindler14</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/5hoblranurhy5hhku6klcq3ksy">fatcat:5hoblranurhy5hhku6klcq3ksy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20160412180332/http://www.ifs.tuwien.ac.at:80/%7Eschindler/pubs/DLFM2014.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/19/80/1980bc0a213d595bef57ead623319f0e72e4bc38.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/2660168.2660185"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention

Georgios Evangelopoulos, Athanasia Zlatintsi, Alexandros Potamianos, Petros Maragos, Konstantinos Rapantzikos, Georgios Skoumas, Yannis Avrithis
<span title="">2013</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/sbzicoknnzc3tjljn7ifvwpooi" style="color: black;">IEEE transactions on multimedia</a> </i> &nbsp;
Detection of attention-invoking audiovisual segments is formulated in this work on the basis of saliency models for the audio, visual, and textual information conveyed in a video stream.  ...  The produced summaries, based on low-level features and content-independent fusion and selection, are of subjectively high aesthetic and informative quality.  ...  Malandrakis and I. Rodomagoulakis for the additional movie annotations, T.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tmm.2013.2267205">doi:10.1109/tmm.2013.2267205</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/jjt7xmjh5narlm5wr2strvrqza">fatcat:jjt7xmjh5narlm5wr2strvrqza</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809111355/http://cvsp.cs.ntua.gr/publications/jpubl+bchap/EZPMRSA_MultimodalSaliencyFusionMovieSumAVTattention_ieeetMM13.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/03/4a/034acd8f057db8ea15eecac28a0ee41b1145fce7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tmm.2013.2267205"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization

Athanasia Zlatintsi, Petros Koutras, Georgios Evangelopoulos, Nikolaos Malandrakis, Niki Efthymiou, Katerina Pastra, Alexandros Potamianos, Petros Maragos
<span title="2017-08-07">2017</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/xas42nwk7zhbnkycpwsnwjm3te" style="color: black;">EURASIP Journal on Image and Video Processing</a> </i> &nbsp;
In order to enable comparisons with other computational models, we propose state-of-the-art algorithms, specifically a unified energy-based audio-visual framework and a method for text saliency computation  ...  The purpose of this database is manifold; it can be used for training and evaluation of event detection and summarization algorithms, for classification and recognition of audio-visual and cross-media  ...  Malandrakis and A. Potamianos was performed while they were  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/s13640-017-0194-1">doi:10.1186/s13640-017-0194-1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/afaddslsknhjrktxqnlmgy4mgq">fatcat:afaddslsknhjrktxqnlmgy4mgq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200310025649/https://dspace.mit.edu/bitstream/handle/1721.1/113872/13640_2017_Article_194.pdf;jsessionid=58764F15FA1ED401C528D2B1C456E60E?sequence=1" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/04/84/0484be2dcdd1cbf5c4ebfb1f97a0f40dd2c1be84.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1186/s13640-017-0194-1"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> springer.com </button> </a>

Video event detection and summarization using audio, visual and text saliency

G. Evangelopoulos, A. Zlatintsi, G. Skoumas, K. Rapantzikos, A. Potamianos, P. Maragos, Y. Avrithis
<span title="">2009</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/rc5jnc4ldvhs3dswicq5wk3vsq" style="color: black;">2009 IEEE International Conference on Acoustics, Speech and Signal Processing</a> </i> &nbsp;
Detection of perceptually important video events is formulated here on the basis of saliency models for the audio, visual and textual information conveyed in a video stream.  ...  The algorithm performs favorably for video summarization in terms of informativeness and enjoyability.  ...  AUDIO ANALYSIS The analysis and saliency-modeling of the audio stream is based on strong modulation structures of the signal waveform, using the AM-FM model for audio signals (speech, music, environmental  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2009.4960393">doi:10.1109/icassp.2009.4960393</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/icassp/EvangelopoulosZSRPMA09.html">dblp:conf/icassp/EvangelopoulosZSRPMA09</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/hvqgfwvuc5d3pndvvsjbctaknq">fatcat:hvqgfwvuc5d3pndvvsjbctaknq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170921225011/http://www.image.ece.ntua.gr/papers/577.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ec/0a/ec0ad3190dc94ca216b7e76b199d421d66305a06.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/icassp.2009.4960393"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Reflections on the Development of the Musical Gestures Toolbox for Python

Bálint Laczkó, Alexander Refsum Jensenius
<span title="2021-11-24">2021</span> <i title="Zenodo"> Zenodo </i> &nbsp;
The toolbox also includes basic computer vision methods, and it is designed to integrate well with audio analysis toolboxes.  ...  The toolbox includes video visualization techniques such as creating motion videos, motion history images, and motiongrams.  ...  Acknowledgments Thanks to Frida Furmyr and Marcus Widmer, who developed the first version of MGT for Python, and Bo Zhou, who co-developed MGT for Matlab, which the Python toolbox builds on.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.5724392">doi:10.5281/zenodo.5724392</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/bgbvf3zrwrerpnx4x5i55hqop4">fatcat:bgbvf3zrwrerpnx4x5i55hqop4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220208023149/https://zenodo.org/record/5724393/files/Nordic_SMC_2021_paper_38.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3a/12/3a12940373dd1923d132b3c9ebd4917301f092bd.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5281/zenodo.5724392"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> zenodo.org </button> </a>

Creating music videos using automatic media analysis

Jonathan Foote, Matthew Cooper, Andreas Girgensohn
<span title="">2002</span> <i title="ACM Press"> Proceedings of the tenth ACM international conference on Multimedia - MULTIMEDIA &#39;02 </i> &nbsp;
Significant audio changes are automatically detected; similarly, the source video is automatically segmented and analyzed for suitability based on camera motion and exposure.  ...  We present methods for automatic and semi-automatic creation of music videos, given an arbitrary audio soundtrack and source video.  ...  AUDIO AND VIDEO ANALYSIS The video clips are then automatically edited by discarding unsuitable portions so that the remaining video is aligned with the audio changes.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/641118.641119">doi:10.1145/641118.641119</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zlvr6ix7wrcbxmtyrqjxgwi5d4">fatcat:zlvr6ix7wrcbxmtyrqjxgwi5d4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20031029130634/http://www.fxpal.com:80/publications/FXPAL-PR-02-175.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/90/84/90843fd063ce5883f7423197d33f8743df59e4b7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/641118.641119"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Creating music videos using automatic media analysis

Jonathan Foote, Matthew Cooper, Andreas Girgensohn
<span title="">2002</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/lahlxihmo5fhzpexw7rundu24u" style="color: black;">Proceedings of the tenth ACM international conference on Multimedia - MULTIMEDIA &#39;02</a> </i> &nbsp;
Significant audio changes are automatically detected; similarly, the source video is automatically segmented and analyzed for suitability based on camera motion and exposure.  ...  We present methods for automatic and semi-automatic creation of music videos, given an arbitrary audio soundtrack and source video.  ...  AUDIO AND VIDEO ANALYSIS The video clips are then automatically edited by discarding unsuitable portions so that the remaining video is aligned with the audio changes.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/641007.641119">doi:10.1145/641007.641119</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/mm/FooteCG02.html">dblp:conf/mm/FooteCG02</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ymwje4cejffkpfpgbmyuolbgzm">fatcat:ymwje4cejffkpfpgbmyuolbgzm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20031029130634/http://www.fxpal.com:80/publications/FXPAL-PR-02-175.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/90/84/90843fd063ce5883f7423197d33f8743df59e4b7.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/641007.641119"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Generation of Multimedia Artifacts: An Extractive Summarization-based Approach [article]

Paulo Figueiredo and Marta Aparício and David Martins de Matos and Ricardo Ribeiro
<span title="2015-08-13">2015</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
For content selection, we use centrality-based and diversity-based summarization, along with topic analysis.  ...  We use audio and video to present two case studies: generation of film tributes, and lecture-driven science talks.  ...  Other approaches, propose the fusion of text, audio, and visual features, with relation to particular topics, for multimedia summarization (Ding et al., 2012) .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1508.03170v1">arXiv:1508.03170v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/stbxcsugz5fhhhwhqluuqtwxvi">fatcat:stbxcsugz5fhhhwhqluuqtwxvi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20191013132504/https://arxiv.org/pdf/1508.03170v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/e9/a1/e9a10430c8ece953376dead422bb6d5bda2ca950.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1508.03170v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Audio-Visual Content Analysis in P2P Networks: The SAPIR Approach

Walter Allasia, Fabrizio Falchi, Francesco Gallo, Mouna Kacimi, Aaron Kaplan, Jonathan Mamou, Yosi Mass, Nicola Orio
<span title="">2008</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/mfwemajbqjcy7hji6w3ztzfccu" style="color: black;">2008 19th International Conference on Database and Expert Systems Applications</a> </i> &nbsp;
Content based search in audio-visual collections requires media specific analysis for extracting low level features to be efficiently indexed and searched.  ...  The framework contains splitters of compound objects to simple objects to deal with complex media like videos, using image and speech analyzers.  ...  Introduction Web search for audio-visual content such as images, music, animations, and videos is limited today to associated text and metadata annotations.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/dexa.2008.123">doi:10.1109/dexa.2008.123</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/dexaw/AllasiaFGKKMMO08.html">dblp:conf/dexaw/AllasiaFGKKMMO08</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/hjt4y2hvrbdvfciybigxqmby4u">fatcat:hjt4y2hvrbdvfciybigxqmby4u</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170922014414/http://puma.isti.cnr.it/rmydownload.php?filename=cnr.isti/cnr.isti/2008-A2-103/2008-A2-103.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4b/0e/4b0ea19f5562d7a3382b3c85c32c4a4f59582aaa.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/dexa.2008.123"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

The XV Symposium New Trends in Audio and Video Technology Wrocław, Poland, September 25 – 27, 2015

<span title="2015-12-01">2015</span> <i title="Walter de Gruyter GmbH"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/tinfbwdufzcfjoorslmtna6qd4" style="color: black;">Archives of Acoustics</a> </i> &nbsp;
The related research usually concentrates on: -phonemes and words recognition in music sound files, -automatic alignment between lyrics and a melody.  ...  ⋆ ⋆ ⋆ Examining influence of video framerate and audio/video synchronization on audio-visual speech recognition accuracy Bratoszewski Piotr, bratoszewski@sound.eti.pg.gda.pl Łopatka Kuba, Czyżewski  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1515/aoa-2015-0062">doi:10.1515/aoa-2015-0062</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ff54lrwy4bhlbbu7inlpuqeho4">fatcat:ff54lrwy4bhlbbu7inlpuqeho4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180720153541/https://content.sciendo.com/downloadpdf/journals/aoa/40/4/article-p621.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/be/a6/bea6c4991f9d634b5f5cfbc8912dee278cbe7a69.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1515/aoa-2015-0062"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> degruyter.com </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 4,898 results