1,984 Hits in 4.1 sec

A Machine Learning based Framework for Parameter based Multi-Objective Optimisation of Video CODECs

Maryam Al-Barwani, Eran A. Edirisinghe
2017 Advances in Science, Technology and Engineering Systems  
A Multi-objective Optimisation framework based on Genetic Algorithms is thus proposed to optimise the performance of a video codec.  ...  We propose a framework that uses machine learning algorithms to model the performance of a video CODEC based on the significant coding parameters.  ...  Acknowledgment This research is supported by the Ministry of Man Power (MOMP) Muscat, Oman.  ... 
doi:10.25046/aj0203190 fatcat:moaeww5b6ba7tf4zp23vqs2qri

Character index

2011 2011 IEEE International Conference on Multimedia and Expo  
doi:10.1109/icme.2011.6011827 fatcat:wjy7yvkmvbbf3hj4wbyjapx5gu

Auction based optimal subcarrier allocation for H.264 scalable video transmission in 4G OFDMA systems

G Chandra Sekhar, Shreyans Parakh, Aditya K. Jagannatham
2012 2012 Annual IEEE India Conference (INDICON)  
With the aid of these models, we propose a novel auction based framework for revenue maximization of the transmitted video streams in the unicast and multicast 4G scenario.  ...  This yields the optimal OFDMA subcarrier allocation for multi-user scalable video multiplexing.  ...  We use the robust framework of convex optimization to obtain the closed form expression for computation of the optimal coded video parameters, thus leading to codec adaptation.  ... 
doi:10.1109/indcon.2012.6420582 fatcat:hwqepuh225dxvlnrh7r2zj4wga

Video Coding for Machine: Compact Visual Representation Compression for Intelligent Collaborative Analytics [article]

Wenhan Yang, Haofeng Huang, Yueyu Hu, Ling-Yu Duan, Jiaying Liu
2021 arXiv   pre-print
Video Coding for Machines (VCM) is committed to bridging to an extent separate research tracks of video/image compression and feature compression, and attempts to optimize compactness and efficiency jointly  ...  Therefore, we investigate a novel visual information compression for the analytics taxonomy problem to strengthen the capability of compact visual representations extracted from multiple tasks for visual  ...  image/video codecs.  ... 
arXiv:2110.09241v1 fatcat:ju7pxk2fhbf7hnby6po3kq5bqi

Guest Editorial: Special Issue on Multi-Core Enabled Multimedia Applications & Architectures

Yen-Kuang Chen, Lurng-Kuo Liu, Shuvra S. Bhattacharyya
2009 Journal of Signal Processing Systems  
It is critical to understand the complexity of developing a new application or porting an existing application onto a multi-core processor.  ...  "Real-time visual tracker by Stream processing" by Lozano et al. develops a novel GPU-based computer vision system for real-time tracking of objects in video sequences.  ...  In "A Multi-core Architecture based Parallel Framework for H.264/AVC Deblocking Filters," Wang et al. carefully review the deblocking filter algorithm and observe that the results of each deblocking  ... 
doi:10.1007/s11265-008-0331-2 fatcat:gbw7zl7t3zh5jbyk4zzo7gkgqy

A Hybrid Deep Animation Codec for Low-bitrate Video Conferencing [article]

Goluck Konuko and Stéphane Lathuilière and Giuseppe Valenzise
2022 arXiv   pre-print
Specifically, we extend a codec based on facial animation by adding an auxiliary stream consisting of a very low bitrate version of the video, obtained through a conventional video codec (e.g., HEVC).  ...  The animated and auxiliary videos are combined through a novel fusion module.  ...  Conversely, the use of a conventional video codec stream shifts the range of bitrates over which our framework can operate before reaching a saturation point.  ... 
arXiv:2207.13530v1 fatcat:becjsfnizbdrpicc72r4vbej4i

Front Matter: Volume 10396

Andrew G. Tescher
2017 Applications of Digital Image Processing XL  
Publication of record for individual papers is online in the SPIE Digital Library. Paper Numbering: Proceedings of SPIE follow an e-First publication model.  ...  A unique citation identifier (CID) number is assigned to each article at the time of publication.  ...  for the emerging AV1 video codec [10396-15] 10396 0G Novel modes and adaptive block scanning order for intra prediction in AV1 [10396-16] iii Proc. of SPIE Vol. 10396 1039601-3 Display of high  ... 
doi:10.1117/12.2293188 fatcat:4uko25br6rcmzjvi4ti2abf4re

A Coding Framework and Benchmark towards Compressed Video Understanding [article]

Yuan Tian, Guo Lu, Yichao Yan, Guangtao Zhai, Li Chen, Zhiyong Gao
2022 arXiv   pre-print
Our framework also enjoys the best of both two worlds, (1) high efficiency of industrial video codec and (2) flexible coding capability of neural networks (NNs).  ...  The proposed Understanding oriented Video Coding framework UVC consistently demonstrates significantly stronger performances than the baseline industrial codec.  ...  Moreover, we have built a benchmark for this novel problem. Fig. 2 : 2 Fig.2: Dual-PVC Framework includes two bitstreams. The one is the video stream produced by video codecs.  ... 
arXiv:2202.02813v2 fatcat:hf2uew726jaxrfk3ps6ui2dlwm

Self-Conditioned Probabilistic Learning of Video Rescaling [article]

Yuan Tian, Guo Lu, Xiongkuo Min, Zhaohui Che, Guangtao Zhai, Guodong Guo, Zhiyong Gao
2021 arXiv   pre-print
We further extend the framework to a lossy video compression system, in which a gradient estimator for non-differential industrial lossy codecs is proposed for the end-to-end training of the whole system  ...  After optimization, the downscaled video by our framework preserves more meaningful information, which is beneficial for both the upscaling step and the downstream tasks, e.g., video action recognition  ...  Acknowledgement This work was supported by the National Science Foundation of China (61831015, 61527804 and U1908210).  ... 
arXiv:2107.11639v2 fatcat:k7g4ewgwhbatzlyk2z26pub2j4

Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis [article]

Karren Yang, Dejan Markovic, Steven Krenn, Vasu Agrawal, Alexander Richard
2022 arXiv   pre-print
In this paper, we propose a novel audio-visual speech enhancement framework for high-fidelity telecommunications in AR/VR.  ...  Our approach leverages audio-visual speech cues to generate the codes of a neural speech codec, enabling efficient synthesis of clean, realistic speech from noisy signals.  ...  Our main contributions are the following: (1) We propose audio-visual (AV) speech codecs, a novel framework for AV speech enhancement.  ... 
arXiv:2203.17263v1 fatcat:ofntq3unt5hy3bvkawloluyiry

A monolithic programmable Ultra-HD video codec engine

Hetul Sanghvi, Mihir Mody, Niraj Nandan, Mahesh Mehendale, Subrangshu Das, Dipan Kumar Mandal, Vyagrheswarudu Nainala, Vijayavardhan Baireddy, Pavan Shastry
2014 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)  
In this paper, we present a monolithic multi-format video codec engine which achieves Ultra HD performance for H.264 High Profile, reduces the external memory bandwidth requirement by 2X as compared to  ...  its predecessor and takes only 5.9 mm 2 of silicon area in a low power 28nm process.  ...  As described in [6] , this work implements novel approach by defining software based video codec framework (aka VCF) to enable multi-threading the local ARP32 controller in firmware.  ... 
doi:10.1109/icassp.2014.6853827 dblp:conf/icassp/SanghviMNMDMVBS14 fatcat:ljzbwy56p5e2zbxnmr7mm5lcw4

Adaptation and Attention for Neural Video Coding [article]

Nannan Zou, Honglei Zhang, Francesco Cricri, Ramin G. Youvalari, Hamed R. Tavakoli, Jani Lainema, Emre Aksu, Miska Hannuksela, Esa Rahtu
2021 arXiv   pre-print
Neural image coding represents now the state-of-the-art image compression approach. However, a lot of work is still to be done in the video domain.  ...  As one architectural novelty, we propose to train the inter-frame codec model to adapt the motion estimation process based on the resolution of the input video.  ...  After that, the OMPs of the intra-frame decoder are optimized on the first intra frame of each video, and used for all intra frames of that video.  ... 
arXiv:2112.08767v1 fatcat:sa6u33lbxjg7bdf2lnqha6soge

Application-driven cross-layer optimization for mobile multimedia communication using a common application layer quality metric

S. Khan, S. Duhovnikov, E. Steinbach, M. Sgroi, W. Kellerer
2006 Proceeding of the 2006 international conference on Communications and mobile computing - IWCMC '06  
We define a novel optimization scheme based on the Mean Opinion Score (MOS) as the unifying metric.  ...  This paper proposes a cross-layer optimization framework that provides efficient allocation of wireless network resources across multiple types of applications to maximize network capacity and user satisfaction  ...  In Section 3 we give a detailed description of our multi-application cross-layer optimization framework.  ... 
doi:10.1145/1143549.1143593 dblp:conf/iwcmc/KhanDSSK06 fatcat:4cfhedkkczaihoyxy2xasq6z5u

Optimal 4G OFDMA Dynamic Subcarrier and Power Auction-based Allocation towards H.264 Scalable Video Transmission

G. Sekhar, Shreyans Parakh, Aditya Jagannatham
2013 Defence Science Journal  
Further, they also consider a framework for optimal power allocation based on a novel revenue maximization scheme in OFDMA based wireless broadband 4G systems employing auction bidding models.  ...  This yields the optimal OFDMA subcarrier allocation for multi-user scalable video multiplexing.  ...  We use the robust framework of convex optimization to obtain the closed form expression for computation of the optimal coded video parameters, thus leading to codec adaptation.  ... 
doi:10.14429/dsj.63.3759 fatcat:erfpzscsojgz3pagfoxuslmtme

Visual Analysis Motivated Rate-Distortion Model for Image Coding [article]

Zhimeng Huang, Chuanmin Jia, Shanshe Wang, Siwei Ma
2021 arXiv   pre-print
Optimized for pixel fidelity metrics, images compressed by existing image codec are facing systematic challenges when used for visual analysis tasks, especially under low-bitrate coding.  ...  This paper proposes a visual analysis-motivated rate-distortion model for Versatile Video Coding (VVC) intra compression.  ...  In this paper, we propose a novel visual analysismotivated RDO model for VVC intra compression. The framework of the proposed model is shown in Fig. 2 .  ... 
arXiv:2104.10315v1 fatcat:aygp5evp55gchltgoqjpgyynnm
« Previous Showing results 1 — 15 out of 1,984 results