Filters








19 Hits in 3.2 sec

Fast MPEG-CDVS Encoder With GPU-CPU Hybrid Computing

Ling-Yu Duan, Wei Sun, Xinfeng Zhang, Shiqi Wang, Jie Chen, Jianxiong Yin, Simon See, Tiejun Huang, Alex C. Kot, Wen Gao
2018 IEEE Transactions on Image Processing  
In this paper, we revisit the merits of low complexity design of CDVS core techniques and present a very fast CDVS encoder by leveraging the massive parallel execution resources of GPU.  ...  Comprehensive experimental results over benchmarks are evaluated, which has shown that the fast CDVS encoder using GPU-CPU hybrid computing is promising for scalable visual search.  ...  The fast CDVS encoder is implemented based on the latest CDVS reference software TM14.0 using GPU-CPU hybrid computing.  ... 
doi:10.1109/tip.2018.2794203 pmid:29432101 fatcat:2dr3sc66djcwnofwht6be2n7zm

Accelerating Local Feature Extraction Using Two Stage Feature Selection and Partial Gradient Computation [chapter]

Keundong Lee, Seungjae Lee, Weon-Geun Oh
2015 Lecture Notes in Computer Science  
In this paper, we present a fast local feature extraction method, which is our contribution to ongoing MPEG standardization of compact descriptor for visual search (CDVS).  ...  For its efficiency, the proposed method has been integrated into CDVS TM since 107 th MPEG meeting.  ...  Evaluation on MPEG CDVS Framework The proposed method has been integrated into CDVS TM as an fast mode of feature extractor for its efficiency since 107 th meeting.  ... 
doi:10.1007/978-3-319-16634-6_27 fatcat:5xziekzd2jhktlofe5jbhnxsra

Compact Global Descriptors for Visual Search

Vijay Chandrasekhar, Jie Lin, Olivier Morere, Antoine Veillard, Hanlin Goh
2015 2015 Data Compression Conference  
State-of-the-art global descriptors based on Fisher Vectors are represented with tens of thousands of floating point numbers.  ...  Motivated by the remarkable success of deep neural networks in recent literature, we propose a compression scheme based on deeply stacked Restriction Boltzmann Machines (SRBM), which learn lower dimensional  ...  The MPEG-CDVS standard adopted the Scalable Fisher Compressed Vector [6] , which was based on binarization of high-dimensional Fisher Vectors.  ... 
doi:10.1109/dcc.2015.54 dblp:conf/dcc/ChandrasekharLM15 fatcat:mdrrky2pvfd33dvdhc7ih7oage

Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing [article]

Zhuo Chen, Weisi Lin, Shiqi Wang, Lingyu Duan, Alex C. Kot
2018 arXiv   pre-print
This strategy enables a good balance among the computational load, transmission load and the generalization ability for cloud servers when deploying the deep neural networks for large scale cloud based  ...  Based on CDVS, MPEG has moved forward to the standardization of Compact Descriptors for Video Analysis (CDVA) [21] since Feb. 2015.  ...  For hand-crafted features, the standards from MPEG including MPEG-CDVS [20] and MPEG-CDVA [21] specify the feature extraction and compression processes.  ... 
arXiv:1809.06196v1 fatcat:qzvvqtyjtvf6zjp2h4xa7r5qbm

Tiny Descriptors for Image Retrieval with Unsupervised Triplet Hashing [article]

Jie Lin, Olivier Morère, Julie Petta, Vijay Chandrasekhar, Antoine Veillard
2015 arXiv   pre-print
A good image descriptor is key to the retrieval pipeline and should reconcile two contradictory requirements: providing recall rates as high as possible and being as compact as possible for fast matching  ...  Then, triplet networks, a rank learning scheme based on weight sharing nets is used to fine-tune the binary embedding functions to retain as much as possible of the useful metric properties of the original  ...  The size of the compressed descriptor in the MPEG-CDVS standard ranges from 256 bytes to several thousand bytes per image, based on the operating points.  ... 
arXiv:1511.03055v1 fatcat:owk7tvr3ibectc6f5u2knokggi

Tiny Descriptors for Image Retrieval with Unsupervised Triplet Hashing

Jie Lin, Olivier Morere, Julie Petta, Vijay Chandrasekhar, Antoine Veillard
2016 2016 Data Compression Conference (DCC)  
Further, it is highly desirable that the global descriptors be binary to enable fast matching through Hamming distances.  ...  For the first step, state-of-the-art schemes are based on comparing global representations of images.  ...  The size of the compressed descriptor in the MPEG-CDVS standard ranges from 256 bytes to several thousand bytes per image, based on the operating points.  ... 
doi:10.1109/dcc.2016.23 dblp:conf/dcc/LinMPCV16 fatcat:qxdp7hxcpbe6ffk3ck5jta45pu

AccMPEG: Optimizing Video Encoding for Video Analytics [article]

Kuntai Du, Qizheng Zhang, Anton Arapin, Haodong Wang, Zhengxu Xia, Junchen Jiang
2022 arXiv   pre-print
This paper presents AccMPEG, a new video encoding and streaming system that meets all the three requirements.  ...  The key is to learn how much the encoding quality at each (16x16) macroblock can influence the server-side DNN accuracy, which we call accuracy gradient.  ...  Fast training of ONLINE ENCODING We now describe AccMPEG's online encoding process, including the architecture of AccM odel and how AccMPEG assigns encoding quality to each macroblock.  ... 
arXiv:2204.12534v1 fatcat:ou73kl7y25agngtrqhjixggrfu

Mobile media communication, processing, and analysis: A review of recent advances

Wen Gao, Ling-Yu Duan, Jun Sun, Junsong Yuan, Yonggang Wen, Yap-Peng Tan, Jianfei Cai, Alex C. Kot
2013 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013)  
To identify the opportunities and challenges in fast growing mobile media computing, we discuss several emerging topics including mobile visual search, retargeting, mobile video streaming, and cloud based  ...  In particular, the ongoing MPEG standardization of compact descriptors for visual search (CDVS) have involved big industry and academia efforts from STMicroelectronics, NEC, Nvidia, Samsung, Nokia, Qualcomm  ...  Hardware acceleration like GPU has been tried out [26] [29] . In [30] , a novel FPGA based architecture was proposed to deal with computational complexity.  ... 
doi:10.1109/iscas.2013.6571985 dblp:conf/iscas/GaoD0YWTCK13 fatcat:mhnjsvlrfvdr3nchgkqhk4i4zi

ACTNET: end-to-end learning of feature activations and multi-stream aggregation for effective instance image retrieval [article]

Syed Sameed Husain, Eng-Jon Ong, Miroslaw Bober
2020 arXiv   pre-print
Recent works on using activation functions to help fast training of CNNs are [30] , [31] , [32] , [33] , [34] .  ...  We overcome the TITAN X GPU memory limitation of 11 GB by processing one triplet at a time and updating the gradients after every 64 triplets.  ...  Miroslaw led the development of ISO MPEG standards for over 20 years, chairing the MPEG-7, CDVS and CVDA groups. He is an inventor of over 80 patents, many deployed in products.  ... 
arXiv:1907.05794v3 fatcat:ukmpjk53wvgq3brok354qqwmim

Feature extraction using MPEG-CDVS and Deep Learning with application to robotic navigation and image classification

Pedro Porto Buarque De Gusmao, Enrico Magli
2017
We first describe a probabilistic approach to loop detection based on the standard's suggested similarity metric.  ...  We then evaluate the performance of CDVS compression modes in terms of matching speed, feature extraction, and storage requirements and compare them with the state of the art SIFT descriptor for five different  ...  This node encapsulates the MPEG-CDVS Test Model's [107] implementation of both CDVS Feature Detector and Extractor. • cdvsMatch: Receives a sequence of MPEG-CDVS bitstreams generated by the cdvsExtract  ... 
doi:10.6092/polito/porto/2665943 fatcat:xe6g2rue5fgphehdetemdhviva

A survey on compact features for visual content analysis

Luca Baroffio, Alessandro E. C. Redondi, Marco Tagliasacchi, Stefano Tubaro
2016 APSIPA Transactions on Signal and Information Processing  
features constitute compact yet effective representations of visual content, and are being exploited in a large number of heterogeneous applications, including augmented reality, image registration, content-based  ...  Besides extraction, a large body of research addressed the problem of ad-hoc feature encoding methods, and a number of networking and transmission protocols enabling distributed visual content analysis  ...  In particular, SIFT is widely regarded as the gold standard in the context of local feature extraction, and has been partially adopted by the MPEG Compact Descriptors for Visual Search (CDVS) [4, 130]  ... 
doi:10.1017/atsip.2016.13 fatcat:lokgfydqrrd6zcvgngcbloopbu

Digital FPGA Circuits Design for Real-Time Video Processing with Reference to Two Application Scenarios

Giorgio Lopez
2015
General purpose CPU or GPU software implementations of these applications are quite simple and widespread, but commonly do not allow high performance because of the high layering that separates high level  ...  The most practised approach nowadays is based on the use of Very-Large-Scale Integrated (VLSI) digital electronic circuits.  ...  MPEG 7: "the bits about the bits" MPEG 7 is a standard for content description of multimedia formats, so it does not deal with the actual encoding of data streams like other MPEG standards (MPEG-2, MPEG  ... 
doi:10.6092/unina/fedoa/10491 fatcat:4bl5re7ci5dyjhbap3ih2bgqzq

Report on Dissemination and Standardisation Activities Y2 [article]

Uwe Riemann
2015
descriptors for visual search (CDVS), which focuses on similarity matching of still images.  ...  on Augmented 360 Degree Video via Gesture-Based Oscillation Compensating Dynamic Adaptive Streaming over HTTP Selecting User Generated Content for Use in Media Productions Interaction A GPU-Accelerated  ... 
doi:10.7800/304icosoled722 fatcat:33zrolhx5rbszp3dpfm5jfmwfe

HPatches: A benchmark and evaluation of handcrafted and learned local descriptors [article]

Vassileios Balntas and Karel Lenc and Andrea Vedaldi and Krystian Mikolajczyk
2017 arXiv   pre-print
We show that a simple normalisation of traditional hand-crafted descriptors can boost their performance to the level of deep learning based descriptors within a realistic benchmarks evaluation.  ...  The CVDS dataset [9] addresses the data diversity issue by extracting patches from five MPEG-CDVS: Graphics, Paintings, Video, Buildings and Common Objects.  ...  on shallow convolutional networks, triplet learning constraints and fast hard negative mining.  ... 
arXiv:1704.05939v1 fatcat:v2hvvhwomvgzzmmj66n3az46uy

HPatches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors

Vassileios Balntas, Karel Lenc, Andrea Vedaldi, Krystian Mikolajczyk
2017 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)  
We show that a simple normalisation of traditional hand-crafted descriptors can boost their performance to the level of deep learning based descriptors within a realistic benchmarks evaluation.  ...  The CVDS dataset [9] addresses the data diversity issue by extracting patches from five MPEG-CDVS: Graphics, Paintings, Video, Buildings and Common Objects.  ...  Line color encodes dataset and line style a detector.  ... 
doi:10.1109/cvpr.2017.410 dblp:conf/cvpr/BalntasLVM17 fatcat:xj2wict7fjhwjkxubpuoeeplem
« Previous Showing results 1 — 15 out of 19 results