Filters








30 Hits in 1.8 sec

A Survey of Recent View-based 3D Model Retrieval Methods [article]

Qiong Liu
2012 arXiv   pre-print
Then a series of 3D model retrieval methods by using bag-of-visual-words description are surveyed in this paper. At last, we summarize the further research content in view-based 3D model retrieval.  ...  For bag-of-visual-words application in 3D model retrieval, we first briefly review the bag-of-visual-words works on multimedia and computer vision tasks, where the visual dictionary has been detailed introduced  ...  Then the relevance feedback information is employed to capture the semantic class information by using the manifold ranking algorithm.  ... 
arXiv:1208.3670v1 fatcat:liljkz7ilrdqbeua62qmzynx3y

A semi-supervised learning algorithm for relevance feedback and collaborative image retrieval

Daniel Carlos Guimarães Pedronette, Rodrigo T. Calumby, Ricardo da S. Torres
2015 EURASIP Journal on Image and Video Processing  
For large-scale multimedia collections, however, the user efforts required in RF search sessions is considerable.  ...  In this paper, we address this issue by proposing a novel semisupervised approach for implementing RF-based search services.  ...  The Manifold Ranking algorithm [20] was proposed aiming at ranking the objects with respect to the intrinsic data distribution.  ... 
doi:10.1186/s13640-015-0081-6 fatcat:uinx4wos7fab3o2gnwywdlrq7a

An Overview of Cross-media Retrieval: Concepts, Methodologies, Benchmarks and Challenges [article]

Yuxin Peng, Xin Huang, Yunzhen Zhao
2017 arXiv   pre-print
Multimedia retrieval plays an indispensable role in big data utilization. Past efforts mainly focused on single-media retrieval.  ...  Cross-media retrieval is designed for the scenarios where the queries and retrieval results are of different media types.  ...  Introduction W ITH the rapid growth of multimedia data such as text, image, video, audio and 3D model, cross-media retrieval is becoming increasingly attractive, through which users can get the results  ... 
arXiv:1704.02223v4 fatcat:z7ez63kodvejpfrodeszdtkccy

A Comprehensive Survey on Cross-modal Retrieval [article]

Kaiye Wang, Qiyue Yin, Wei Wang, Shu Wu, Liang Wang
2016 arXiv   pre-print
In recent years, cross-modal retrieval has drawn much attention due to the rapid growth of multimodal data. It takes one type of data as the query to retrieve relevant data of another type.  ...  For example, a user can use a text to retrieve relevant pictures or videos.  ...  3) rank based methods, and 4) supervised methods.  ... 
arXiv:1607.06215v1 fatcat:jfbmmlvzrvcmtmzezogzuxvvqu

2021 Index IEEE Transactions on Multimedia Vol. 23

2021 IEEE transactions on multimedia  
Liu, S., +, TMM 2021 2188-2198 Content-based retrieval Collaborative Image Relevance Learning for Visual Re-Ranking.  ...  Zheng, Y., +, TMM 2021 3590-3602 Deep Unsupervised Binary Descriptor Learning Through Locality Consistency and Self Distinctiveness.  ...  ., Low-Rank Pairwise Align- ment Bilinear Network For Few-Shot Fine-Grained Image Classification; TMM 2021 1666-1680 Huang, H., see 1855 -1867 Huang, H., see Jiang, X., TMM 2021 2602-2613 Huang, J.,  ... 
doi:10.1109/tmm.2022.3141947 fatcat:lil2nf3vd5ehbfgtslulu7y3lq

Transductive Multi-View Zero-Shot Learning

Yanwei Fu, Timothy M. Hospedales, Tao Xiang, Shaogang Gong
2015 IEEE Transactions on Pattern Analysis and Machine Intelligence  
To overcome this problem, a novel heterogeneous multi-view hypergraph label propagation method is formulated for zero-shot learning in the transductive embedding space.  ...  It effectively exploits the complementary information offered by different semantic representations and takes advantage of the manifold structures of multiple representation spaces in a coherent manner  ...  We leverage graph-based semi-supervised learning to exploit the manifold structure of the unlabelled data transductively for classification.  ... 
doi:10.1109/tpami.2015.2408354 pmid:26440271 fatcat:eazqbmoc6vholji7ke6yyis5wq

Content-Based Image Retrieval and Feature Extraction: A Comprehensive Review

Afshan Latif, Aqsa Rasheed, Umer Sajid, Jameel Ahmed, Nouman Ali, Naeem Iqbal Ratyal, Bushra Zafar, Saadat Hanif Dar, Muhammad Sajid, Tehmina Khalil
2019 Mathematical Problems in Engineering  
Most of the search engines retrieve images on the basis of traditional text-based approaches that rely on captions and metadata.  ...  In the last two decades, extensive research is reported for content-based image retrieval (CBIR), image classification, and analysis.  ...  Transductive learning image retrieval Hypergraph-based multiexample ranking Yale face dataset 0.65 Wang et al. [68] Retrieval-based face annotation Weak label regularized local coordinate coding  ... 
doi:10.1155/2019/9658350 fatcat:dncplhkm6vcrvfh3q7ifxkrdkq

Transductive Zero-Shot Action Recognition by Word-Vector Embedding [article]

Xun Xu, Timothy Hospedales, Shaogang Gong
2016 arXiv   pre-print
Existing ZSL studies focus primarily on still images, and attribute-based semantic representations.  ...  A distributed hypergraph was adopted to replace the local neighbourhood graph in . 8. Unsupervised Domain Adaptation (UDA).  ...  Overall, due to the nature of a retrieval task which depends on the ranking of testing samples w.r.t. prototypes, the performance of retrieval task is not affected by the two hubness correction methods  ... 
arXiv:1511.04458v2 fatcat:yxfn52pdhjfatedmixz4evtiay

Transductive Zero-Shot Action Recognition by Word-Vector Embedding

Xun Xu, Timothy Hospedales, Shaogang Gong
2017 International Journal of Computer Vision  
A distributed hypergraph was adopted to replace the local neighbourhood graph in . 8. Unsupervised Domain Adaptation (UDA).  ...  Overall, due to the nature of a retrieval task which depends on the ranking of testing samples w.r.t. prototypes, the performance of retrieval task is not affected by the two hubness correction methods  ... 
doi:10.1007/s11263-016-0983-5 fatcat:c6rn4jpg3ff5pbks52ohlcafny

Web mining in soft computing framework: relevance, state of the art and future directions

S.K. Pal, V. Talwar, P. Mitra
2002 IEEE Transactions on Neural Networks  
Index Terms-Artificial neural networks (ANNs), data mining, fuzzy logic (FL), genetic algorithms (GAs), information retrieval (IR), knowledge discovery, pattern recognition, rough sets (RSs), search engines  ...  Other techniques for clustering web data include those using hypergraph-based clustering [58] .  ...  A system where RSs have been used for retrieval of multimedia objects is described in [92] . B.  ... 
doi:10.1109/tnn.2002.1031947 pmid:18244512 fatcat:a2ea5nfnczgjlpwsbwe6ebt5hi

Effective Multi-Query Expansions: Collaborative Deep Networks for Robust Landmark Retrieval

Yang Wang, Xuemin Lin, Lin Wu, Wenjie Zhang
2017 IEEE Transactions on Image Processing  
Given a query photo issued by a user (q-user), the landmark retrieval is to return a set of photos with their landmarks similar to those of the query, while the existing studies on the landmark retrieval  ...  Then, motivated by the typical collaborative filtering methods, we propose to learn a collaborative deep networks based semantically, nonlinear and high-level features over the latent factor for landmark  ...  between multi-query set and each photo in the database, leading to the final ranking based retrieval result. query photo is too low.  ... 
doi:10.1109/tip.2017.2655449 pmid:28103558 fatcat:k6wsv2mjszes7kgegyrg4urtsa

Recent Advances in Zero-shot Recognition [article]

Yanwei Fu, Tao Xiang, Yu-Gang Jiang, Xiangyang Xue, Leonid Sigal, and Shaogang Gong
2017 arXiv   pre-print
ACM International Conference on Multimedia Retrieval (ICMR) 2017 published special issue ("multimodal understanding of subjective properties" 1 ) on the applications of multimedia analysis for subjective  ...  To alleviate this problem, the transductive learning based approaches were proposed, to utilize the manifold information of the instances from unseen classes [115] , [120] , [121] , [122] , [123]  ... 
arXiv:1710.04837v1 fatcat:u3mp6dgj2rgqrarjm4dcywegmy

Action Datasets and MHI [chapter]

Md. Atiqur Rahman Ahad
2012 SpringerBriefs in Computer Science  
Wang T, Shum H, Xu Y, Zheng N (2001) Unsupervised analysis of human gestures. IEEE pacific rim conference on multimedia, pp 174-181 376.  ...  Duchennel O, Bach F, Kweon I, Ponce J (2009) A tensor-based algorithm for high-order graph matching. IEEE computer vision and, pattern recognition, pp 1980graph and hypergraph matching.  ... 
doi:10.1007/978-1-4471-4730-5_4 fatcat:kvnpmq3zgbeftjrnnscq4gxed4

Survey of Generative Methods for Social Media Analysis [article]

Stan Matwin, Aristides Milios, Paweł Prałat, Amilcar Soares, François Théberge
2021 arXiv   pre-print
In order to produce a low dimensional representation of high dimensional data that preserves relevant structure, the Uniform Manifold Approximation and Projection (UMAP 16 ) [169] was used, a novel manifold  ...  A popular example of a rule-based approach for entirely unsupervised sentiment analysis of text is known as VADER [119] .  ... 
arXiv:2112.07041v1 fatcat:xgmduwctpbddfo67y6ack5s2um

A Comprehensive Survey of Graph Embedding: Problems, Techniques and Applications [article]

Hongyun Cai, Vincent W. Zheng, Kevin Chen-Chuan Chang
2018 arXiv   pre-print
[88] maps images into a semantic manifold that faithfully grasps users' preferences to facilitate content-based image retrieval.  ...  Multimedia Networks. A multimedia network is a network containing multimedia data, e.g., image, text, etc.  ... 
arXiv:1709.07604v3 fatcat:6w42r4k6rvbodmnuerbqqrxynq
« Previous Showing results 1 — 15 out of 30 results