Filters








5,472 Hits in 6.6 sec

Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation [article]

Wanrong Zhu, Xin Eric Wang, Tsu-Jui Fu, An Yan, Pradyumna Narayana, Kazoo Sone, Sugato Basu, William Yang Wang
2021 arXiv   pre-print
One of the most challenging topics in Natural Language Processing (NLP) is visually-grounded language understanding and reasoning.  ...  We first enrich the navigation data by transferring the style of the instructions generated by Google Maps API, then pre-train the navigator with the augmented external outdoor navigation dataset.  ...  The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the sponsor.  ... 
arXiv:2007.00229v3 fatcat:xcl3bzh2kvczla6aumikeqsjjq

Self-supervised Audiovisual Representation Learning for Remote Sensing Data [article]

Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu
2021 arXiv   pre-print
In order to contribute towards the availability of pre-trained backbone networks in remote sensing, we devise a self-supervised approach for pre-training deep neural networks.  ...  In remote sensing, the lack of comparable large annotated datasets and the wide diversity of sensing platforms impedes similar developments.  ...  In the early stage, their relationship was first explored and exploited in audiovisual speech recognition [3] and affect classification [4] , [6] , where the visual and audio modalities are considered  ... 
arXiv:2108.00688v1 fatcat:bhvcwavkibhxfmezayic5yryfe

Positioning yourself in the maze of Neural Text Generation: A Task-Agnostic Survey [article]

Khyathi Raghavi Chandu, Alan W Black
2021 arXiv   pre-print
Thereby, we deliver a one-stop destination for researchers in the field to facilitate a perspective on where to situate their work and how it impacts other closely related generation tasks.  ...  In this context, we present an abstraction of the imperative techniques with respect to learning paradigms, pretraining, modeling approaches, decoding and the key challenges outstanding in the field in  ...  a major surge in interest for pre-training techniques.  ... 
arXiv:2010.07279v2 fatcat:jp76n5vk7zbvnhfexhwa3rludu

Discovery of Visual Semantics by Unsupervised and Self-Supervised Representation Learning [article]

Gustav Larsson
2017 arXiv   pre-print
To address this concern, with the long-term goal of leveraging the abundance of cheap unlabeled data, we explore methods of unsupervised "pre-training."  ...  The success of deep learning in computer vision is rooted in the ability of deep networks to scale up model complexity as demanded by challenging visual tasks.  ...  This makes it easy to visualize and promotes better intuition. The old loss was shift invariant for both student and teacher activations.  ... 
arXiv:1708.05812v1 fatcat:w77w3q3ms5c5fnyzl65mkj4ozy

ECO: Egocentric Cognitive Mapping [article]

Jayant Sharma, Zixing Wang, Alberto Speranzon, Vijay Venkataraman, Hyun Soo Park
2018 arXiv   pre-print
ECO is biologically inspired, by the cognitive map that allows human navigation, and it encodes the surrounding visual semantics with respect to both distance and orientation.  ...  ECO possesses three main properties: (1) reconfigurability: complex semantics and geometry is captured via the synthesis of atomic visual representations (e.g., image patch); (2) robustness: the visual  ...  For instance, you can localize yourself in a new store despite the fact it has a new spatial layout and visual patterns.  ... 
arXiv:1812.00312v1 fatcat:4xmcfbgtc5gqrn2q4jj6w2xbbi

Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text [article]

Felix Hill, Sona Mokra, Nathaniel Wong, Tim Harley
2020 arXiv   pre-print
By applying our method with a state-of-the-art pre-trained text-based language model (BERT), on tasks requiring agents to identify and position everyday objects relative to other objects in a naturalistic  ...  Our approach is a general recipe for training any deep RL-based system to interface with human users, and bridges the gap between two research directions of notable recent success: agent-centric motor  ...  The accuracies for the reference task are presented in Table 3 and for the putting task in Table 4 .  ... 
arXiv:2005.09382v1 fatcat:ftss3t5a3nbvlcmtiemgbqxa6u

Adversarial Audio Synthesis [article]

Chris Donahue, Julian McAuley, Miller Puckette
2019 arXiv   pre-print
Our experiments demonstrate that, without labels, WaveGAN learns to produce intelligible words when trained on a small-vocabulary speech dataset, and can also synthesize audio from other domains such as  ...  In this paper we introduce WaveGAN, a first attempt at applying GANs to unsupervised synthesis of raw-waveform audio.  ...  ACKNOWLEDGMENTS The authors would like to thank Peter Boesman and Colin Raffel for providing training data for this work.  ... 
arXiv:1802.04208v3 fatcat:nhktoh4fqjf25ggbfvkvm6gzfy

Delphi: Towards Machine Ethics and Norms [article]

Liwei Jiang, Jena D. Hwang, Chandra Bhagavatula, Ronan Le Bras, Maxwell Forbes, Jon Borchardt, Jenny Liang, Oren Etzioni, Maarten Sap, Yejin Choi
2021 arXiv   pre-print
This is in stark contrast to the zero-shot performance of GPT-3 of 52.3%, which suggests that massive scale alone does not endow pre-trained neural language models with human values.  ...  In addition to the new resources and baseline performances for future research, our study provides new insights that lead to several important open research questions: differentiating between universal  ...  This research was supported in part by DARPA under the MCS program through NIWC Pacific (N66001-19-2-4031), and the Allen Institute for AI (AI2).  ... 
arXiv:2110.07574v1 fatcat:yi4kplseqfanfccina227nx2zi

Behavioral Methods to Study Learning and Memory in Rats [chapter]

Jorge Alberto Quillfeldt
2016 Rodent Model as Tools in Ethical Biomedical Research  
In the Visual discrimination tasks, animals are trained to to recognize and discriminate between visual cues: one protocol used two different visible platforms, one stable and the other, floating (but  ...  Presently there is great conceptual diversity in this area and the need for such conceptual organization is a matter of debate among memory specialists, but we believe it still has a role, at least for  ... 
doi:10.1007/978-3-319-11578-8_17 fatcat:rwdmgvbedzewpn3sdetcejrejq

Pros and Cons of GAN Evaluation Measures: New Developments [article]

Ali Borji
2021 arXiv   pre-print
These are important areas of concern in the machine learning community today and progress in GAN evaluation can help mitigate them.  ...  still room for improvement.  ...  To obtain a suitable feature video representation, they used a pre-trained network (the Inflated 3D Convnet (Carreira and Zisserman, 2017) pre-trained on Kinetics-400 and Kinetics-600 datasets) that  ... 
arXiv:2103.09396v3 fatcat:shsmleblqfgzfpa3w5mwt7xyma

See far with TPNET: a Tile Processor and a CNN Symbiosis [article]

Andrey Filippov, Oleg Dzhimiev
2018 arXiv   pre-print
In this work, we explore application of TPNET to 3D perception with a narrow-baseline (0.0001-0.0025) quad stereo camera and prove that a trained network provides a disparity prediction from the 2D phase  ...  The TP in turn reduces the dimensions of the input features of the network and provides instrument-invariant and translation-invariant data, making real-time high resolution stereo 3D perception feasible  ...  Restricting attention to these datasets limits the diversity and reach of research in this field.  ... 
arXiv:1811.08032v1 fatcat:cxti6icqjvdvfguyenaffiueby

Men developing emotional intelligence through meditation? Integrating narrative, cognitive and electroencephalography (EEG) evidence

Tim Lomas, Trudi Edginton, Tina Cartwright, Damien Ridge
2014 Psychology of men & masulinity  
However, it is recognized that men and masculinities are diverse, and that some men can positively self-manage their mental health, although this has received little attention in the literature.  ...  Participants undertook two cognitive neuroscience sessionsapproximately one year apartcomprising cognitive assessments of attention, in combination with EEG measurement during task performance and meditation  ...  The present paper explores the possibility that meditation may help men with emotional intelligencethus facilitating better mental healthby training attention and giving men more choice in how they approach  ... 
doi:10.1037/a0032191 fatcat:2h2vfwtig5a45py7ldp6u4djeq

Music Classification: Beyond Supervised Learning, Towards Real-world Applications [article]

Minz Won, Janne Spijkervet, Keunwoo Choi
2021 Zenodo  
This is a book written for a tutorial session of the 22nd International Society for Music Information Retrieval Conference, Nov 8-12, 2021 in an online format.  ...  In this book, we focus on the more modern history of music classification since the popularization of deep learning in mid 2010s.  ...  After finishing this chapter, you can understand the procedures and tasks researchers and engineers in industry spend time on. We're delighted that you have studied music classification with us.  ... 
doi:10.5281/zenodo.5703780 fatcat:vpjixx4nmfaqtipf3ytuu7srwa

Music Classification: Beyond Supervised Learning, Towards Real-world Applications [article]

Minz Won, Janne Spijkervet, Keunwoo Choi
2021 Zenodo  
This is a book written for a tutorial session of the 22nd International Society for Music Information Retrieval Conference, Nov 8-12, 2021 in an online format.  ...  In this book, we focus on the more modern history of music classification since the popularization of deep learning in mid 2010s.  ...  After finishing this chapter, you can understand the procedures and tasks researchers and engineers in industry spend time on. We're delighted that you have studied music classification with us.  ... 
doi:10.5281/zenodo.5703779 fatcat:ggefiongcnb5boahjsz4lgiuz4

Explainability-aided Domain Generalization for Image Classification [article]

Robin M. Schmidt
2021 arXiv   pre-print
In this work, we empirically demonstrate that applying methods and architectures from the explainability literature can, in fact, achieve state-of-the-art performance for the challenging task of domain  ...  generalization while offering a framework for more insights into the prediction and training process.  ...  Due to the prevalence of this challenge for the wide-spread deployment of machine learning systems in diverse environments, many researchers tried to tackle this task with different approaches.  ... 
arXiv:2104.01742v1 fatcat:ml7l2vhhxzdyfbf3d7ilrvxmzq
« Previous Showing results 1 — 15 out of 5,472 results