Filters








261 Hits in 1.4 sec

Orthogonality Constrained Multi-Head Attention For Keyword Spotting [article]

Mingu Lee, Jinkyu Lee, Hye Jin Jang, Byeonggeun Kim, Wonil Chang and Kyuwoong Hwang
2019 arXiv   pre-print
Multi-head attention mechanism is capable of learning various representations from sequential data while paying attention to different subsequences, e.g., word-pieces or syllables in a spoken word. From the subsequences, it retrieves richer information than a single-head attention which only summarizes the whole sequence into one context vector. However, a naive use of the multi-head attention does not guarantee such richness as the attention heads may have positional and representational
more » ... ancy. In this paper, we propose a regularization technique for multi-head attention mechanism in an end-to-end neural keyword spotting system. Augmenting regularization terms which penalize positional and contextual non-orthogonality between the attention heads encourages to output different representations from separate subsequences, which in turn enables leveraging structured information without explicit sequence models such as hidden Markov models. In addition, intra-head contextual non-orthogonality regularization encourages each attention head to have similar representations across keyword examples, which helps classification by reducing feature variability. The experimental results demonstrate that the proposed regularization technique significantly improves the keyword spotting performance for the keyword "Hey Snapdragon".
arXiv:1910.04500v1 fatcat:qd3ph25cobggzgql7qwzu4w2au

Value Function Gradient Learning for Large-Scale Multistage Stochastic Programming Problems [article]

Jinkyu Lee, Sanghyeon Bae, Woo Chang Kim, Yongjae Lee
2022 arXiv   pre-print
A stagewise decomposition algorithm called value function gradient learning (VFGL) is proposed for large-scale multistage stochastic convex programs. VFGL finds the parameter values that best fit the gradient of the value function within a given parametric family. Widely-used decomposition algorithms for multistage stochastic programming, such as stochastic dual dynamic programming (SDDP), approximate the value function by adding linear subgradient cuts at each iteration. Although this approach
more » ... has been successful for linear problems, nonlinear problems may suffer from the increasing size of each subproblem as the iteration proceeds. On the other hand, VFGL has a fixed number of parameters; thus, the size of the subproblems remains constant throughout the iteration. Furthermore, VFGL can learn the parameters by means of stochastic gradient descent, which means that it can be easily parallelized and does not require a scenario tree approximation of the underlying uncertainties. VFGL was compared with a deterministic equivalent formulation of the multistage stochastic programming problem and SDDP approaches for three illustrative examples: production planning, hydrothermal generation, and the lifetime financial planning problem. Numerical examples show that VFGL generates high-quality solutions and is computationally efficient.
arXiv:2205.08934v1 fatcat:5odcfb7wojdanfmklgwu6aspw4

Extended framework of Hamilton's principle applied to Duffing oscillation [article]

Jinkyu Kim, Hyeonseok Lee, Jinwon Shin
2019 arXiv   pre-print
The paper begins with a novel variational formulation of Duffing equation using the extended framework of Hamilton's principle (EHP). This formulation properly accounts for initial conditions, and it recovers all the governing differential equations as its Euler-Lagrange equation. Thus, it provides elegant structure for the development of versatile temporal finite element methods. Herein, the simplest temporal finite element method is presented by adopting linear temporal shape functions.
more » ... cal examples are included to verify and investigate performance of non-iterative algorithm in the developed method.
arXiv:1903.06524v1 fatcat:leleyv2x2bbldlyld6ffbhq6qu

Multiprocessor real-time scheduling considering concurrency and urgency

Jinkyu Lee, Arvind Easwaran, Insik Shin, Insup Lee
2010 ACM SIGBED Review  
It has been widely studied how to schedule real-time tasks on multiprocessor platforms. Several studies find optimal scheduling policies for implicit deadline task systems, but it is hard to understand how each policy utilizes the two important aspects of scheduling real-time tasks on multiprocessors:inter-job concurrency and job urgency. In this paper, we introduce a new scheduling policy that considers these two properties. We prove that the policy is optimal for the special case when the
more » ... ution time of all tasks are equally one and deadlines are implicit, and observe that the policy is a new concept in that it is not an instance of Pfair or ERfair. It remains open to find a schedulability condition for general task systems under our scheduling policy. Abstract-It has been widely studied how to schedule realtime tasks on multiprocessor platforms. Several studies find optimal scheduling policies for implicit deadline task systems, but it is hard to understand how each policy utilizes the two important aspects of scheduling real-time tasks on multiprocessors: inter-job concurrency and job urgency. In this paper, we introduce a new scheduling policy that considers these two properties. We prove that the policy is optimal for the special case when the execution time of all tasks are equally one and deadlines are implicit, and observe that the policy is a new concept in that it is not an instance of Pfair or ERfair. It remains open to find a scheduliability condition for general task systems under our scheduling policy. During [t, t + D * 1 (t)], D * 1 (t) · m jobs are serviced, and thus, using U * sys (t), we calculate the system dynamic density at t + D 1 (t) of this set as follows.
doi:10.1145/1851166.1851173 fatcat:xgkmff567nfhznhtqigdkwbyhy

SelfReg: Self-supervised Contrastive Regularization for Domain Generalization [article]

Daehee Kim, Seunghyun Park, Jinkyu Kim, Jaekoo Lee
2021 arXiv   pre-print
In general, an experimental environment for deep learning assumes that the training and the test dataset are sampled from the same distribution. However, in real-world situations, a difference in the distribution between two datasets, domain shift, may occur, which becomes a major factor impeding the generalization performance of the model. The research field to solve this problem is called domain generalization, and it alleviates the domain shift problem by extracting domain-invariant features
more » ... explicitly or implicitly. In recent studies, contrastive learning-based domain generalization approaches have been proposed and achieved high performance. These approaches require sampling of the negative data pair. However, the performance of contrastive learning fundamentally depends on quality and quantity of negative data pairs. To address this issue, we propose a new regularization method for domain generalization based on contrastive learning, self-supervised contrastive regularization (SelfReg). The proposed approach use only positive data pairs, thus it resolves various problems caused by negative pair sampling. Moreover, we propose a class-specific domain perturbation layer (CDPL), which makes it possible to effectively apply mixup augmentation even when only positive data pairs are used. The experimental results show that the techniques incorporated by SelfReg contributed to the performance in a compatible manner. In the recent benchmark, DomainBed, the proposed method shows comparable performance to the conventional state-of-the-art alternatives. Codes are available at https://github.com/dnap512/SelfReg.
arXiv:2104.09841v1 fatcat:5hxntvaz6retrhbi6zk5jejnxa

DRAM architecture for efficient data lifetime management

Yongjun Lee, Yunkeuk Kim, Jinkyu Jeong, Jae W. Lee
2017 IEICE Electronics Express  
Lee et al. demonstrate that web pages can be reconstructed by acquisition of deallocated pages [7] . Kong et al. present a method to recover messages from a web mail client via memory dumps [8] .  ... 
doi:10.1587/elex.14.20170309 fatcat:rxqvitzfbjebplzvuytbgifbq4

Adaptation Of Hmm Dynamic Parameters In Reverberant Environment

Hong-Goo Kang, Jinkyu Lee, Hyunson Seo
2013 Zenodo  
Publication in the conference proceedings of EUSIPCO, Marrakech, Morocco, 2013
doi:10.5281/zenodo.43690 fatcat:awkxxorlkrbfzmba6j6c4xr52u

Design of AT-DMB Baseband Receiver SoC

Joohyun Lee, Hyuk Kim, Jinkyu Kim, Bontae Koo, Nakwoong Eum, Hyuckjae Lee
2009 ETRI Journal  
Design of AT-DMB Baseband Receiver SoC Joohyun Lee, Hyuk Kim, Jinkyu Kim, Bontae Koo, Nakwoong Eum, and Hyuckjae Lee Fig. 1 . 1 Hierarchical modulation procedure (LP:QPSK).  ... 
doi:10.4218/etrij.09.1209.0009 fatcat:ufoupekkrvbabh5rqmifj3reoq

Limited Non-Preemptive EDF Scheduling for a Real-Time System with Symmetry Multiprocessors

Hoyoun Lee, Jinkyu Lee
2020 Symmetry  
Also, Lee et al. proposed a new schedulability analysis [11, 12] using the well-known schedulability analysis technique for preemptive scheduling algorithms, called RTA (Response Time Analysis).  ... 
doi:10.3390/sym12010172 fatcat:4uescstrtjahhgrlxscxuqtkzm

Time-Reversibility of Schedulability Tests

Jinkyu Lee
2014 2014 IEEE Real-Time Systems Symposium  
For timing guarantees of a set of real-time tasks under a target scheduling algorithm, a number of schedulability tests have been studied. However, there still exist many task sets that are potentially schedulable by a target scheduling algorithm, but proven schedulable by none of existing schedulability tests, especially on a multiprocessor platform. In this paper, we propose a new notion of time-reversibility of schedulability tests, which yields tighter schedulability guarantees by viewing
more » ... al-time scheduling under a change in the sign of time. To this end, we first define the notion of a time-reversed scheduling algorithm against a target scheduling algorithm; for example, the time-reversed scheduling algorithm against EDF (Earliest Deadline First) is LCFS (Last-Come, First-Served), and the converse also holds. Then, a schedulability test for a scheduling algorithm is said to be time-reversible with respect to schedulability, if all task sets deemed schedulable by the test are also schedulable by its time-reversed scheduling algorithm. To exploit the notion of time-reversibility for tighter schedulability guarantees, we not only prove timereversibility of an existing schedulability test, but also develop a new time-reversible schedulability test, both of which cover additional schedulable task sets. Next, we generalize the time-reversibility theory towards partial execution. Utilizing the notion, we can assure the schedulability of a task under a target scheduling algorithm in a divide-and-conquer manner: (i) the first some units of execution guaranteed by a schedulability test for the scheduling algorithm, and (ii) the remaining execution guaranteed by a time-reversible (with respect to partial execution) schedulability test for its time-reversed scheduling algorithm. Such a divide-and-conquer approach has not been directly applied to existing schedulability tests in that they cannot address (ii) effectively. As a case study, this paper develops RTA (Response-Time Analysis) for LCFS, proves its time-reversibility, and applies the divide-and-conquer approach to the test along with an existing EDF schedulability test. Our simulation results show that the time-reversibility theory helps to find up to 13.1% additional EDF-schedulable task sets on a multiprocessor platform.
doi:10.1109/rtss.2014.18 dblp:conf/rtss/Lee14 fatcat:y7mgwq4vondjxpq3q5nt6bxf7u

Multiplying decomposition of stress/strain, constitutive/compliance relations, and strain energy [article]

HyunSuk Lee, Jinkyu Kim
2012 arXiv   pre-print
To account for phenomenological theories and a set of invariants, stress and strain are usually decomposed into a pair of pressure and deviatoric stress and a pair of volumetric strain and deviatoric strain. However, the conventional decomposition method only focuses on individual stress and strain, so that cannot be directly applied to either formulation in Finite Element Method (FEM) or Boundary Element Method (BEM). In this paper, a simpler, more general, and widely applicable decomposition
more » ... s suggested. A new decomposition method adopts multiplying decomposition tensors or matrices to not only stress and strain but also constitutive and compliance relation. With this, we also show its practical usage on FEM and BEM in terms of tensors and matrices.
arXiv:1211.2693v1 fatcat:acngdducenawzkaz64jcr34prm

Zero-laxity based real-time multiprocessor scheduling

Jinkyu Lee, Arvind Easwaran, Insik Shin, Insup Lee
2011 Journal of Systems and Software  
Please cite this article in press as: Lee, J., et al., Zero-laxity based real-time multiprocessor scheduling.  ...  ., 2010; Lee et al., 2011; Stavrinides and Karatza, 2011) .  ...  Please cite this article in press as: Lee, J., et al., Zero-laxity based real-time multiprocessor scheduling. J. Syst. Software (2011), doi:10.1016/j.jss.2011.07.002  ... 
doi:10.1016/j.jss.2011.07.002 fatcat:g3tv2bkdfzbrznjpss3skqi6ri

Sound-Guided Semantic Image Manipulation [article]

Seung Hyun Lee, Wonseok Roh, Wonmin Byeon, Sang Ho Yoon, Chan Young Kim, Jinkyu Kim, Sangpil Kim
2021 arXiv   pre-print
The recent success of the generative model shows that leveraging the multi-modal embedding space can manipulate an image using text information. However, manipulating an image with other sources rather than text, such as sound, is not easy due to the dynamic characteristics of the sources. Especially, sound can convey vivid emotions and dynamic expressions of the real world. Here, we propose a framework that directly encodes sound into the multi-modal (image-text) embedding space and
more » ... an image from the space. Our audio encoder is trained to produce a latent representation from an audio input, which is forced to be aligned with image and text representations in the multi-modal embedding space. We use a direct latent optimization method based on aligned embeddings for sound-guided image manipulation. We also show that our method can mix text and audio modalities, which enrich the variety of the image modification. We verify the effectiveness of our sound-guided image manipulation quantitatively and qualitatively. We also show that our method can mix different modalities, i.e., text and audio, which enrich the variety of the image modification. The experiments on zero-shot audio classification and semantic-level image classification show that our proposed model outperforms other text and sound-guided state-of-the-art methods.
arXiv:2112.00007v1 fatcat:rcv7bt5ppvfihc57kqfa4d2pau

Panda: Reinforcement Learning-based Priority Assignment for Multi-processor Real-Time Scheduling

Hyunsung Lee, Jinkyu Lee, Ikjun Yeom, Honguk Woo
2020 IEEE Access  
doi:10.1109/access.2020.3029040 fatcat:pmqey3rltfb7hnhir74ztesuae

A fully associative, tagless DRAM cache

Yongjun Lee, Jongwon Kim, Hakbeom Jang, Hyunggyun Yang, Jangwoo Kim, Jinkyu Jeong, Jae W. Lee
2015 SIGARCH Computer Architecture News  
This paper introduces a tagless cache architecture for large in-package DRAM caches. The conventional die-stacked DRAM cache has both a TLB and a cache tag array, which are responsible for virtual-to-physical and physical-to-cache address translation, respectively. We propose to align the granularity of caching with OS page size and take a unified approach to address translation and cache tag management. To this end, we introduce cache-map TLB (cTLB), which stores virtual-to-cache, instead of
more » ... rtual-to-physical, address mappings. At a TLB miss, the TLB miss handler allocates the requested block into the cache if it is not cached yet, and updates both the page table and cTLB with the virtual-tocache address mapping. Assuming the availability of large in-package DRAM caches, this ensures that an access to the memory region within the TLB reach always hits in the cache with low hit latency since a TLB access immediately returns the exact location of the requested block in the cache, hence saving a tag-checking operation. The remaining cache space is used as victim cache for memory pages that are recently evicted from cTLB. By completely eliminating data structures for cache tag management, from either on-die SRAM or inpackage DRAM, the proposed DRAM cache achieves best scalability and hit latency, while maintaining high hit rate of a fully associative cache. Our evaluation with 3D Through-Silicon Via (TSV)-based in-package DRAM demonstrates that the proposed cache improves the IPC and energy efficiency by 30.9% and 39.5%, respectively, compared to the baseline with no DRAM cache. These numbers translate to 4.3% and 23.8% improvements over an impractical SRAM-tag cache requiring megabytes of on-die SRAM storage, due to low hit latency and zero energy waste for cache tags.
doi:10.1145/2872887.2750383 fatcat:td3aznb73zfanmsbh774wxndji
« Previous Showing results 1 — 15 out of 261 results