Filters








195 Hits in 4.9 sec

Dima

Ji Sun, Zeyuan Shang, Guoliang Li, Dong Deng, Zhifeng Bao
2017 Proceedings of the VLDB Endowment  
It calls for effective query processing techniques to tolerate the errors and inconsistencies. In this paper, we develop a distributed in-memory similarity-based query processing system called Dima.  ...  To the best of our knowledge, this is the first full-fledged distributed in-memory system that can support similarity-based query processing.  ...  To address these limitations, we develop a distributed in-memory system Dima that can utilize SQL to support similarity-based query processing.  ... 
doi:10.14778/3137765.3137810 fatcat:sbj6yphv2ndjfa32grnszrhoaq

Semi-Stream Similarity Join Processing in a Distributed Environment

Hong-Ji Kim, Ki-Hoon Lee
2020 IEEE Access  
DSim-Join minimizes the data transmission, reduces database accesses using a cache in a distributed stream processing engine, parallelizes join processing, and balances the load between parallel join threads  ...  Although several studies have been conducted on the similarity join, little attention has been paid to a semi-stream similarity join, which is a similarity join between stream data and a large disk-based  ...  Recently, a distributed in-memory similarity join method, called Dima [18] , has been proposed.  ... 
doi:10.1109/access.2020.3009414 fatcat:daez27nvlrfjtot4yekycz7cpm

Balance-aware distributed string similarity-based query processing system

Ji Sun, Zeyuan Shang, Guoliang Li, Dong Deng, Zhifeng Bao
2019 Proceedings of the VLDB Endowment  
In this paper, we develop a distributed in-memory similarity-based query processing system called Dima.  ...  To the best of our knowledge, this is the first full-fledged distributed in-memory system that can support complex similarity-based query processing on largescale datasets.  ...  In summary, we make the following contributions. (1) We develop a full-fledged distributed in-memory similarity-based query processing system Dima, which provides SQL-based programming interface and DataFrame  ... 
doi:10.14778/3329772.3329774 fatcat:sorbhypaijg6rjoevbvdhxyjbm

Secure and Privacy-Preserving Querying of Personal Health Records in the Cloud [chapter]

Samira Barouti, Feras Aljumah, Dima Alhadidi, Mourad Debbabi
2014 Lecture Notes in Computer Science  
Many of the current PHR systems allow storage providers to access patients' data. Recently, architectures of storing PHRs in cloud have been proposed.  ...  In this paper, we propose a protocol that would allow health organizations to produce statistical information about encrypted PHRs stored in the cloud.  ...  According to a similar analysis, for small values of k there is a small change in the execution time but as k increases the query time becomes linear with k.  ... 
doi:10.1007/978-3-662-43936-4_6 fatcat:hwexs5uy3zfevfsaqxdfky3q6y

Fiber-based architecture for NFV cloud databases

Vaidas Gasiunas, Alexander Nozdrin, Weijie Ou, Nir Pachter, Dima Sivov, Eliezer Levy, David Dominguez-Sal, Ralph Acker, Aharon Avitzur, Ilan Bronshtein, Rushan Chen, Eli Ginot (+2 others)
2017 Proceedings of the VLDB Endowment  
This paper presents an industrial experience of applying sharednothing techniques in order to achieve the scalability of a DB in an NFV setup.  ...  Therefore, we designed a special shared-nothing architecture that is based on cooperative multi-tasking using user-level threads (fibers).  ...  Some recent analysis concluded that the network bandwidth will not be the primary bottleneck of a distributed in-memory database [2] .  ... 
doi:10.14778/3137765.3137774 fatcat:tlzqna6ukzdy7f7mfqz3qtsipu

Human-in-the-loop data integration

Guoliang Li
2017 Proceedings of the VLDB Endowment  
We build a distributed in-memory system DIMA to efficiently apply these rules. In the second step, we propose a selection-inference-refine framework that uses the crowd to verify the candidate pairs.  ...  We develop a crowd-powered database system CDB and deploy it on real crowdsourcing platforms. CDB allows users to utilize a SQL-like language for processing crowd-based queries.  ...  We build a distributed in-memory system DIMA to learn and apply these rules and seamlessly integrate our techniques into Spark SQL [60] .  ... 
doi:10.14778/3137765.3137833 fatcat:xbgfksxhtravdd5g3pze3cxoau

Load Balancing in the Presence of Services in Named-Data Networking

Dima Mansour, Haidar Osman, Christian Tschudin
2019 Journal of Network and Systems Management  
Structure of the paper The organization of this paper follows directly the process of building a new system; from understanding the problem and evaluating potential options, to designing and implementing  ...  Load balancing is a mechanism to distribute client requests among several service instances. It enables resource utilization, lowers response time, and increases user satisfaction.  ...  NDN is a pull-based system. In NDN, consumers send Interests to get the required data. For each Interest, there is at most one corresponding data packet.  ... 
doi:10.1007/s10922-019-09507-x fatcat:p2mg6cohxjaktfb5v4anbyzpqu

On Event/Time Triggered and Distributed Analysis of a WSN System for Event Detection, Using Fuzzy Logic

Sofia Maria Dima, Christos Antonopoulos, Stavros Koubias
2016 Journal of Sensors  
Although efforts related to the environmental applications have been presented in the current literature, there is a significant lack of investigation on the performance of such systems, when applied in  ...  Additionally to a straightforward centralized approach, a distributed implementation of the above FIS is also proposed, aiming towards network congestion reduction while optimally distributing the energy  ...  They describe a new cluster formation model that considers relevance of nodes to a specific query and organizes nodes within a cluster in a tree structure and showed using simulation based evaluation that  ... 
doi:10.1155/2016/9350928 fatcat:shnfym6ozramvk2leop2x7oqi4

Centroids Triplet Network and Temporally-Consistent Embeddings for In-Situ Object Recognition

Miguel Lagunes-Fortiz, Dima Damen, Walterio Mayol-Cuevas
2020 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)  
We aim for a scalable real-time recognition system that can process in-situ data, as we argue that is a more straightforward approach of data collection compared to using complex data-acquisition setups  ...  Additionally, for performing person identification, authors utilize cosine similarity to compare a query feature, against the features in the database.  ... 
doi:10.1109/iros45743.2020.9341050 fatcat:rlvf3ng7yrekpp7hcr4c2b2xdu

The ISTI Rapid Response on Exploring Cloud Computing 2018 [article]

Carleton Coffrin, James Arnold, Stephan Eidenbenz, Derek Aberle, John Ambrosiano, Zachary Baker, Sara Brambilla, Michael Brown, K. Nolan Carter, Pinghan Chu, Patrick Conry, Keeley Costigan, Ariane Eberhardt (+31 others)
2019 arXiv   pre-print
These demonstrations ranged from deploying proprietary software in a cloud environment to leveraging established cloud-based analytics workflows for processing scientific datasets.  ...  By and large, the projects were successful and collectively they suggest that cloud computing can be a valuable computational resource for scientific computation at national laboratories.  ...  Science & Technology Institute's "Exploring Cloud Computing 2018" would like to thank Terence Joyce and Brady Jones from the Associate Directorate for Business Innovation (ADBI) for their technical support in  ... 
arXiv:1901.01331v1 fatcat:cdkmje2agzfsdpyulbp4cxz22q

Ego4D: Around the World in 3,000 Hours of Egocentric Video [article]

Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan (+72 others)
2021 arXiv   pre-print
Furthermore, we present a host of new benchmark challenges centered around understanding the first-person visual experience in the past (querying an episodic memory), present (analyzing hand-object manipulation  ...  We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite.  ...  Most objects in the living room are interesting to query about when the person is in the kitchen. To select videos based on these considerations, we use a two-step process.  ... 
arXiv:2110.07058v1 fatcat:txid2kijergxljy7pe7mdxiqvy

Autotuning GEMM Kernels for the Fermi GPU

Jakub Kurzak, Stanimire Tomov, Jack Dongarra
2012 IEEE Transactions on Parallel and Distributed Systems  
This paper presents a methodology for producing matrix multiplication kernels tuned for a specific architecture, through a canonical process of heuristic autotuning, based on generation of multiple code  ...  In recent years, the use of graphics chips has been recognized as a viable way of accelerating scientific and engineering applications, even more so since the introduction of the Fermi architecture by  ...  Architecture-aware Algorithms for Scalable Performance and Resilience on Heterogeneous Architectures," DOE grant #DE-SC0004983, "Matrix Algebra for GPU and Multicore Architectures (MAGMA) for Large Petascale Systems  ... 
doi:10.1109/tpds.2011.311 fatcat:nc7hsw2vhfgyvjio6vjo2mtpca

A Compositional Approach for Schedulability Analysis of Distributed Avionics Systems

Pujie Han, Zhengjun Zhai, Brian Nielsen, Ulrik Nyman
2018 Electronic Proceedings in Theoretical Computer Science  
This work presents a compositional approach for schedulability analysis of Distributed Integrated Modular Avionics (DIMA) systems that consist of spatially distributed ARINC-653 modules connected by a  ...  We model a DIMA system as a set of stopwatch automata in UPPAAL to verify its schedulability by model checking. However, direct model checking is infeasible due to the large state space.  ...  A DIMA system installs standardized computer modules in spatially distributed locations [19] that are connected by a unified bus system [3] such as an AFDX network.  ... 
doi:10.4204/eptcs.272.4 fatcat:dcj7eko64nc6bjms5uhe732pse

A Survey of Blocking and Filtering Techniques for Entity Resolution [article]

George Papadakis, Dimitrios Skoutas, Emmanouil Thanos, Themis Palpanas
2020 arXiv   pre-print
The former led to the introduction of schema-agnostic Blocking in conjunction with Block Processing techniques, while the latter led to more relaxed criteria of similarity.  ...  Our survey covers these new fields in detail, putting in context all relevant works.  ...  Finally, Dima [164, 165] is a distributed in-memory system built on top of Spark that supports threshold and top-k similarity search and join with both token-based and character-based similarities.  ... 
arXiv:1905.06167v4 fatcat:zoodv75tazg23cfnq4dwfgt6ge

Blocking and Filtering Techniques for Entity Resolution

George Papadakis, Dimitrios Skoutas, Emmanouil Thanos, Themis Palpanas
2020 ACM Computing Surveys  
For each framework we provide a comprehensive list of the relevant works, discussing them in the greater context. We conclude with the most promising directions for future work in the field.  ...  In this survey, we review a large number of relevant works under two different but related frameworks: Blocking and Filtering.  ...  Finally, Dima [160, 161] is a distributed in-memory system built on top of Spark that supports threshold and top-k similarity search and join with both token-based and character-based similarities.  ... 
doi:10.1145/3377455 fatcat:uuzuuxwwzrfg7cwfwzswdqvklm
« Previous Showing results 1 — 15 out of 195 results