3,068 Hits in 4.7 sec

SUMMARIZED: Efficient Framework for Analyzing Multidimensional Process Traces under Edit-distance Constraint [article]

Phuong Nguyen and Vatche Ishakian and Vinod Muthusamy and Aleksander Slominski
2019 arXiv   pre-print
We introduce summarization schemes that provide tunable trade-offs between the quality and efficiency of analysis tasks and derive an error model for summary-based similarity under an edit-distance constraint  ...  In this work, we introduce Summarized, a framework for efficient analysis on sequence-based multi-dimensional data using intuitive and user-controlled summarizations.  ...  We define a set of summarization schemes that offer flexible trade-off between quality and efficiency of analysis tasks and derive an error model for summary-based similarity under an edit-distance constraint  ... 
arXiv:1905.00983v1 fatcat:dixw656yd5gsxda5gbi5usckli

Spatiotemporal Phenomena Summarization through Static Visual Narratives

Daniel Marques, Alexandre Valle de Carvalho, Rui Rodrigues, Edgar Carneiro
2020 2020 24th International Conference Information Visualisation (IV)  
Considering the assessment, the authors conclude that summarizing spatiotemporal phenomena through static visual narratives is a suitable alternative for understanding their evolution.  ...  To assess the effectiveness and efficiency of the conceptual visualization framework, a questionnaire was produced and conducted on college students.  ...  As a result, the goals of this work are to (1) study, conceptualize, develop and test a framework that can automate the process of building static and interactive visual narratives for the summarization  ... 
doi:10.1109/iv51561.2020.00081 fatcat:zlkv34rvwffj7mu6lneooezxcu

Visualization of anisotropic contact potentials within protein structures

Corinna Vehlow, Bernhard Preim, Michael Lappe
2011 2011 IEEE Symposium on Biological Data Visualization (BioVis).  
The Contact Geometry Analysis Plugin (CGAP) (for CMView) we developed allows incorporation of geometric orientation propensities into the process of interactive protein modeling and can be used for the  ...  A second visualization is overlaid onto this, showing similar local neighborhoods as abstract traces of residues contained within each individual neighborhood.  ...  A contact map defines the distance constraints for use in threedimensional reconstruction. With CGAP these distance constraints are supplemented by orientation constraints.  ... 
doi:10.1109/biovis.2011.6094045 dblp:conf/biovis/VehlowPL11 fatcat:qsubl5lafbfnloco6gb64vgkaq

Machine Learning Techniques in RFID Datasets

2020 International journal of recent technology and engineering  
Our research is specific to the supply-chain process using RFID system specifically for the abnormality detection in the localization process in supply-chain process.  ...  This work mainly focuses on identifying various techniques used for outlier detection in RFID datasets in supply chain process.  ...  Under the constraints of computational/memory/power limitations, A framework based on probabilistic method for supervised learning has been proposed by Ghosh et al.  ... 
doi:10.35940/ijrte.f9052.038620 fatcat:opsqzsoczbeuvgfkwzlcs5aoby

Time-series data mining

Philippe Esling, Carlos Agon
2012 ACM Computing Surveys  
The study of the relevant literature has been categorized for each individual aspects. Four types of robustness could then be formalized and any kind of distance could then be classified.  ...  Considering that in most cases, time series task relies on the same components for implementation, we divide the literature depending on these common aspects, namely representation techniques, distance  ...  Jean Claude Lejosne, Professor of English for Special Purposes (ESP) for having improved the English wording of the manuscript.  ... 
doi:10.1145/2379776.2379788 fatcat:prjlpze5arefrkrnkrpsx3inke

Multiobjective Time Series Matching for Audio Classification and Retrieval

Philippe Esling, Carlos Agon
2013 IEEE Transactions on Audio, Speech, and Language Processing  
We formally state this problem and report an efficient implementation. This approach introduces a multidimensional assessment of similarity in audio matching.  ...  This allows to cope with the multidimensional nature of timbre perception and also to obtain a set of efficient propositions rather than a single best solution.  ...  This framework allows to provide a multidimensional assessment of similarity in audio matching.  ... 
doi:10.1109/tasl.2013.2265086 fatcat:rsfwp7fnwjduro2y3jlhqo6bpq

A General Framework for Density Based Time Series Clustering Exploiting a Novel Admissible Pruning Strategy [article]

Nurjahan Begum, Liudmila Ulanova, Hoang Anh Dau, Jun Wang, Eamonn Keogh
2016 arXiv   pre-print
In addition, we show the generality of our clustering framework to other domains by efficiently obtaining semantically significant clusters in protein sequences using the Edit Distance, the discrete data  ...  Time Series Clustering is an important subroutine in many higher-level data mining analyses, including data editing for classifiers, summarization, and outlier detection.  ...  We see TADPole as a potential general framework for efficiently clustering such biological data.  ... 
arXiv:1612.00637v1 fatcat:saz7gkro2nccnopm6aevj6m5ye

The Continuous Hint Factory - Providing Hints in Vast and Sparsely Populated Edit Distance Spaces

Benjamin Paassen, Barbara Hammer, Thomas W. Price, Tiffany Barnes, Sebastian Gross, Niels Pinkwart
2018 Zenodo  
In this contribution we provide a mathematical framework for edit-based hint policies and, based on this theory, propose a novel hint policy to provide edit hints in vast and sparsely populated state spaces  ...  Still, the Hint Factory relies on student data being available for any state a student might visit while solving the task, which is not the case for some learning tasks, such as open-ended programming  ...  We first show that, under our constraints on ∆ and C, the resulting edit distance d ∆,C fulfills the constraints of Theorem 1. d ∆,C (x, y) ≥ 0: If x and y are connected in the legal move graph, d ∆,C  ... 
doi:10.5281/zenodo.3554697 fatcat:yrd4o7kemngshagb4nalpzaqry

Towards a Near Universal Time Series Data Mining Tool: Introducing the Matrix Profile [article]

Chin-Chia Michael Yeh
2020 arXiv   pre-print
The proposed algorithm is not only parameter-free, exact and scalable, but also applicable for both single and multidimensional time series.  ...  ., motif discovery, discord discovery, shapelet discovery, semantic segmentation, and clustering) can be efficiently solved.  ...  The distance profile can be computed efficiently by using a convolution-based method such as Mueen's Algorithm for Similarity Search (MASS) [96] .  ... 
arXiv:1811.03064v2 fatcat:2p5o45bedjfyxei34kgrnliojm

Embedding-based subsequence matching in time-series databases

Panagiotis Papapetrou, Vassilis Athitsos, Michalis Potamias, George Kollios, Dimitrios Gunopulos
2011 ACM Transactions on Database Systems  
We propose an embedding-based framework for subsequence matching in time series databases that improves the efficiency of processing subsequence matching queries under the Dynamic Time Warping (DTW) distance  ...  We apply the proposed framework to define two specific methods. The first method focuses on time series subsequence matching under unconstrained Dynamic Time Warping.  ...  ACKNOWLEDGMENTS Panagiotis Papapetrou has been supported in part by the Finnish Centre of Excellence for Algorithmic Data Analysis Research (AlGODAN).  ... 
doi:10.1145/2000824.2000827 fatcat:t3ptj77iinfq7pdmgisoh2oofe

Kernel machine tests of association between brain networks and phenotypes

Alexandria M. Jensen, Jason R. Tregellas, Brianne Sutton, Fuyong Xing, Debashis Ghosh, Xiao Hu
2019 PLoS ONE  
Frequently, summary measures of these maps, such as global efficiency and clustering coefficients, collapse the changing structures of graph topology from many scales to one.  ...  Drawing from the electrical engineering field, the resistance perturbation distance is a quantification of similarity between graphs on the same vertex set that has been shown to identify changes in dynamic  ...  Acknowledgments We thank Michael Regner, M.D. for his insight into the use and troubleshooting of the CONN toolbox within MatLab as well as providing comments that greatly improved the manuscript.  ... 
doi:10.1371/journal.pone.0199340 pmid:30897094 pmcid:PMC6428401 fatcat:a6nqwijeqzanhhgjsobvvobbzm

Advanced and efficient execution trace management for executable domain-specific modeling languages

Erwan Bousse, Tanja Mayerhofer, Benoit Combemale, Benoit Baudry
2017 Journal of Software and Systems Modeling  
Our contribution is a novel generative approach that defines a multidimensional and domain-specific trace metamodel enabling the construction and manipulation of execution traces for models conforming  ...  Yet, regarding trace manipulations, generic trace metamodels lack efficiency in time because of their sequential structure, efficiency in memory because they capture superfluous data, and usability because  ...  Furthermore, the generated trace metamodel is multidimensional meaning that it provides alternative and combinable navigation paths to efficiently traverse and process traces. 2.  ... 
doi:10.1007/s10270-017-0598-5 fatcat:x3znf4sawbaormlvbkk6olhoda

Event Log Preprocessing for Process Mining: A Review

Heidy M. Marin-Castro, Edgar Tello-Leal
2021 Applied Sciences  
In this paper, we conduct a systematic literature review and provide, for the first time, a survey of relevant approaches of event data preprocessing for business process mining tasks.  ...  Thus, new techniques and algorithms for event data preprocessing have been of interest in the research community in business process.  ...  der Aalst hierarchical trace (b) k-gram model, (c) Levenshtein two sequences based on the similarity; clustering distance, and (d) generic (2) algorithm to generates the scores for the edit distance insertion  ... 
doi:10.3390/app112210556 fatcat:lls2qf6llnddxbdnego2okk2ya

Subsoil Reconstruction in Geostatistics beyond Kriging: A Case Study in Veneto (NE Italy)

Paolo Fabbri, Carlo Gaetan, Luca Sartore, Nico Dalla Libera
2020 Hydrology  
The study highlights some advantages of the presented approach in term of hydrogeological knowledge and computational efficiency.  ...  In this context, an application of the spMC package for the R software is presented by using a test site located within the Venetian alluvial plain (NE Italy).  ...  Acknowledgments: The authors would like to thank the three anonymous reviewers of Hydrology for their many useful comments. Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/hydrology7010015 fatcat:e6uyiymn6bh3vgdfbs5o3p2mve

nbCNV: a multi-constrained optimization model for discovering copy number variants in single-cell sequencing data

Changsheng Zhang, Hongmin Cai, Jingying Huang, Yan Song
2016 BMC Bioinformatics  
The nbCNV method uses two constraints-sparsity and smoothness to fit the CNV patterns under the assumption that the read signals are negatively binomially distributed.  ...  However, the amplification process inevitably introduces amplification bias, resulting in an over-dispersing portion of the sequencing data.  ...  Consent for publication Not applicable. Ethics approval and consent to participate The authors declare that ethics approval and consent to participate are not applicable to this study.  ... 
doi:10.1186/s12859-016-1239-7 pmid:27639558 pmcid:PMC5027123 fatcat:pq3uw6nxvrebrc3pdeth7viw4u
« Previous Showing results 1 — 15 out of 3,068 results