Filters








47,674 Hits in 1.9 sec

Indexing correlated probabilistic databases

Bhargav Kanagal, Amol Deshpande
2009 Proceedings of the 35th SIGMOD international conference on Management of data - SIGMOD '09  
We represent the correlations in the probabilistic database using a junction tree over the tuple-existence or attribute-value random variables, and use tree partitioning techniques to build an index structure  ...  While there is an exhaustive body of literature on querying independent probabilistic data, supporting efficient queries over large-scale, correlated databases remains a challenge.  ...  We evaluate the performance of our index on the following two probabilistic databases. • General probabilistic database: We generate a probabilistic database on 2 relations that is representative of a  ... 
doi:10.1145/1559845.1559894 dblp:conf/sigmod/KanagalD09 fatcat:h3iqyajgmjfibcmkfdyebo7n3e

Handling Uncertainty in Database: An Introduction and Brief Survey

Nermin Abdel-Hakim Othman, Ahmed Sharaf Eldin, Doaa Saad El Zanfaly
2015 Computer and Information Science  
, database integration, indexing uncertain data, security and information leakage and representation formalisms.  ...  The second is surveying different data management issues in uncertain databases such as join and query processing, database integration, indexing uncertain data, security and information leakage and representation  ...  PrDB (Sen & Deshpande, 2007) focus on managing and exploiting rich correlations in probabilistic databases .Other group has also studied correlation in probabilistic database (Sen & Deshpande, 2007)  ... 
doi:10.5539/cis.v8n3p119 fatcat:z4yofk2dofeylgkzltl7qhxi3u

Orion 2.0

Sarvjeet Singh, Chris Mayfield, Sagar Mittal, Sunil Prabhakar, Susanne Hambrusch, Rahul Shah
2008 Proceedings of the 2008 ACM SIGMOD international conference on Management of data - SIGMOD '08  
In contrast to other uncertain databases, Orion supports both attribute and tuple uncertainty with arbitrary correlations.  ...  Orion is a state-of-the-art uncertain database management system with built-in support for probabilistic data as first class data types.  ...  INTRODUCTION Probabilistic and uncertain data management have recently received much attention in the database community (see [7] for related work).  ... 
doi:10.1145/1376616.1376744 dblp:conf/sigmod/SinghMMPHS08 fatcat:wtqanq7n5nfcxlynirpidutoly

Lineage processing over correlated probabilistic databases

Bhargav Kanagal, Amol Deshpande
2010 Proceedings of the 2010 international conference on Management of data - SIGMOD '10  
., those generated by hierarchical conjunctive queries), polynomially computable over tuple independent probabilistic databases, is #P-complete for lightly correlated probabilistic databases like Markov  ...  We scale our algorithms to very large correlated probabilistic databases using the previously proposed INDSEP data structure.  ...  In prior work [18] , we developed an index structure called INDSEP that enables scalable query processing over large correlated probabilistic databases.  ... 
doi:10.1145/1807167.1807241 dblp:conf/sigmod/KanagalD10 fatcat:aefwk6mrljdmzin24pqprivava

Probabilistic Databases with MarkoViews [article]

Abhay Jha, Dan Suciu
2012 arXiv   pre-print
We validate experimentally our techniques on a large probabilistic database with MarkoViews inferred from the DBLP data.  ...  Most of the work on query evaluation in probabilistic databases has focused on the simple tuple-independent data model, where tuples are independent random events.  ...  CONCLUSION We described a new approach to probabilistic databases, which allows complex correlations to be defined between the tuples in a database.  ... 
arXiv:1208.0079v1 fatcat:ewuktezyuzhdvdm2zcuv3jiawu

Probabilistic databases with MarkoViews

Abhay Jha, Dan Suciu
2012 Proceedings of the VLDB Endowment  
We validate experimentally our techniques on a large probabilistic database with MarkoViews inferred from the DBLP data.  ...  Most of the work on query evaluation in probabilistic databases has focused on the simple tuple-independent data model, where tuples are independent random events.  ...  CONCLUSION We described a new approach to probabilistic databases, which allows complex correlations to be defined between the tuples in a database.  ... 
doi:10.14778/2350229.2350236 fatcat:5pikzrio6za6polh4w66jcqrcy

A Survey on Efficient Clustering Methods with Effective Pruning Techniques for Probabilistic Graphs

M. Balaganesh, G.Bharathikannan G.Bharathikannan
2015 International Journal of Computer Applications  
However, very little analysis has been performed to develop efficient agglomeration algorithms for probabilistic graphs.  ...  To cluster a correlate probabilistic graph G, a possible world graph Gi of G can be sculptural as a settled internal representation sampled from the correlate probabilistic graph in step with the chance  ...  Association rule mining is done to extract interesting correlations, associations, patterns among items in the transaction database or other data repositories.  ... 
doi:10.5120/19979-0721 fatcat:htcebdqalba7xliy475o3hvuyi

Making Address-Correlated Prefetching Practical

Thomas F. Wenisch, Michael Ferdman, Anastasia Ailamaki, Babak Falsafi, Andreas Moshovos
2010 IEEE Micro  
Naïvely adding STMS to a baseline system without addressing index table updates can triple memory traffic. To reduce index-table-update traffic, we introduced probabilistic update.  ...  Her research interests include the broad area of database systems and applications, with emphasis on database system behavior on modern processor hardware and disks.  ... 
doi:10.1109/mm.2010.21 fatcat:4nnthxry2rbdvejylzswlmbcja

Efficient Ad-Hoc Graph Inference and Matching in Biological Databases

Xiang Lian, Dongchul Kim
2017 Proceedings of the 2017 ACM International Conference on Management of Data - SIGMOD '17  
We also present an effective indexing mechanism and an efficient IM-GRN query processing algorithm by the index traversal.  ...  Specifically, we propose a novel probabilistic score to measure the possible interaction between any two genes (inferred from gene feature vectors), and thus model GRNs by probabilistic graphs, containing  ...  Probabilistic graph databases.  ... 
doi:10.1145/3035918.3035929 dblp:conf/sigmod/LianK17 fatcat:e5eqlhp375bnrkc7skrphlksym

A Survey of Uncertain Data Algorithms and Applications

C.C. Aggarwal, P.S. Yu
2009 IEEE Transactions on Knowledge and Data Engineering  
Such databases are much more complex because of the additional challenges of representing the probabilistic information.  ...  Index Terms-Mining methods and algorithms, database applications, database management, information technology and systems.  ...  It is shown in [33] that such imprecisions can be modeled by a certain kind of probabilistic database with complex tuples correlations.  ... 
doi:10.1109/tkde.2008.190 fatcat:7htcj7pcqnholig7v2ledsdqom

Representing Tuple and Attribute Uncertainty in Probabilistic Databases

Prithviraj Sen, Amol Deshpande, Lise Getoor
2007 Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007)  
Building on existing probabilistic database work, we present a unifying framework which allows a flexible representation of correlated tuple and attribute level uncertainties.  ...  There has been a recent surge in work in probabilistic databases, propelled in large part by the huge increase in noisy data sources -sensor data, experimental data, data from uncurated sources, and many  ...  To this end, we propose a probabilistic database model that not only supports correlated tuple and attribute level uncertainty but also sharing of probabilistic factors.  ... 
doi:10.1109/icdmw.2007.11 dblp:conf/icdm/SenDG07 fatcat:dmxy2luohzdsrjo5ycsbeq6k4y

A DCT Statistics-Based Blind Image Quality Index

Michele A Saad, Alan C Bovik, Christophe Charrier
2010 IEEE Signal Processing Letters  
The method is shown to correlate highly with human perception of quality.  ...  Towards ameliorating this we introduce the BLIINDS index (BLind Image Integrity Notator using DCT Statistics) which is a no-reference approach to image quality assessment that does not assume a specific  ...  The probabilistic model is trained on a subset of the LIVE image database to determine the parameters of the probabilistic model by distribution fitting.  ... 
doi:10.1109/lsp.2010.2045550 fatcat:wfpyeksrb5czblwjxlqvrrfzmi

Uncertainty quantification and predictability of wind speed over the Iberian Peninsula

S. Fernández-González, M. L. Martín, A. Merino, J. L. Sánchez, F. Valero
2017 Journal of Geophysical Research - Atmospheres  
MAPE is estimated by comparing the ensemble mean with wind speed values from different databases. Later, correlation between MAPE and ES was evaluated.  ...  During recent decades, the use of probabilistic forecasting methods has increased markedly.  ...  Deutscher Wetterdienst (DWD) and European Centre for Medium-Range Weather Forecasts (ECMWF) for providing the gridded daily mean near-surface (10 m) wind speed for Europe (DWD), EPS, and ERA-Interim databases  ... 
doi:10.1002/2017jd026533 fatcat:vwvxs55lcrbnjhdqge6mtgfxmu

Similarities Between Human Structured Subject Indexing and Probabilistic Topic Models [chapter]

Günter Reiner, Philipp Adämmer
2020 Knowledge Organization at the Interface  
Then we investigate the similarities between the indexing terms and word clusters generated by unsupervised probabilistic topic models, namely latent Dirichlet allocation (LDA) and the correlated topic  ...  The prototype database, built as part of the project, consists of nearly 2,500 cases which have been manually indexed.  ...  Do facet indexing and topic keywords created by topic models correlate more strongly than unstructured indexing (e.g., SOQUIJ indexing) and topic modeling keywords?  ... 
doi:10.5771/9783956507762-374 fatcat:v4vmdzgzzncd5fxzzmdkrn3cr4

A probabilistic automated tagger to identify human-related publications

Aaron M Cohen, Zackary O Dunivin, Neil R Smalheiser
2018 Database: The Journal of Biological Databases and Curation  
Database URL: http://clingen.igib.res.in/sage curators who assign standardized indexing terms; in particular, Medical Subject Headings (MeSH) that represent the major topics discussed in the article (1  ...  A probabilistic automated tagger to identify human-related publications. Abstract The Medical Subject Heading 'Humans' is manually curated and indicates human-related studies within MEDLINE.  ...  However, databases such as CINAHL (Cumulative Index to Nursing and Allied Health Literature), Embase (Excerpta Medica dataBASE) and PsycInfo have a 'Humans type' indexing tag, but without consistent, transparent  ... 
doi:10.1093/database/bay079 pmid:30184195 pmcid:PMC6146117 fatcat:dfseysbpuvbtxakj4plhtkbwqm
« Previous Showing results 1 — 15 out of 47,674 results