A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
HYDRA
2014
Proceedings of the 2014 ACM SIGMOD international conference on Management of data - SIGMOD '14
We study the problem of large-scale social identity linkage across different social media platforms, which is of critical importance to business intelligence by gaining from social data a deeper understanding ...
This paper proposes HYDRA, a solution framework which consists of three key steps: (I) modeling heterogeneous behavior by long-term behavior distribution analysis and multi-resolution temporal information ...
User Attribute Modeling Textual Attributes. Common textual attributes in a user profile include name, gender, age, nationality, profession, education, email account, etc. ...
doi:10.1145/2588555.2588559
dblp:conf/sigmod/LiuWZZK14
fatcat:osatik6fcfhbxcjugwvtvzvslq
Graph Summarization Methods and Applications: A Survey
[article]
2018
arXiv
pre-print
While advances in computing resources have made processing enormous amounts of data possible, human ability to identify patterns in such data has not scaled accordingly. ...
This survey is a structured, comprehensive overview of the state-of-the-art methods for summarizing graph data. We first broach the motivation behind, and the challenges of, graph summarization. ...
-Influence-based: These approaches aim to discover a high-level description of the influence propagation in large-scale graphs. ...
arXiv:1612.04883v3
fatcat:fhg2g5eldfdgfkzoqdmbfl5er4
United States Geological Survey
1899
Nature
ACKNOWLEDGEMENTS Gratitude is expressed to Dave Seller (USGS) and Peter Davenport for improving this paper. Special thanks to John Broome for finding necessary funds to attend DMT2001. ...
Three Dimensional Representations of Aeromagnetic and Isostatic Residual Gravity Surfaces with Geology in Montana ...
Despite admonitions in readme files and metadata, enlarging from regional scales to sitespecific scales is a common practice. ...
doi:10.1038/060182a0
fatcat:5yixy3u2xzempgcwdbidvtq7wq
The United States Geological Survey
1903
Nature
ACKNOWLEDGEMENTS Gratitude is expressed to Dave Seller (USGS) and Peter Davenport for improving this paper. Special thanks to John Broome for finding necessary funds to attend DMT2001. ...
Three Dimensional Representations of Aeromagnetic and Isostatic Residual Gravity Surfaces with Geology in Montana ...
Despite admonitions in readme files and metadata, enlarging from regional scales to sitespecific scales is a common practice. ...
doi:10.1038/069115a0
fatcat:x3tsgfnyfzbpjdm2tgu2jenbze
Analyzing Non-Textual Content Elements to Detect Academic Plagiarism
2021
Zenodo
Identifying academic plagiarism is a pressing problem, among others, for research institutions, publishers, and funding organizations. ...
The thesis addresses this problem by proposing plagiarism detection approaches that implement a different concept—analyzing non-textual content in academic documents, such as citations, images, and mathematical ...
scenarios than in authorship attribution scenarios. ...
doi:10.5281/zenodo.4913344
fatcat:xmpaahvwuva53l5l5i2gaidvi4
Approaches for Enriching and Improving Textual Knowledge Bases
2018
SIGIR Forum
We propose a two-stage approach for this problem. First, we classify each statement whether it requires a news citation or citations from other categories (e.g. web, book, journal, etc.). ...
Wikipedia entities, with relevant information published on a daily basis in news articles, we propose a two-stage supervised approach for this problem. ...
This can be attributed to two factors: inherent popularity of the entity, and evolution of authorship of entity pages in Wikipedia. ...
doi:10.1145/3274784.3274806
fatcat:ul36jbmx7zgt7o5vzz42a44jai
Approaches for Enriching and Improving Textual Knowledge Bases
[article]
2018
arXiv
pre-print
Even in cases where citations are provided, there are no explicit indicators for the span of a citation for a given piece of text. ...
In this thesis, we address the aforementioned issues and propose automated approaches that enforce the verifiability principle in Wikipedia, and suggest relevant and missing news references for further ...
This can be attributed to two factors: inherent popularity of the entity, and evolution of authorship of entity pages in Wikipedia. ...
arXiv:1804.07583v2
fatcat:7hz535vsi5ftraefx7gnxbcowa
Outlier Detection for Temporal Data: A Survey
2014
IEEE Transactions on Knowledge and Data Engineering
In particular, advances in hardware technology have enabled the availability of various forms of temporal data collection mechanisms, and advances in software technology have enabled a variety of data ...
In the statistics community, outlier detection for time series data has been studied for decades. ...
Though community based outlier detection has been studied for a static heterogeneous graph recently [145] , there is no technique yet for temporal heterogeneous graphs. ...
doi:10.1109/tkde.2013.184
fatcat:b6nableuvvgthlw3xxj6axabgi
Semantic Networks: Structure and Dynamics
2010
Entropy
In the first years, network approach to language mostly focused on a very abstract and general overview of language complexity, and few of them studied how this complexity is actually embodied in humans ...
in time. ...
This feature collection is used to build up a vector of characteristics for each word, where each dimension represents a feature. ...
doi:10.3390/e12051264
fatcat:doxmjqfofbcnpoal2bzpqjkhxe
Towards a robust modeling of temporal interest change patterns for behavioral targeting
2013
Proceedings of the 22nd international conference on World Wide Web - WWW '13
A typical behavioral targeting system faces two main challenges: the web-scale amounts of user histories to process on a daily basis, and the relative sparsity of conversions (compared to clicks in a traditional ...
Modern web-scale behavioral targeting platforms leverage historical activity of billions of users to predict user interests and inclinations, and consequently future activities. ...
Actually deploying the work presented in this paper to production as part of the platform presented in [3] is a result of a large team effort across multiple organizations at Yahoo! ...
doi:10.1145/2488388.2488396
dblp:conf/www/AlyPJP13
fatcat:jaqxkgfeinhpnfyiffyvvagywe
IRRODL Volume 15, Number 6
2014
International Review of Research in Open and Distance Learning
The authors report on the development of the TEL concept, success Editorial : 15(6) Conrad Vol 15 | No 6 Creative Commons Attribution 4.0 International License Decs/14 iii indicators for TEL integration ...
And as always, there are the outliers, articles whose topics are so unique within a collection. In this issue, I would thus classify Cunningham's and Koole's articles. ...
Acknowledgements The authors appreciatively thank the National Commission for Science and Technology in Kenya for the financial support provided to facilitate data collection and analysis. ...
doi:10.19173/irrodl.v15i6.2063
fatcat:tspb3bvb45cajgko7vhrh736ca
The Fourth Paradigm – Data-Intensive Scientific Discovery
[chapter]
2012
Communications in Computer and Information Science
The F o u r T H Data-In t e n s I v e scIen t I f I c DIs c o v e r y P a r a d i g m MiCroSoF T reSe arCH REDMOND, WASHINGTON ...
very helpful discussion on a draft of this material. to all the contributors to this book for sharing their visions within the Fourth Paradigm. ...
Acknowledgments My thanks to the participants at the April 24, 2009, Buckland-Lynch-Larsen "Friday Seminar" on information access at the University of California, Berkeley, School of Information for a ...
doi:10.1007/978-3-642-33299-9_1
fatcat:etue636ubrandga5corvkx3d7u
Analyzing Social Book Reading Behavior on Goodreads and how it predicts Amazon Best Sellers
[article]
2018
arXiv
pre-print
A book's success/popularity depends on various parameters - extrinsic and intrinsic. In this paper, we study how the book reading characteristics might influence the popularity of a book. ...
We are able to achieve quite good results with very high average accuracy of 87.1% and as well a high ROC for ABS vs GCAN. For ABS vs HRHR, our model yields a high average accuracy of 86.22%. ...
[30] suggest that the frequencies with which syntactic rewrite rules are put to use provide a better clue to authorship than word usage and thus can improve accuracy of authorship attribution. ...
arXiv:1809.07354v1
fatcat:kjkcuskwfrbjdderbxxglk73ta
The effects of habitat connectivity and regional heterogeneity on artificial pond metacommunities
2010
Oecologia
Despite the different processes expected to act in homogeneous and heterogeneous regions, it does not appear that connectivity and heterogeneity interact strongly. iii Co-Authorship This thesis conforms ...
scale. ii Invertebrate community composition was unaffected by either connectivity or heterogeneity, though there was a significant effect of heterogeneity on its variance. ...
To prevent back-dispersal, buckets containing water to be transferred were only added to destination ponds after all water for dispersal had been collected. ...
doi:10.1007/s00442-010-1814-y
pmid:20976605
fatcat:j3ioxw74kjdkvbjvlo2kul3ije
Software Development Analytics in Practice: A Systematic Literature Review
[article]
2022
arXiv
pre-print
Conclusions:There is a wide improvement margin for software development analytics in practice. ...
For instance, mining and analyzing the activities performed by software developers in their actual workbench, the IDE. ...
ChangeLocator was also proposed as a method to automatically locate crash-inducing changes for a given bucket of crash reports. ...
arXiv:2007.10213v2
fatcat:v3b4v3zocncu5fux27kdqz63om
« Previous
Showing results 1 — 15 out of 125 results