Filters








125 Hits in 6.8 sec

HYDRA

Siyuan Liu, Shuhui Wang, Feida Zhu, Jinbo Zhang, Ramayya Krishnan
2014 Proceedings of the 2014 ACM SIGMOD international conference on Management of data - SIGMOD '14  
We study the problem of large-scale social identity linkage across different social media platforms, which is of critical importance to business intelligence by gaining from social data a deeper understanding  ...  This paper proposes HYDRA, a solution framework which consists of three key steps: (I) modeling heterogeneous behavior by long-term behavior distribution analysis and multi-resolution temporal information  ...  User Attribute Modeling Textual Attributes. Common textual attributes in a user profile include name, gender, age, nationality, profession, education, email account, etc.  ... 
doi:10.1145/2588555.2588559 dblp:conf/sigmod/LiuWZZK14 fatcat:osatik6fcfhbxcjugwvtvzvslq

Graph Summarization Methods and Applications: A Survey [article]

Yike Liu, Tara Safavi, Abhilash Dighe, Danai Koutra
2018 arXiv   pre-print
While advances in computing resources have made processing enormous amounts of data possible, human ability to identify patterns in such data has not scaled accordingly.  ...  This survey is a structured, comprehensive overview of the state-of-the-art methods for summarizing graph data. We first broach the motivation behind, and the challenges of, graph summarization.  ...  -Influence-based: These approaches aim to discover a high-level description of the influence propagation in large-scale graphs.  ... 
arXiv:1612.04883v3 fatcat:fhg2g5eldfdgfkzoqdmbfl5er4

United States Geological Survey

1899 Nature  
ACKNOWLEDGEMENTS Gratitude is expressed to Dave Seller (USGS) and Peter Davenport for improving this paper. Special thanks to John Broome for finding necessary funds to attend DMT2001.  ...  Three Dimensional Representations of Aeromagnetic and Isostatic Residual Gravity Surfaces with Geology in Montana  ...  Despite admonitions in readme files and metadata, enlarging from regional scales to sitespecific scales is a common practice.  ... 
doi:10.1038/060182a0 fatcat:5yixy3u2xzempgcwdbidvtq7wq

The United States Geological Survey

H. B. W.
1903 Nature  
ACKNOWLEDGEMENTS Gratitude is expressed to Dave Seller (USGS) and Peter Davenport for improving this paper. Special thanks to John Broome for finding necessary funds to attend DMT2001.  ...  Three Dimensional Representations of Aeromagnetic and Isostatic Residual Gravity Surfaces with Geology in Montana  ...  Despite admonitions in readme files and metadata, enlarging from regional scales to sitespecific scales is a common practice.  ... 
doi:10.1038/069115a0 fatcat:x3tsgfnyfzbpjdm2tgu2jenbze

Analyzing Non-Textual Content Elements to Detect Academic Plagiarism

Norman Meuschke, Bela Gipp, Harald Reiterer, Michael L. Nelson
2021 Zenodo  
Identifying academic plagiarism is a pressing problem, among others, for research institutions, publishers, and funding organizations.  ...  The thesis addresses this problem by proposing plagiarism detection approaches that implement a different concept—analyzing non-textual content in academic documents, such as citations, images, and mathematical  ...  scenarios than in authorship attribution scenarios.  ... 
doi:10.5281/zenodo.4913344 fatcat:xmpaahvwuva53l5l5i2gaidvi4

Approaches for Enriching and Improving Textual Knowledge Bases

Besnik Fetahu
2018 SIGIR Forum  
We propose a two-stage approach for this problem. First, we classify each statement whether it requires a news citation or citations from other categories (e.g. web, book, journal, etc.).  ...  Wikipedia entities, with relevant information published on a daily basis in news articles, we propose a two-stage supervised approach for this problem.  ...  This can be attributed to two factors: inherent popularity of the entity, and evolution of authorship of entity pages in Wikipedia.  ... 
doi:10.1145/3274784.3274806 fatcat:ul36jbmx7zgt7o5vzz42a44jai

Approaches for Enriching and Improving Textual Knowledge Bases [article]

Besnik Fetahu
2018 arXiv   pre-print
Even in cases where citations are provided, there are no explicit indicators for the span of a citation for a given piece of text.  ...  In this thesis, we address the aforementioned issues and propose automated approaches that enforce the verifiability principle in Wikipedia, and suggest relevant and missing news references for further  ...  This can be attributed to two factors: inherent popularity of the entity, and evolution of authorship of entity pages in Wikipedia.  ... 
arXiv:1804.07583v2 fatcat:7hz535vsi5ftraefx7gnxbcowa

Outlier Detection for Temporal Data: A Survey

Manish Gupta, Jing Gao, Charu C. Aggarwal, Jiawei Han
2014 IEEE Transactions on Knowledge and Data Engineering  
In particular, advances in hardware technology have enabled the availability of various forms of temporal data collection mechanisms, and advances in software technology have enabled a variety of data  ...  In the statistics community, outlier detection for time series data has been studied for decades.  ...  Though community based outlier detection has been studied for a static heterogeneous graph recently [145] , there is no technique yet for temporal heterogeneous graphs.  ... 
doi:10.1109/tkde.2013.184 fatcat:b6nableuvvgthlw3xxj6axabgi

Semantic Networks: Structure and Dynamics

Javier Borge-Holthoefer, Alex Arenas
2010 Entropy  
In the first years, network approach to language mostly focused on a very abstract and general overview of language complexity, and few of them studied how this complexity is actually embodied in humans  ...  in time.  ...  This feature collection is used to build up a vector of characteristics for each word, where each dimension represents a feature.  ... 
doi:10.3390/e12051264 fatcat:doxmjqfofbcnpoal2bzpqjkhxe

Towards a robust modeling of temporal interest change patterns for behavioral targeting

Mohamed Aly, Sandeep Pandey, Vanja Josifovski, Kunal Punera
2013 Proceedings of the 22nd international conference on World Wide Web - WWW '13  
A typical behavioral targeting system faces two main challenges: the web-scale amounts of user histories to process on a daily basis, and the relative sparsity of conversions (compared to clicks in a traditional  ...  Modern web-scale behavioral targeting platforms leverage historical activity of billions of users to predict user interests and inclinations, and consequently future activities.  ...  Actually deploying the work presented in this paper to production as part of the platform presented in [3] is a result of a large team effort across multiple organizations at Yahoo!  ... 
doi:10.1145/2488388.2488396 dblp:conf/www/AlyPJP13 fatcat:jaqxkgfeinhpnfyiffyvvagywe

IRRODL Volume 15, Number 6

Varios Authors
2014 International Review of Research in Open and Distance Learning  
The authors report on the development of the TEL concept, success Editorial : 15(6) Conrad Vol 15 | No 6 Creative Commons Attribution 4.0 International License Decs/14 iii indicators for TEL integration  ...  And as always, there are the outliers, articles whose topics are so unique within a collection. In this issue, I would thus classify Cunningham's and Koole's articles.  ...  Acknowledgements The authors appreciatively thank the National Commission for Science and Technology in Kenya for the financial support provided to facilitate data collection and analysis.  ... 
doi:10.19173/irrodl.v15i6.2063 fatcat:tspb3bvb45cajgko7vhrh736ca

The Fourth Paradigm – Data-Intensive Scientific Discovery [chapter]

Tony Hey
2012 Communications in Computer and Information Science  
The F o u r T H Data-In t e n s I v e scIen t I f I c DIs c o v e r y P a r a d i g m MiCroSoF T reSe arCH REDMOND, WASHINGTON  ...  very helpful discussion on a draft of this material. to all the contributors to this book for sharing their visions within the Fourth Paradigm.  ...  Acknowledgments My thanks to the participants at the April 24, 2009, Buckland-Lynch-Larsen "Friday Seminar" on information access at the University of California, Berkeley, School of Information for a  ... 
doi:10.1007/978-3-642-33299-9_1 fatcat:etue636ubrandga5corvkx3d7u

Analyzing Social Book Reading Behavior on Goodreads and how it predicts Amazon Best Sellers [article]

Suman Kalyan Maity, Abhishek Panigrahi, Animesh Mukherjee
2018 arXiv   pre-print
A book's success/popularity depends on various parameters - extrinsic and intrinsic. In this paper, we study how the book reading characteristics might influence the popularity of a book.  ...  We are able to achieve quite good results with very high average accuracy of 87.1% and as well a high ROC for ABS vs GCAN. For ABS vs HRHR, our model yields a high average accuracy of 86.22%.  ...  [30] suggest that the frequencies with which syntactic rewrite rules are put to use provide a better clue to authorship than word usage and thus can improve accuracy of authorship attribution.  ... 
arXiv:1809.07354v1 fatcat:kjkcuskwfrbjdderbxxglk73ta

The effects of habitat connectivity and regional heterogeneity on artificial pond metacommunities

Michael T. Pedruski, Shelley E. Arnott
2010 Oecologia  
Despite the different processes expected to act in homogeneous and heterogeneous regions, it does not appear that connectivity and heterogeneity interact strongly. iii Co-Authorship This thesis conforms  ...  scale. ii Invertebrate community composition was unaffected by either connectivity or heterogeneity, though there was a significant effect of heterogeneity on its variance.  ...  To prevent back-dispersal, buckets containing water to be transferred were only added to destination ponds after all water for dispersal had been collected.  ... 
doi:10.1007/s00442-010-1814-y pmid:20976605 fatcat:j3ioxw74kjdkvbjvlo2kul3ije

Software Development Analytics in Practice: A Systematic Literature Review [article]

Joao Caldeira, Fernando Brito e Abreu, Jorge Cardoso, Rachel Simões, Toacy Oliveira, José Reis
2022 arXiv   pre-print
Conclusions:There is a wide improvement margin for software development analytics in practice.  ...  For instance, mining and analyzing the activities performed by software developers in their actual workbench, the IDE.  ...  ChangeLocator was also proposed as a method to automatically locate crash-inducing changes for a given bucket of crash reports.  ... 
arXiv:2007.10213v2 fatcat:v3b4v3zocncu5fux27kdqz63om
« Previous Showing results 1 — 15 out of 125 results