Filters








3,042 Hits in 4.8 sec

A phrase mining framework for recursive construction of a topical hierarchy

Chi Wang, Marina Danilevsky, Nihit Desai, Yinan Zhang, Phuong Nguyen, Thrivikrama Taula, Jiawei Han
2013 Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '13  
In this paper we propose an algorithm for recursively constructing a hierarchy of topics from a collection of content-representative documents.  ...  Our mining framework is based on a phrase-centric view for clustering, extracting, and ranking topical phrases.  ...  In this work we present CATHY (Constructing A Topical HierarchY), a phrase-centric framework for topical hierarchy generation via recursive clustering and ranking.  ... 
doi:10.1145/2487575.2487631 dblp:conf/kdd/WangDDZNTH13 fatcat:hqdughl2ubgp5d6quhl32yn7ju

Scalable and Robust Construction of Topical Hierarchies [article]

Chi Wang, Xueqing Liu, Yanglei Song, Jiawei Han
2014 arXiv   pre-print
In this paper a scalable and robust algorithm is proposed for constructing a hierarchy of topics from a text collection.  ...  Automated generation of high-quality topical hierarchies for a text collection is a dream problem in knowledge engineering with many valuable applications.  ...  CATHY [29] is a recursive topical phrase mining framework, where the phrase mining and the topic discovery are also separated for efficiency purpose.  ... 
arXiv:1403.3460v1 fatcat:jzoozf4ifbghpg5y6cyy465t6e

Constructing Topical Hierarchies in Heterogeneous Information Networks

Chi Wang, Marina Danilevsky, Jialu Liu, Nihit Desai, Heng Ji, Jiawei Han
2013 2013 IEEE 13th International Conference on Data Mining  
In this work we present an algorithm for recursively constructing multi-typed topical hierarchies.  ...  Contrary to traditional text-based topic modeling, our approach handles both textual phrases and multiple types of entities by a newly designed clustering and ranking algorithm for heterogeneous network  ...  Our framework generates a heterogeneous topical hierarchy in a top-down, recursive way: Step 1. Construct the edge-weighted network G o . Set t = o. Step 2.  ... 
doi:10.1109/icdm.2013.53 dblp:conf/icdm/WangDLDJH13 fatcat:rfn4w3f7ufeg5fw7asip4exxfy

Constructing topical hierarchies in heterogeneous information networks

Chi Wang, Jialu Liu, Nihit Desai, Marina Danilevsky, Jiawei Han
2014 Knowledge and Information Systems  
In this work we present an algorithm for recursively constructing multi-typed topical hierarchies.  ...  Contrary to traditional text-based topic modeling, our approach handles both textual phrases and multiple types of entities by a newly designed clustering and ranking algorithm for heterogeneous network  ...  Our framework generates a heterogeneous topical hierarchy in a top-down, recursive way: Step 1. Construct the edge-weighted network G o . Set t = o. Step 2.  ... 
doi:10.1007/s10115-014-0777-4 fatcat:exhvtiyr7nd4nhshh5kf2uoiau

Towards Interactive Construction of Topical Hierarchy

Chi Wang, Xueqing Liu, Yanglei Song, Jiawei Han
2015 Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD '15  
Automatic construction of user-desired topical hierarchies over large volumes of text data is a highly desirable but challenging task.  ...  In this study, we propose a novel method, called STROD, that allows efficient and consistent modification of topic hierarchies, based on a recursive generative model and a scalable tensor decomposition  ...  DHS-IDS Center for Multimodal Information Access and Synthesis at UIUC.  ... 
doi:10.1145/2783258.2783288 pmid:26705505 pmcid:PMC4688012 dblp:conf/kdd/WangLSH15 fatcat:nbqkqm5qizddffuubye5jcjbh4

AMETHYST

Marina Danilevsky, Chi Wang, Fangbo Tao, Son Nguyen, Gong Chen, Nihit Desai, Lidan Wang, Jiawei Han
2013 Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '13  
In this demo we present AMETHYST, a system for exploring and analyzing a topical hierarchy constructed from a heterogeneous information network (HIN).  ...  The automatically constructed topical hierarchy reflects a domain-specific ontology, interacts with multiple types of linked entities, and can be tailored for both free text and OLAP queries.  ...  Topical Frequency The process of constructing the topical hierarchy results in every phrase in the hierarchy having a topical frequency value for every topic in the hierarchy.  ... 
doi:10.1145/2487575.2487716 dblp:conf/kdd/DanilevskyWTNCDWH13 fatcat:z3r2hwkcb5dorcfj77y2z2qyxi

NewsNetExplorer

Fangbo Tao, Yizhou Sun, George Brova, Jiawei Han, Heng Ji, Chi Wang, Brandon Norick, Ahmed El-Kishky, Jialu Liu, Xiang Ren
2014 Proceedings of the 2014 ACM SIGMOD international conference on Management of data - SIGMOD '14  
By further developing a set of information extraction, information network construction, and information network mining methods, we extract types, topical hierarchies and other semantic structures from  ...  news data, construct a semistructured news information network NewsNet.  ...  Moreover, we have recently developed a phrase mining framework for recursive construction of topical hierarchies from text data [10] and from heterogeneous information networks [11] .  ... 
doi:10.1145/2588555.2594537 dblp:conf/sigmod/TaoBHJWNELRS14 fatcat:iqzfctmbvvba7d3gxrirds4ahu

Mining latent entity structures from massive unstructured and interconnected data

Jiawei Han, Chi Wang
2014 Proceedings of the 2014 ACM SIGMOD international conference on Management of data - SIGMOD '14  
The framework enables recursive construction of phrase-represented and entity-enriched topic hierarchy from text-attached information networks.  ...  A mining framework is proposed, to solve and integrate a chain of tasks: hierarchical topic discovery, topical phrase mining, entity role analysis and entity relation mining.  ...  In sum, the main features of the framework are: • Recursive hierarchy construction: The hierarchy is constructed in a top-down order. One can recursively apply our method to expand the hierarchy.  ... 
doi:10.1145/2588555.2588890 dblp:conf/sigmod/HanW14 fatcat:js7d3r5yd5gbfgnhjgwsfmco2i

Mining Multi-aspect Reflection of News Events in Twitter: Discovery, Linking and Presentation

Jingjing Wang, Wenzhu Tong, Hongkun Yu, Min Li, Xiuli Ma, Haoyan Cai, Tim Hanratty, Jiawei Han
2015 2015 IEEE International Conference on Data Mining  
In this paper, we propose a unified framework to mine multi-aspect reflections of news events in Twitter.  ...  The aspects of an event are linked to their reflections in Twitter by a bootstrapped dataless classification scheme, which elegantly handles the challenges of selecting informative tweets under overwhelming  ...  We propose a unified generative model for recursive construction of the hierarchy in a top-down manner. Essentially, it is a top-down hierarchical clustering process. Step 1.  ... 
doi:10.1109/icdm.2015.112 pmid:27034625 pmcid:PMC4811610 dblp:conf/icdm/WangTYLMCHH15 fatcat:c46a7wfclbhc5g2nrbwzx6zfrm

TextCube

Yu Meng, Jiaxin Huang, Jingbo Shang, Jiawei Han
2019 Proceedings of the VLDB Endowment  
We overview a set of recently developed data-driven methods that facilitate automated construction of TextCubes from massive, domain-specific text corpora, and show that TextCubes so constructed will enhance  ...  text exploration and analysis for various applications.  ...  Taxonomy construction: Taxonomy construction clusters similar concepts and generates a hierarchy of "concept clusters" from massive corpus.  ... 
doi:10.14778/3352063.3352113 fatcat:lskuc2z45bg5fdjay37jpp62x4

Representing Information Structure in a Formal Grammar of Danish [chapter]

Patrizia Paggio
2006 Lecture Notes in Computer Science  
This paper presents a proposal for the integration of information structure in a unification-based grammar of Danish.  ...  Three information structure features -topic, focus and background -are defined, and it is shown how they are instantiated in a number of different grammar constructions.  ...  I propose a model in which information structure is formalised as a dimension in a hierarchy of constructions where it interacts with prosodic, syntactic and semantic properties of phrasal and clausal  ... 
doi:10.1007/11780496_11 fatcat:iitcw5lstzegxnm6papwan5c44

TaxoCom: Topic Taxonomy Completion with Hierarchical Discovery of Novel Topic Clusters [article]

Dongha Lee, Jiaming Shen, SeongKu Kang, Susik Yoon, Jiawei Han, Hwanjo Yu
2022 arXiv   pre-print
We propose a novel framework for topic taxonomy completion, named TaxoCom, which recursively expands the topic taxonomy by discovering novel sub-topic clusters of terms and documents.  ...  other baselines for a downstream task.  ...  TAXOCOM: PROPOSED FRAMEWORK 4.1 Overview The proposed TaxoCom framework recursively expands the given hierarchy in a top-down approach.  ... 
arXiv:2201.06771v1 fatcat:kqhnz4a2vnb3ncg3mv2rqlddz4

Hierarchical Viewpoint Discovery from Tweets Using Bayesian Modelling

Lixing Zhu, Yulan He, Deyu Zhou
2018 Expert systems with applications  
Hence, a hierarchical Pitman-Yor process is employed as a prior for modelling the generation of phrases with arbitrary length.  ...  Driven by the motivation that a viewpoint expressed in a tweet can be regarded as a path from the root to a leaf of a hierarchical viewpoint tree, the assignment of the relevant viewpoint topics is assumed  ...  Acknowledgements We would like to thank the reviewers for their valuable comments and helpful suggestions.  ... 
doi:10.1016/j.eswa.2018.09.028 fatcat:mzgcmjxuxjb3rhzc22ztowirrm

Additive Regularization for Hierarchical Multimodal Topic Modeling

N.A. Chirkova
2016 Machine Learning and Data Analysis  
Hence, the hierarchical ARTM (hARTM) can be easily adapted to a wide class of text mining problems, e. g., for learning topical hierarchies from multimodal and multilingual heterogeneous data of scientific  ...  The authors focus on topical hierarchies that allow a topic to have several parent topics which is important for multidisciplinary collections of scientific papers.  ...  Constructing A Topical Hier-archY (CATHY) approach [18] operates with phrases rather than with words and divides them between subtopics.  ... 
doi:10.21469/22233792.2.2.05 fatcat:6bvbzgd3zngsdphabpzbalqdhi

An Integrated Digital Library Server with OAI and Self-Organizing Capabilities [chapter]

Hyunki Kim, Chee-Yoong Choo, Su-Shing Chen
2003 Lecture Notes in Computer Science  
We also propose a multi-layered Self-Organizing Map (SOM) algorithm for building a subject-specific concept hierarchy using two input vector sets constructed by indexing the harvested metadata collection  ...  By using the concept hierarchy, we can also automatically classify the harvested metadata collection for the purpose of selective harvesting.  ...  Construction of a Concept Hierarchy We then constructed a concept hierarchy by extending the multilayered SOM algorithm [15] , permitting unlimited layers of SOM, with two input vector sets.  ... 
doi:10.1007/978-3-540-45175-4_16 fatcat:p2h5vktgszcbhkk2zspm7f3m3y
« Previous Showing results 1 — 15 out of 3,042 results