Filters








269 Hits in 6.1 sec

Data Mining, Management and Visualization in Large Scientific Corpuses [chapter]

Hui Wei, Shaopeng Wu, Youbing Zhao, Zhikun Deng, Nikolaos Ersotelos, Farzad Parvinzamir, Baoquan Liu, Enjie Liu, Feng Dong
2016 Lecture Notes in Computer Science  
In this paper, we experiment text mining and data management of scientific publications for collecting and presenting useful information to support research.  ...  For efficient data management and fast information retrieval, four data storages are employed: a semantic repository, an index and search repository, a document repository and a graph repository, taking  ...  Conclusion In this paper, we present our work on text mining and data management on a large number of scientific publications for collecting and presenting citation information and topic trends to facilitate  ... 
doi:10.1007/978-3-319-40259-8_32 fatcat:teizysh53rdrrbyd5efyuo3wsq

Management of Scientific Documents and Visualization of Citation Relationships using Weighted Key Scientific Terms

Hui Wei, Youbing Zhao, Shaopeng Wu, Zhikun Deng, Farzad Parvinzamir, Feng Dong, Enjie Liu, Gordon Clapworthy
2016 Proceedings of the 5th International Conference on Data Management Technologies and Applications  
This paper presents work on the management and visualization of large corpuses of scientific papers in order to help researchers explore their citation relationships.  ...  Effective management and visualization of scientific and research documents can greatly assist researchers by improving understanding of relationships (e.g. citations) between the documents.  ...  This paper presents our work on the management and visualization of large corpuses of scientific papers in order to help researchers explore their citation relationships.  ... 
doi:10.5220/0005981501350143 dblp:conf/data/WeiZWDPDLC16 fatcat:ghhhxrealvcg7n5e2vy4fzr6aa

Unstructured Text Documents Summarization with Multi-Stage Clustering

Muhammad Yahya Saeed, Muhammad Awais, Ramzan Talib, Muhammad Younas
2020 IEEE Access  
This step created a cluster-based text grouping of a large corpus into manageable sub-corpuses.  ...  Using this form of metadata, our DCC converted a broad set of documents into small manageable subgroups. We presented these subgroups/sub-corpuses as nodes and joined by weighted paths in AG.  ... 
doi:10.1109/access.2020.3040506 fatcat:bxzgs6ohenak7gc6nvtndwom2y

Is there a grand challenge or X-prize for data mining?

Gregory Piatetsky-Shapiro, Robert Grossman, Chabane Djeraba, Ronen Feldman, Lise Getoor, Mohammed Zaki
2006 Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining - KDD '06  
This panel will discuss possible exciting and motivating Grand Challenge problems for Data Mining, focusing on bioinformatics, multimedia mining, link mining, text mining, and web mining.  ...  and knowledge management.  ...  foundations for working with large, complex data. c) Pragmatic Grand Challenges: concerned with data preparation and integration, and developing, deploying and embedding statistical and data mining models  ... 
doi:10.1145/1150402.1150535 dblp:conf/kdd/Piatetsky-ShapiroGDFGZ06 fatcat:tdfjeonwajbpfpdxr72rub37pi

Causal Knowledge Extraction through Large-Scale Text Mining

Oktie Hassanzadeh, Debarun Bhattacharjya, Mark Feblowitz, Kavitha Srinivas, Michael Perrone, Shirin Sohrabi, Michael Katz
2020 PROCEEDINGS OF THE THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE TWENTY-EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE  
In this demonstration, we present a system for mining causal knowledge from large corpuses of text documents, such as millions of news articles.  ...  We show example use cases developed for a commercial application in enterprise risk management.  ...  Introduction Capturing and representing causal knowledge is a challenging problem in AI, with important applications in various domains such as healthcare, legal, and enterprise risk management.  ... 
doi:10.1609/aaai.v34i09.7092 fatcat:ns7l3zzkpvezjhjp7tfl67nrsq

The Big Data Analytics Regarding the Cadastral Resurvey News Articles

Yong-Jin Joo, Duck-Ho Kim
2014 Journal of the Korean Society of Surveying Geodesy Photogrammetry and Cartography  
That is, we searched the main keywords regarding cadastral resurvey, performing extraction of compound noun and data mining analysis. And visualization of the results was presented.  ...  In addition, new reports related to cadastral resurvey between 2012 and 2014 were searched in newspapers, and nouns were extracted from the searched data for the data mining analysis of cadastral information  ...  As mining techniques are applied to decision making, marketing, customer management, finance, medicine, education, and energy, data mining in a broad sense is a generic term for mining techniques based  ... 
doi:10.7848/ksgpc.2014.32.6.651 fatcat:73lwayqh5ff4tin3khe7g7reaq

CERC: an interactive content extraction, recognition, and construction tool for clinical and biomedical text

Eva K Lee, Karan Uppal
2020 BMC Medical Informatics and Decision Making  
to improve quality of care and assist in data-driven and evidence-based informed decision making for direct patient care.  ...  In this work, we develop an interactive content extraction, recognition, and construction system (CERC) that combines machine learning and visualization techniques with domain knowledge for highlighting  ...  , Chris Kwan, Eunho Kwon, Di Liu, Joe Malecki, Autumn Phillips, and Peijue Zhang, who helped with the initial usage and testing of the anonymized data generated from the customizable information extraction  ... 
doi:10.1186/s12911-020-01330-8 pmid:33323109 fatcat:neac7vl5fncibnx3kxcktyqmba

Skill Needs for Early Career Researchers—A Text Mining Approach

Monica Mihaela Maer-Matei, Cristina Mocanu, Ana-Maria Zamfir, Tiberiu Marian Georgescu
2019 Sustainability  
This article proposes a text mining approach applied to a large amount of data extracted from job vacancies advertisements, aiming to shed light on the main skills and demands that characterize first stage  ...  Management of time, risks, projects, and resources plays an important part in the job requirements included in the analyzed advertisements.  ...  Leadership, management and entrepreneurial skills are also addressed by the scientific literature, but are treated on a rather separate track.  ... 
doi:10.3390/su11102789 fatcat:gv7ibxlzhnfgfgtxhxmupvdzxe

Computer science for non-technological cyber programs

Amir Rubinstein
2014 2014 IEEE Frontiers in Education Conference (FIE) Proceedings  
The world of cyberspace revolves around the scientific and technological as well as other facets of the internet, data encryption, digital communication, signal processing and data mining.  ...  This course was offered for the first time in Fall 2013. We describe the considerations in the design of the course, its content and structure.  ...  ACKNOWLEDGMENT We thank Benny Chor for a critical review of this manuscript, for contributing material for the course, and for helpful discussions.  ... 
doi:10.1109/fie.2014.7044442 dblp:conf/fie/Rubinstein14 fatcat:dsbpii7o65futdg5rjh4tcm2ty

Visual Analytics Infrastructures: From Data Management to Exploration

Jean-Daniel Fekete
2013 Computer  
visualization, data analysis, and data management.  ...  We argue that addressing these requirements will benefit not only Visual Analytics tools, but also analytics and data management systems in general.  ...  ACKNOWLEDGMENTS Thanks to Wesley Willett for his help in improving the paper and Jeremy Boy for the illustrations.  ... 
doi:10.1109/mc.2013.120 fatcat:22qguaoiojf2xorvcxdieeq22e

Anything Else? Assessing the Needs of Researchers at the Library of Paris-Dauphine

André Lohisse
2019 Ticker: The Academic Business Librarianship Review  
In March 2017, the library of Université Paris-Dauphine launched a survey to assess researchers' level of familiarity with Open Access, research data, text mining and the new legal environment.  ...  The researchers who were surveyed also demonstrated proportionally limited knowledge of Open Access and research data management.  ...  Since 48.8% of respondents are already "searching large corpuses of texts and data," an opportunity exists for the library to help researchers and PhD students explore these large datasets. 60.8% of the  ... 
doi:10.3998/ticker.16481003.0003.205 fatcat:4m5gx4j255aerjdg7k536unw6a

A Seeded Cloud Approach to Health Cyberinfrastructure: Preliminary Architecture Design and Case Applications

Chaitan Baru, Nathan Botts, Thomas Horan, Kevin Patrick, Sue S. Feldman
2012 2012 45th Hawaii International Conference on System Sciences  
personal health information; and, from reference scientific datasets to observational data and sensor streams.  ...  Applications in public health and health services require access to a range of heterogeneous data, from environmental information in a region, to population-level data across regions, to more closely held  ...  San Diego, the San Diego Supercomputer Center and the Kay Center for E-Health Research for their support of these research efforts.  ... 
doi:10.1109/hicss.2012.82 dblp:conf/hicss/BaruBHPF12 fatcat:oxtrpgsx2vgapne3g4pmuoeshm

Methodologically grounded semantic analysis of large volume of chilean medical literature data applied to the analysis of medical research funding efficiency in Chile

Patricio Wolff, Sebastián Ríos, David Clavijo, Manuel Graña, Miguel Carrasco
2020 Journal of Biomedical Semantics  
encoded in the large volume of medical literature.  ...  In order to exploit this knowledge by automated systems, there is a growing interest in developing text mining methodologies to extract, structure, and analyze in the shortest time possible the knowledge  ...  Begoña Yarza, M.D. for the support and good suggestion to enhance this work.  ... 
doi:10.1186/s13326-020-00226-w pmid:32993795 pmcid:PMC7523397 fatcat:7d45lc3cdjbyfivitgrylamxp4

TellUsWho: Guided Social Network Data Collection

Stephen T. Ricken, Richard P. Schuler, Sukeshini A. Grandhi, Quentin Jones
2010 2010 43rd Hawaii International Conference on System Sciences  
TellUsWho supported the collection of rich social network data in a relatively short time period.  ...  One important reason for this has been that researchers have not been able to systematically probe individuals in sufficient detail about 'who' and 'how' they interact with in the social networks they  ...  The opinions expressed are those of the authors and may not reflect those of the NSF. References  ... 
doi:10.1109/hicss.2010.365 dblp:conf/hicss/RickenSGJ10 fatcat:kobn2wedxjaenepdsleajl7j7u

The state-of-the-art on Intellectual Property Analytics (IPA): A literature review on artificial intelligence, machine learning and deep learning methods for analysing intellectual property (IP) data

Leonidas Aristodemou, Frank Tietze
2018 World Patent Information  
and effective management of information.  ...  We define Intellectual Property Analytics (IPA) as the data science of analysing large amount of IP information, to discover relationships, trends and patterns for decision making.  ...  Acknowledgement The authors would like to acknowledge support of the Engineering and Physical Sciences Research Council (EPSRC).  ... 
doi:10.1016/j.wpi.2018.07.002 fatcat:cawnmevwcna2zep7z6ikixwzgu
« Previous Showing results 1 — 15 out of 269 results