Filters








5,925 Hits in 5.7 sec

Research on the Building Method of Domain Lexicon Combining Association Rules and Improved TF*IDF [chapter]

Shouning Qu, Simon Xu
2011 Lecture Notes in Electrical Engineering  
The improved TF*IDF algorithm considers the affection of text length, feature item length, feature item location and identification of compound words on the topic extraction.  ...  :To improve the efficiency and accuracy of topic words extraction in information extraction and topic words classification, a new topic lexicon building method is gradually updated and finally formed by  ...  In this paper, we use the compound word recognition method based on association rules [16] , treat the foreground document collections as a transaction database.  ... 
doi:10.1007/978-1-4614-0373-9_24 fatcat:pqzcqfti45dj5hpvqzgdqs6daq

A test-suite generator for database systems

Ariel Hamlin, Jonathan Herzog
2014 2014 IEEE High Performance Extreme Computing Conference (HPEC)  
Thus, database evaluators could use this tool to craft test suites for particular aspects of a specific database system.  ...  In this paper, we describe the SPAR Test Suite Generator (STSG), a new test-suite generator for SQL style database systems.  ...  ACKNOWLEDGMENT The authors would like to acknowledge Oliver Dain for his aid in the design and implementation of the test suite generator.  ... 
doi:10.1109/hpec.2014.7040957 dblp:conf/hpec/HamlinH14 fatcat:kgi7rrezmnffpf4zdwfiry42hy

Wavelet domain textual coding of Ottoman script images

Oemer N. Gerek, Enis A. Cetin, Ahmed H. Tewfik, Rashid Ansari, Mark J. T. Smith
1996 Visual Communications and Image Processing '96  
On the other hand, these coding methods neither take into account the special characteristics of the images in a database nor are they suitable for fast database search.  ...  Typically, one has to deal with compound structures consisting of a group of letters. Therefore, the matching criterion will be according to those compound structures.  ...  This method is suitable for fast database search if the properties of each extracted compound structure are supplied.  ... 
doi:10.1117/12.233272 fatcat:p3w22vlkpbdwncbcwk6cp7f2aq

Screening for type 2 diabetes

Ruth Sander
2017 Nursing Older People  
Evidence for the environment as a determinant of diabetes is also apparent in studies of recent immigrants from a developing country to Canada.  ...  [mp=title, abstract, full text, keywords, caption text] 6 4 and 5 Database: EBM Reviews -Database of Abstracts of Reviews of Effects Search Strategy: 1 ((fasting glucose or glucose tolerance) adj3  ... 
doi:10.7748/nop.29.2.13.s14 pmid:28244355 fatcat:maxgnekcavcxhhg67wxxwbzqwa

Prediction of Chinese Semantic Word-Building Patterns Based on Complex Network Features

Minfeng Wang, Xin Ning
2022 Wireless Communications and Mobile Computing  
In this paper, complex networks are introduced into the prediction of Chinese semantic word-formation patterns, and a new prediction method of Chinese semantic word-formation patterns based on complex  ...  And a solution that combines the semantic word-building rules of Chinese language with pattern recognition algorithm is put forward.  ...  Based on the co-occurrence relationship between words, this paper constructs a single text weighted complex network to represent Chinese text. is text representation method can not only contain the information  ... 
doi:10.1155/2022/4162998 fatcat:nt6chn2l4jh3bkggo2sptdbim4

Intelligent Search for Image Information on the Web through Text and Link Structure Analysis [chapter]

Euripides G.M. Petrakis
2008 Multimodal Processing and Interaction  
Searching for effective methods to retrieve information from the World Wide Web (WWW) has been in the center of many research efforts during the last few years.  ...  Image retrieval on the Web, in particular, is a very important problem in itself [8] . The relevant technology has also evolved significantly propelled by advances in image database research [20] .  ...  Relevance feedback attempts to guess the ideal query (or matching method) from answers that are initially obtained from the database.  ... 
doi:10.1007/978-0-387-76316-3_12 fatcat:qxlpqotdmndspb3oqnmq3iac2i

Using IR Techniques for Text Classification in Document Analysis [chapter]

Rainer Hoch
1994 SIGIR '94  
As output, the system evaluates a set of weighted hypotheses about the type of the actual letter.  ...  The system employs several knowledge sources including a letter database, word frequency statistics for German, lists of message type specific words, morphological knowledge as well as the underlying document  ...  Acknowledgements I would like to thank Stefan Dittrich who implemented most parts of the INFOCLAS system.  ... 
doi:10.1007/978-1-4471-2099-5_4 dblp:conf/sigir/Hoch94 fatcat:c7i46dd6wnexlibsb6o2ek2f74

Senti-COVID19: An Interactive Visual Analytics System for Detecting Public Sentiment and Insights regarding COVID-19 from Social Media

Xuemin Yu, Martha Ferreira, Fernando V. Paulovich.
2021 IEEE Access  
performance especially for social media text and is fast enough [8] .  ...  These numbers are stored in the database. B. KEYWORD EXTRACTION The system extracts keywords from daily tweets to detect the trigger of the sentiment.  ... 
doi:10.1109/access.2021.3111833 fatcat:vvmy2shvxffpdlxwyhkw2e5msi

A system for facilitating and enhancing web search [chapter]

Steffen Staab, Christian Braun, Ilvio Bruder, Antje Düsterhöft, Andreas Heuer, Meike Klettke, Günter Neumann, Bernd Prager, Jan Pretzel, Hans-Peter Schnurr, Rudi Studer, Hans Uszkoreit (+1 others)
1999 Lecture Notes in Computer Science  
We present a system that uses semantic methods and natural language processing capabilites in order to provide comprehensive and easy-to-use access to tourist information in the WWW.  ...  Thereby, the system is designed such that as background knowledge and linguistic coverage increase, the benefits of the system improve, while it guarantees state-of-the-art information and database retrieval  ...  based on a cascade of weighted finite state transducers.  ... 
doi:10.1007/bfb0100538 fatcat:xxdm2vvurjaznhhtrnbkqa7hgi

Indexing Text and Visual Features for WWW Images [chapter]

Heng Tao Shen, Xiaofang Zhou, Bin Cui
2005 Lecture Notes in Computer Science  
Based on the property that relevant images haves similar similarity values from the center of the same local partition in any feature space, certain number of irrelevant images can be fast pruned based  ...  Our LBS method outperforms sequential scan on high dimensional space by an order of magnitude.  ...  The following formula is used to compute the total weight for each word. ∑ = = 6 1 i i weight weight Word Word Where i weight Word is the weight of Word in type i lexical chain, and i ranges from 1 to  ... 
doi:10.1007/978-3-540-31849-1_85 fatcat:in2bkrfv4be6jlognxtmzzkdnu

Treatment of Semantic Heterogeneity in Information Retrieval [article]

Heiko Hellweg, Jürgen Krause, Thomas Mandl, Jutta Marx, Matthias N.O. Müller, Peter Mutschke, Robert Strötgen
2011 arXiv   pre-print
Section 2 describes a set of cascading deductive and heuristic extraction rules, which were developed in the project CARMEN for the domain of Social Sciences.  ...  Section 3 describes the creation, storage and handling of such transfers.  ...  We defined that manually assigned keywords were "pure" assignments and assumed a weight of 1 on a scale from 0 to 1.  ... 
arXiv:1102.3866v1 fatcat:l23fxw4fgrhj5prioywraig6eq

GETESS—Searching the Web Exploiting German Texts [chapter]

Steffen Staab, Christian Braun, Ilvio Bruder, Antje Düsterhöft, Andreas Heuer, Meike Klettke, Günter. Neumann, Bernd Prager, Jan Pretzel, Hans-Peter Schnurr, Rudi Studer, Hans Uszkoreit (+1 others)
1999 Lecture Notes in Computer Science  
We present an intelligent information agent that uses semantic methods and natural language processing capabilites in order to gather tourist information from the WWW and present it to the human user in  ...  Thereby, the information agent is designed such that as background knowledge and linguistic coverage increase, its benefits improve, while it guarantees state-of-the-art information and database retrieval  ...  based on a cascade of weighted finite state transducers.  ... 
doi:10.1007/3-540-48414-0_7 fatcat:zjwjobdpvnetdd4a4eckd7vlm4

DrugCombDB: a comprehensive database of drug combinations toward network medicine and combination therapy [article]

Lei Deng, Bo Zou, Wenhao Zhang, Hui Liu
2018 bioRxiv   pre-print
Thanks to the fast development of high-throughput screening (HTS) methods, the amount of available drug combination datasets has tremendously increased.  ...  In this paper, we present DrugCombDB, a comprehensive database dedicated to integrating drug combinations from various data sources.  ...  Thanks to the fast development of high-throughput screening (HTS) methods, it is possible to systematically evaluate the pairwise combinations from a large number of both approved and investigational chemical  ... 
doi:10.1101/477547 fatcat:gylwru3nvrcdbc4zvp7psiyc7y

Pruning the vocabulary for better context recognition

R.E. Madsen, S. Sigurdsson, L.K. Hansen, J. Larsen
2004 Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.  
We consider a new approach, using neural network based sensitivity maps and information gain for determination of term relevancy, when pruning the vocabularies.  ...  We also study the applicability of information gain and sensitivity maps for automated keyword generation.  ...  ACKNOWLEDGMENT The work is supported by the European Commission through the sixth framework IST Network of Excellence: Pattern Analysis, Statistical Modelling and Computational Learning (PASCAL), contract  ... 
doi:10.1109/icpr.2004.1334270 dblp:conf/icpr/MadsenSHL04 fatcat:lto3ltt6src43hcv7buqkamnbi

Robust text processing in automated information retrieval

Tomek Strzalkowski
1994 Proceedings of the fourth conference on Applied natural language processing -  
This paper outlines a prototype text retrieval system which uses relatively advanced natural language processing techniques in order to enhance the effectiveness of statistical document retrieval.  ...  We report on selected preliminary results of experiments with 500 MByte database of Wall Street Journal articles, as well as some earlier results with a smaller document collection.  ...  Jose Perez Carballo has contributed a number of valuable observations during the course of this work, and his assistance in processing the TREC data was critical.  ... 
doi:10.3115/974358.974396 dblp:conf/anlp/Strzalkowski94 fatcat:3irhkmnwuzgqpif7ld2fizgpeu
« Previous Showing results 1 — 15 out of 5,925 results