A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Filters
Research on the Building Method of Domain Lexicon Combining Association Rules and Improved TF*IDF
[chapter]
2011
Lecture Notes in Electrical Engineering
The improved TF*IDF algorithm considers the affection of text length, feature item length, feature item location and identification of compound words on the topic extraction. ...
:To improve the efficiency and accuracy of topic words extraction in information extraction and topic words classification, a new topic lexicon building method is gradually updated and finally formed by ...
In this paper, we use the compound word recognition method based on association rules [16] , treat the foreground document collections as a transaction database. ...
doi:10.1007/978-1-4614-0373-9_24
fatcat:pqzcqfti45dj5hpvqzgdqs6daq
A test-suite generator for database systems
2014
2014 IEEE High Performance Extreme Computing Conference (HPEC)
Thus, database evaluators could use this tool to craft test suites for particular aspects of a specific database system. ...
In this paper, we describe the SPAR Test Suite Generator (STSG), a new test-suite generator for SQL style database systems. ...
ACKNOWLEDGMENT The authors would like to acknowledge Oliver Dain for his aid in the design and implementation of the test suite generator. ...
doi:10.1109/hpec.2014.7040957
dblp:conf/hpec/HamlinH14
fatcat:kgi7rrezmnffpf4zdwfiry42hy
Wavelet domain textual coding of Ottoman script images
1996
Visual Communications and Image Processing '96
On the other hand, these coding methods neither take into account the special characteristics of the images in a database nor are they suitable for fast database search. ...
Typically, one has to deal with compound structures consisting of a group of letters. Therefore, the matching criterion will be according to those compound structures. ...
This method is suitable for fast database search if the properties of each extracted compound structure are supplied. ...
doi:10.1117/12.233272
fatcat:p3w22vlkpbdwncbcwk6cp7f2aq
Screening for type 2 diabetes
2017
Nursing Older People
Evidence for the environment as a determinant of diabetes is also apparent in studies of recent immigrants from a developing country to Canada. ...
[mp=title, abstract, full text, keywords, caption text]
6 4 and 5
Database: EBM Reviews -Database of Abstracts of Reviews of Effects
Search Strategy:
1 ((fasting glucose or glucose tolerance) adj3 ...
doi:10.7748/nop.29.2.13.s14
pmid:28244355
fatcat:maxgnekcavcxhhg67wxxwbzqwa
Prediction of Chinese Semantic Word-Building Patterns Based on Complex Network Features
2022
Wireless Communications and Mobile Computing
In this paper, complex networks are introduced into the prediction of Chinese semantic word-formation patterns, and a new prediction method of Chinese semantic word-formation patterns based on complex ...
And a solution that combines the semantic word-building rules of Chinese language with pattern recognition algorithm is put forward. ...
Based on the co-occurrence relationship between words, this paper constructs a single text weighted complex network to represent Chinese text. is text representation method can not only contain the information ...
doi:10.1155/2022/4162998
fatcat:nt6chn2l4jh3bkggo2sptdbim4
Intelligent Search for Image Information on the Web through Text and Link Structure Analysis
[chapter]
2008
Multimodal Processing and Interaction
Searching for effective methods to retrieve information from the World Wide Web (WWW) has been in the center of many research efforts during the last few years. ...
Image retrieval on the Web, in particular, is a very important problem in itself [8] . The relevant technology has also evolved significantly propelled by advances in image database research [20] . ...
Relevance feedback attempts to guess the ideal query (or matching method) from answers that are initially obtained from the database. ...
doi:10.1007/978-0-387-76316-3_12
fatcat:qxlpqotdmndspb3oqnmq3iac2i
Using IR Techniques for Text Classification in Document Analysis
[chapter]
1994
SIGIR '94
As output, the system evaluates a set of weighted hypotheses about the type of the actual letter. ...
The system employs several knowledge sources including a letter database, word frequency statistics for German, lists of message type specific words, morphological knowledge as well as the underlying document ...
Acknowledgements I would like to thank Stefan Dittrich who implemented most parts of the INFOCLAS system. ...
doi:10.1007/978-1-4471-2099-5_4
dblp:conf/sigir/Hoch94
fatcat:c7i46dd6wnexlibsb6o2ek2f74
Senti-COVID19: An Interactive Visual Analytics System for Detecting Public Sentiment and Insights regarding COVID-19 from Social Media
2021
IEEE Access
performance especially for social media text and is fast enough [8] . ...
These numbers are stored in the database.
B. KEYWORD EXTRACTION The system extracts keywords from daily tweets to detect the trigger of the sentiment. ...
doi:10.1109/access.2021.3111833
fatcat:vvmy2shvxffpdlxwyhkw2e5msi
A system for facilitating and enhancing web search
[chapter]
1999
Lecture Notes in Computer Science
We present a system that uses semantic methods and natural language processing capabilites in order to provide comprehensive and easy-to-use access to tourist information in the WWW. ...
Thereby, the system is designed such that as background knowledge and linguistic coverage increase, the benefits of the system improve, while it guarantees state-of-the-art information and database retrieval ...
based on a cascade of weighted finite state transducers. ...
doi:10.1007/bfb0100538
fatcat:xxdm2vvurjaznhhtrnbkqa7hgi
Indexing Text and Visual Features for WWW Images
[chapter]
2005
Lecture Notes in Computer Science
Based on the property that relevant images haves similar similarity values from the center of the same local partition in any feature space, certain number of irrelevant images can be fast pruned based ...
Our LBS method outperforms sequential scan on high dimensional space by an order of magnitude. ...
The following formula is used to compute the total weight for each word. ∑ = = 6 1 i i weight weight Word Word Where i weight Word is the weight of Word in type i lexical chain, and i ranges from 1 to ...
doi:10.1007/978-3-540-31849-1_85
fatcat:in2bkrfv4be6jlognxtmzzkdnu
Treatment of Semantic Heterogeneity in Information Retrieval
[article]
2011
arXiv
pre-print
Section 2 describes a set of cascading deductive and heuristic extraction rules, which were developed in the project CARMEN for the domain of Social Sciences. ...
Section 3 describes the creation, storage and handling of such transfers. ...
We defined that manually assigned keywords were "pure" assignments and assumed a weight of 1 on a scale from 0 to 1. ...
arXiv:1102.3866v1
fatcat:l23fxw4fgrhj5prioywraig6eq
GETESS—Searching the Web Exploiting German Texts
[chapter]
1999
Lecture Notes in Computer Science
We present an intelligent information agent that uses semantic methods and natural language processing capabilites in order to gather tourist information from the WWW and present it to the human user in ...
Thereby, the information agent is designed such that as background knowledge and linguistic coverage increase, its benefits improve, while it guarantees state-of-the-art information and database retrieval ...
based on a cascade of weighted finite state transducers. ...
doi:10.1007/3-540-48414-0_7
fatcat:zjwjobdpvnetdd4a4eckd7vlm4
DrugCombDB: a comprehensive database of drug combinations toward network medicine and combination therapy
[article]
2018
bioRxiv
pre-print
Thanks to the fast development of high-throughput screening (HTS) methods, the amount of available drug combination datasets has tremendously increased. ...
In this paper, we present DrugCombDB, a comprehensive database dedicated to integrating drug combinations from various data sources. ...
Thanks to the fast development of high-throughput screening (HTS) methods, it is possible to systematically evaluate the pairwise combinations from a large number of both approved and investigational chemical ...
doi:10.1101/477547
fatcat:gylwru3nvrcdbc4zvp7psiyc7y
Pruning the vocabulary for better context recognition
2004
Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004.
We consider a new approach, using neural network based sensitivity maps and information gain for determination of term relevancy, when pruning the vocabularies. ...
We also study the applicability of information gain and sensitivity maps for automated keyword generation. ...
ACKNOWLEDGMENT The work is supported by the European Commission through the sixth framework IST Network of Excellence: Pattern Analysis, Statistical Modelling and Computational Learning (PASCAL), contract ...
doi:10.1109/icpr.2004.1334270
dblp:conf/icpr/MadsenSHL04
fatcat:lto3ltt6src43hcv7buqkamnbi
Robust text processing in automated information retrieval
1994
Proceedings of the fourth conference on Applied natural language processing -
This paper outlines a prototype text retrieval system which uses relatively advanced natural language processing techniques in order to enhance the effectiveness of statistical document retrieval. ...
We report on selected preliminary results of experiments with 500 MByte database of Wall Street Journal articles, as well as some earlier results with a smaller document collection. ...
Jose Perez Carballo has contributed a number of valuable observations during the course of this work, and his assistance in processing the TREC data was critical. ...
doi:10.3115/974358.974396
dblp:conf/anlp/Strzalkowski94
fatcat:3irhkmnwuzgqpif7ld2fizgpeu
« Previous
Showing results 1 — 15 out of 5,925 results