A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2019; you can also visit the original URL.
The file type is application/pdf
.
Filters
Research on Domain Ontology Generation Based on Semantic Web
[chapter]
2016
IFIP Advances in Information and Communication Technology
showes a football ontology constructed by protégé , and makes a prospect to semantic retrieval based on ontology. ...
semantic web, domain ontology and so on are proposed, next it makes a research in the plsa algorithm of extracting domain concepts and the k-means algorithm of clustering thoese concepts, finally, it ...
A theme is a concept or one aspect that shows as a series of the relevant words which can represent the theme. ...
doi:10.1007/978-3-319-48390-0_19
fatcat:hy72kfsksvgnvolgwqatkfapey
Improved Shark-Search Flash Theme Search Algorithm
2015
International Journal of Database Theory and Application
The keyword set is the general search algorithm of children's game dictionary (1672 words), and the result is shown in Table 1 . ...
The following hypotheses are proposed according to such features: (1) If a web page is the one with Flash related to the theme, the sub-link of web page may be the one with Flash related to the theme. ...
doi:10.14257/ijdta.2015.8.1.27
fatcat:453364xstfaxpmf26mynapttny
Organization of Information for the Web Using Hierarchical Fuzzy Clustering Algorithm Based on Co-occurrence Networks
2010
2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology
The algorithm is applied on a collection of web pages and the results are compared with existing algorithms in the literature. ...
In this paper, we present a Hierarchical Fuzzy Clustering algorithm which uses domain knowledge to automatically determine the number of clusters and their initial values. ...
These data sets are a collection of web pages found on Wikipedia encyclopedia. ...
doi:10.1109/wi-iat.2010.86
dblp:conf/webi/ZaidiM10
fatcat:dzyg6szf5vgn7iuixvkvktefhu
Internet Tourism Resource Retrieval Using PageRank Search Ranking Algorithm
2021
Complexity
The main work completed in the thesis proposes and constructs a topic collection algorithm and establishes a starting point, topic keywords, and a prediction mechanism. ...
Experimental results show that the algorithm can successfully extract the main content of the article from a wide variety of web pages. ...
asynchronous technology. e theme collection module uses the theme collection algorithm to establish a related database and collects the pages related to the theme based on the collection database. e early ...
doi:10.1155/2021/5114802
fatcat:mt5yyadjcjcptnvkehxqff4pyu
A Semantic Search Engine Based on SKOS Model Ontology in Agriculture
[chapter]
2011
IFIP Advances in Information and Communication Technology
A theme relevance algorithm based on terms' distances in ontology system was tested and applied in improving the Pagerank evaluating. ...
A simple agriculture ontology system was constructed under extended SKOS model in this paper. ...
It analyzes the posted strings with a Chinese word segment agent and generates a keyword set. ...
doi:10.1007/978-3-642-18333-1_15
fatcat:d7ibtrodareyra6mgmg6624ttu
Effective Keyword And Similarity Thresholds For The Discovery Of Themes From The User Web Access Patterns
2007
Zenodo
In this paper we focus on both keyword and similarity thresholds to generate themes with concentrated themes, and hence build a more sound model of the user behavior. ...
The purpose of this paper is two fold: use distance based clustering methods to recognize overall themes from the Proxy log file, and suggest an efficient cut off levels for the keyword and similarity ...
The impetus for the work reported in this paper came from our need for a complete user profile which would allow us to design a fully automatic Web navigation system and a theme based search engine. ...
doi:10.5281/zenodo.1060548
fatcat:b5gpgftetfch3fqojqybcnwkce
Semantic word cloud generation based on word embeddings
2016
2016 IEEE Pacific Visualization Symposium (PacificVis)
Distributed word representation is applied to accurately describe the semantic meaning of words, and a word similarity graph is constructed based on the semantic distance between words to lay out words ...
Word-related interactions are introduced to guide users fast read and understand the text. ...
In order to create a sematic word layout with a pleasing layout, we construct a word similarity graph, and then use the graph related algorithms to generate a semantic-preserving and aesthetic word layout ...
doi:10.1109/pacificvis.2016.7465278
dblp:conf/apvis/XuTL16
fatcat:eqfey3lgobhqnanvor3ms4bksu
Extracting key terms from noisy and multitheme documents
2009
Proceedings of the 18th international conference on World wide web - WWW '09
Additional experiments on web pages prove that our method appears to be substantially more effective on noisy and multi-theme documents than existing methods. ...
First, it allows effectively processing multi-theme documents. Second, it is good at filtering out noise information in the document, such as, for example, navigational bars or headers in web pages. ...
For example, in [15] the graph is constructed using a syntactic term relatedness (namely, co-occurrence relation) defined as follows: two terms are related if they co-occur within a window of maximum ...
doi:10.1145/1526709.1526798
dblp:conf/www/GrinevaGL09
fatcat:rjds4d7kxfe5dbnlycjekfnoke
Application on Web Page Filtering Technology
2014
International Journal of Multimedia and Ubiquitous Engineering
And in combination with the Vision-based Page Segmentation Algorithm, a DVPS Algorithm which considers both layout features and visual features was proposed to improve web page filtering efficiency. ...
Based on DIV tags dividing the content block of the page, this paper proposes a new data filtering scheme, DVPS algorithm. ...
The construction method of Web page classifier including Artificial Neural Networks, Machine Learning and Web classification based on statistics model [17] . ...
doi:10.14257/ijmue.2014.9.12.35
fatcat:tdapvuf5azgz7js2yifg42rxzi
Digitalization and Information Management Mechanism of Sports Events Based on Multisensor Node Cooperative Perception Model
2022
Journal of Sensors
Through experimental comparison, the effectiveness of content-based recommendation algorithm technology in the event network data set is verified, and an algorithm model suitable for marathon event recommendation ...
According to the recommendation target of the event and the characteristics of the event data type, we can choose a single or comprehensive recommendation algorithm to build a model to realize the event ...
After the above description, it can be seen that the Internet data sets of sports events have the characteristics of large number and many types, and a single content-based recommendation algorithm modeling ...
doi:10.1155/2022/6430191
fatcat:tw4ix3flz5hkfbj7iaq32ap3x4
Multilingual document mining and navigation using self-organizing maps
2011
Information Processing & Management
In this approach, a self-organizing map is constructed to train each set of monolingual Web pages and obtain two feature maps, which reveal the relationships among Web pages and thematic keywords respectively ...
Finally, a multilingual Web directory is constructed according to such associations. ...
The construction process consists of two major tasks. The first is topic detection which identifies the major themes existed in a set of close related Web pages. ...
doi:10.1016/j.ipm.2009.12.003
fatcat:lg26lu7menfkbl6qhsywpbpngy
Mining Generalized Associations of Semantic Relations from Textual Web Content
2007
IEEE Transactions on Knowledge and Data Engineering
In this paper, we present a two-step procedure to mine generalized associations of semantic relations conveyed by the textual content of Web documents. ...
Then, a novel generalized association pattern mining algorithm (GP-Close) is applied to discover the underlying relation association patterns on RDF metadata. ...
represents a meaning of a word and corresponds to a set of synonyms in WordNet. ...
doi:10.1109/tkde.2007.36
fatcat:jkxp3oiotbe2vmmegagu7ekpuu
A text mining approach for automatic construction of hypertexts
2005
Expert systems with applications
In this work, we will propose a new automatic hypertext construction method based on a text mining approach. ...
Our method had been tested on a set of at text documents collected from a newswire site. ...
The threshold is a real value near 1. By virtue of SOM algorithm, a neuron may be labeled by several words which often co-occurred in a set of documents. Thus a neuron forms a word cluster. ...
doi:10.1016/j.eswa.2005.05.003
fatcat:4a3p7cen75gyle2hxepwb7cbhy
Improved PageRank Algorithm Combined User Behavior with Topic Similarity
2017
DEStech Transactions on Computer Science and Engineering
Aiming at shortcomings of the drifting theme and the splitting page weight of traditional PageRank algorithm, an improved PageRank algorithm combined user behavior with topic similarity was proposed. ...
The simulation results show that compared with Micro-blog's common sorting algorithms, the improved PageRank algorithm has better sorting effect. ...
influence order of users), WPR algorithm (The theme correlation algorithm, by constructing a link analysis of micro-blog users / web content based on user influence ranking) and TURank algorithm (Combined ...
doi:10.12783/dtcse/aita2016/7578
fatcat:r466r2asnzcchdhkysmja5jo4m
Automatic Acquisition and Semantic Annotation of Web Tourism Information
2019
DEStech Transactions on Computer Science and Engineering
The crawler which collects data from web site automatically is introduced firstly. Then the Chinese word segmentation tool and a classic key word extraction algorithm TF/IDF are introduced. ...
A method which collects data from tourism web site and annotates these data with semantic tags automatically is promoted in this paper. ...
The second type of directional web crawler is also called the theme web crawler. It crawls according to the theme given in advance when crawling the web page and crawls selectively. ...
doi:10.12783/dtcse/cscbd2019/30026
fatcat:xtctwaugzbemfbegaugi5q5yny
« Previous
Showing results 1 — 15 out of 27,295 results