A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2005; you can also visit the original URL.
The file type is application/pdf
.
Filters
XTRACT
2000
Proceedings of the 2000 ACM SIGMOD international conference on Management of data - SIGMOD '00
An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the role of a schema for an XML data collection. ...
In this paper, we propose XTRACT, a novel system for inferring a DTD schema for a database of XML documents. ...
A characteristic, however, that distinguishes XML from semistructured data models is the notion of a Document Type Descriptor (DTD) that may optionally accompany an XML document. ...
doi:10.1145/342009.335409
dblp:conf/sigmod/GarofalakisGRSS00
fatcat:7l6d3mjy4jgmtmv3exiykxf2zu
XTRACT
2000
SIGMOD record
An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the role of a schema for an XML data collection. ...
In this paper, we propose XTRACT, a novel system for inferring a DTD schema for a database of XML documents. ...
A characteristic, however, that distinguishes XML from semistructured data models is the notion of a Document Type Descriptor (DTD) that may optionally accompany an XML document. ...
doi:10.1145/335191.335409
fatcat:n6ml2x2efvbundih3fawjr2rbe
Advanced document description, a sequential approach
2006
SIGIR Forum
After a look at the state of the art of advanced document representations, we present a novel technique to efficiently extract frequent word sequences from document collections of any size. ...
We apply this new metric to the task of document retrieval and illustrate the multilingual-and domain-independence of our work by conducting experiments with scientific and general iii iv document collections ...
Its initial case document collection is a set of 12, 000 journal articles of the IEEE. The sample XML document given in Figure 2 .2 is an article from the INEX collection. ...
doi:10.1145/1147197.1147212
fatcat:k32ofbs5szd4hph4mgq6yq65am
Extraction of Template using Clustering from Heterogeneous Web Documents
2015
International Journal of Computer Applications
Data extraction from XML documents XTract: A system for extracting Document Type Descriptor from XML Documents [9] , provides a system for extracting Document Type Descriptor (DTD) schema from a database ...
DTD contains valuable information on the structure of document. XTract method solved the problem of DTD extraction from multiple XML documents. MDL cost is used to find good DTD. ...
doi:10.5120/21112-3906
fatcat:vksbqx55rjc5tohadufqt2uy34
Facility location problems: A parameterized view
2011
Discrete Applied Mathematics
Some applications of algorithms for these problems in the processing of semistructured documents and in computational biology are also described. ...
We introduce the study of these problems from the point of view of parameterized algorithms and complexity. ...
In his notes on this standard, Bray wrote, 3 commenting on the construction of document type descriptors (DTD), a context-free grammar used to specify syntactic characteristics of XML documents: ''Suppose ...
doi:10.1016/j.dam.2011.03.021
fatcat:zep44dg6lzhqla6cd2ghyukpvu
Information Extraction from Web Pages Using Presentation Regularities and Domain Knowledge
2007
World wide web (Bussum)
World Wide Web is transforming itself into the largest information resource making the process of information extraction (IE) from Web an important and challenging problem. ...
We demonstrate that such system can recover from ambiguities in the presentation and boost the overall accuracy of a base information extractor by up to 20%. ...
XTRACT [12] is such a system that can automatically extract Document Type Descriptors (DTDs) from a set of XML documents. ...
doi:10.1007/s11280-007-0021-1
fatcat:dnrbncmuvzd5rbsick7fismnqa
Discovering Knowledge from XML Documents
[chapter]
Encyclopedia of Data Warehousing and Mining
Simple Object Access Proto- extracting document type descriptors from XML docu-
col (SOAP) is a new technology that has enabled XML ments. ...
XML documents in this type of mining. ...
doi:10.4018/9781591405573.ch071
fatcat:k6girofj6fhnlgh23avgwn3g5a
Facility Location Problems: A Parameterized View
[chapter]
Lecture Notes in Computer Science
Some applications of algorithms for these problems in the processing of semistructured documents and in computational biology are also described. ...
We introduce the study of these problems from the point of view of parameterized algorithms and complexity. ...
In his notes on this standard, Bray wrote, 3 commenting on the construction of document type descriptors (DTD), a context-free grammar used to specify syntactic characteristics of XML documents: ''Suppose ...
doi:10.1007/978-3-540-68880-8_19
fatcat:e6os5a5jrfcbtgiica27kd6vuq
Facial expression recognition in the wild : from individual to group
[article]
2018
An image-only based facial expressions database Static Facial Expressions In The Wild (SFEW) extracted from AFEW is proposed. Furthermore, the thesis focuses on HPN for real-world images. ...
The central hypothesis of the thesis is that extracting close to real-world data from movies and performing facial expression analysis on movies is a stepping stone in the direction of moving the analysis ...
FER techniques can be segregated on the basis of type of descriptors used. Generally, facial descriptors can be broadly classified as geometric and appearance. ...
doi:10.25911/5d4ea922db07c
fatcat:g2yn7xzq5baxpfsuwdna574l2e
Disease-Symptom relation extraction from medical text corpora with BERT
2021
For example, effective disease- symptom relation extraction accelerates tasks such as reviewing large amounts of medical literature to learn new disease characteristics.In this work we present a relation ...
the problem of relation extraction as a named entity recognition problem, which simplifies the model and the annotation of the training dataset.We evaluate our model using the Disease Symptom Relation Collection ...
Teachm odel howt oe xtract relations from finetuning dataset 3. ...
doi:10.34726/hss.2021.77705
fatcat:slch6usppnecngnemf5verj53m
A framework for self-adaptive networked appliances
2017
XML applies meta-data to the internal structures of an XML document whilst and RDF document focuses on providing meta-data about the external information associated with a document such as `Author' and ...
A
Document Type Definition (DTD) schema[W3C 2005], however this was updated to the XML Schema specification[W3C 2005] by Dimitrov [Dimitrov 2000 ]. ...
doi:10.24377/ljmu.t.00005780
fatcat:cvyrun4dbngmdoce4eob2n3ize
Programme Committee Members
unpublished
The Association of Lithuanian Serials has received a grant from EIFL which helped introduce innovative ideas in Lithuania and abroad during 2011 and 2012. ...
Acknowledgements This paper presents results from the research project "Prevalence and attitudes towards plagiarism" supported by the Croatian Ministry of Science Education and Sports and the Committee ...
Data sample The sample of papers included 13,032 papers indexed with all three types of automatically generated descriptors. ...
fatcat:nkq2kysdtbgs7mbceldvovpcl4