Filters








12 Hits in 2.2 sec

XTRACT

Minos Garofalakis, Aristides Gionis, Rajeev Rastogi, S. Seshadri, Kyuseok Shim
2000 Proceedings of the 2000 ACM SIGMOD international conference on Management of data - SIGMOD '00  
An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the role of a schema for an XML data collection.  ...  In this paper, we propose XTRACT, a novel system for inferring a DTD schema for a database of XML documents.  ...  A characteristic, however, that distinguishes XML from semistructured data models is the notion of a Document Type Descriptor (DTD) that may optionally accompany an XML document.  ... 
doi:10.1145/342009.335409 dblp:conf/sigmod/GarofalakisGRSS00 fatcat:7l6d3mjy4jgmtmv3exiykxf2zu

XTRACT

Minos Garofalakis, Aristides Gionis, Rajeev Rastogi, S. Seshadri, Kyuseok Shim
2000 SIGMOD record  
An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the role of a schema for an XML data collection.  ...  In this paper, we propose XTRACT, a novel system for inferring a DTD schema for a database of XML documents.  ...  A characteristic, however, that distinguishes XML from semistructured data models is the notion of a Document Type Descriptor (DTD) that may optionally accompany an XML document.  ... 
doi:10.1145/335191.335409 fatcat:n6ml2x2efvbundih3fawjr2rbe

Advanced document description, a sequential approach

Antoine Doucet
2006 SIGIR Forum  
After a look at the state of the art of advanced document representations, we present a novel technique to efficiently extract frequent word sequences from document collections of any size.  ...  We apply this new metric to the task of document retrieval and illustrate the multilingual-and domain-independence of our work by conducting experiments with scientific and general iii iv document collections  ...  Its initial case document collection is a set of 12, 000 journal articles of the IEEE. The sample XML document given in Figure 2 .2 is an article from the INEX collection.  ... 
doi:10.1145/1147197.1147212 fatcat:k32ofbs5szd4hph4mgq6yq65am

Extraction of Template using Clustering from Heterogeneous Web Documents

Rashmi DThakare, Manisha R Patil
2015 International Journal of Computer Applications  
Data extraction from XML documents XTract: A system for extracting Document Type Descriptor from XML Documents [9] , provides a system for extracting Document Type Descriptor (DTD) schema from a database  ...  DTD contains valuable information on the structure of document. XTract method solved the problem of DTD extraction from multiple XML documents. MDL cost is used to find good DTD.  ... 
doi:10.5120/21112-3906 fatcat:vksbqx55rjc5tohadufqt2uy34

Facility location problems: A parameterized view

Michael R. Fellows, Henning Fernau
2011 Discrete Applied Mathematics  
Some applications of algorithms for these problems in the processing of semistructured documents and in computational biology are also described.  ...  We introduce the study of these problems from the point of view of parameterized algorithms and complexity.  ...  In his notes on this standard, Bray wrote, 3 commenting on the construction of document type descriptors (DTD), a context-free grammar used to specify syntactic characteristics of XML documents: ''Suppose  ... 
doi:10.1016/j.dam.2011.03.021 fatcat:zep44dg6lzhqla6cd2ghyukpvu

Information Extraction from Web Pages Using Presentation Regularities and Domain Knowledge

Srinivas Vadrevu, Fatih Gelgi, Hasan Davulcu
2007 World wide web (Bussum)  
World Wide Web is transforming itself into the largest information resource making the process of information extraction (IE) from Web an important and challenging problem.  ...  We demonstrate that such system can recover from ambiguities in the presentation and boost the overall accuracy of a base information extractor by up to 20%.  ...  XTRACT [12] is such a system that can automatically extract Document Type Descriptors (DTDs) from a set of XML documents.  ... 
doi:10.1007/s11280-007-0021-1 fatcat:dnrbncmuvzd5rbsick7fismnqa

Discovering Knowledge from XML Documents [chapter]

Richi Nayak
Encyclopedia of Data Warehousing and Mining  
Simple Object Access Proto- extracting document type descriptors from XML docu- col (SOAP) is a new technology that has enabled XML ments.  ...  XML documents in this type of mining.  ... 
doi:10.4018/9781591405573.ch071 fatcat:k6girofj6fhnlgh23avgwn3g5a

Facility Location Problems: A Parameterized View [chapter]

Michael Fellows, Henning Fernau
Lecture Notes in Computer Science  
Some applications of algorithms for these problems in the processing of semistructured documents and in computational biology are also described.  ...  We introduce the study of these problems from the point of view of parameterized algorithms and complexity.  ...  In his notes on this standard, Bray wrote, 3 commenting on the construction of document type descriptors (DTD), a context-free grammar used to specify syntactic characteristics of XML documents: ''Suppose  ... 
doi:10.1007/978-3-540-68880-8_19 fatcat:e6os5a5jrfcbtgiica27kd6vuq

Facial expression recognition in the wild : from individual to group [article]

Abhinav Dhall, University, The Australian National, University, The Australian National
2018
An image-only based facial expressions database Static Facial Expressions In The Wild (SFEW) extracted from AFEW is proposed. Furthermore, the thesis focuses on HPN for real-world images.  ...  The central hypothesis of the thesis is that extracting close to real-world data from movies and performing facial expression analysis on movies is a stepping stone in the direction of moving the analysis  ...  FER techniques can be segregated on the basis of type of descriptors used. Generally, facial descriptors can be broadly classified as geometric and appearance.  ... 
doi:10.25911/5d4ea922db07c fatcat:g2yn7xzq5baxpfsuwdna574l2e

Disease-Symptom relation extraction from medical text corpora with BERT

Adrian Schiegl, Allan Hanbury, Markus Zlabinger
2021
For example, effective disease- symptom relation extraction accelerates tasks such as reviewing large amounts of medical literature to learn new disease characteristics.In this work we present a relation  ...  the problem of relation extraction as a named entity recognition problem, which simplifies the model and the annotation of the training dataset.We evaluate our model using the Disease Symptom Relation Collection  ...  Teachm odel howt oe xtract relations from finetuning dataset 3.  ... 
doi:10.34726/hss.2021.77705 fatcat:slch6usppnecngnemf5verj53m

A framework for self-adaptive networked appliances

P Fergus
2017
XML applies meta-data to the internal structures of an XML document whilst and RDF document focuses on providing meta-data about the external information associated with a document such as `Author' and  ...  A Document Type Definition (DTD) schema[W3C 2005], however this was updated to the XML Schema specification[W3C 2005] by Dimitrov [Dimitrov 2000 ].  ... 
doi:10.24377/ljmu.t.00005780 fatcat:cvyrun4dbngmdoce4eob2n3ize

Programme Committee Members

Pero Šipka, Ed, Agathe Gebert, Bhaskar Mukherjee, Biljana Kosanović, Eleonora Dagiene, Giannis Tsakonas, Goran Nenadić, Irina Kuchma, Jovan Zubović, Marcin Kozak, Miloš Radovanović (+9 others)
unpublished
The Association of Lithuanian Serials has received a grant from EIFL which helped introduce innovative ideas in Lithuania and abroad during 2011 and 2012.  ...  Acknowledgements This paper presents results from the research project "Prevalence and attitudes towards plagiarism" supported by the Croatian Ministry of Science Education and Sports and the Committee  ...  Data sample The sample of papers included 13,032 papers indexed with all three types of automatically generated descriptors.  ... 
fatcat:nkq2kysdtbgs7mbceldvovpcl4