Filters








5,996 Hits in 5.8 sec

An XML-enabled data extraction toolkit for web sources

Ling Liu, Calton Pu, Wei Han
2001 Information Systems  
Hence, the web users or applications need a smart way of extracting data from these web sources.  ...  The second phase combines the information extraction rules generated at the first phase with the XWRAP component library to construct an executable wrapper program for the given web source. r  ...  Acknowledgements We would like to thank the XWRAP team at Georgia Tech for their implementation effort.  ... 
doi:10.1016/s0306-4379(01)00040-0 fatcat:5dxghttqgzgnjlu2dcy2bkprxa

Toolkits for Generating Wrappers [chapter]

Stefan Kuhlins, Ross Tredwell
2003 Lecture Notes in Computer Science  
This paper examines the suitability of software toolkits for the extraction of data from web sites.  ...  With the aim of providing improved ease-of-use and faster wrapper generation in mind, possible areas for further development of toolkits for automated web data extraction are discussed.  ...  providing the open source LAPIS toolkit.  ... 
doi:10.1007/3-540-36557-5_15 fatcat:pibsq32qkzc43dvhma6d3agyge

Metadata, Ontologies, and Information Models for Grid PSE Toolkits Based on Web Services

Carmela Comito, Carlo Mastoroianni, Domenico Talia
2006 International Journal of Web Services Research  
This paper presents a metadata model for Grid PSE toolkits based on Web services and the architecture of an information system that exploits the proposed metadata model.  ...  on Web services.  ...  An extract from the XML schema BioinformaticsSoftware.xsd CONCLUSIONS A Grid PSE toolkit based on Web services is a group of technologies that allows for building PSEs for different application domains  ... 
doi:10.4018/jwsr.2006100103 fatcat:qz4n36tfpzfwndxujv2uvavslu

Customized way of Resource Discovery in a Campus Grid [article]

Damandeep Kaur, Lokesh Shandi, Jyotsna Sengupta
2010 arXiv   pre-print
This paper pro ides the grid resource discovery solutions for Campus Grid using Globus Toolkit which will enable us to customize the resource information according to the requirements based on the jobs  ...  Campus Grid computing involves heterogeneous resources of an organization working in collaboration to sol e the problems that cannot be addressed by a single resource.  ...  [11] is an open source software toolkit used for building Grid systems and applications.  ... 
arXiv:1006.2695v1 fatcat:h4bdbqjs7jadhpu73ipfcaj6ai

The ProteoRed MIAPE web toolkit: A user-friendly framework to connect and share proteomics standards

J. A. Medina-Aunon, S. Martinez-Bartolome, M. A. Lopez-Garcia, E. Salazar, R. Navajas, A. R. Jones, A. Paradela, J. P. Albar
2011 Molecular & Cellular Proteomics  
MIAPE MS mainly gathers the meta-data derived from the mass spectrometer, as instrument components: ion source, analyzer, detector or The ProteoRed MIAPE web toolkit The ProteoRed MIAPE web toolkit  ...  the The ProteoRed MIAPE web toolkit 18 XML files) on the web interface.  ... 
doi:10.1074/mcp.o111.008334 fatcat:qmtjjrdkrzflldtczrvkv2tkla

The ProteoRed MIAPE web toolkit: A User-friendly Framework to Connect and Share Proteomics Standards

J. Alberto Medina-Aunon, Salvador Martínez-Bartolomé, Miguel A. López-García, Emilio Salazar, Rosana Navajas, Andrew R. Jones, Alberto Paradela, Juan P. Albar
2011 Molecular & Cellular Proteomics  
The toolkit is thus the first application capable of automatically linking the PSI's MIAPE modules with the corresponding XML data exchange standards, enabling bidirectional conversions.  ...  In this article, we describe a new web-based software suite (The ProteoRed MIAPE web toolkit) that performs several complementary roles related to proteomic data standards.  ...  PRIDE XML visualization and submission to the central repository (http://www.ebi.ac.uk/pride). 1) Data Retrieval-To use the MIAPE web toolkit, protein identification data should be formatted as an mzIdentML  ... 
doi:10.1074/mcp.m111.008334 pmid:21983993 pmcid:PMC3205864 fatcat:biykglzztrfgvmgdr44obfb6n4

Web Services Based on Prolog and Xml [chapter]

Bernd D. Heumesser, Andreas Ludwig, Dietmar Seipel
2005 Lecture Notes in Computer Science  
Since a lot of information available on the Internet is nowadays XML based and since Web service technologies use XML based encodings, it is both necessary and useful to be able to process XML documents  ...  To make this possible, a new package for SWI-PROLOG called X2P is introduced, making available to PROLOG many of the XML processing facilities of the Libxml2 library, which is a very up-to-date and efficient  ...  XML has been widely adopted as the foundation for data representation and formats on the Web.  ... 
doi:10.1007/11415763_16 fatcat:phcoy7s2j5dhpisvwyuswjox2e

Extending SDARTS

Panagiotis G. Ipeirotis, Tom Barry, Luis Gravano
2002 Proceedings of the second ACM/IEEE-CS joint conference on Digital libraries - JCDL '02  
The SDARTS toolkit, with all related documentation and source code, is publicly available at  ...  First, we have added a tool that automatically builds rich content summaries for remote web collections by probing the collections with appropriate queries.  ...  EXTRACTING CONTENT SUMMARIES FROM WEB DATABASES The SDARTS text and XML wrappers extract complete metadata from locally available text and XML collections.  ... 
doi:10.1145/544220.544254 dblp:conf/jcdl/IpeirotisBG02 fatcat:r3uam62vgnc2xhiz63arth4xsq

Semantic technology applications for homeland security

D. Avant, M. Baum, C. Bertram, M. Fisher, A. Sheth, Y. Warke
2002 Proceedings of the eleventh international conference on Information and knowledge management - CIKM '02  
While BSBQ alone is already a helpful tool for analysts, an important role of the SCORE technology is to interface with and retrieve the best value from various software packages that all expect data in  ...  and relationships between them) from heterogeneous sources and formats (database tables, xml feeds, PDF files, streaming media, internal documents) • Co-relate extracted information to discover previously  ...  concepts for discovering unknown co-occurrenc relationships  ... 
doi:10.1145/584792.584893 dblp:conf/cikm/AvantBBFSW02 fatcat:evhrqyg3abemnl6n4l2arjtib4

Semantic technology applications for homeland security

D. Avant, M. Baum, C. Bertram, M. Fisher, A. Sheth, Y. Warke
2002 Proceedings of the eleventh international conference on Information and knowledge management - CIKM '02  
While BSBQ alone is already a helpful tool for analysts, an important role of the SCORE technology is to interface with and retrieve the best value from various software packages that all expect data in  ...  and relationships between them) from heterogeneous sources and formats (database tables, xml feeds, PDF files, streaming media, internal documents) • Co-relate extracted information to discover previously  ...  concepts for discovering unknown co-occurrenc relationships  ... 
doi:10.1145/584891.584893 fatcat:o5whk7ne3jeghdq45n6dwqckye

Aggregative Data Infrastructures for the Cultural Heritage [chapter]

Alessia Bardi, Paolo Manghi, Franco Zoppi
2012 Communications in Computer and Information Science  
In this paper, we present the D-NET Software Toolkit as an ideal candidate for the realization of sustainable, extensible, scalable and dynamic ADIs for CH.  ...  Besides, the realization of ADIs for CH can be particularly complex when compared to other disciplines due to the possibly high heterogeneity of data sources involved.  ...  Conclusion We highlighted the need for aggregative data infrastructures (ADIs) in the Cultural Heritage (CH) domain and described the important role of enabling software for ADIs.  ... 
doi:10.1007/978-3-642-35233-1_24 fatcat:zwdemnvvdzczhd7xqwgtcxqphi

An Information Food Chain for Advanced Applications on the WWW [chapter]

Stefan Decker, Jan Jannink, Sergey Melnik, Prasenjit Mitra, Steffen Staab, Rudi Studer, Gio Wiederhold
2000 Lecture Notes in Computer Science  
The Internet and especially the World Wide Web are growing at a tremendous rate. More and more information is becoming directly available for human consumption.  ...  The remedy this situation we look at different existing technologies and put them together to a new information food chain [Etzioni 1997] for agents, that enables advanced applications on the WWW.  ...  This facility is an Ontology Articulation Toolkit for information mediation.  ... 
doi:10.1007/3-540-45268-0_69 fatcat:4xnxywlqtbd4dkb5njn7scjtyq

Effective Web data extraction with standard XML technologies

Jussi Myllymaki
2001 Proceedings of the tenth international conference on World Wide Web - WWW '01  
We discuss the problem of Web data extraction and describe an XML-based methodology whose goal extends far beyond simple "screen scraping."  ...  Key aspects of ANDES are that it uses XML technologies for data extraction, including XHTML and XSLT, and provides access to the "deep Web."  ...  The WysiWyg Web Wrapper Factory (W4F) is a toolkit for generating Web wrappers [9] .  ... 
doi:10.1145/371920.372183 dblp:conf/www/Myllymaki01 fatcat:rcdwcekjpze47cldjr23amznsy

Effective Web data extraction with standard XML technologies

Jussi Myllymaki
2002 Computer Networks  
We discuss the problem of Web data extraction and describe an XML-based methodology whose goal extends far beyond simple "screen scraping."  ...  Key aspects of ANDES are that it uses XML technologies for data extraction, including XHTML and XSLT, and provides access to the "deep Web."  ...  would like to thank Jared Jackson and Stephen Dill of IBM Almaden Research Center, Yan Zhou of IBM China Development Laboratory, and Dorine Yelton, John Rees, and Douglas Griswold of IBM Global Services, for  ... 
doi:10.1016/s1389-1286(02)00214-1 fatcat:wb6x6erukbeqpkhbpsi6tsv6aq

Managing semantic content for the Web

A. Sheth, C. Bertram, D. Avant, B. Hammond, K. Kochut, Y. Warke
2002 IEEE Internet Computing  
An asset is the collection of all metadata for one piece of content. The extractor toolkit creates extractor agents for a particular information source, such as a NewsML feed or a Web site.  ...  processing Schema for asset storage Knowledge- base Content metabase Unstructured documents XML feeds XML feeds Open source Web Corporate databases A s s e t s Extractor management  ... 
doi:10.1109/mic.2002.1020330 fatcat:pyvzhalzgbbrbp3pamc6n3w3tm
« Previous Showing results 1 — 15 out of 5,996 results