82 Hits in 6.6 sec

Wrapper generation for semi-structured Internet sources

Naveen Ashish, Craig A. Knoblock
1997 SIGMOD record  
From this structure the system generates a wrapper that facilitates querying of a source and possibly integrating it with other sources.  ...  The key idea is to exploit the formatting information in pages from the source to hypothesize the underlying structure of a page.  ...  Acknowledgements We would like to thank Steve Minton and the other members of the SIMS and Ariadne projects for their contributions to this work.  ... 
doi:10.1145/271074.271078 fatcat:3fek26paxfd4fbe6kph4ih2lhu

The NITE XML Toolkit: Flexible annotation for multimodal language data

Jean Carletta, Stefan Evert, Ulrich Heid, Jonathan Kilgour, Judy Robertson, Holger Voormann
2003 Behavoir research methods, instruments & computers  
Current tools allow one either to apply sets of time-stamped codes to the data and consider their timing and sequencing or to describe some specific linguistic structure that is present in the data, built  ...  libraries, a tool for running queries, and an experimental engine that builds interfaces on the basis of declarative specifications.  ...  To make it easier to start using the toolkit, NXT Search includes a simple graphical user interface that will allow one to load a data set, type in a query, and display the results.  ... 
doi:10.3758/bf03195511 pmid:14587542 fatcat:7mu5yajdcrh6fp4dris4rgppaa

The Trip to The Enterprise Gourmet Data Product Marketplace through a Self-service Data Platform [article]

Michal Zasadzinski, Michael Theodoulou, Markus Thurner, Kshitij Ranganath
2021 arXiv   pre-print
We then show how the platform enables and operates the data marketplace, facilitating the exchange of stable data products across users and tenants.  ...  Data is ingested at a rate of well over 1000 individual messages per second and serves more than 100k analytical queries daily.  ...  All of them helped in shaping the data platform, influencing the spirit and culture of the gourmet marketplace setting it up for success.  ... 
arXiv:2107.13212v1 fatcat:uvjrk4foyzhprfxvez4sddui3y

Effective Web data extraction with standard XML technologies

Jussi Myllymaki
2002 Computer Networks  
Key aspects of ANDES are that it uses XML technologies for data extraction, including XHTML and XSLT, and provides access to the "deep Web."  ...  A comprehensive data extraction process needs to deal with such roadblocks such as session identifiers, HTML forms, and client-side JavaScript, and data integration problems such as incompatible datasets  ...  of IBM Global Services, for their contributions to the ideas and software presented in this paper.  ... 
doi:10.1016/s1389-1286(02)00214-1 fatcat:wb6x6erukbeqpkhbpsi6tsv6aq

Effective Web data extraction with standard XML technologies

Jussi Myllymaki
2001 Proceedings of the tenth international conference on World Wide Web - WWW '01  
Key aspects of ANDES are that it uses XML technologies for data extraction, including XHTML and XSLT, and provides access to the "deep Web."  ...  A comprehensive data extraction process needs to deal with such roadblocks such as session identifiers, HTML forms, and client-side JavaScript, and data integration problems such as incompatible datasets  ...  In each system, a modeling process produces an integrated view of the data contained in the sources and a query planning process decomposes queries on the integrated view into a set of subqueries on the  ... 
doi:10.1145/371920.372183 dblp:conf/www/Myllymaki01 fatcat:rcdwcekjpze47cldjr23amznsy

Personalizing Interactions with Information Systems [chapter]

Saverio Perugini, Naren Ramakrishnan
2003 Advances in Computers  
This helps bring out the role of the personalization system as a facilitator which reconciles the user's mental model with the underlying information system's organization.  ...  In this chapter, we study personalization from the viewpoint of personalizing interaction.  ...  Typically, the data to restructure is a subset of an information space and retrieved via the WHERE clause of a semistructured data query. The WHERE clause thus serves as a match operator.  ... 
doi:10.1016/s0065-2458(03)57007-3 fatcat:rdooy2c4gnfajgvu246kjzm2ja

A decision support system to improve e-learning environments

Marta Zorrilla, Diego García, Elena Álvarez
2010 Proceedings of the 1st International Workshop on Data Semantics - DataSem '10  
The primary aim of the development of eLAT is to process large data sets in microseconds with regard to individual data analysis interests of teachers and data privacy issues, in order to help them to  ...  In this paper, we present the theoretical background, design, implementation, and evaluation details of eLAT, a Learning Analytics Toolkit, which enables teachers to explore and correlate learning object  ...  Acknowledgements This project was partly funded by the Excellence Initiative of the Federal and State Governments.  ... 
doi:10.1145/1754239.1754252 dblp:conf/edbtw/ZorrillaGA10 fatcat:abw3z3wgqzajffkox2t6pzszvi

An Incremental Approach for Real-Time Big Data Visual Analytics

Ignacio Garcia, Ruben Casado, Abdelhamid Bouchachia
2016 2016 IEEE 4th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW)  
In the age of Big Data, the real-time interactive visualization is a challenge due to latency of executing calculation over terabytes (even, petabytes) datasets.  ...  To address such a requirement, this paper introduces a new approach for real-time visualization of extremely large data-at-rest as well as data-in-motion by showing intermediate results as soon as they  ...  Acknowledgment The research leading to these results has received funding from the European Union  ... 
doi:10.1109/w-ficloud.2016.46 dblp:conf/ficloud/GarciaCB16 fatcat:yxcga4gnpbaj5nz5wouciy2be4

A Survey on Visual Query Systems in the Web Era (extended version) [article]

Jorge Lloret-Gazo
2017 arXiv   pre-print
As more and more collections of data are becoming available on the web to everyone, non expert users demand easy ways to retrieve data from these collections.  ...  We have also gathered basic features of VQSs such as the visual representation adopted to present the reality of interest or the visual representation adopted to express queries.  ...  In the left part, a diagram of an XML document is shown to represent source XML documents, lets users explore the documents, and selects parts of the documents to be used in queries.  ... 
arXiv:1708.00192v1 fatcat:bdg2tuubzbd6rp5hpwoxge2eey

Developing Library GIS Services for Humanities and Social Science: An Action Research Approach

Ningning Kong, Michael Fosmire, Benjamin Dewayne Branch
2017 College and Research Libraries  
to the stages of learning and research.  ...  Our results suggested that a library's GIS service can support humanities and social science from the research collaboration, learning support, and outreach perspectives, with different focuses according  ...  Analysis •QueryData query •Statistics • Integrated with learning support 5.  ... 
doi:10.5860/crl.78.4.413 fatcat:sbecklbnlfa6das2lwsrzwbdee

Enabling Agile Clinical and Translational Data Warehousing: Platform Development and Evaluation

Helmut Spengler, Claudia Lang, Tanmaya Mahapatra, Ingrid Gatz, Klaus A Kuhn, Fabian Prasser
2020 JMIR Medical Informatics  
Both the cloud-based hosting infrastructure and the data-loading pipeline are available to the community as open source software with comprehensive documentation.  ...  Moreover, it facilitates the iterative refinement of data representations in the target platforms, as the required configuration files are very compact.  ...  The work was, in parts, funded by the German Federal Ministry of Education and Research within the Medical Informatics Funding Scheme under reference number 01ZZ1804A (Data Integration for Future Medicine  ... 
doi:10.2196/15918 pmid:32706673 fatcat:245xrwsfxbdp3izb6qeyugwrea

Federated Semantic Data Management (Dagstuhl Seminar 17262)

Olaf Hartig, Maria-Esther Vidal, Johann-Christoph Freytag, Marc Herbstritt
2017 Dagstuhl Reports  
This report documents the program and the outcomes of Dagstuhl Seminar 17262 "Federated Semantic Data Management" (FSDM).  ...  The discussions were centered around the following four themes, each of which was the focus of a separate working group: i) graph data models, ii) federated query processing, iii) access control and privacy  ...  the sources; and a set of mappings from the databases to the ontology.  ... 
doi:10.4230/dagrep.7.6.135 dblp:journals/dagstuhl-reports/HartigVF17 fatcat:a7uvfkbt3bczldr54aikx3e274

Citation analysis of database publications

Erhard Rahm, Andreas Thor
2005 SIGMOD record  
His current research interests include intelligent search on semistructured data, combining DB technology with IR techniques, and "autonomic" peer-to-peer information management.  ...  Jayavel's research interests include Internet data management, IR, and query processing in emerging system architectures.  ...  Rather than computing the probability that a document matches a query, it computes the probability that a query is generated from the language model of a document [15] .  ... 
doi:10.1145/1107499.1107505 fatcat:c4evhhw6y5difggghxsvvdqzeu

METICOS Deliverable D6.2 Social data analysis and extracted perceptions

Sarang Shaikh, Sule Yildirim Yayilgan, Mohamed Abomhara, Erjon Zoto
2022 Zenodo  
from social media data as a part of T6.2.  ...  Section 5 and 6 discuss the perception extraction from the social media data and review the SoTA studies for perception extraction.  ...  Examples of semistructured data include JSON and XML are forms of semi-structured data.  ... 
doi:10.5281/zenodo.6684365 fatcat:mfb6qkj73zdbbgskruqetubczq

Analysis of Cognitive Work

Ann Bisantz, Emilie Roth
2007 Reviews of Human Factors and Ergonomics  
the task, technical system, social and organizational structure, and physical environment; and examination of the goals, knowledge, skills, and strategies that domain practitioners utilize in response  ...  A cognitive analysis requires consideration of two perspectives: examination of domain characteristics and constraints that impose cognitive demands on domain practitioners, which include components of  ...  Typical analysis process used by intelligence analysts to search a document database and synthesize results to formulate a response to an analysis query. Reprinted from Patterson, E. S., Roth, E.  ... 
doi:10.1518/155723408x299825 fatcat:3rkwofjcarcbbepgklnq67felu
« Previous Showing results 1 — 15 out of 82 results