3,894 Hits in 10.3 sec

Text Windows and Phrases Differing by Discipline, Location in Document, and Syntactic Structure [article]

Robert M. Losee
1996 arXiv   pre-print
Different syntactic structures in sublanguages are examined, and their use is considered for discriminating between specific academic disciplines and, more generally, between theory versus practice or  ...  Knowledge of window style, content, location and grammatical structure may be used to classify documents as originating within a particular discipline or may be used to place a document on a theory versus  ...  Introduction The nature of term groupings, phrases, and text windows in documents is not fully understood, yet the importance of term clusters is obvious to those in disciplines who study text.  ... 
arXiv:cmp-lg/9602003v1 fatcat:4xwkpxul4bfifnasctyxjkmyx4

Semantic indexing for a complete subject discipline

Yi-Ming Chung, Qin He, Kevin Powell, Bruce Schatz
1999 Proceedings of the fourth ACM conference on Digital libraries - DL '99  
In this experiment, we first partitioned the collection into smaller collections (repositories) by subject, extracted noun phrases from titles and abstracts, then performed semantic indexing on these subcollections  ...  These semantic indexes record the contextual correlation of noun phrases, and are computed generically, independent of subject domain.  ...  Richard Berlin, MD, at the HealthAlliance HMO in Urbana, and Jonathan Silverstein, MD, at the University of Illinois at Chicago, kindly served as early adopters for testing the prototypes.  ... 
doi:10.1145/313238.313253 dblp:conf/dl/ChungHPS99 fatcat:7uy7hnerqfcl5ocvcqpbz7hs2m

Content-based citation analysis: The next generation of citation analysis

Ying Ding, Guo Zhang, Tamy Chambers, Min Song, Xiaolong Wang, Chengxiang Zhai
2014 Journal of the Association for Information Science and Technology  
Content-based citation analysis (CCA) addresses a citation's value by interpreting each based on their contexts at both syntactic and semantic level.  ...  In addition, we highlight how increased computational capabilities and publicly available full-text resources have opened this area of research to vast possibilities, which enable deeper of citation analysis  ...  Syntactic CCA uses the structure or layout of an article to identify the location and number of citations to investigate the significance of influence.  ... 
doi:10.1002/asi.23256 fatcat:7gethjhuzva4nejqnd4uthpogi

Ontology Driven Extraction of Research Processes [chapter]

Vayianos Pertsas, Panos Constantopoulos, Ion Androutsopoulos
2018 Lecture Notes in Computer Science  
These representations are driven by the Scholarly Ontology, specifically conceived for documenting research processes.  ...  We developed and experimented with several sliding window classifiers using Logistic Regression, SVMs, and Random Forests, as well as a two-stage pipeline classifier.  ...  The classifiers were evaluated against a reference standard produced by human annotators, with three different test sets from three domains (Digital Humanities, Bionformatics and Medicine) and very promising  ... 
doi:10.1007/978-3-030-00671-6_10 fatcat:ienvwgvsgrfazglksyjhtyp7im

Design and development of a concept-based multi-document summarization system for research abstracts

Shiyan Ou, Christopher Soo-Guan Khoo, Dion H. Goh
2008 Journal of information science  
and relationships across different abstracts; and (4) combining and organizing the different kinds of information using a variable-based framework, and presenting them in an interactive Web-based interface  ...  Dissertation abstracts in the field of sociology were selected as sample documents for this study.  ...  However, most of these studies identified similarities and differences using low-level text analysis, i.e. mainly based on lexical, syntactic and rhetorical relations between text units (e.g. words, phrases  ... 
doi:10.1177/0165551507084630 fatcat:oz44rflbcfcc3ihxnmnf3bsao4

A design rationale for a language-based editor

Jim Welsh, Brad Broom, Derek Kiong
1991 Software, Practice & Experience  
In practice, however, few languagebased editors have yet gained widespread use, and many of the basic design issues noted by Marlin 1 have yet to be resolved.  ...  This paper reviews the design issues that arise in the construction of effective language-based editors for the preparation of syntactically and static semantically correct language sentences, typically  ...  ACKNOWLEDGEMENTS The work described here was carried out at the University of Queensland while Brad Broom was supported by an Australian Commonwealth Postgraduate Research Award, and Derek Kiong was on  ... 
doi:10.1002/spe.4380210904 fatcat:fgrv4wjwovay7fkfhiqtdewmai

The Importance of Fine-Grained Cue Phrases in Scientific Citations [chapter]

Robert E. Mercer, Chrysanne Di Marco
2003 Lecture Notes in Computer Science  
In previous work, Garzone and Mercer (2000) presented a system for citation classification that relied on characteristic syntactic structure to determine citation category.  ...  In this present work, we extend this idea to propose that fine-grained cue phrases within citation sentences may provide just such a stylistic basis for categorization.  ...  The cue phrase previously is three times more frequent in citation sentences than in the full text body and twice as frequent as in citation windows.  ... 
doi:10.1007/3-540-44886-1_49 fatcat:hlghg26zbncefkyyxsxj7zcxsi

A User Evaluation of Hierarchical Phrase Browsing [chapter]

Katrina D. Edgar, David M. Nichols, Gordon W. Paynter, Kirsten Thomson, Ian H. Witten
2003 Lecture Notes in Computer Science  
Phrase browsing interfaces based on hierarchies of phrases extracted automatically from document collections offer a useful compromise between automatic full-text searching and manually-created subject  ...  Users found the results located via phrase browsing to be relevant and useful but preferred keyword searching for certain types of queries.  ...  The text above the lower panel shows that the phrase sustainable forest appears in 36 larger phrases, and in 258 documents.  ... 
doi:10.1007/978-3-540-45175-4_29 fatcat:rky7i34eozc6tmzjuivivvtgnm

Natural language processing and advanced information management

James E. Hoard
1989 Telematics and informatics  
One can easily envision the need and desire to integrate the information in relational (and other kinds of structured) databases with the information in such diverse sources as CAD/CAM systems and text  ...  documents.  ...  Unlike databases and CAD/CAM systems, text documents contain, by and large, their own RWIs by virtue of being text documents.  ... 
doi:10.1016/s0736-5853(89)80024-3 fatcat:fy2om3xdj5fqthydrfxwk2fvuq

Computer assisted processing of large unstructured document sets

John McKechnie, Sameh Shaaban, Stephen Lockley
2001 Proceedings of the 2001 ACM Symposium on Document engineering - DocEng '01  
The authors are grateful to Marshall Crawford, Keith Wilson, Ruth O'Leary and John Cann for their helpful conversations, and to Susan Latif for cheerfully "OCRing" thousands of document images.  ...  This research was funded by NBS Services, a Royal Institute of British Architects (RIBA) company.  ...  A tiled window layout was adopted to increase the ease of crossreferencing between the different activities.  ... 
doi:10.1145/502187.502190 dblp:conf/doceng/McKechnieSL01 fatcat:ck3udfbyubbjjoeyt4rrpmqafi

Automatic keyword extraction for learning object repositories

Kino High Coursey, Rada Mihalcea, William E. Moen
2009 Proceedings of the American Society for Information Science and Technology  
The results suggest that automatic keyword extraction is a viable solution for suggesting terms and phrases for metadata annotation.  ...  The paper describes experiments in metadata generation for learning object repositories.  ...  ACKNOWLEDGMENTS The Texas Center for Digital Knowledge and the project team gratefully acknowledge the funding from the Texas Higher Education Coordinating Board through the Texas Course Redesign Project  ... 
doi:10.1002/meet.2008.1450450274 fatcat:xeil4htskvafjozwofaz7rtwea

The Searchbench - Combining Sentence-semantic, Full-text and Bibliographic Search in Digital Libraries

Ulrich Schäfer, Bernd Kiefer, Christian Spurk, Jörg Steffen, Rui Wang, Benjamin Weitz, Magdalena Wolska
2013 Liber Quarterly: The Journal of European Research Libraries  
Because the NLP technology is domain-independent, it could also be applied to newspaper texts, technical documentation, or scientific publications from other disciplines.  ...  By clicking on a citation edge, the original citation sentence is shown in context, and optionally also in the original PDF layout.  ...  Computing and Interaction (M2CI) -Robust, Efficient and Intelligent Processing of Text, Speech, Visual Data and High Dimensional Representations -Open Science Web, and has been conducted in the context  ... 
doi:10.18352/lq.8091 fatcat:rpzajleqlbdito3mwzu7dz7sga

Finding Parallel Passages in Cultural Heritage Archives

Martyn Harris, Mark Levene, Dell Zhang, Dan Levene
2018 ACM Journal on Computing and Cultural Heritage  
It is of great interest to researchers and scholars in many disciplines (particularly those working on cultural heritage projects) to study parallel passages (i.e., identical or similar pieces of text  ...  The system has already been used to support research on five large text corpora that span a number of different domains and languages.  ...  The technique involves comparing corresponding passages located across more than one text, by laying out the texts side-by-side.  ... 
doi:10.1145/3195727 fatcat:plgfe4irgnbgjizo5d3lg6hurq

Systematic acceleration of radical discovery and innovation in science and technology

Ronald N. Kostoff
2006 Technological forecasting & social change  
sources, gathering and maintaining the data needed, and completing and reviewing the collection of information.  ...  Report Documentation Page Form Public reporting burden for the collection of information is estimated to average 1 hour per response, including the time for reviewing instructions, searching existing data  ...  of the study performers in aggregate to understand the potentially different meanings and contexts that specific technical phrases could have when used in different technical areas and by different technical  ... 
doi:10.1016/j.techfore.2005.09.004 fatcat:u3n53gdl5vd3vndm7atrdgw5ta

Adaptive information extraction

Jordi Turmo, Alicia Ageno, Neus Català
2006 ACM Computing Surveys  
, and others related to text mining.  ...  This survey describes and compares the main approaches to IE and the different ML techniques used to achieve Adaptive IE technology.  ...  While in IR the answer to a query is simply a list of potentially relevant documents, in IE the relevant content of such documents has to be located and extracted from the text.  ... 
doi:10.1145/1132956.1132957 fatcat:usu2uodawzf5hh2ibl7ufo2y6y
« Previous Showing results 1 — 15 out of 3,894 results