Filters








1,206 Hits in 7.8 sec

An empirical study on retrieval models for different document genres

Makoto Iwayama, Atsushi Fujii, Noriko Kando, Yuzo Marukawa
2003 Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03  
The relative superiority among existing retrieval models did not significantly differ depending on the document genre, that is, patents and newspaper articles.  ...  However, most collections were intended for retrieving newspaper articles and technical abstracts.  ...  Comparison between patents and newspapers By looking at Table 5 , the relative superiority among different retrieval models did not significantly differ depending on the document genre (i.e., patents  ... 
doi:10.1145/860435.860482 dblp:conf/sigir/IwayamaFKM03 fatcat:i6gmtrihkjdy3hemmyqxdt3xfu

An empirical study on retrieval models for different document genres

Makoto Iwayama, Atsushi Fujii, Noriko Kando, Yuzo Marukawa
2003 Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval - SIGIR '03  
The relative superiority among existing retrieval models did not significantly differ depending on the document genre, that is, patents and newspaper articles.  ...  However, most collections were intended for retrieving newspaper articles and technical abstracts.  ...  Comparison between patents and newspapers By looking at Table 5 , the relative superiority among different retrieval models did not significantly differ depending on the document genre (i.e., patents  ... 
doi:10.1145/860480.860482 fatcat:275gtggfazd5lnjhqrqxk3fidm

Web Genre Benchmark Under Construction

Marina Santini, Serge Sharoff
2009 Journal for Language Technology and Computational Linguistics  
Hyppia (for English) -The Hyppia demo allows news articles to be filtered and searched based on genre information.  ...  We suggest focusing on the following key points: ) propose a characterisation of genre suitable for digital environments and empirical approaches shared by a number of genre experts working in automatic  ...  different computational models, and, last but not least, to assess the impact of the number of genres, the number of documents, the number of annotators and the criteria of annotation may have on genre  ... 
dblp:journals/ldvf/SantiniS09 fatcat:2ci2dyupn5emnhl7idm773abw4

Text as Algorithm and as Process [chapter]

Paul Eggert
2010 Text and Genre in Reconstruction  
Thanks are due to the Institute of English Studies (to its Director, for whom the dedication) and to the Centre for Computing in the Humanities, King's College London, and its Director, Professor Harold  ...  Short, for unstinting support of the Seminar from which most of the essays in this volume originated.  ...  well-documented model has developed for a printed edition, but no such thing exists for an eρectronic edition .  ... 
doi:10.2307/j.ctt5vjtd9.12 fatcat:kegc3rsth5eydj3r3m257xv67i

A Lawyer's Hidden Persuader: Genre Bias and How It Shapes Legal Texts by Constraining Writers' Choices and Influencing Readers' Perceptions

Bret Rappaport
2014 Social Science Research Network  
Shapes of knowledge are always ineluctably local, indivisible from their instruments and encasements. 1  ...  Sneddon's article should be a model for application to and analysis of other sub-genres, be they transactional documents and litigation documents.  ...  music are innate and adaptive traits. 104 This Article, instead, focuses on exposing the logical existence of genre bias by exploring genre literature and through detailing the empirical evidence.  ... 
doi:10.2139/ssrn.2394804 fatcat:a32otz3ge5giviounz75fk3nvm

Narrative and Identity Formation: An Analysis of Media Personal Accounts from Patients of Cosmetic Plastic Surgery [chapter]

D�bora de Carvalho Figueiredo
2009 Genre in a Changing World  
English language--Rhetoric--Study and teaching. 2. Report writing--Study and teaching. 3.  ...  The Perspectives on Writing series addresses writing studies in a broad sense.  ...  -M. (2001 Gérard Genette (and of his studies about the transtextuality), but putting the focus of the problem in the genres and not alone in the texts.  ... 
doi:10.37514/per-b.2009.2324.2.13 fatcat:zoc5rugwang5biqp5d53xtfd2a

Financial speculation in Victorian fiction: plotting money and the novel genre, 1815-1901

2010 ChoiceReviews  
Cover illustration: "Celebrated Comic Scene Between the Railway Clown (Hudson) and the Indignant Shareholders, " from Punch (artist unknown), 1849.  ...  It is vital for an understanding of the multiple function of India in the novel that the two-way export of colonial products (including banknotes) is much more than simply an empire founded on different  ...  For one, the stepmother is a dedicated consumer of the genre.  ... 
doi:10.5860/choice.48-1343 fatcat:6pqyxwdx5ndmpc42suw544k6g4

The design of a corpus of Contemporary Arabic

Latifa Al-Sulaiti, Eric Steven Atwell
2006 International Journal of Corpus Linguistics  
Corpora are an important resource for both teaching and research.  ...  Overall, our survey confirms our view that existing corpora are too narrowly limited in source-type and genre, and that there is a need for a freely-accessible corpus of contemporary Arabic covering a  ...  Acknowledgments We would like to thank all those who participated in our corpus user survey; and all source owners who generously donated texts for inclusion in the online Corpus of Contemporary Arabic  ... 
doi:10.1075/ijcl.11.2.02als fatcat:zt7psho2grdmzntsc7nwexld3u

My Approach = Your Apparatus?

Julian Risch, Ralf Krestel
2018 Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries - JCDL '18  
We evaluate our model on patents, scientific papers, newspaper articles, forum posts, and Wikipedia articles.  ...  Comparative text mining extends from genre analysis and political bias detection to the revelation of cultural and geographic differences, through to the search for prior art across patents and scientific  ...  Another eld of application is bias detection in newspapers. For this task, we consider each newspaper's articles as an individual collection.  ... 
doi:10.1145/3197026.3197038 dblp:conf/jcdl/RischK18 fatcat:sf6dahdpnrdq5kv2f2k72d6o2a

Leveraging Subjective Human Annotation for Clustering Historic Newspaper Articles [article]

Haimonti Dutta, William Chan, Deepak Shankargouda, Manoj Pooleery, Axinia Radeva, Kyle Rego, Boyi Xie, Rebecca Passonneau, Austin Lee and Barbara Taranto
2012 arXiv   pre-print
This paper studies techniques for automatic categorization of newspaper articles so as to enhance search and retrieval on the archive. We explore unsupervised (e.g.  ...  The "BODHI" system currently being developed is a step in that direction, allowing users to correct wrongly scanned OCR and providing keywords and tags for newspaper articles used frequently.  ...  Dragomir Radev for his generous and insightful comments on drafts of the paper, Sam Lee and Hatim Diab for help with infrastructure and system development.  ... 
arXiv:1208.3530v1 fatcat:zyrcsngcgngrlnwzehzdiqdj24

Tornar a ciência popular Figuier nos jornais e revistas do Brasil (1850-1870)

Kaori KODAMA
2018 Varia História  
From then on, the formula "for all" would become a label and model for communicating science to a wider public, used by a considerable number of writers in different countries -including female ones, although  ...  The method and its author were mentioned once again the following year in Diario de Pernambuco, based on an article originally published in French newspaper La Presse.  ...  From then on, we can see that the institutionalized sciences more emphatically defined the difference between the forms of scientific communication for peers and non-specialists.  ... 
doi:10.1590/0104-87752018000300003 fatcat:u7fh7s7e3jfbzgjnllrcpxnkpm

IText

Cheryl Geisler, Charles Bazerman, Stephen Doheny-Farina, Laura Gurak, Christina Haas, Johndan Johnson-Eilola, David S. Kaufer, Andrea Lunsford, Carolyn R. Miller, Dorothy Winsor, Joanne Yates
2001 Journal of business and technical communication  
Print and screen differences. Studies have noted that people interact with ITexts differently than with print ones (see Haas, Writing, for a review).  ...  These studies all presume traditional print text, however, and little theoretical or empirical work explores how IText might be different.  ... 
doi:10.1177/105065190101500302 fatcat:a5gpmp35ibf3hjggldcnfyjxta

Comparing citation contexts for information retrieval

Anna Ritchie, Stephen Robertson, Simone Teufel
2008 Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08  
Our experiments show that the citation-enhanced document representation increases retrieval effectiveness across a range of standard retrieval models and evaluation measures.  ...  This thesis investigates taking words from around citations to scientific papers in order to create an enhanced document representation for improved information retrieval.  ...  Documents: The genre of the Ad hoc document collections has always been news (i.e., newspaper and newswire articles), patents and documents from various government departments.  ... 
doi:10.1145/1458082.1458113 dblp:conf/cikm/RitchieRT08 fatcat:bd4azphahbhj3mrn3gfjvozkhy

The Rowling Case: A Proposed Standard Analytic Protocol for Authorship Questions

Patrick Juola
2015 Digital Scholarship in the Humanities  
We propose a possible solution to one of the major weaknesses in the application of authorship attribution-the absence of clear-cut standards for accurate analytic practice.  ...  This protocol (or close variants of it) has been used in at least four separate cases across a wide variety of documents and consumers.  ...  Funding This material is based in part upon work supported by the National Science Foundation [OCI-1032683]; and by the Defense Advanced Research Projects Agency [Active Authentication, Phases I and II  ... 
doi:10.1093/llc/fqv040 dblp:journals/lalc/Juola15 fatcat:uzcpmdav25chhor5tsrphtoox4

MLSUM: The Multilingual Summarization Corpus [article]

Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier, Benjamin Piwowarski, Jacopo Staiano
2020 arXiv   pre-print
Obtained from online newspapers, it contains 1.5M+ article/summary pairs in five different languages -- namely, French, German, Spanish, Russian, Turkish.  ...  Together with English newspapers from the popular CNN/Daily mail dataset, the collected data form a large scale multilingual dataset which can enable new research directions for the text summarization  ...  The accompanying code for parsing the articles allows to easily retrieve the titles and thus use them for News Title Generation.  ... 
arXiv:2004.14900v1 fatcat:vwiennfr4zbstprtrvc3rxf2rm
« Previous Showing results 1 — 15 out of 1,206 results