9 Hits in 4.7 sec

Simplified Data Wrangling with ir_datasets [article]

Sean MacAvaney, Andrew Yates, Sergey Feldman, Doug Downey, Arman Cohan, Nazli Goharian
2021 arXiv   pre-print
Integrations with popular IR indexing and experimentation toolkits demonstrate the tool's utility.  ...  Dataset documentation is scattered across the Internet and once one obtains a copy of the data, there are numerous different data formats to work with.  ...  Since experiments in neural IR frequently only work with a small subset of documents, this is very beneficial for these pipelines.  ... 
arXiv:2103.02280v1 fatcat:uxvenzrtk5cstejygnrb3cvldq

Experimaestro and Datamaestro

Benjamin Piwowarski
2020 Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval  
Experimaestro and Datamaestro: Experiment and Dataset Managers (for IR). In  ...  In this demo paper, we present two managers, Experimaestro and Datamaestro, and their add-ons for IR, designed to help to define and run experimental plans.  ...  CONCLUSION In this paper, we have presented Experimaestro and Datamaestro, and their extensions to IR (Experimaestro-IR, OpenNIR-XPM and Datamaestro-text), which aim at helping reproducibility, as well  ... 
doi:10.1145/3397271.3401410 dblp:conf/sigir/Piwowarski20 fatcat:ehdepyib5jbabfeyqtlmoib2bi

PyTerrier: Declarative Experimentation in Python from BM25 to Dense Retrieval

Craig Macdonald, Nicola Tonellotto, Sean MacAvaney, Iadh Ounis
2021 Proceedings of the 30th ACM International Conference on Information & Knowledge Management  
CONCLUSIONS This paper presented PyTerrier, a tool for building flexible retrieval pipelines.  ...  However, there is little compatibility between these toolkits and none offers flexible declarative pipelines.  ... 
doi:10.1145/3459637.3482013 fatcat:m7ukjbgcejg7jf47wp3qacmzm4

Introduction to Pregnancy in Waiting: Embryonic Diapause in Mammals: Proceedings of the Third International Symposium on Embryonic Diapause

BD Murphy, K Jewgenow, MB Renfree, SE Ulbrich
2020 Bioscientifica Proceedings  
The first known observation of this phenomenon was in a ruminant, the roe deer (Capreolus capreolus) in 1854, later confirmed in a number of studies in the last century [1].  ...  Raw sequence reads were analysed using a customised Galaxy pipeline.  ...  Dynamic transcriptome changes during embryonic diapause and reactivation in the embryo and endometrial epithelium of the European roe deer (Capreolus capreolus) Vera The European roe deer (Capreolus capreolus  ... 
doi:10.1530/biosciprocs.10.001 fatcat:55ytefod6fbbrpptji2bdf46gq

Pretrained Transformers for Text Ranking: BERT and Beyond [article]

Jimmy Lin, Rodrigo Nogueira, Andrew Yates
2021 arXiv   pre-print
This survey provides an overview of text ranking with neural network architectures known as transformers, of which BERT is the best-known example.  ...  The combination of transformers and self-supervised pretraining has been responsible for a paradigm shift in natural language processing (NLP), information retrieval (IR), and beyond.  ...  To better understand the reproduction difficulties with the CEDR codebase, we replicated some of the important model configurations using the Capreolus toolkit ] to obtain new results with the different  ... 
arXiv:2010.06467v3 fatcat:obla6reejzemvlqhvgvj77fgoy

Key note presentations, 10th Arctic Ungualate Conference, Tromsø, Norway, 1999

Editor in Chief
2000 Rangifer  
In an important break with normal practice, keynote speakers included scientists of international reputation who do not normally work with Arctic ungulates.  ...  A Rangifer Special Issue with workshop papers is planned for publication later in 2000.  ...  Acknowledgements Acknowledgements I would like to thank Nicholas Tyler for introducing me to the literature on arcric ungulate ecology and also, together with Carol Kerven and Cara Kerven, for suggesting  ... 
doi:10.7557/2.20.2-3.1478 fatcat:vmz6vqp4njcrnlhuip2oi4yvsm

4th International Reindeer/Caribou Symposium, 22-25 August 1985, Whitehorse, Canada

Sven Skjenneberg (ed. in chief)
1986 Rangifer  
Alyeska Pipeline Service Company and Northwest-Alaska Pipeline Company also provided logistical and financial support. We are grateful to J. R. Dau, P. Valkenburg, and D. D.  ...  The study was conducted while the first two authors were with the Canadian Wildlife Service, Whitehorse, and the third author was with the Department of Renewable Resources, Government of Yukon, Whitehorse  ...  capreolus) (Weiner, 1973) .  ... 
doi:10.7557/ fatcat:c7pi7lg7drhkvlywnojuzi345m


This model allows for the representation of sequences of activities (or labels), each annotated with a time period.  ...  Spatial trajectories can represent the movement of vehicles, people and animals, for example equipped with a GPS receiver.  ...  With these datasets, reasonable values for λ range between 20 and 40. Comparison with IR-tree and IF-R* The IR-tree and IF-R* indexing techniques.  ... 
doi:10.13130/issa-hamze_phd2017-02-27 fatcat:jqwa2t72jvhtdlonusax7sdtey

Development of Software Platforms for Annotation and Dereplication of Peptidic Natural Products

Emma Ricart Altimiras, Frédérique Lisacek
The annotated peaks are not always the same as the scored peaks because some peaks with unusual neutral losses or adducts are not used for the scoring.  ...  Secondly, working with relational databases requires defining schemes that strictly fit with the data, reducing the flexibility and not allowing the introduction of unstructured data, as needed in our  ...  For the deployment of my tools I used Apache Tomcat, which benefits from a light footprint and high flexibility.  ... 
doi:10.13097/archive-ouverte/unige:147481 fatcat:rxvi4jnxj5cmrfr2yji32msccy