Filters








3,918 Hits in 4.1 sec

Modeling word occurrences for the compression of concordances

A. Bookstein, S. T. Klein, T. Raita
<span title="1997-07-01">1997</span> <i title="Association for Computing Machinery (ACM)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/a2o77szzyngfjiyr6oftvf2ts4" style="color: black;">ACM Transactions on Information Systems</a> </i> &nbsp;
An earlier paper developed a procedure for compressing concordances, assuming that all elements occurred independently.  ...  The concordance is conceptualized as a set of bitmaps, in which the bit locations represent documents, and the one-bits represent the occurrence of given terms.  ...  Thus the transitional states are indeed transitional: Modeling Word Occurrences for the Compression of Concordances • -C4: Modeling Word Occurrences for the Compression of Concordances • current  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/256163.256166">doi:10.1145/256163.256166</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ujqtkflohrg57mbsqmengi2xtq">fatcat:ujqtkflohrg57mbsqmengi2xtq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809042621/http://www.cs.uml.edu/~haim/teaching/iws/tirsaa/sources/ACM_Transactions_on_Information_Systems/modeling_word_occurances.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/87/a0/87a05c1d5993f2c6b8564c33e2d2a715cf13721d.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/256163.256166"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Compression of concordances in full-text retrieval systems

Y. Choueka, A. S. Fraenkel, S. T. Klein
<span title="">1988</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ibcfmixrofb3piydwg5wvir3t4" style="color: black;">Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR &#39;88</a> </i> &nbsp;
The concordance contains, for every word 'W' of the dictionary, the l.exicographically ordered list of all its coordinates in the text; it is accessed via the dictionary that contains for every word a  ...  Every occurrence of every word in the data base can be uniquely characterized by a sequence of numbers that give its exact position in the text.  ...  However, the hierarchical structure is lost, so that, for example, queries asking for the co-occurrence of scvcral words in the same sentence or paragraph are much harder to process.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/62437.62500">doi:10.1145/62437.62500</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigir/ChouekaFK88.html">dblp:conf/sigir/ChouekaFK88</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/4zt5iy4vmrestelf4h4lnkpir4">fatcat:4zt5iy4vmrestelf4h4lnkpir4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218123151/https://static.aminer.org/pdf/20170130/pdfs/sigir/q4zahgwzm6qsvvpl8erdch0ujmto3ber.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/2d/dd/2ddd3373358cf1be466d022832ab52a02bdcac94.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/62437.62500"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Improved techniques for processing queries in full-text systems

Y. Choueka, A. Fraenkel, S. Klein, E. Segal
<span title="">1987</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ibcfmixrofb3piydwg5wvir3t4" style="color: black;">Proceedings of the 10th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR &#39;87</a> </i> &nbsp;
Alternatively, in a system with e documents, the concordance can be replaced by a set of bit-maps of fixed length ~e, which are constructed for every different word of the database and serve as occurrence  ...  We propose to combine the concordance and bit-map approaches~ and show how this can speed up the processing of queries: fast ANDing and ORing of the maps in a preprocessing stage, lead to large I/O savings  ...  The concordance contains, for each distinct word of the data base, the ordered list of the coordinates of its occurrences.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/42005.42039">doi:10.1145/42005.42039</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigir/ChouekaFKS87.html">dblp:conf/sigir/ChouekaFKS87</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/bjqdmc4bs5a33fudbpzzpb6sx4">fatcat:bjqdmc4bs5a33fudbpzzpb6sx4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190217145154/https://static.aminer.org/pdf/20170130/pdfs/sigir/l5uy7z9zdcvdto46nrlp3tv8igpsmjbn.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/cb/f5/cbf5440fa1c22e7289cceee788c1457baece03f5.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/42005.42039"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Compression of indexes with full positional information in very large text databases

Gordon Linoff, Craig Stanfill
<span title="">1993</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ibcfmixrofb3piydwg5wvir3t4" style="color: black;">Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR &#39;93</a> </i> &nbsp;
This paper describes a combination of compression methods which maybe used to reduce the size of inverted indexes for very large text databases.  ...  Using these compression methods on two different text sources (the King James Version of the Bible and a sample of Wall Street Journal Stories), the compressed index occupies less than 40% of the size  ...  The 50 most frequent words account for about 3570 of term occurrences.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/160688.160699">doi:10.1145/160688.160699</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigir/LinoffS93.html">dblp:conf/sigir/LinoffS93</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/2ytxbwyqwncabdv7v3y7bcsvfe">fatcat:2ytxbwyqwncabdv7v3y7bcsvfe</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190218070429/https://static.aminer.org/pdf/20170130/pdfs/sigir/t4vz8jorfepiwga65knur7mqiwhzlvtl.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/76/d3/76d33c8e688317ae636b735b5d9c247cbcb28e46.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/160688.160699"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

LDC online

Zhibiao Wu, Mark Liberman
<span title="">1997</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/pq6bwkrhtjc6dhh7k7xw3zmaqy" style="color: black;">Proceedings of the second ACM international conference on Digital libraries - DL &#39;97</a> </i> &nbsp;
The volume of LDC data roughly doubles every year. Few organizations have been able to afford to store and index all LDC data, or to develop the software needed for efficient search and retrieval.  ...  The Linguistic Data Consortium (LDC), an open consortium of universities, companies and government research laboratories, creates, collects and distributes speech and text databases, lexicons, and other  ...  some 58 CD-ROMs of compressed speech data, as well as four CD-ROMs of compressed text from similar sources for language modeling purposes.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/263690.263810">doi:10.1145/263690.263810</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/dl/WuL97.html">dblp:conf/dl/WuL97</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/q63chx7rffgw3ge5iwic6n54o4">fatcat:q63chx7rffgw3ge5iwic6n54o4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190217075507/https://static.aminer.org/pdf/PDF/000/157/377/ldc_online_a_digital_library_for_linguistic_research_and_development.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b2/81/b281cbd8f6ec4c8e7d38875ba06262eeddebd025.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/263690.263810"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Compression and the origins of Zipf's law of abbreviation [article]

R. Ferrer-i-Cancho, C. Bentz, C. Seguin
<span title="2016-05-04">2016</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Languages across the world exhibit Zipf's law of abbreviation, namely more frequent words tend to be shorter.  ...  The generalized version of the law - an inverse relationship between the frequency of a unit and its magnitude - holds also for the behaviours of other species and the genetic code.  ...  At a later stage, CB was also supported by the EVOLAEMP project and the DFG Center for Advanced Studies Words, Bones, Genes, Tools at the University of Tübingen.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1504.04884v3">arXiv:1504.04884v3</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/wka6sdzk3zdlxaqrdthkjhi3gy">fatcat:wka6sdzk3zdlxaqrdthkjhi3gy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200825083436/https://arxiv.org/pdf/1504.04884v3.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d3/08/d308263443cfc9ffe2e221a0a2a4977192536a1a.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1504.04884v3" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Parameterised compression for sparse bitmaps

Alistair Moffat, Justin Zobel
<span title="">1992</span> <i title="ACM Press"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/ibcfmixrofb3piydwg5wvir3t4" style="color: black;">Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR &#39;92</a> </i> &nbsp;
Acknowledgement This work was supported by the Australian Research Council.  ...  The decrease in size for GNUbib is dramatic-it is very rare for any citation to con- tain two occurrences of the same word.  ...  Klein, IEEE Data Compression Conference, Snow- Compression of concordances in full-text re- trieval systems.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/133160.133210">doi:10.1145/133160.133210</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/sigir/MoffatZ92.html">dblp:conf/sigir/MoffatZ92</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/qwd2cw2o7bfhrchg4dbp6klmoy">fatcat:qwd2cw2o7bfhrchg4dbp6klmoy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20070917103701/http://widit.slis.indiana.edu/irpub/SIGIR/1992/pdf26.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3d/6b/3d6ba9d0b5cf872ab09870e96a7a6bd2ebca54e3.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1145/133160.133210"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> acm.org </button> </a>

Moving Text Analysis Tools to the Cloud

Himanshu Vashishtha, Michael Smit, Eleni Stroulia
<span title="">2010</span> <i title="IEEE"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/xxehgqixcnevja5kfad6uir6wi" style="color: black;">2010 6th World Congress on Services</a> </i> &nbsp;
To that end, we have started migrating existing text analysis tools to the cloud, beginning with TAPoR, the Text Analysis Portal for Research.  ...  In our collaboration with Digital Humanists, we have started to examine the opportunities that the cloud offers to improving the response times of text-analysis tools so that users can comparatively analyze  ...  ACKNOWLEDGMENTS We thank Geoffrey Rockwell and the rest of the TAPoR team for introducing us to TAPoRware.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/services.2010.91">doi:10.1109/services.2010.91</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/services/VashishthaSS10.html">dblp:conf/services/VashishthaSS10</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ir5gdskeznggbhu2izfnmv2zcy">fatcat:ir5gdskeznggbhu2izfnmv2zcy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170922005418/http://www.mikesmit.com/wp-content/papercite-data/pdf/services2010.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ae/08/ae08c3f54f6b63f36bc0d0677c4e3bc2ad921f18.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/services.2010.91"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Sketching Word Vectors Through Hashing [article]

Behrang QasemiZadeh, Laura Kallmeyer
<span title="2018-08-30">2018</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
We propose a new fast word embedding technique using hash functions.  ...  models after their construction (and at a reduced dimensionality) imparts a high discriminatory power to the resulting embeddings.  ...  Instead of counting co-occurrences of a word with other context words in a corpus, we keep track of the count of the co-occurrences of words and the buckets that context words are assigned to.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1705.04253v2">arXiv:1705.04253v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/oy7alfjatneqtdwcgy7fceqb5m">fatcat:oy7alfjatneqtdwcgy7fceqb5m</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200826173159/https://arxiv.org/pdf/1705.04253v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/e0/42/e042acca58b7215929451bde302be160c71778aa.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1705.04253v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

DocuBurst: Visualizing Document Content using Language Structure

Christopher Collins, Sheelagh Carpendale, Gerald Penn
<span title="">2009</span> <i title="Wiley"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/p2lpq6bugfcqxk44anrm6yki4m" style="color: black;">Computer graphics forum (Print)</a> </i> &nbsp;
DocuBurst is a radial, space-filling layout of hyponymy (the IS-A relation), overlaid with occurrence counts of words in a document of interest to provide visual summaries at varying levels of granularity  ...  Textual data is at the forefront of information management problems today. One response has been the development of visualizations of text data.  ...  Acknowledgements Thanks to Ravin Balakrishnan for advice and guidance. Funding for this research was provided by NSERC, iCore, SMART Technologies, and NECTAR.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1111/j.1467-8659.2009.01439.x">doi:10.1111/j.1467-8659.2009.01439.x</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/esq3653dgnayhbnqs62sl2vsz4">fatcat:esq3653dgnayhbnqs62sl2vsz4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20160401172520/http://vialab.science.uoit.ca/wp-content/papercite-data/pdf/col2009a.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/87/ae/87ae93c8e80a3e1d360899e3f60bda1db81c2f08.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1111/j.1467-8659.2009.01439.x"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

Automatic Text Summarization Using a Machine Learning Approach [chapter]

Joel Larocca Neto, Alex A. Freitas, Celso A. A. Kaestner
<span title="">2002</span> <i title="Springer Berlin Heidelberg"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/2w3awgokqne6te4nvlofavy5a4" style="color: black;">Lecture Notes in Computer Science</a> </i> &nbsp;
These features are of two kinds: statistical -based on the frequency of some elements in the text; and linguistic -extracted from a simplified argumentative structure of the text.  ...  We will present a summarization procedure based on the application of trainable Machine Learning algorithms which employs a set of features extracted directly from the original text.  ...  Similar to the previous experiment, results for 20% of compression were superior to the results produced with 10% of compression.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/3-540-36127-8_20">doi:10.1007/3-540-36127-8_20</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/6hmitu6qljg3nkrae2vxhcm5uq">fatcat:6hmitu6qljg3nkrae2vxhcm5uq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20170809000602/https://www.cs.kent.ac.uk/people/staff/aaf/pub_papers.dir/SBIA-2002-Joel.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/30/e1/30e128568200e6777dc629bc6fb2fb95833aa98c.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/3-540-36127-8_20"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Restoring Arabic vowels through omission-tolerant dictionary lookup

Alexis Amid Neme, Sébastien Paumier
<span title="2019-04-25">2019</span> <i title="Springer Nature"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/qiptgj2ubngu3hrrsrkbdvpchi" style="color: black;">Language Resources and Evaluation</a> </i> &nbsp;
Our program performs the analysis of 5000 words/second for running text (20 pages/second).  ...  For restoring vowels, our resources are capable of identifying words in which the vowels are not shown, as well as words in which the vowels are partially or fully included.  ...  calculated on the basis of word types in texts not word occurrences.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10579-019-09464-6">doi:10.1007/s10579-019-09464-6</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/chdbye2d55fhxdvvbp4so2wloi">fatcat:chdbye2d55fhxdvvbp4so2wloi</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190513145453/https://hal.archives-ouvertes.fr/hal-02113751/document" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/23/e9/23e90752241b7f0461cb8d7ad285d04f56f09efb.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1007/s10579-019-09464-6"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> springer.com </button> </a>

Automatic Correction of Real-Word Errors in Spanish Clinical Texts

Daniel Bravo-Candel, Jésica López-Hernández, José Antonio García-Díaz, Fernando Molina-Molina, Francisco García-Sánchez
<span title="2021-04-21">2021</span> <i title="MDPI AG"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/taedaf6aozg7vitz5dpgkojane" style="color: black;">Sensors</a> </i> &nbsp;
In this work, a deep learning model were implemented for correcting real-word errors in clinical text.  ...  Then, the probability of a word being a real-word error is computed.  ...  For instance, by splitting texts in short sequences and counting their number of occurrences in a corpus, the probability of a word being a real-word error is obtained.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/s21092893">doi:10.3390/s21092893</a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pubmed/33919018">pmid:33919018</a> <a target="_blank" rel="external noopener" href="https://pubmed.ncbi.nlm.nih.gov/PMC8122440/">pmcid:PMC8122440</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/sdac7vet3ja5pmtphp76qa4gqu">fatcat:sdac7vet3ja5pmtphp76qa4gqu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210425111346/https://res.mdpi.com/d_attachment/sensors/sensors-21-02893/article_deploy/sensors-21-02893.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/b0/78/b078a405ad4b4bb5a7ee0d555bd409b905b80588.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/s21092893"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> mdpi.com </button> </a> <a target="_blank" rel="external noopener" href="https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8122440" title="pubmed link"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> pubmed.gov </button> </a>

On the Randomness of Compressed Data

Shmuel T. Klein, Dana Shapira
<span title="2020-04-07">2020</span> <i title="MDPI AG"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/dmr4kpn2yreovpdxpdiqtjcrnu" style="color: black;">Information</a> </i> &nbsp;
We investigate this premise for a variety of known lossless compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method.  ...  that of Huffman.  ...  Another line of investigation deals with compression methods that are not for general purpose, but custom tailored for files with known structure, such as dictionaries, concordances, lists, B-trees and  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/info11040196">doi:10.3390/info11040196</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/hhisxjvmunalxojzreaib6kfym">fatcat:hhisxjvmunalxojzreaib6kfym</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200410233920/https://res.mdpi.com/d_attachment/information/information-11-00196/article_deploy/information-11-00196.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/7b/79/7b79104a94928fdaf356f586b494d7728c156c14.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/info11040196"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> mdpi.com </button> </a>

Outilex, a linguistic platform for text processing

Olivier Blanc, Matthieu Constant
<span title="">2006</span> <i title="Association for Computational Linguistics"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/5n6volmnonf5tn6xputi5f2t3e" style="color: black;">Proceedings of the COLING/ACL on Interactive presentation sessions -</a> </i> &nbsp;
The platform includes several modules implementing the main operations for text processing and is designed to use large-coverage Language Resources.  ...  We present Outilex, a generalist linguistic platform for text processing.  ...  The platform includes a concordancer that allows for listing in their occurring context different occurrences of the patterns described in the grammar.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3115/1225403.1225422">doi:10.3115/1225403.1225422</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/conf/acl/BlancC06.html">dblp:conf/acl/BlancC06</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/ezsnnqiq6jbxvcffxpv3d3fpcy">fatcat:ezsnnqiq6jbxvcffxpv3d3fpcy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20110629121518/http://acl.ldc.upenn.edu/P/P06/P06-4019.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/d5/c1/d5c12748d7a127badff28c876e115d1cf21f5a30.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3115/1225403.1225422"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 3,918 results