A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2009; you can also visit the original URL.
The file type is application/pdf
.
Filters
Estimating the value of automatic disambiguation
2007
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '07
A common motivation for personalised search systems is the ability to disambiguate queries based on some knowledge of a user's interests. ...
An analysis of log files from three search providers, covering a range of scenarios, suggests that this sort of disambiguation would be of marginal use for more specialised providers but may be of use ...
We would also like to thank the (anonymous) providers of log files for their support. ...
doi:10.1145/1277741.1277875
dblp:conf/sigir/ThomasR07
fatcat:i3lxdf7qijdrxfimxny2g6p2yu
Combination of an automatic and an interactive disambiguation method
1998
Proceedings of the 36th annual meeting on Association for Computational Linguistics -
the interactive disambiguation and automatic one. ...
In this paper, we propose a technique to combine a method of interactive disambiguation and automatic one for alnbiguous words. ...
To reduce the number of interaction, the automatte disambiguation is executed instead of executing tile interactive disambiguation, estimating the loss of the accuracy L(i) ill node i. ...
doi:10.3115/980691.980801
dblp:conf/acl/YamaguchiKIKN98
fatcat:7o677msvjngjti4a5ucxlrtv7m
Clustering Words with the MDL Principle
[article]
1996
arXiv
pre-print
We view the problem of clustering words as that of estimating a joint distribution over the Cartesian product of a partition of a set of nouns and a partition of a set of verbs, and propose an estimation ...
We also evaluated the method by conducting pp-attachment disambiguation experiments using an automatically constructed thesaurus. ...
Kobayashi of NEC C&C Res. Labs. for their constant encouragement. We thank Dr. K. Yamanishi of C&C Res. Labs. for his valuable comments. We thank Ms. Y. Yamaguchi of NIS for her programming effort. ...
arXiv:cmp-lg/9605014v2
fatcat:jktdqivfizbmdm5a5yagbv2m7q
Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations
2012
AMIA Annual Symposium Proceedings
Furthermore, we developed a strategy to combine sense frequency information estimated from a clustering analysis with the profile-based method. ...
Our results showed that the combined approach largely improved the performance and achieved a highest precision of 0.875 on the same test set, indicating that integrating sense frequency information with ...
Acknowledgement This study was supported by grants from the US NIH: NLM R01LM010681 (HX), R01LM8635 (CF), and R01LM010016 (CF). ...
pmid:23304376
pmcid:PMC3540457
fatcat:qvkthyylwzgo5a3ylhnhfdahsq
Clustering Words with the MDL Principle
1997
Journal of Natural Language Processing
We address the problem of automatically constructing a thesaurus(hierarchically clustering words)based on corpus data.We view the problem of clustering words as that of estimating a joint distribution ...
the latter.We also evaluated the method by conducting pp-attachment disambiguation experiments using an automatically constructed thesaurus.Our experimental results indicate that we can improve accuracy ...
We address the problem of automatically constructing a thesaurus(hierarchically clustering words)based on corpus data.We view the problem of clustering words as that of estimating a joint distribution ...
doi:10.5715/jnlp.4.2_71
fatcat:dr26b2usdnad3o26q64turiwju
Uyghur-Chinese Translation Disambiguation Method Research Based on Knowledge Automatic-Acquisition
2014
Open Cybernetics and Systemics Journal
This thesis studies the disambiguation method in Uyghur-Chinese translation, and proposes the design philosophy of automatic-acquisition in translation label library aiming at the deficiency of disambiguation ...
From the experiment result, it can prove that the Uyghur-Chinese translation disambiguation framework based on automatically acquired corpus is effective, and increases with the increasing of the scale ...
ACKNOWLEDGEMENTS Sponsor acknowledgment: (1) Ministry of Education, Humanities and social science projects (No: 12XJJC740006). ...
doi:10.2174/1874110x01408010739
fatcat:klq5jr3io5h6xiwzfdxr4seuay
Page 205 of Computational Linguistics Vol. 28, Issue 2
[page]
2002
Computational Linguistics
The approach described in Clark and Weir (1999) is shown in Clark (2001) to have some impact on the pseudo-disambiguation task, but only with certain values of the a parameter, and ultimately does not ...
Finally, an issue that has not been much addressed in the literature (except by Li and Abe [1996]) is how the accuracy of class-based estimation techniques compare when automatically acquired classes, ...
Word Clustering and Disambiguation Based on Co-occurrence Data
[article]
1998
arXiv
pre-print
We then combined this clustering method with the disambiguation method of (Li & Abe 95) to derive a disambiguation method that makes use of both automatically constructed thesauruses and a hand-made thesaurus ...
The overall disambiguation accuracy achieved by our method is 85.2%, which compares favorably against the accuracy (82.4%) obtained by the state-of-the-art disambiguation method of (Brill & Resnik 94). ...
Doi of NEC C&C Media Res. Labs. for his encouragement. We thank Ms. Y. Yamaguchi of NIS for her programming efforts. ...
arXiv:cmp-lg/9807004v1
fatcat:jmmtfqipnvbdbhp7aamute353e
Word clustering and disambiguation based on co-occurrence data
2002
Natural Language Engineering
We then combined this clustering method with the disambiguation method of (Li and Abe, 1995) to derive a disambiguation method that makes use of both automatically constructed thesauruses and a hand-made ...
The overall disambiguation accuracy achieved by our method is 85.2%, which compares favorably against the accuracy (82.4%) obtained by the state-of-the-art disambiguation method of (Brill and Resnik, 1994 ...
Doi of NEC C&C Media Res. Labs. for his encouragement. We thank Ms. Y. Yamaguchi of NIS for her programming efforts. ...
doi:10.1017/s1351324902002838
fatcat:kxusagqqqjasnlfr7cgsta5sge
Word clustering and disambiguation based on co-occurrence data
1998
Proceedings of the 36th annual meeting on Association for Computational Linguistics -
We then combined this clustering method with the disambiguation method of (Li and Abe, 1995) to derive a disambiguation method that makes use of both automatically constructed thesauruses and a hand-made ...
The overall disambiguation accuracy achieved by our method is 85.2%, which compares favorably against the accuracy (82.4%) obtained by the state-of-the-art disambiguation method of (Brill and Resnik, 1994 ...
Doi of NEC C&C Media Res. Labs. for his encouragement. We thank Ms. Y. Yamaguchi of NIS for her programming efforts. ...
doi:10.3115/980691.980693
dblp:conf/acl/LiA98
fatcat:i2y4mebuk5h5rjjfzrlfbogmz4
Improving efficiency and accuracy in multilingual entity extraction
2013
Proceedings of the 9th International Conference on Semantic Systems - I-SEMANTICS '13
Finally, we present challenges and experiences to foment the discussion with other developers interested in recognition and disambiguation of entities in natural language text. ...
We compare our solution to the previous system, considering time performance, space requirements and accuracy in the context of the Dutch and English languages. ...
ACKNOWLEDGMENTS Parts of this work were funded by Google Summer of Code 2012 and by the FP7 grant Dicode (GA no. 257184). ...
doi:10.1145/2506182.2506198
dblp:conf/i-semantics/DaiberJHM13
fatcat:q6d6gyse6be4viho3lte5ppk3i
Automatic WordNet Construction Using Markov Chain Monte Carlo
2013
POLIBITS Research Journal on Computer Science and Computer Engineering With Applications
By applying MCMC techniques in estimating these probabilities, we integrate prior knowledge in the estimation and use the expected value of generated samples to give the final estimates. ...
We model the problem of constructing a Persian WordNet by estimating the probability of assigning senses (synsets) to Persian words. ...
For example maximize a posteriori estimation gives an estimate of 6 as follows: This estimation provides the possibility that our expectation of what 6 could be, affect the final estimated value for 6. ...
doi:10.17562/pb-47-2
fatcat:lymqfkcvfbhhflqosv57oixydu
Do we mean the same?
2009
Proceedings of the First International Workshop on Keyword Search on Structured Data - KEYS '09
In this paper we analyze the impact of selected document and database statistics on the effectiveness of keyword disambiguation for manually created as well as automatically extracted keyword queries. ...
Our evaluation is performed using a set of user queries from the AOL query log and a set of queries automatically extracted from Wikipedia articles both executed against the Internet Movie Database (IMDB ...
(A) is the number of non-zero values of the attribute A in an entity and DF(A) is the total number of entities containing a non-zero value of the attribute A. ...
doi:10.1145/1557670.1557682
dblp:conf/sigmod/DemidovaOF09
fatcat:kbstuhqjoree5monts6uswroz4
Dealing with Uncertainty in Lexical Annotation
2010
Revista de Informática Teórica e Aplicada
ALA performs automatic lexical annotation through the use of probabilistic annotations, i.e. an annotation is associated to a probability value. ...
We present ALA, a tool for the automatic lexical annotation (i.e. annotation w.r.t. a thesaurus/lexical resource) of structured and semi-structured data sources and the discovery of probabilistic lexical ...
As shown in the demo, through the GUI the user may have an estimation of the quality of the obtained annotations in terms of the number of annotated terms, the average probability of the annotations and ...
doi:10.22456/2175-2745.12580
fatcat:74twu7tluvcf7g3jczv5k7ugra
Page 283 of Computational Linguistics Vol. 25, Issue 2
[page]
1999
Computational Linguistics
We see that airline company can be the value of the arg1 (subject) slot, when the value of the arg2 (direct object) slot is airplane but not when it is airline company. ...
But, with that interpretation, word senses would have to be automatically disambiguated given the corpus data, and we would find ourselves left with the same problem. ...
« Previous
Showing results 1 — 15 out of 22,266 results