Filters








22,266 Hits in 3.6 sec

Estimating the value of automatic disambiguation

Paul Thomas, Tom Rowlands
2007 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '07  
A common motivation for personalised search systems is the ability to disambiguate queries based on some knowledge of a user's interests.  ...  An analysis of log files from three search providers, covering a range of scenarios, suggests that this sort of disambiguation would be of marginal use for more specialised providers but may be of use  ...  We would also like to thank the (anonymous) providers of log files for their support.  ... 
doi:10.1145/1277741.1277875 dblp:conf/sigir/ThomasR07 fatcat:i3lxdf7qijdrxfimxny2g6p2yu

Combination of an automatic and an interactive disambiguation method

Masaya Yamaguchi, Takeyuki Kojima, Nobuo Inui, Yoshiyuki Kotani, Hirohiko Nisimura
1998 Proceedings of the 36th annual meeting on Association for Computational Linguistics -  
the interactive disambiguation and automatic one.  ...  In this paper, we propose a technique to combine a method of interactive disambiguation and automatic one for alnbiguous words.  ...  To reduce the number of interaction, the automatte disambiguation is executed instead of executing tile interactive disambiguation, estimating the loss of the accuracy L(i) ill node i.  ... 
doi:10.3115/980691.980801 dblp:conf/acl/YamaguchiKIKN98 fatcat:7o677msvjngjti4a5ucxlrtv7m

Clustering Words with the MDL Principle [article]

Hang Li, Naoki Abe
1996 arXiv   pre-print
We view the problem of clustering words as that of estimating a joint distribution over the Cartesian product of a partition of a set of nouns and a partition of a set of verbs, and propose an estimation  ...  We also evaluated the method by conducting pp-attachment disambiguation experiments using an automatically constructed thesaurus.  ...  Kobayashi of NEC C&C Res. Labs. for their constant encouragement. We thank Dr. K. Yamanishi of C&C Res. Labs. for his valuable comments. We thank Ms. Y. Yamaguchi of NIS for her programming effort.  ... 
arXiv:cmp-lg/9605014v2 fatcat:jktdqivfizbmdm5a5yagbv2m7q

Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations

Hua Xu, Peter D Stetson, Carol Friedman
2012 AMIA Annual Symposium Proceedings  
Furthermore, we developed a strategy to combine sense frequency information estimated from a clustering analysis with the profile-based method.  ...  Our results showed that the combined approach largely improved the performance and achieved a highest precision of 0.875 on the same test set, indicating that integrating sense frequency information with  ...  Acknowledgement This study was supported by grants from the US NIH: NLM R01LM010681 (HX), R01LM8635 (CF), and R01LM010016 (CF).  ... 
pmid:23304376 pmcid:PMC3540457 fatcat:qvkthyylwzgo5a3ylhnhfdahsq

Clustering Words with the MDL Principle

Hang Li, Naoki Abe
1997 Journal of Natural Language Processing  
We address the problem of automatically constructing a thesaurus(hierarchically clustering words)based on corpus data.We view the problem of clustering words as that of estimating a joint distribution  ...  the latter.We also evaluated the method by conducting pp-attachment disambiguation experiments using an automatically constructed thesaurus.Our experimental results indicate that we can improve accuracy  ...  We address the problem of automatically constructing a thesaurus(hierarchically clustering words)based on corpus data.We view the problem of clustering words as that of estimating a joint distribution  ... 
doi:10.5715/jnlp.4.2_71 fatcat:dr26b2usdnad3o26q64turiwju

Uyghur-Chinese Translation Disambiguation Method Research Based on Knowledge Automatic-Acquisition

Ren Ge, Yang Yong, Xu Chun
2014 Open Cybernetics and Systemics Journal  
This thesis studies the disambiguation method in Uyghur-Chinese translation, and proposes the design philosophy of automatic-acquisition in translation label library aiming at the deficiency of disambiguation  ...  From the experiment result, it can prove that the Uyghur-Chinese translation disambiguation framework based on automatically acquired corpus is effective, and increases with the increasing of the scale  ...  ACKNOWLEDGEMENTS Sponsor acknowledgment: (1) Ministry of Education, Humanities and social science projects (No: 12XJJC740006).  ... 
doi:10.2174/1874110x01408010739 fatcat:klq5jr3io5h6xiwzfdxr4seuay

Page 205 of Computational Linguistics Vol. 28, Issue 2 [page]

2002 Computational Linguistics  
The approach described in Clark and Weir (1999) is shown in Clark (2001) to have some impact on the pseudo-disambiguation task, but only with certain values of the a parameter, and ultimately does not  ...  Finally, an issue that has not been much addressed in the literature (except by Li and Abe [1996]) is how the accuracy of class-based estimation techniques compare when automatically acquired classes,  ... 

Word Clustering and Disambiguation Based on Co-occurrence Data [article]

Hang Li, Naoki Abe
1998 arXiv   pre-print
We then combined this clustering method with the disambiguation method of (Li & Abe 95) to derive a disambiguation method that makes use of both automatically constructed thesauruses and a hand-made thesaurus  ...  The overall disambiguation accuracy achieved by our method is 85.2%, which compares favorably against the accuracy (82.4%) obtained by the state-of-the-art disambiguation method of (Brill & Resnik 94).  ...  Doi of NEC C&C Media Res. Labs. for his encouragement. We thank Ms. Y. Yamaguchi of NIS for her programming efforts.  ... 
arXiv:cmp-lg/9807004v1 fatcat:jmmtfqipnvbdbhp7aamute353e

Word clustering and disambiguation based on co-occurrence data

HANG LI
2002 Natural Language Engineering  
We then combined this clustering method with the disambiguation method of (Li and Abe, 1995) to derive a disambiguation method that makes use of both automatically constructed thesauruses and a hand-made  ...  The overall disambiguation accuracy achieved by our method is 85.2%, which compares favorably against the accuracy (82.4%) obtained by the state-of-the-art disambiguation method of (Brill and Resnik, 1994  ...  Doi of NEC C&C Media Res. Labs. for his encouragement. We thank Ms. Y. Yamaguchi of NIS for her programming efforts.  ... 
doi:10.1017/s1351324902002838 fatcat:kxusagqqqjasnlfr7cgsta5sge

Word clustering and disambiguation based on co-occurrence data

Hang Li, Naoki Abe
1998 Proceedings of the 36th annual meeting on Association for Computational Linguistics -  
We then combined this clustering method with the disambiguation method of (Li and Abe, 1995) to derive a disambiguation method that makes use of both automatically constructed thesauruses and a hand-made  ...  The overall disambiguation accuracy achieved by our method is 85.2%, which compares favorably against the accuracy (82.4%) obtained by the state-of-the-art disambiguation method of (Brill and Resnik, 1994  ...  Doi of NEC C&C Media Res. Labs. for his encouragement. We thank Ms. Y. Yamaguchi of NIS for her programming efforts.  ... 
doi:10.3115/980691.980693 dblp:conf/acl/LiA98 fatcat:i2y4mebuk5h5rjjfzrlfbogmz4

Improving efficiency and accuracy in multilingual entity extraction

Joachim Daiber, Max Jakob, Chris Hokamp, Pablo N. Mendes
2013 Proceedings of the 9th International Conference on Semantic Systems - I-SEMANTICS '13  
Finally, we present challenges and experiences to foment the discussion with other developers interested in recognition and disambiguation of entities in natural language text.  ...  We compare our solution to the previous system, considering time performance, space requirements and accuracy in the context of the Dutch and English languages.  ...  ACKNOWLEDGMENTS Parts of this work were funded by Google Summer of Code 2012 and by the FP7 grant Dicode (GA no. 257184).  ... 
doi:10.1145/2506182.2506198 dblp:conf/i-semantics/DaiberJHM13 fatcat:q6d6gyse6be4viho3lte5ppk3i

Automatic WordNet Construction Using Markov Chain Monte Carlo

Marzieh Fadaee, Hamidreza Ghader, Heshaam Faili, Azadeh Shakery
2013 POLIBITS Research Journal on Computer Science and Computer Engineering With Applications  
By applying MCMC techniques in estimating these probabilities, we integrate prior knowledge in the estimation and use the expected value of generated samples to give the final estimates.  ...  We model the problem of constructing a Persian WordNet by estimating the probability of assigning senses (synsets) to Persian words.  ...  For example maximize a posteriori estimation gives an estimate of 6 as follows: This estimation provides the possibility that our expectation of what 6 could be, affect the final estimated value for 6.  ... 
doi:10.17562/pb-47-2 fatcat:lymqfkcvfbhhflqosv57oixydu

Do we mean the same?

Elena Demidova, Irina Oelze, Peter Fankhauser
2009 Proceedings of the First International Workshop on Keyword Search on Structured Data - KEYS '09  
In this paper we analyze the impact of selected document and database statistics on the effectiveness of keyword disambiguation for manually created as well as automatically extracted keyword queries.  ...  Our evaluation is performed using a set of user queries from the AOL query log and a set of queries automatically extracted from Wikipedia articles both executed against the Internet Movie Database (IMDB  ...  (A) is the number of non-zero values of the attribute A in an entity and DF(A) is the total number of entities containing a non-zero value of the attribute A.  ... 
doi:10.1145/1557670.1557682 dblp:conf/sigmod/DemidovaOF09 fatcat:kbstuhqjoree5monts6uswroz4

Dealing with Uncertainty in Lexical Annotation

Sonia Bergamaschi, Laura Po, Serena Sorrentino, Alberto Corni
2010 Revista de Informática Teórica e Aplicada  
ALA performs automatic lexical annotation through the use of probabilistic annotations, i.e. an annotation is associated to a probability value.  ...  We present ALA, a tool for the automatic lexical annotation (i.e. annotation w.r.t. a thesaurus/lexical resource) of structured and semi-structured data sources and the discovery of probabilistic lexical  ...  As shown in the demo, through the GUI the user may have an estimation of the quality of the obtained annotations in terms of the number of annotated terms, the average probability of the annotations and  ... 
doi:10.22456/2175-2745.12580 fatcat:74twu7tluvcf7g3jczv5k7ugra

Page 283 of Computational Linguistics Vol. 25, Issue 2 [page]

1999 Computational Linguistics  
We see that airline company can be the value of the arg1 (subject) slot, when the value of the arg2 (direct object) slot is airplane but not when it is airline company.  ...  But, with that interpretation, word senses would have to be automatically disambiguated given the corpus data, and we would find ourselves left with the same problem.  ... 
« Previous Showing results 1 — 15 out of 22,266 results