Filters








156 Hits in 9.7 sec

Detecting Duplicate Posts in Programming QA Communities via Latent Semantics and Association Rules

Wei Emma Zhang, Quan Z. Sheng, Jey Han Lau, Ermyas Abebe
2017 Proceedings of the 26th International Conference on World Wide Web - WWW '17  
These features capture semantic similarities between questions and produce a strong performance for duplicate detection.  ...  Existing duplicate detection methodologies from traditional community based question-answering (CQA) websites are difficult to be adopted directly to PCQA, as PCQA posts often contain source code which  ...  RELATED WORK Our work is related to previous studies in two fields: 1) question retrieval from QA communities; and 2) Mining PCQA websites. Question retrieval from QA communities.  ... 
doi:10.1145/3038912.3052701 dblp:conf/www/ZhangSLA17 fatcat:7pxytsaa2ncjtipqlfgcuabgdm

Knowledge-based question answering using the semantic embedding space

Min-Chul Yang, Do-Gil Lee, So-Young Park, Hae-Chang Rim
2015 Expert systems with applications  
In the latent space, the semantic associations between existing features can be exploited based on their embeddings without using a manually produced lexicon and rules.  ...  In this study, our goal is to answer questions in any domains by using the semantic embedding space in which the embeddings encode the semantics of words and logical properties.  ...  Acknowledgments This research was supported by Next-Generation Information Computing Development Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science, ICT and  ... 
doi:10.1016/j.eswa.2015.07.009 fatcat:6wdmjjacwzevhnnzo7z32rl75m

SemEval-2017 Task 3: Community Question Answering

Preslav Nakov, Doris Hoogeveen, Lluís Màrquez, Alessandro Moschitti, Hamdy Mubarak, Timothy Baldwin, Karin Verspoor
2017 Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)  
Additionally, we added a new subtask E in order to enable experimentation with Multi-domain Question Duplicate Detection in a larger-scale scenario, using StackExchange subforums.  ...  We describe SemEval2017 Task 3 on Community Question Answering.  ...  It is part of the Interactive sYstems for Answer Search (IYAS) project, which is developed in collaboration with MIT-CSAIL. This research received funding in part from the Australian Research Council.  ... 
doi:10.18653/v1/s17-2003 dblp:conf/semeval/NakovHMMMBV17 fatcat:jv67mxsfpbc4tbuu6hffxi4exa

SemEval-2017 Task 3: Community Question Answering [article]

Preslav Nakov, Doris Hoogeveen, Lluís Màrquez, Alessandro Moschitti, Hamdy Mubarak, Timothy Baldwin, Karin Verspoor
2019 arXiv   pre-print
Additionally, we added a new subtask E in order to enable experimentation with Multi-domain Question Duplicate Detection in a larger-scale scenario, using StackExchange subforums.  ...  We describe SemEval-2017 Task 3 on Community Question Answering.  ...  It is part of the Interactive sYstems for Answer Search (IYAS) project, which is developed in collaboration with MIT-CSAIL. This research received funding in part from the Australian Research Council.  ... 
arXiv:1912.00730v1 fatcat:e7kuv74k7naxnmhgasp77qqrjq

Message from the general chair

Benjamin C. Lee
2015 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)  
Learning-based Multi-Sieve Co-reference Resolution with Knowledge Lev Ratinov and Dan Roth Saturday 11:00am-11:30am -202 A (ICC) We explore the interplay of knowledge and structure in co-reference resolution  ...  Compared with the best system from CoNLL-2011, which employs a rule-based method, our system shows competitive performance.  ...  Genre Independent Sub- group Detection in Online Discussion Threads: A Study of Implicit Atti- tude using Textual Latent Semantics P.  ... 
doi:10.1109/ispass.2015.7095776 dblp:conf/ispass/Lee15 fatcat:ehbed6nl6barfgs6pzwcvwxria

Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey [article]

Julian Wörmann, Daniel Bogdoll, Etienne Bührle, Han Chen, Evaristus Fuh Chuo, Kostadin Cvejoski, Ludger van Elst, Tobias Gleißner, Philip Gottschall, Stefan Griesche, Christian Hellert, Christian Hesels (+34 others)
2022 arXiv   pre-print
This work provides an overview of existing techniques and methods in the literature that combine data-based models with existing knowledge.  ...  Furthermore, predictions that conform with knowledge are crucial for making trustworthy and safe decisions even in underrepresented scenarios.  ...  However, the search space needs to be constrained via handcrafted rule templates and all logic programs are restricted to definite Datalog clauses, disallowing function symbols.  ... 
arXiv:2205.04712v1 fatcat:u2bgxr2ctnfdjcdbruzrtjwot4

A Birds Eye View on Knowledge Graph Embeddings, Software Libraries, Applications and Challenges [article]

Satvik Garg, Dwaipayan Roy
2022 arXiv   pre-print
In recent years, Knowledge Graph (KG) development has attracted significant researches considering the applications in web search, relation prediction, natural language processing, information retrieval  ...  This study intends to provide an overview of knowledge bases combined with different challenges and their impacts.  ...  In a semantic-based QA framework, the semantics of the query can be communicated by turning standard language-based questions into logic structures.  ... 
arXiv:2205.09088v1 fatcat:c4gfzg4ldras3axpf5wvbldstm

Machine Knowledge: Creation and Curation of Comprehensive Knowledge Bases [article]

Gerhard Weikum, Luna Dong, Simon Razniewski, Fabian Suchanek
2021 arXiv   pre-print
This machine knowledge can be harnessed to semantically interpret textual phrases in news, social media and web tables, and contributes to question answering, natural language processing and data analytics  ...  It covers models and methods for discovering and canonicalizing entities and their semantic types and organizing them into clean taxonomies.  ...  It is a great pleasure and honor to have such wonderful colleagues in our research community.  ... 
arXiv:2009.11564v2 fatcat:vh2lqfmhhbcwpf6dcsej3hhvgy

A Survey on Mining Software Repositories

Woosung JUNG, Eunjoo LEE, Chisu WU
2012 IEICE transactions on information and systems  
The data sources such as source control systems, bug tracking systems or archived communications, data types and techniques used for general MSR problems are also presented.  ...  Finally, evaluation approaches, opportunities and challenge issues are given.  ...  In [65] , SVM has been applied for the bug triage and in [147] , association rule mining has [155] duplicate bug detection [28] - [30] prediction [15] , [18] , [31] - [33] , [62] - [64]  ... 
doi:10.1587/transinf.e95.d.1384 fatcat:kfje3mzcufchzdj7qyt5smaaum

Filtering and Classifying Relevant Short Text with a Few Seed Words

Chenliang Li, Shiqian Chen, Yan Qi
2019 Data and Information Management  
The dominating topic of a short text is identified through post inference and then used for filtering and classification.  ...  SSCF infers two kinds of topics on pseudo-documents: category-topics and general-topics. Each category-topic is associated with one category of interest, covering the meaning of the latter.  ...  and filtering via a post inference process.  ... 
doi:10.2478/dim-2019-0011 fatcat:ekrqy7yumraqxcas5zg4wnhwjq

A Roadmap for Big Model [article]

Sha Yuan, Hanyu Zhao, Shuai Zhao, Jiahong Leng, Yangxiao Liang, Xiaozhi Wang, Jifan Yu, Xin Lv, Zhou Shao, Jiaao He, Yankai Lin, Xu Han (+88 others)
2022 arXiv   pre-print
Researchers have achieved various outcomes in the construction of BMs and the BM application in many fields.  ...  In each topic, we summarize clearly the current studies and propose some future research directions. At the end of this paper, we conclude the further development of BMs in a more general view.  ...  It associates each entity with a vector to capture its latent semantics. Each relation is represented as a matrix, which models pairwise interactions between latent factors.  ... 
arXiv:2203.14101v4 fatcat:rdikzudoezak5b36cf6hhne5u4

User-Generated Content in Social Media (Dagstuhl Seminar 17301)

Tat-Seng Chua, Norbert Fuhr, Gregory Grefenstette, Kalervo Järvelin, Jaakko Paltonen, Marc Herbstritt
2018 Dagstuhl Reports  
This report documents the program and the outcomes of Dagstuhl Seminar 17301 "User-Generated Content in Social Media". Social media have a profound impact on individuals, businesses, and society.  ...  As users post vast amounts of text and multimedia content every minute, the analysis of this user generated content (UGC) can offer insights to individual and societal concerns and could be beneficial  ...  In this talk, I illustrate, using historical Wikipedia associations, how community use and abuse changes the semantics and meaning of the images we use.  ... 
doi:10.4230/dagrep.7.7.110 dblp:journals/dagstuhl-reports/ChuaFGJP17 fatcat:bman5u6q5zdg7a6csnzwpba7sm

Interdisciplinary Perspectives on Place – Proceedings of the Second International Symposium on Platial Information Science (PLATIAL'19)

Franz-Benjamin Mocnik, Rene Westerholt
2020 Zenodo  
In contrast to abstract space, the way people experience places includes a range of aspects like physical setting, meaning, and emotional attachment.  ...  The formal representation of place – a major goal in GIScience related to place – is no exception and can only be successfully addressed if we consider geographical, psychological, anthropological, sociological  ...  Professor Kalina Bontcheva and colleagues at the University of Sheffield provided invaluable assistance in the use of GATE and GATEcloud natural language processing software.  ... 
doi:10.5281/zenodo.3628833 fatcat:idzsy3sbqzgntgyntz4wmmgrhe

EMBnet.journal 18 Suppl. B

EMBnet Journal
2012 EMBnet journal  
Acknowledgements The Onco-i2b2 project is funded by the "Regione Lombardia" in Italy. We gratefully acknowledge Prof. Carlo Bernasconi and the Collegio Ghislieri in Pavia for their active support.  ...  per la Biodiversità Molecolare" and "PON01 _ 02589 -MicroMap project "Caratterizzazione su larga scala del profilo metatrascrittomico e metagenomico di campioni animali in diverse condizioni fisiopatologiche  ...  Web browsers communicate with the interface using Javascript via Apache Tomcat.  ... 
doi:10.14806/ej.18.b.592 fatcat:wlwsmbdlfzbjtk7vyhiabdov6q

A taxonomy for software change impact analysis

Steffen Lehnert
2011 Proceedings of the 12th international workshop and the 7th annual ERCIM workshop on Principles on software evolution and software evolution - IWPSE-EVOL '11  
However, there has not been an extensive attempt made to summarize and review published approaches as a base for further research in the area.  ...  They are further classified according to the criteria of the taxonomy to enable the comparison and evaluation of approaches proposed in literature.  ...  The proposed IR approach uses latent semantic indexing (LSI) in combination with a term-by-document matrix to associate terms with documents.  ... 
doi:10.1145/2024445.2024454 dblp:conf/iwpse/Lehnert11 fatcat:s5ucqjzsr5c2pdoi3oeodnocpm
« Previous Showing results 1 — 15 out of 156 results