Filters








7,201 Hits in 7.4 sec

Advertising Keywords Recommendation for Short-Text Web Pages Using Wikipedia

Weinan Zhang, Dingquan Wang, Gui-Rong Xue, Hongyuan Zha
2012 ACM Transactions on Intelligent Systems and Technology  
Given a target Web page, we propose to use a content-biased PageRank on the Wikipedia graph to rank the related entities.  ...  With these two biases, advertising keywords that are both relevant to a target Web page and valuable for advertising are recommended.  ...  Hu et al. map the target to a Wikipedia thesaurus and use the entity content and links to enhance the query intent identification [Hu et al. 2009 ] and text clustering [Hu et al. 2008 ].  ... 
doi:10.1145/2089094.2089112 fatcat:b5dskprcejc2nmswnehtlnr374

A Survey on Mining Aspects for Queries

Haritha Padmanabhan, Derroll David
2017 IJARCCE  
Third, query facets may also be used to improve the diversity of the ten blue links.  ...  Third, query facets may also be used to improve the diversity of the ten blue links. There is a problem of finding query facets.  ...  Third, query facets may also be used to improve the diversity of the ten blue links.  ... 
doi:10.17148/ijarcce.2017.6529 fatcat:igypfbiyyfd4hlupxykfadq6xa

Context browsing with mobiles - when less is more

Yevgen Borodin, Jalal Mahmud, I.V. Ramakrishnan
2007 Proceedings of the 5th international conference on Mobile systems, applications and services - MobiSys '07  
Our prototype system, CMo, reduces information overload by allowing its users to see and navigate between fragments of a Web page.  ...  Our experiments show that the use of context can potentially save browsing time and improve mobile browsing experience.  ...  And, finally, we would like to extend our appreciation to our evaluators for their time and patience, and to our developers: Amogh Ranadive, Anish Jayavant, Rakesh Jawale, Abhijit Aparadh, and Dhiraj Chawla  ... 
doi:10.1145/1247660.1247665 dblp:conf/mobisys/BorodinMR07 fatcat:hegdow7lc5fmxjxj6k3eelvdt4

Business Intelligence and Analytics

Ee-Peng Lim, Hsinchun Chen, Guoqing Chen
2013 ACM Transactions on Management Information Systems  
The article aims to review the state-of-the-art techniques and models and to summarize their use in BIA applications.  ...  The new insights can be used for improving products and services, achieving better operational efficiency, and fostering customer relationships.  ...  Advanced information extraction, topic identification, opinion mining, and time-series analysis techniques can be applied to traditional business information and the new BI 2.0 contents for various accounting  ... 
doi:10.1145/2407740.2407741 fatcat:enjlhfzqfnfqvlv63axun5dvee

Topic Modeling for Wikipedia Link Disambiguation

Bradley Skaggs, Lise Getoor
2014 ACM Transactions on Information Systems  
We propose a novel statistical topic model, which we refer to as the Link Text Topic Model (lttm), that can suggest new link targets for existing ambiguous links in Wikipedia articles.  ...  We evaluate lttm on this ground truth, and demonstrate its superiority over existing link-and content-based approaches.  ...  run in web browers, and is accessible on most PCs JavaScript, a web scripting language with no direct relationship to the Java platform Consumables Java (cigarette), a brand of Russian cigarettes Java  ... 
doi:10.1145/2633044 fatcat:bk3vno2hzbc55klqryexmx4q4m

A Scalable Approach to Harvest Modern Weblogs

Vangelis Banos, Olivier Blanvillain, Nikos Kasioumis, Yannis Manolopoulos
2015 International journal on artificial intelligence tools  
More precisely, our work concentrates on techniques to automatically extract content such as articles, authors, dates and comments from blog posts.  ...  To achieve this goal, we introduce a simple yet robust and scalable algorithm to generate extraction rules based on string matching using the blog's web feed in conjunction with blog hypertext.  ...  Acknowledgments Acknowledgments to G. Gkotsis from the University of Warwick for generously sharing his research material, time, and ideas with us.  ... 
doi:10.1142/s0218213015400059 fatcat:mggtuhzpzzfz5gsd3gydj4dpoi

Identifying, Collecting, and Presenting Hacker Community Data: Forums, IRC, Carding Shops, and DNMs

Po-Yi Du, Ning Zhang, Mohammedreza Ebrahimi, Sagar Samtani, Ben Lazarine, Nolan Arnold, Rachael Dunn, Sandeep Suntwal, Guadalupe Angeles, Robert Schweitzer, Hsinchun Chen
2018 2018 IEEE International Conference on Intelligence and Security Informatics (ISI)  
In this paper, we summarize our efforts in systematically identifying and automatically collecting a large-scale of hacker forums, carding shops, Internet-Relay-Chat, and Dark Net Marketplaces.  ...  To combat this issue, researchers and practitioners put enormous efforts into developing Cyber Threat Intelligence, or the process of identifying emerging threats and key hackers.  ...  We followed these links and identified if they contained valuable cybersecurity content. The newly identified platforms were used as new seeds to identify additional platforms. B.  ... 
doi:10.1109/isi.2018.8587327 dblp:conf/isi/DuZESLADSASC18 fatcat:ndazzm4f4reondspeccsdd5724

Augmented EHR: Enrichment of EHR with Contents from Semantic Web Sources

Alejandro Mañas-García, José Alberto Maldonado, Mar Marcos, Diego Boscá, Montserrat Robles
2021 Applied Sciences  
The results are converted into a standardized EHR extract according to an archetype. This work sets the foundations to transform Semantic Web contents into normalized EHR extracts.  ...  Finally, to exemplify the approach, the work includes a practical use case in which the summarized EHR is augmented with drug–drug interactions and disease-related treatment information.  ...  For each EHR augmentation, this repository holds: The XQuery script to generate augmented EHR extracts starting from initial EHR extracts and normalized augmentation contents • (I) To set up new EHR augmentations  ... 
doi:10.3390/app11093978 doaj:8172b04fc6334b999d0b4dd399244190 fatcat:2ycx3aw6vrg7xfoxh4dxp5672m

Wikimantic: Disambiguation for Short Queries [chapter]

Christopher Boston, Sandra Carberry, Hui Fang
2012 Lecture Notes in Computer Science  
By exploiting Wikipedia articles and their reference relations, our method is able to disambiguate terms in particularly short queries with few context words.  ...  This work is part of a larger project to retrieve information graphics in response to user queries.  ...  Acknowledgments This work uses Microsoft Web N-gram Services and was supported by the National Science Foundation under Grants III-1016916 and IIS-1017026.  ... 
doi:10.1007/978-3-642-31178-9_13 fatcat:2ytmbb5pprdzncqodvnpqodjry

WebSets: Extracting Sets of Entities from the Web Using Unsupervised Information Extraction [article]

Bhavana Dalvi, William W. Cohen, Jamie Callan
2013 arXiv   pre-print
The method can be efficiently applied to a large corpus, and experimental results on several datasets show that our method can accurately extract large numbers of concept-instance pairs.  ...  Most earlier approaches to this problem rely on combining clusters of distributionally similar terms and concept-instance pairs obtained with Hearst patterns.  ...  Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon.  ... 
arXiv:1307.0261v1 fatcat:ufqyye2nhjh5vot5afzuglaklm

WebSets

Bhavana Bharat Dalvi, William W. Cohen, Jamie Callan
2012 Proceedings of the fifth ACM international conference on Web search and data mining - WSDM '12  
The method can be efficiently applied to a large corpus, and experimental results on several datasets show that our method can accurately extract large numbers of concept-instance pairs.  ...  Most earlier approaches to this problem rely on combining clusters of distributionally similar terms and conceptinstance pairs obtained with Hearst patterns.  ...  Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon.  ... 
doi:10.1145/2124295.2124327 dblp:conf/wsdm/DalviCC12 fatcat:ozcdmq4t75ax5afmdkg35qqtbm

Of mice and terms

Nicola Raffaele Di Matteo, Silvio Peroni, Fabio Tamburini, Fabio Vitali
2010 Proceedings of the 2010 ACM Symposium on Applied Computing - SAC '10  
Web technologies.  ...  In this paper we extend our previous studies using FolksEngine and offer a new query expansion algorithms based on Natural Language Processing techniques, and a new view for the results based on Semantic  ...  , for example by linking them to the Linked Data graph.  ... 
doi:10.1145/1774088.1774262 dblp:conf/sac/MatteoPTV10 fatcat:4n44xk5dnzchrce3dkc3763fku

Applications of Advanced Analysis Technologies in Precise Governance of Social Media Rumors

Xinyu Du, Limei Ou, Ye Zhao, Qi Zhang, Zongmin Li
2021 Applied Sciences  
Social media rumor precise governance is conducive to better coping with the difficulties of rumor monitoring within massive information and improving rumor governance effectiveness.  ...  This paper is beneficial to clarify and promote the promising thought of social media rumor precise governance and create impacts on the technologies' applications in this area.  ...  information, it will help to make targeted improvements.  ... 
doi:10.3390/app11156726 fatcat:a2wmcjgqzraktfo75hjlcu4jfe

Accessibility of Tables in PDF Documents

Nosheen Fayyaz, Shah Khusro, Shakir Ullah
2021 Information Technology and Libraries  
Among these elements, tables are particularly important because they can add value to the resource description, discovery, and accessibility of documents not only on the web but also in libraries if they  ...  People access and share information over the web and in other digital environments, including digital libraries, in the form of documents such as books, articles, technical reports, etc.  ...  The researchers claimed improvement in table schema identification and quality of relation. 54 Similarly, ontologies are used to identify the semantic relations among the text, table contents, and table  ... 
doi:10.6017/ital.v40i3.12325 fatcat:6bzai4e2crd7bkqvmads52v5cy

Automated text summarization and the SUMMARIST system

Eduard Hovy, Chin-Yew Lin
1996 Proceedings of a workshop on held at Baltimore, Maryland October 13-15, 1998 -  
The scale may vary from book-length to paragraphlength. Different summarization techniques may apply to some genres and scales and not others.  ...  Output: characteristics of the summary as a text Derivation: Extract vs. abstract: An extract is a collection of passages (ranging from single words to whole paragraphs) extracted from the input text(s  ...  , Th6r~se Firmin Hand, Sara Shelton, and Beth Sundheim for discussions about evaluation, and especially Sara Shelton for continued encouragement.  ... 
doi:10.3115/1119089.1119121 dblp:conf/tipster/HovyL98 fatcat:af4ybwnp5fek3koqeregpf4l7a
« Previous Showing results 1 — 15 out of 7,201 results