517 Hits in 7.4 sec

Business Insight from Collection of Unstructured Formatted Documents with IBM Content Harvester

Biplav Srivastava, Yuan-Chi Chang
2009 International Conference on Management of Data  
In this paper, we report the development and experiments of IBM Content Harvester (CH), a tool to analyze and recover templates and content from word processor created text documents.  ...  CH is part of a bigger effort to collect and reuse material generated in business service engagements.  ...  Business Insight with CH We now illustrate the kind of business insight possible with CH. Recall the original document shown in Figure 1 .  ... 
dblp:conf/comad/SrivastavaC09 fatcat:plu6vkulvnejvoelh4ypmxauzi

Machine Learning and Cloud Computing: Survey of Distributed and SaaS Solutions [article]

Daniel Pop
2016 arXiv   pre-print
A second line of products is augmenting existing tools with plugins that allow users to create a Hadoop cluster in the cloud and run jobs on it.  ...  Next on the list are libraries of distributed implementations for ML algorithms, and on-premise deployments of complex systems for data analytics and data mining.  ...  Analytics tools allow end-users to harvest the meaningful patterns buried in large volumes of structured and unstructured data.  ... 
arXiv:1603.08767v1 fatcat:vuzeggijyfbb7bmlcqdnt3xdjy

Blending Big Data Analytics: Review on Challenges and a Recent Study

Fairuz Amalina, Ibrahim Abaker Targio Hashem, Zati Hakim Azizul, Ang Tan Fong, Ahmad Firdaus, Muhammad Imran, Nor Badrul Anuar
2019 IEEE Access  
Variety is related to the interdisciplinary type of data, which are typically collected from a different source, format, and type.  ...  With unstructured data, modern businesses require new methods to analyze various big data [4] .  ... 
doi:10.1109/access.2019.2923270 fatcat:dmtpplybtncdho5cvaiwb6mgy4

Social media analytics: a survey of techniques, tools and platforms

Bogdan Batrinca, Philip C. Treleaven
2014 AI & Society: The Journal of Human-Centred Systems and Machine Intelligence  
Analyzing social media, in particular Twitter feeds for sentiment analysis, has become a major research and business activity due to the availability of web-based application programming interfaces (APIs  ...  The principal contribution of this paper is to provide an overview (including code fragments) for scientists seeking to utilize social media scraping and analytics either in their research or business.  ...  Acknowledgments The authors would like to acknowledge Michal Galas who led the design and implementation of the UCL Social-STORM platform with the assistance of Ilya Zheludev, Kacper Chwialkowski and Dan  ... 
doi:10.1007/s00146-014-0549-4 fatcat:eaezr6dis5fsvjv3o52zjuwhde

Analysis of Human Behavior by Mining Textual Data: Current Research Topics and Analytical Techniques

Edgar Gutierrez, Waldemar Karwowski, Krzysztof Fiok, Mohammad Reza Davahli, Tameika Liciaga, Tareq Ahram
2021 Symmetry  
data with a focus on enabling classification of psychological behaviors regarding emotion, cognition, and social empathy.  ...  Our findings show that, despite recent advancements in predicting human behaviors based on unstructured textual data, significant developments in data analytics systems for identification, determination  ...  [64, 109] collected insightful information from customers by analyzing textual data from various documents to improve business operations and performance.  ... 
doi:10.3390/sym13071276 fatcat:hi5x22zfjfav3oorqm77rtaaaq

Architectural thinking and modeling with the Architects' Workbench

S. Abrams, B. Bloom, P. Keyser, D. Kimelman, E. Nelson, W. Neuberger, T. Roth, I. Simmonds, S. Tang, J. Vlissides
2006 IBM Systems Journal  
This paper presents key AWB innovations and discusses how their design was motivated by insights into architectural work and feedback from IT architects.  ...  Collecting and organizing all of the architectural information for a system is a challenge faced by information technology (IT) architects.  ...  IBM Research managing the Business Application Modeling group.  ... 
doi:10.1147/sj.453.0481 fatcat:klbdoktzufbmjcj76t7jfnzuuq

Knowledge encapsulation framework for technosocial predictive modeling

Michael C Madison, Andrew J Cowell, R Scott Butner, Keith Fligg, Andrew W Piatt, Liam R McGrath, Peter C Ellis
2012 Security Informatics  
Commonly, this evidence is distilled from large data sets with significant amount of culling and searching through a variety of sources including traditional and social media.  ...  , and content analysis within a collaborative environment, with a functional interface to models and simulations.  ...  KEF project, number of comments, and whether this document is used to seed future harvests from the ADM) the original content history of the article within KEF (to preserve provenance of harvest, edits  ... 
doi:10.1186/2190-8532-1-10 fatcat:q5t52u5gfrfztfsyquxp4fekkq

An Introductory Guide to Data Science: The Terminological Landscape

Abhinav Yedla, Shawn Dorius
2017 Social Science Research Network  
First, we report results of a literature review that identifies and defines the essential content domain of data science, with special focus on the classification of data collection techniques.  ...  First, we report results of a literature review that identifies and defines the essential content domain of data science, with special focus on the classification of data collection techniques.  ...  Digital reports can be PDFs, word processor formatted documents, images, or any other human readable documents. Each report reading will require distinctive data collection methods.  ... 
doi:10.2139/ssrn.2920842 fatcat:72hmezczrbdq3argv6hterzpw4

Evaluating FPGA-acceleration for real-time unstructured search

SaiRahul Chalamalasetti, Martin Margala, Wim Vanderbauwhede, Mitch Wright, Parthasarathy Ranganathan
2012 2012 IEEE International Symposium on Performance Analysis of Systems & Software  
Emerging data-centric workloads that operate on and harvest useful insights from large amounts of unstructured data require corresponding new data-centric system architecture optimizations.  ...  We focus on an important class of data-centric workloads, realtime unstructured search, or information filtering, where large collections of documents are scored against specific topic profiles, and present  ...  Representative of these operations, in this paper, we focus on real-time unstructured search or information filtering where given a collection of unstructured data sources (e.g., documents), we identify  ... 
doi:10.1109/ispass.2012.6189226 dblp:conf/ispass/ChalamalasettiMVWR12 fatcat:twafxckf5ze23cstop4xvgxanq

Data-Driven Participation: Algorithms, Cities, Citizens, and Corporate Control

Matthew Tenney, Renee Sieber
2016 Urban Planning  
We ground theory and praxis with a report on the uneven impacts of algorithmic civic participation underway in the Canadian city of Toronto.  ...  We move to a praxis level and examine the motivations of local planners to adopt and increasingly automate forms of VGI as a form of citizen engagement.  ...  Acknowledgments We are grateful for the support from the following funders: SSHRC grant 895-2012-1023 "How the geospatial web 2.0 is reshaping government-citizen interactions" and Mitacs Accelerate PhD  ... 
doi:10.17645/up.v1i2.645 fatcat:vivi4uvfbbeshkxjqeuyng3wjm

Big Data Implementation in Malaysian Public Sector: A Review

Mohd Amiruddin Hamzah, Saiful Farik Mat Yatin, Maisahara Yusof, Tunku Sofiah Larasih T. Zainol Rashid, Hasnah Shuhaimi, Abu Bakar Suleiman, Ahmad Nazri Mansor, Khairul Mizan Taib
2020 International Journal of Academic Research in Business and Social Sciences  
Big Data is a new world phenomenon for information and knowledge management where the huge chunk of data set been collected and analyzed for further use in many sectors including security, business, investment  ...  The explosion of information sparked from mobile and internet technologies such as via social media and government agencies data gives the Big Data management an ultimate challenge for its characteristics  ...  Faculty of Information Management, UiTM Selangor, Malaysia 2. Advanced Analytics Engineering Center (AAEC), UiTM Malaysia  ... 
doi:10.6007/ijarbss/v10-i11/9072 fatcat:s7osojy5rreonhtfrh7p2ejqeq

An Architectural Approach to Cognitive Information Systems

Dóra Mattyasovszky-Philipp, Bálint Molnár
2020 Acta Polytechnica Hungarica  
The most significant components for modeling are: semi-structured documents, business processes, constituents of knowledge management, the enterprise and the information architecture, including self-directing  ...  The fast changes in information technology and business needs have led to the evolution and development of Cognitive Information Systems (CIS).  ...  The data are stored in an unstructured format in Data Lakes, in a structured format in Data Warehouses.  ... 
doi:10.12700/aph.17.2.2020.2.13 fatcat:rsfe6fihjnckjkj5q463kyb6ae

Changing the corporate IT development model: Tapping the power of grassroots computing

L. Cherbakov, A. Bravery, B. D. Goodman, A. Pandya, J. Baggett
2007 IBM Systems Journal  
We also describe the experience at IBM in building, deploying, and managing the IBM Situational Applications Environment that enables employees to take responsibility for some of their own solutions.  ...  are created rapidly by teams or individuals who best understand the business need, but without the overhead and formality of traditional information technology (IT) methods.  ...  We also thank many IBM colleagues (too numerous to individually name) who either contributed to the development of SAE or shared their experiences and ''lessons learned'' with the situational-applications  ... 
doi:10.1147/sj.464.0743 fatcat:q5obbkthobdgppxniyz3k7fibe

Analyzing Analytics

Rajesh Bordawekar, Bob Blainey, Ruchir Puri
2015 Synthesis Lectures on Computer Architecture  
Many organizations today are faced with the challenge of processing and distilling information from huge and growing collections of data.  ...  Such organizations are increasingly deploying sophisticated mathematical algorithms to model the behavior of their business processes to discover correlations in the data, to predict trends and ultimately  ...  The input data can be scalar, structured with one or more dimensions, or unstructured, and is usually read from files, streams or relational tables in the binary or text format.  ... 
doi:10.2200/s00678ed1v01y201511cac035 fatcat:jkjywe5rzzaupjwq5rjyavqxi4

The State of Digital Preservation: An International Perspective

Susan Hamburger
2003 Library collections, acquisitions & technical services  
In partnership with other organizations, CLIR helps create services that expand the concept of "library" and supports the providers and preservers of information. iii  ...  The Council on Library and Information Resources is an independent, nonprofit organization dedicated to improving the management of information for research, teaching, and learning.  ...  Acknowledgments The assistance of the Council on Library and Information Resources (CLIR), the Digital Library Federation, and Documentation Abstracts, Inc., in supporting my participation in this symposium  ... 
doi:10.1016/s1464-9055(03)00076-9 fatcat:soydekgwhndw5hkvq7ileukegu
« Previous Showing results 1 — 15 out of 517 results