A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2013; you can also visit the original URL.
The file type is application/pdf
.
Filters
Business Insight from Collection of Unstructured Formatted Documents with IBM Content Harvester
2009
International Conference on Management of Data
In this paper, we report the development and experiments of IBM Content Harvester (CH), a tool to analyze and recover templates and content from word processor created text documents. ...
CH is part of a bigger effort to collect and reuse material generated in business service engagements. ...
Business Insight with CH We now illustrate the kind of business insight possible with CH. Recall the original document shown in Figure 1 . ...
dblp:conf/comad/SrivastavaC09
fatcat:plu6vkulvnejvoelh4ypmxauzi
Machine Learning and Cloud Computing: Survey of Distributed and SaaS Solutions
[article]
2016
arXiv
pre-print
A second line of products is augmenting existing tools with plugins that allow users to create a Hadoop cluster in the cloud and run jobs on it. ...
Next on the list are libraries of distributed implementations for ML algorithms, and on-premise deployments of complex systems for data analytics and data mining. ...
Analytics tools allow end-users to harvest the meaningful patterns buried in large volumes of structured and unstructured data. ...
arXiv:1603.08767v1
fatcat:vuzeggijyfbb7bmlcqdnt3xdjy
Blending Big Data Analytics: Review on Challenges and a Recent Study
2019
IEEE Access
Variety is related to the interdisciplinary type of data, which are typically collected from a different source, format, and type. ...
With unstructured data, modern businesses require new methods to analyze various big data [4] . ...
doi:10.1109/access.2019.2923270
fatcat:dmtpplybtncdho5cvaiwb6mgy4
Social media analytics: a survey of techniques, tools and platforms
2014
AI & Society: The Journal of Human-Centred Systems and Machine Intelligence
Analyzing social media, in particular Twitter feeds for sentiment analysis, has become a major research and business activity due to the availability of web-based application programming interfaces (APIs ...
The principal contribution of this paper is to provide an overview (including code fragments) for scientists seeking to utilize social media scraping and analytics either in their research or business. ...
Acknowledgments The authors would like to acknowledge Michal Galas who led the design and implementation of the UCL Social-STORM platform with the assistance of Ilya Zheludev, Kacper Chwialkowski and Dan ...
doi:10.1007/s00146-014-0549-4
fatcat:eaezr6dis5fsvjv3o52zjuwhde
Analysis of Human Behavior by Mining Textual Data: Current Research Topics and Analytical Techniques
2021
Symmetry
data with a focus on enabling classification of psychological behaviors regarding emotion, cognition, and social empathy. ...
Our findings show that, despite recent advancements in predicting human behaviors based on unstructured textual data, significant developments in data analytics systems for identification, determination ...
[64, 109] collected insightful information from customers by analyzing textual data from various documents to improve business operations and performance. ...
doi:10.3390/sym13071276
fatcat:hi5x22zfjfav3oorqm77rtaaaq
Architectural thinking and modeling with the Architects' Workbench
2006
IBM Systems Journal
This paper presents key AWB innovations and discusses how their design was motivated by insights into architectural work and feedback from IT architects. ...
Collecting and organizing all of the architectural information for a system is a challenge faced by information technology (IT) architects. ...
IBM Research managing the Business Application Modeling group. ...
doi:10.1147/sj.453.0481
fatcat:klbdoktzufbmjcj76t7jfnzuuq
Knowledge encapsulation framework for technosocial predictive modeling
2012
Security Informatics
Commonly, this evidence is distilled from large data sets with significant amount of culling and searching through a variety of sources including traditional and social media. ...
, and content analysis within a collaborative environment, with a functional interface to models and simulations. ...
KEF project, number of comments, and whether this document is used to seed future harvests from the ADM) the original content history of the article within KEF (to preserve provenance of harvest, edits ...
doi:10.1186/2190-8532-1-10
fatcat:q5t52u5gfrfztfsyquxp4fekkq
An Introductory Guide to Data Science: The Terminological Landscape
2017
Social Science Research Network
First, we report results of a literature review that identifies and defines the essential content domain of data science, with special focus on the classification of data collection techniques. ...
First, we report results of a literature review that identifies and defines the essential content domain of data science, with special focus on the classification of data collection techniques. ...
Digital reports can be PDFs, word processor formatted documents, images, or any other human readable documents. Each report reading will require distinctive data collection methods. ...
doi:10.2139/ssrn.2920842
fatcat:72hmezczrbdq3argv6hterzpw4
Evaluating FPGA-acceleration for real-time unstructured search
2012
2012 IEEE International Symposium on Performance Analysis of Systems & Software
Emerging data-centric workloads that operate on and harvest useful insights from large amounts of unstructured data require corresponding new data-centric system architecture optimizations. ...
We focus on an important class of data-centric workloads, realtime unstructured search, or information filtering, where large collections of documents are scored against specific topic profiles, and present ...
Representative of these operations, in this paper, we focus on real-time unstructured search or information filtering where given a collection of unstructured data sources (e.g., documents), we identify ...
doi:10.1109/ispass.2012.6189226
dblp:conf/ispass/ChalamalasettiMVWR12
fatcat:twafxckf5ze23cstop4xvgxanq
Data-Driven Participation: Algorithms, Cities, Citizens, and Corporate Control
2016
Urban Planning
We ground theory and praxis with a report on the uneven impacts of algorithmic civic participation underway in the Canadian city of Toronto. ...
We move to a praxis level and examine the motivations of local planners to adopt and increasingly automate forms of VGI as a form of citizen engagement. ...
Acknowledgments We are grateful for the support from the following funders: SSHRC grant 895-2012-1023 "How the geospatial web 2.0 is reshaping government-citizen interactions" and Mitacs Accelerate PhD ...
doi:10.17645/up.v1i2.645
fatcat:vivi4uvfbbeshkxjqeuyng3wjm
Big Data Implementation in Malaysian Public Sector: A Review
2020
International Journal of Academic Research in Business and Social Sciences
Big Data is a new world phenomenon for information and knowledge management where the huge chunk of data set been collected and analyzed for further use in many sectors including security, business, investment ...
The explosion of information sparked from mobile and internet technologies such as via social media and government agencies data gives the Big Data management an ultimate challenge for its characteristics ...
Faculty of Information Management, UiTM Selangor, Malaysia 2. Advanced Analytics Engineering Center (AAEC), UiTM Malaysia ...
doi:10.6007/ijarbss/v10-i11/9072
fatcat:s7osojy5rreonhtfrh7p2ejqeq
An Architectural Approach to Cognitive Information Systems
2020
Acta Polytechnica Hungarica
The most significant components for modeling are: semi-structured documents, business processes, constituents of knowledge management, the enterprise and the information architecture, including self-directing ...
The fast changes in information technology and business needs have led to the evolution and development of Cognitive Information Systems (CIS). ...
The data are stored in an unstructured format in Data Lakes, in a structured format in Data Warehouses. ...
doi:10.12700/aph.17.2.2020.2.13
fatcat:rsfe6fihjnckjkj5q463kyb6ae
Changing the corporate IT development model: Tapping the power of grassroots computing
2007
IBM Systems Journal
We also describe the experience at IBM in building, deploying, and managing the IBM Situational Applications Environment that enables employees to take responsibility for some of their own solutions. ...
are created rapidly by teams or individuals who best understand the business need, but without the overhead and formality of traditional information technology (IT) methods. ...
We also thank many IBM colleagues (too numerous to individually name) who either contributed to the development of SAE or shared their experiences and ''lessons learned'' with the situational-applications ...
doi:10.1147/sj.464.0743
fatcat:q5obbkthobdgppxniyz3k7fibe
Analyzing Analytics
2015
Synthesis Lectures on Computer Architecture
Many organizations today are faced with the challenge of processing and distilling information from huge and growing collections of data. ...
Such organizations are increasingly deploying sophisticated mathematical algorithms to model the behavior of their business processes to discover correlations in the data, to predict trends and ultimately ...
The input data can be scalar, structured with one or more dimensions, or unstructured, and is usually read from files, streams or relational tables in the binary or text format. ...
doi:10.2200/s00678ed1v01y201511cac035
fatcat:jkjywe5rzzaupjwq5rjyavqxi4
The State of Digital Preservation: An International Perspective
2003
Library collections, acquisitions & technical services
In partnership with other organizations, CLIR helps create services that expand the concept of "library" and supports the providers and preservers of information. iii ...
The Council on Library and Information Resources is an independent, nonprofit organization dedicated to improving the management of information for research, teaching, and learning. ...
Acknowledgments The assistance of the Council on Library and Information Resources (CLIR), the Digital Library Federation, and Documentation Abstracts, Inc., in supporting my participation in this symposium ...
doi:10.1016/s1464-9055(03)00076-9
fatcat:soydekgwhndw5hkvq7ileukegu
« Previous
Showing results 1 — 15 out of 517 results