121,538 Hits in 11.3 sec

Building Large-Scale Software Organizations [chapter]

1988 Expert Systems for Engineering Design  
This paper describes DPSK, an environment for building organizations of distributed, collaborating programs.  ...  This paper traces the evolution of DPSK, describes its principal features and illustrates its use with some simple examples.  ...  For instance, an important problem may turn out to be unsolvable, an algorithm may fail to converge or a vital piece of data may prove to be unobtainable. • Large-Scale Organization: an organization with  ... 
doi:10.1016/b978-0-12-605110-0.50016-1 fatcat:4ntefepdhfdivbapjammfougsq

A Survey of Data Mining Activities in Distributed Systems

Waleed A. Mohammad, Hajar Maseeh Yasin, Azar Abid Salih, Adel AL-Zebari, Naaman Omar, Karwan Jameel Merceedi, Abdulraheem Jamil Ahmed, Nareen O. M. Salim, Sheren Sadiq Hasan, Shakir Fattah Kak, Ibrahim Mahmood Ibrahim
2021 Asian Journal of Research in Computer Science  
Managing massive amounts of data and processing them with limited resources is difficult. Large volumes of data, for instance, are swiftly generated and stored in many locations.  ...  Distributed systems, which may be utilized to do computations, are being developed as a result of the fast growth of sharing resources.  ...  Distributed computing also refers to the use of distributed systems to solve computational problems.  ... 
doi:10.9734/ajrcos/2021/v11i430267 fatcat:t4wcvxu2czd45imagjenzsvcqi

A Survey of Human Computation Systems

Man-Ching Yuen, Ling-Jyh Chen, Irwin King
2009 2009 International Conference on Computational Science and Engineering  
The human computation problems are the problems those computers are not good at solving but are trivial for humans.  ...  Human computation is a technique that makes use of human abilities for computation to solve problems.  ...  This work is also affiliated with the Microsoft-CUHK Joint Laboratory for Human-centric Computing and Interface Technologies.  ... 
doi:10.1109/cse.2009.395 dblp:conf/cse/YuenCK09 fatcat:jdoa7kar4fclrbxezqc3brtzny

Scaling up data mining algorithms: review and taxonomy

Nicolás García-Pedrajas, Aida de Haro-García
2012 Progress in Artificial Intelligence  
Thus, for many problems, especially when dealing with very large datasets, the only way to deal with the aforementioned problems is to scale up the data mining algorithm.  ...  In many cases, the demands of the algorithm in terms of the running time are very large, and mining methods cannot be applied when the problem grows.  ...  Acknowledgments This work was supported in part by the Grant TIN2008-03151 of the Spanish "Comisión Interministerial de Ciencia y Tecnología" and the Grant P09-TIC-4623 of the Regional Government of Andalucía  ... 
doi:10.1007/s13748-011-0004-4 fatcat:o53sri33rbf5dmjazaexfwvxeq

Analysis of problems of storage and processing of data in serverless technologies

Tetiana Naumenko, Anatolii Petrenko
2021 Technology Audit and Production Reserves  
The problem of interacting with data is an integral part of the work of almost all computer systems, as they lay the basis for the goal of creating such systems.  ...  storing and processing data requires modifications and new approaches to solving classical problems of working with data.  ...  This data will allow to calculate risks even at the stage of system design. As recommended above, further research should focus on solving the problems that remain in the industry.  ... 
doi:10.15587/2706-5448.2021.230174 doaj:15580f07cf0640c78fa5d5ddc2d94168 fatcat:rsutc4wg5rftpa57sky7gobsse

Distributed computing and data storage in proteomics: Many hands make light work, and a stronger memory

Kenneth Verheggen, Harald Barsnes, Lennart Martens
2013 Proteomics  
In this review, we provide an overview of the current techniques for distributed computing, along with examples of how the techniques are currently being employed in the field of proteomics.  ...  We thus underline the benefits of distributed computing in proteomics, while also pointing out the potential issues and pitfalls involved.  ...  Tandem version designed to handle both very large amounts of spectra, as well as very large databases.  ... 
doi:10.1002/pmic.201300288 pmid:24285552 fatcat:k3spgcffrfg6zak5ivledihiuq

Detecting performance anti-patterns for applications developed using object-relational mapping

Tse-Hsun Chen, Weiyi Shang, Zhen Ming Jiang, Ahmed E. Hassan, Mohamed Nasser, Parminder Flora
2014 Proceedings of the 36th International Conference on Software Engineering - ICSE 2014  
However, developers often write ORM code without considering the impact of such code on database performance, leading to cause transactions with timeouts or hangs in large-scale systems.  ...  We have successfully evaluated our framework on two open source and one large-scale industrial systems.  ...  The findings and opinions in this paper belong solely to the authors, and are not necessarily those of BlackBerry.  ... 
doi:10.1145/2568225.2568259 dblp:conf/icse/ChenSJHNF14 fatcat:hsgw24j6fnefvgjmf3n3xpa7iq

Large-Scale Parallel Computing on Grids

Henri Bal, Kees Verstoep
2008 Electronical Notes in Theoretical Computer Science  
We illustrate large-scale parallel computing on grids with three example applications that search large state spaces: transposition-driven search, retrograde analysis, and model checking.  ...  This paper argues that computational grids can be used for far more types of applications than just trivially parallel ones.  ...  The various DAS systems have been co-funded by the Netherlands Organization for Scientific Research (N.W.O.), the Netherlands National Computing Facilities foundation (N.C.F.), the Virtual Laboratory for  ... 
doi:10.1016/j.entcs.2008.11.010 fatcat:esb4axh6zberfdlf2m5y6w3rpe

Data Management in Large-Scale P2P Systems [chapter]

Patrick Valduriez, Esther Pacitti
2005 Lecture Notes in Computer Science  
Most techniques designed for distributed database systems which statically exploit schema and network information no longer apply.  ...  For instance, medical doctors in a hospital may want to share some patient data for an epidemiological study.  ...  Martins for their many inputs and fruitful discussions in the context of the APPA project, and S. Abiteboul and I. Manolescu for fruitful discussions in the context of the MDP2P project.  ... 
doi:10.1007/11403937_9 fatcat:nwjttulmdffx7l5s3lw25qd4ta

Evaluating accumulo performance for a scalable cyber data processing pipeline

Scott M. Sawyer, B. David O'Gwynn
2014 2014 IEEE High Performance Extreme Computing Conference (HPEC)  
For many applications, distributed NoSQL databases are effective alternatives to traditional relational database management systems.  ...  This paper considers a cyber situational awareness system that uses the Apache Accumulo database to provide scalable data warehousing, realtime data ingest, and responsive querying for human users and  ...  The authors would like to think the many people whose hard work helped make these experiments possible.  ... 
doi:10.1109/hpec.2014.7040978 dblp:conf/hpec/SawyerO14 fatcat:sxtl2xkpnfh25b3hlwfwzwhzvi

Evaluating Accumulo Performance for a Scalable Cyber Data Processing Pipeline [article]

Scott M. Sawyer, B. David O'Gwynn
2014 arXiv   pre-print
For many applications, distributed NoSQL databases are effective alternatives to traditional relational database management systems.  ...  This paper considers a cyber situational awareness system that uses the Apache Accumulo database to provide scalable data warehousing, real-time data ingest, and responsive querying for human users and  ...  The authors would like to think the many people whose hard work helped make these experiments possible. The LL-CySA ingest and query pipeline builds upon the work of our fellow members of the  ... 
arXiv:1407.5661v1 fatcat:wlyerumjajffbnq66qfur362wa

Lightweight component-based scalability

Hussein Suleman, Christopher Parker, Muammar Omar
2008 International Journal on Digital Libraries  
It can be argued that such distributed components also can be an enabler for scalability of service provision in medium to large systems.  ...  Finally, specific problems areas have been identified as future avenues for exploration at the crucial intersection of digital libraries and high performance computing.  ...  Content delivery networks (CDNs) such as Akamai [28] , for example, have solved this problem by making use of distributed networks of servers to deliver content to consumers.  ... 
doi:10.1007/s00799-008-0047-8 fatcat:hltg4yxrc5hv7frt4p3uyv27wq

DTR: Distributed Transaction Routing in a Large Scale Network [chapter]

Idrissa Sarr, Hubert Naacke, Stéphane Gançarski
2008 Lecture Notes in Computer Science  
In this article, we propose a novel solution for the distributed control of transaction routing in a large scale network.  ...  We leverage a cluster-oriented routing solution with a fully distributed approach that uses a large scale distributed directory to handle routing metadata.  ...  Conclusion This paper presents an ongoing work towards the design and implementation of a grid-based large scale data management system.  ... 
doi:10.1007/978-3-540-92859-1_46 fatcat:blvnk3rxcnfvnbjomwipbufvfa

Comparison of a Deductive Database with a Semantic Web reasoning engine

Simone A. Ludwig
2010 Knowledge-Based Systems  
The objective of this paper is the comparison and evaluation of a Deductive Database system (ConceptBase) with a Semantic Web reasoning engine (Racer).  ...  Issues such as documentation, feasibility, expressiveness, complexity, distribution, performance and scalability are investigated in order to explore the advantages and shortcomings of each system.  ...  subclasses and instances of instances Open world assumption; restriction must be explicitly stated Does not allow properties to be defined outside of a class No support for a numerical data type for the  ... 
doi:10.1016/j.knosys.2010.04.005 fatcat:f5snnhes2zdq5hlfyhhwvxihxq

Automatic Generation of Integration and Preprocessing Ontologies for Biomedical Sources in a Distributed Scenario

Alberto Anguita, David Pérez-Rey, José Crespo, Víctor Maojo
2008 2008 21st IEEE International Symposium on Computer-Based Medical Systems  
In this scenario, the authors have developed various methods and tools in the area of database integration, using an ontological approach.  ...  of the approach.  ...  Due to the special characteristics of biomedical information systems, a distributed approach has been adopted for the KDD model.  ... 
doi:10.1109/cbms.2008.71 dblp:conf/cbms/AnguitaPCM08 fatcat:74m6eocqgremzl2ohfhuozwjpi
« Previous Showing results 1 — 15 out of 121,538 results