6 Hits in 1.9 sec

E2EProf: Automated End-to-End Performance Management for Enterprise Systems

Sandip Agarwala, Fernando Alegre, Karsten Schwan, Jegannathan Mehalingham
2007 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07)  
The E2EProf toolkit enables the efficient and nonintrusive capture and analysis of end-to-end program behavior for complex enterprise applications.  ...  E2EProf permits an enterprise to recognize and analyze performance problems when they occur -online, to take corrective actions as soon as possible and whereever necessary along the paths currently taken  ...  We show these results simply to demonstrate E2EProf's utility for online and automated system management, in addition to its already proven use by system administrators to diagnose performance problems  ... 
doi:10.1109/dsn.2007.38 dblp:conf/dsn/AgarwalaASM07 fatcat:btgcautobvhtlkulowbkofygp4

Performance troubleshooting in data centers

Chengwel Wang, Soila P. Kavulya, Jiaqi Tan, Liting Hu, Mahendra Kutare, Mike Kasick, Karsten Schwan, Priya Narasimhan, Rajeev Gandhi
2013 ACM SIGOPS Operating Systems Review  
E2EProf: Automated End-to-End Performance Management for Enterprise Systems.  ...  the whole data center and/or the end-to-end performance of the applications.  ... 
doi:10.1145/2553070.2553079 fatcat:musiuzk4hnd4xdjhcagqkxbday


Liting Hu, Karsten Schwan, Ajay Gulati, Junjie Zhang, Chengwei Wang
2012 Proceedings of the 9th international conference on Autonomic computing - ICAC '12  
To address this problem, we present 'Net-Cohort', which offers lightweight system-level techniques to (1) discover VM ensembles and (2) collect information about intra-ensemble VM interactions.  ...  Placements based on ensemble information provided by Net-Cohort can result in an up to 385% improvement in application throughput for a RUBiS instance, a 56.4% improvement in application throughput for  ...  Compared to Net-Cohort, E2EProf has higher runtime overheads due to capturing the end-to end latencies of all requests in multi-tier systems and applying cross correlation analysis to all network flows  ... 
doi:10.1145/2371536.2371540 dblp:conf/icac/HuSGZW12 fatcat:zga2ss4hqzanjnjunepttdlgce

VScope: Middleware for Troubleshooting Time-Sensitive Data Center Applications [chapter]

Chengwei Wang, Infantdani Abel Rayan, Greg Eisenhauer, Karsten Schwan, Vanish Talwar, Matthew Wolf, Chad Huneycutt
2012 Lecture Notes in Computer Science  
Data-Intensive infrastructures are increasingly used for online processing of live data to guide operations and decision making.  ...  VScope is a flexible monitoring and analysis middleware for troubleshooting such large-scale, time-sensitive, multi-tier applications.  ...  However, VMs' resource contention on I/O devices can degrade the end-to-end performance of the application.  ... 
doi:10.1007/978-3-642-35170-9_7 fatcat:2aevtezkyjdktdujs7d7nhhq3m

A flexible architecture integrating monitoring and analytics for managing large-scale data centers

Chengwei Wang, Karsten Schwan, Vanish Talwar, Greg Eisenhauer, Liting Hu, Matthew Wolf
2011 Proceedings of the 8th ACM international conference on Autonomic computing - ICAC '11  
To effectively manage large-scale data centers and utility clouds, operators must understand current system and application behaviors.  ...  Results show that the approach provides the flexibility to meet the demands of autonomic management at large scale with considerably better performance/cost than traditional and brute force solutions.  ...  A resulting challenge is: how to design a cost-effective system minimizing management cost while yielding the best possible performance?.  ... 
doi:10.1145/1998582.1998605 dblp:conf/icac/WangSTEHW11 fatcat:ebelm7tgjvenjhx4aqdr7yabo4

Automatic performance diagnosis and recovery in cloud microservices

Li Wu, Technische Universität Berlin, Odej Kao
Therefore, there is an urgent need for an automatic performance problem management system that can not only detect anomalous behaviors (performance anomalies) but also uncover the root causes and recommend  ...  To this end, this thesis contributes: (1) a method for locating the faulty service from which a performance anomaly originates, including a graphical model for capturing the propagation of the anomaly  ...  Figure 9 An overall architecture of performance problem management system for cloud microservices. . . . . . . . . .  ... 
doi:10.14279/depositonce-14959 fatcat:byvzjzcizvar5bpwqhwcs4krcu