Large-scale collaborative analysis and extraction of web data

Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke, Manuel Calimlim
2008 Proceedings of the VLDB Endowment  
Archived web data is a great resource for scientific research, but poses serious challenges in data processing and management. We demonstrate the Web Lab Collaboration Server, a platform and service for large-scale collaborative web data analysis in a distributed computing environment, and show how it seamlessly supports non-technical users during search, data extraction and analysis.
doi:10.14778/1454159.1454205 fatcat:2uiln2tqsrhxbjrcer6fekdbbe