10 Hits in 5.2 sec

Batch is back: CasJobs, serving multi-TB data on the Web

W. O'Mullane, N. Li, M. Nieto-Santisteban, A. Szalay, A. Thakar, J. Gray
2005 IEEE International Conference on Web Services (ICWS'05)  
To ameliorate this problem, we developed a multi-server multi-queue batch job submission, execution, and tracking system for the CAS called CasJobs.  ...  CasJobs is built using SOAP XML Web services and has been in operation since May 2004.  ...  We successfully put the system on a server at another institute and had it serving up data from one of their databases in about one hour.  ... 
doi:10.1109/icws.2005.29 dblp:conf/icws/OMullaneLNST05 fatcat:6rovdfyygrc65pvhodw5okre3e

Extending the SDSS Batch Query System to the National Virtual Observatory Grid [article]

Maria A. Nieto-Santisteban, William O'Mullane, Jim Gray, Nolan Li, Tamas Budavari, Alexander S. Szalay, Aniruddha R. Thakar
2004 arXiv   pre-print
This implies development, in a distributed manner, of several features, which have been demonstrated for a single node in the SDSS Batch Query System (CasJobs).  ...  In response to this, we added a multi-queue job submission and tracking system. The transfer of very large result sets from queries over the network is another serious problem.  ...  The SkyServer front end is coded in ASP on a Microsoft.Net server and backed by a SQL Server database.  ... 
arXiv:cs/0403017v1 fatcat:cfv4x7opufacdcqypvjlprjdwm

Graywulf: A platform for federated scientific databases and services [article]

László Dobos and Alexander S. Szalay and Tamás Budavári and István Csabai and Nolan Li
2013 arXiv   pre-print
Uniform user access to the data is provided through a web based query interface and a data surface for software clients.  ...  Many fields of science rely on relational database management systems to analyze, publish and share data.  ...  Acknowledgments This research is partly funded by the Gordon and Betty Moore Foundation through Grant GBMF#554.02 to the Johns Hopkins University.  ... 
arXiv:1308.1440v1 fatcat:35akjhyx3japfo4lw7mqot73g4

Towards an Astronomical Science Platform: Experiences and Lessons Learned from Chinese Virtual Observatory [article]

Chenzhou Cui, Yihan Tao, Changhua Li, Dongwei Fan, Jian Xiao, Boliang He, Shanshan Li, Ce Yu, Linying Mi, Yunfei Xu, Jun Han, Sisi Yang (+7 others)
2020 arXiv   pre-print
The Chinese Virtual Observatory (China-VO) is one of the member projects in the International Virtual Observatory Alliance and it is dedicated to providing a research and education environment where globally  ...  In the era of big data astronomy, next generation telescopes and large sky surveys produce data sets at the TB or even PB level.  ...  Acknowledgments This work is supported by National Natural Science Foundation of China (NSFC)(11803055), the Joint Research Fund in Astronomy (U1731125, U1731243, U1931132) under cooperative agreement  ... 
arXiv:2005.10501v1 fatcat:52b75s5fcfauzissevyfeticyu

Large Science Databases – Are Cloud Services Ready for Them?

Ani Thakar, Alex Szalay, Ken Church, Andreas Terzis
2011 Scientific Programming  
Certainly it is impossible to migrate a large database in excess of a TB, but even with (much) smaller databases, the limitations of cloud services make it very difficult to migrate the data to the cloud  ...  We report on attempts to put an astronomical database – the Sloan Digital Sky Survey science archive – in the cloud.  ...  The Data-Scope project is supported by an MRI grant from the National Science Foundation (NSF). A. Thakar et al. / Large science databases -are cloud services ready for them?  ... 
doi:10.1155/2011/591536 fatcat:ruu3xrke7jcypfhya4ljczqp6y

Scientific data management in the coming decade

Jim Gray, David T. Liu, Maria Nieto-Santisteban, Alex Szalay, David J. DeWitt, Gerd Heber
2005 SIGMOD record  
., CIDR, 2005, 9 "Batch is back: CasJobs serving multi-TB data on the Web," W.  ...  The largest data analysis gap is in this man-machine interface. How can we put the scientist back in control of his data?  ... 
doi:10.1145/1107499.1107503 fatcat:u5h2iovwjjd2teg3vtrfs7lvdm

Scalable community-driven data sharing in e-science grids

Tobias Scholl, Bernhard Bauer, Benjamin Gufler, Richard Kuntschke, Angelika Reiser, Alfons Kemper
2009 Future generations computer systems  
HiSbase is an approach to data management in scientific federated Data Grids that addresses the scalability issue by combining established techniques of database research in the field of spatial data structures  ...  Considering the involved enormous and exponentially growing data volumes, centralized data management reaches its limits.  ...  Batch systems (such as CasJobs [20] ) offer less restrictive access to the data sources and sometimes even a private database for later processing or sharing the results with colleagues.  ... 
doi:10.1016/j.future.2008.05.006 fatcat:lyrd662cwvgotlriwnociwnzzi

Sloan Digital Sky Survey IV: Mapping the Milky Way, Nearby Galaxies, and the Distant Universe

Michael R. Blanton, Matthew A. Bershady, Bela Abolfathi, Franco D. Albareti, Carlos Allende Prieto, Andres Almeida, Javier Alonso-García, Friedrich Anders, Scott F. Anderson, Brett Andrews, Erik Aquino-Ortíz, Alfonso Aragón-Salamanca (+351 others)
2017 Astronomical Journal  
In keeping with previous SDSS policy, SDSS-IV provides regularly scheduled public data releases; the first one, Data Release 13, was made available in July 2016.  ...  The Apache Point Observatory Galactic Evolution Experiment 2 (APOGEE-2) is observing hundreds of thousands of Milky Way stars at high resolution and high signal-to-noise ratio in the near-infrared.  ...  Sloan Foundation, the U.S.  ... 
doi:10.3847/1538-3881/aa7567 fatcat:5qahp2fcdrd4bd6mgxacvw5foq

Citation analysis of database publications

Erhard Rahm, Andreas Thor
2005 SIGMOD record  
Gerhard serves on the editorial boards of ACM TODS and IEEE CS TKDE, and he was the program committee chair for the 2004 SIGMOD conference in Paris.  ...  He is an invited expert to the W3C Full-Text Task Force, and is also the recipient of the NSF CAREER Award and an IBM Faculty Award.  ...  "When Database Systems Meet the Grid," M. Nieto Santisteban et. al., CIDR, 2005, 9 "Batch is back: CasJobs serving multi-TB data on the Web," W.  ... 
doi:10.1145/1107499.1107505 fatcat:c4evhhw6y5difggghxsvvdqzeu

Efficient Evaluation of HAVING Queries on a Probabilistic Database [chapter]

Christopher Ré, Dan Suciu
Database Programming Languages  
The datasets produced by these simulations are in the TB and even PB ranges.  ...  The integrated method for the evaluation of threshold queries that we have developed achieves scalability through data-parallel execution of the computations on the nodes of an analysis database cluster  ...  This work is supported in part by the National Science Foundation under Grants CMMI-0941530, ACI-1261715, OCI-1244820 and AST-0939767 and Johns Hopkins University's Institute for Data Intensive Engineering  ... 
doi:10.1007/978-3-540-75987-4_13 dblp:conf/dbpl/ReS07 fatcat:k5uba4wocjfrhettqn3kccewoe