TY - GEN
T1 - The open connectome project data cluster
T2 - 25th International Conference on Scientific and Statistical Database Management, SSDBM 2013
AU - Burns, Randal
AU - Roncal, William Gray
AU - Kleissas, Dean
AU - Lillaney, Kunal
AU - Manavalan, Priya
AU - Perlman, Eric
AU - Berger, Daniel R.
AU - Bock, Davi D.
AU - Chung, Kwanghun
AU - Grosenick, Logan
AU - Kasthuri, Narayanan
AU - Weiler, Nicholas C.
AU - Deisseroth, Karl
AU - Kazhdan, Michael
AU - Lichtman, Jeff
AU - Reid, R. Clay
AU - Smith, Stephen J.
AU - Szalay, Alexander S.
AU - Vogelstein, Joshua T.
AU - Vogelstein, R. Jacob
PY - 2013
Y1 - 2013
N2 - We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily for workloads that build connectomes- neural connectivity maps of the brain-using the parallel execution of computer vision algorithms on high-performance compute clusters. These services and open-science data sets are publicly available at openconnecto.me. The system design inherits much from NoSQL scale-out and data-intensive computing architectures. We distribute data to cluster nodes by partitioning a spatial index. We direct I/O to different systems-reads to parallel disk arrays and writes to solid-state storage-to avoid I/O interference and maximize throughput. All programming interfaces are RESTful Web services, which are simple and stateless, improving scalability and usability. We include a performance evaluation of the production system, highlighting the effectiveness of spatial data organization.
AB - We describe a scalable database cluster for the spatial analysis and annotation of high-throughput brain imaging data, initially for 3-d electron microscopy image stacks, but for time-series and multi-channel data as well. The system was designed primarily for workloads that build connectomes- neural connectivity maps of the brain-using the parallel execution of computer vision algorithms on high-performance compute clusters. These services and open-science data sets are publicly available at openconnecto.me. The system design inherits much from NoSQL scale-out and data-intensive computing architectures. We distribute data to cluster nodes by partitioning a spatial index. We direct I/O to different systems-reads to parallel disk arrays and writes to solid-state storage-to avoid I/O interference and maximize throughput. All programming interfaces are RESTful Web services, which are simple and stateless, improving scalability and usability. We include a performance evaluation of the production system, highlighting the effectiveness of spatial data organization.
UR - http://www.scopus.com/inward/record.url?scp=84882936998&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84882936998&partnerID=8YFLogxK
U2 - 10.1145/2484838.2484870
DO - 10.1145/2484838.2484870
M3 - Conference contribution
AN - SCOPUS:84882936998
SN - 9781450319218
T3 - ACM International Conference Proceeding Series
BT - SSDBM 2013 - Proceedings of the 25th International Conference on Scientific and Statistical Database Management
Y2 - 29 July 2013 through 31 July 2013
ER -