Serialization

Topic Tools

Papers published on a yearly basis

Papers

Proceedings Article•10.1109/ICCV.2009.5459148•

Building Rome in a day

[...]

Sameer Agarwal¹, Noah Snavely², Ian Simon¹, Steven M. Seitz¹, Richard Szeliski³ - Show less +1 more•Institutions (3)

University of Washington¹, Cornell University², Microsoft³

1 Sep 2009

TL;DR: A system that can match and reconstruct 3D scenes from extremely large collections of photographs such as those found by searching for a given city on Internet photo sharing sites and is designed to scale gracefully with both the size of the problem and the amount of available computation.

...read moreread less

Abstract: We present a system that can match and reconstruct 3D scenes from extremely large collections of photographs such as those found by searching for a given city (e.g., Rome) on Internet photo sharing sites. Our system uses a collection of novel parallel distributed matching and reconstruction algorithms, designed to maximize parallelism at each stage in the pipeline and minimize serialization bottlenecks. It is designed to scale gracefully with both the size of the problem and the amount of available computation. We have experimented with a variety of alternative algorithms at each stage of the pipeline and report on which ones work best in a parallel computing environment. Our experimental results demonstrate that it is now possible to reconstruct cities consisting of 150K images in less than a day on a cluster with 500 compute cores.

...read moreread less

1,999 citations

Journal Article•10.5334/JORS.148•

xarray: N-D labeled arrays and datasets in Python

[...]

Stephan Hoyer¹, Joseph Hamman²•Institutions (2)

Google¹, University of Washington²

05 Apr 2017-Journal of open research software

TL;DR: This approach combines an application programing interface (API) inspired by pandas with the Common Data Model for self-described scientific data to provide a toolkit and data structures for N-dimensional labeled arrays.

...read moreread less

Abstract: xarray is an open source project and Python package that provides a toolkit and data structures for N-dimensional labeled arrays. Our approach combines an application programing interface (API) inspired by pandas with the Common Data Model for self-described scientific data. Key features of the xarray package include label-based indexing and arithmetic, interoperability with the core scientific Python packages (e.g., pandas, NumPy, Matplotlib), out-of-core computation on datasets that don’t fit into memory, a wide range of serialization and input/output (I/O) options, and advanced multi-dimensional data manipulation tools such as group-by and resampling. xarray, as a data model and analytics toolkit, has been widely adopted in the geoscience community but is also used more broadly for multi-dimensional data analysis in physics, machine learning and finance.

...read moreread less

1,313 citations

Journal Article•10.1145/1071610.1071615•

Making snapshot isolation serializable

[...]

Alan Fekete¹, Dimitrios Liarokapis², Elizabeth O'Neil², Patrick O'Neil², Dennis Shasha³ - Show less +1 more•Institutions (3)

University of Sydney¹, University of Massachusetts Amherst², Courant Institute of Mathematical Sciences³

01 Jun 2005-ACM Transactions on Database Systems

TL;DR: A theory is developed that characterizes when nonserializable executions of applications can occur under Snapshot Isolation, and it is applied to demonstrate that the TPC-C benchmark application has no serialization anomalies under SI, and how this demonstration can be generalized to other applications.

...read moreread less

Abstract: Snapshot Isolation (SI) is a multiversion concurrency control algorithm, first described in Berenson et al. [1995]. SI is attractive because it provides an isolation level that avoids many of the common concurrency anomalies, and has been implemented by Oracle and Microsoft SQL Server (with certain minor variations). SI does not guarantee serializability in all cases, but the TPC-C benchmark application [TPC-C], for example, executes under SI without serialization anomalies. All major database system products are delivered with default nonserializable isolation levels, often ones that encounter serialization anomalies more commonly than SI, and we suspect that numerous isolation errors occur each day at many large sites because of this, leading to corrupt data sometimes noted in data warehouse applications. The classical justification for lower isolation levels is that applications can be run under such levels to improve efficiency when they can be shown not to result in serious errors, but little or no guidance has been offered to application programmers and DBAs by vendors as to how to avoid such errors. This article develops a theory that characterizes when nonserializable executions of applications can occur under SI. Near the end of the article, we apply this theory to demonstrate that the TPC-C benchmark application has no serialization anomalies under SI, and then discuss how this demonstration can be generalized to other applications. We also present a discussion on how to modify the program logic of applications that are nonserializable under SI so that serializability will be guaranteed.

...read moreread less

391 citations

Journal Article•10.3233/ISU-2010-0613•

The anatomy of a nanopublication

[...]

Paul Groth¹, Andrew P. Gibson¹, Jan Velterop•Institutions (1)

University of Amsterdam¹

01 Jan 2010-Information services & use

TL;DR: This document presents a model of nanopublications along with a Named Graph/RDF serialization of the model and discusses the importance of aggregating nanopublication and the role that the Concept Wiki plays in facilitating it.

...read moreread less

Abstract: As the amount of scholarly communication increases, it is increasingly difficult for specific core scientific statements to be found, connected and curated. Additionally, the redundancy of these statements in multiple fora makes it difficult to determine attribution, quality and provenance. To tackle these challenges, the Concept Web Alliance has promoted the notion of nanopublications (core scientific statements with associated context). In this document, we present a model of nanopublications along with a Named Graph/RDF serialization of the model. Importantly, the serialization is defined completely using already existing community-developed technologies. Finally, we discuss the importance of aggregating nanopublications and the role that the Concept Wiki plays in facilitating it.

...read moreread less

378 citations

Patent•

Modeling and using computer resources over a heterogeneous distributed network using semantic ontologies

[...]

Eric T. Hillerbrand, Girish Padmalyan, Michelle Pauly, Jhong-Hee Park, Sujith Kumar, William S. Glasgow, Sambasiva Murakonda, Tim D. Sebel, Vivekanand Velmuran, Chitta Basu - Show less +6 more

10 Mar 2003

TL;DR: In this article, a method of computing to address a predetermined computing requirement involving access to and use of computer resources, where more than one resource is capable of addressing the computing requirement, is presented.

...read moreread less

Abstract: A method of computing to address a predetermined computing requirement involving access to and use of computer resources, where more than one resource is capable of addressing the computing requirement. The method includes steps of describing plural computer resources using a description language, thereby obtaining descriptions of the resources; arranging the descriptions in one or more semantic ontologies; accessing one or more of the ontologies to select a particular one of the plural resources as available and/or qualified for addressing the computing requirement; and executing a computing process that utilizes the selected one of the plural resources to satisfy the computing requirement. The disclosed method involves use of description language and the computer resources comprise web services. The describing step involves identifying attributes of the computer-accessible resources. The attributes of the computer resources are selected from the group comprising but not limited to: message formatting for the computer resources, transport mechanisms associated with the computer resources, protocols associated with the computer resources, type serialization associated with the computer resources, and invocation requirements of the computer resources. In further accordance with the method, the invocation requirements of the computer resources may be selected from the group comprising: HTTP, SOAP, CORBA, JAVA RMI, and equivalent computer invocation specifications.

...read moreread less

364 citations

...

Expand

Year	Papers
2026	2
2025	48
2024	47
2023	83
2022	178
2021	74

Topic Tools

Papers published on a yearly basis

Papers

Building Rome in a day

xarray: N-D labeled arrays and datasets in Python

Making snapshot isolation serializable

The anatomy of a nanopublication

Modeling and using computer resources over a heterogeneous distributed network using semantic ontologies

Related Topics (5)

Performance Metrics