Scispace (Formerly Typeset)
  1. Home
  2. Conferences
  3. Algorithm Engineering and Experimentation
  4. 2007
  1. Home
  2. Conferences
  3. Algorithm Engineering and Experimentation
  4. 2007
Showing papers presented at "Algorithm Engineering and Experimentation in 2007"
Proceedings Article•
Practical entropy-compressed rank/select dictionary

[...]

Daisuke Okanohara1, Kunihiko Sadakane2•
University of Tokyo1, Kyushu University2
6 Jan 2007
TL;DR: Four novel Rank/Select dictionaries are proposed: esp, recrank, vcode and sdarray, each of which is small if the number of elements in S is small, and indeed close to nH0(S) (H0 (S) ≤ 1 is the zero-th order empirical entropy of S) in practice.
Abstract: Rank/Select dictionaries are data structures for an ordered set S ⊂ {0,1, . . ., n − 1} to compute rank(x, S) (the number of elements in S that are no greater than x), and select(i, S) (the i-th smallest element in S), which are the fundamental components of succinct data structures of strings, trees, graphs, etc.. In these data structures, however, only asymptotic behavior has been considered and their performance for real data is not satisfactory. In this paper, we propose four novel Rank/Select dictionaries: esp, recrank, vcode and sdarray, each of which is small if the number of elements in S is small, and indeed close to nH0(S) (H0(S) ≤ 1 is the zero-th order empirical entropy of S) in practice. Furthermore, their query times are superior to those of existing structures. Experimental results reveal the characteristics of our data structures and also show that these data structures are superior to existing implementations, both in terms of size and query time.

298 citations

Proceedings Article•
Computing many-to-many shortest paths using highway hierarchies

[...]

Sebastian Knopp1, Peter Sanders2, Dominik Schultes2, Frank Schulz1, Dorothea Wagner2 •
PTV Planung Transport Verkehr1, Karlsruhe Institute of Technology2
6 Jan 2007
TL;DR: A fast algorithm for computing all shortest paths between source nodes s ∈ S and target nodes t ∈ T, based on highway hierarchies, which is also used for the currently fastest speedup techniques for shortest path queries in road networks.
Abstract: We present a fast algorithm for computing all shortest paths between source nodes s ∈ S and target nodes t ∈ T. This problem is important as an initial step for many operations research problems (e.g., the vehicle routing problem), which require the distances between S and T as input. Our approach is based on highway hierarchies, which are also used for the currently fastest speedup techniques for shortest path queries in road networks. We show how to use highway hierarchies so that for example, a 10 000 × 10 000 distance table in the European road network can be computed in about one minute. These results are based on a simple basic idea, several refinements, and careful engineering of the approach. We also explain how the approach can be parallelized and how the computation can be restricted to computing only the k closest connections.

115 citations

Book Chapter•10.1137/1.9781611972870.3•
An experimental study of a parallel shortest path algorithm for solving large-scale graph instances

[...]

Kamesh Madduri1, David A. Bader1, Jonathan W. Berry2, Joseph R. Crobak3•
Georgia Institute of Technology1, Sandia National Laboratories2, Rutgers University3
6 Jan 2007
TL;DR: These are the first performance results of a shortest path problem on realistic graph instances in the order of billions of vertices and edges, and the implementation exhibits remarkable parallel speedup when compared with competitive sequential algorithms, for low-diameter sparse graphs.
Abstract: We present an experimental study of the single source shortest path problem with non-negative edge weights (NSSP) on large-scale graphs using the Δ-stepping parallel algorithm. We report performance results on the Cray MTA-2, a multithreaded parallel computer. The MTA-2 is a high-end shared memory system offering two unique features that aid the efficient parallel implementation of irregular algorithms: the ability to exploit fine-grained parallelism, and low-overhead synchronization primitives. Our implementation exhibits remarkable parallel speedup when compared with competitive sequential algorithms, for low-diameter sparse graphs. For instance, Δ-stepping on a directed scale-free graph of 100 million vertices and 1 billion edges takes less than ten seconds on 40 processors of the MTA-2, with a relative speedup of close to 30. To our knowledge, these are the first performance results of a shortest path problem on realistic graph instances in the order of billions of vertices and edges.

112 citations

Proceedings Article•
Locating guards for visibility coverage of polygons

[...]

Yoav Amit1, Joseph S. B. Mitchell1, Eli Packer1•
Stony Brook University1
6 Jan 2007
TL;DR: Experimental evidence is given that the heuristics proposed give a provably optimal result, while in other cases there is only a small gap between the computed upper and lower bounds on the optimal guard number.
Abstract: We propose heuristics for visibility coverage of a polygon with the fewest point guards. This optimal coverage problem, often called the "art gallery problem", is known to be NP-hard, so most recent research has focused on heuristics and approximation methods. We evaluate our heuristics through experimentation, comparing the upper bounds on the optimal guard number given by our methods with computed lower bounds based on heuristics for placing a large number of visibility-independent "witness points". We give experimental evidence that our heuristics perform well in practice, on a large suite of input data; often the heuristics give a provably optimal result, while in other cases there is only a small gap between the computed upper and lower bounds on the optimal guard number.

64 citations

Book Chapter•10.1137/1.9781611972870.1•
Improved external memory BFS implementations

[...]

Deepak Ajwani1, Ulrich Meyer1, Vitaly Osipov1•
Max Planck Society1
6 Jan 2007
TL;DR: The implementation of the deterministic variant of MM_BFS is presented and it is shown that in most cases, it outperforms the randomized variant, and the running time for BFS traversal is improved with a heuristic that preserves the worst case guarantees of MM-BFS.
Abstract: Breadth first search (BFS) traversal on massive graphs in external memory was considered non-viable until recently, because of the large number of I/Os it incurs. Ajwani et al. [3] showed that the randomized variant of the o(n) I/O algorithm of Mehlhorn and Meyer [24] (MM_BFS) can compute the BFS level decomposition for large graphs (around a billion edges) in a few hours for small diameter graphs and a few days for large diameter graphs. We improve upon their implementation of this algorithm by reducing the overhead associated with each BFS level, thereby improving the results for large diameter graphs which are more difficult for BFS traversal in external memory. Also, we present the implementation of the deterministic variant of MM_BFS and show that in most cases, it outperforms the randomized variant. The running time for BFS traversal is further improved with a heuristic that preserves the worst case guarantees of MM_BFS. Together, they reduce the time for BFS on large diameter graphs from days shown in [3] to hours. In particular, on line graphs with random layout on disks, our implementation of the deterministic variant of MM_BFS with the proposed heuristic is more than 75 times faster than the previous best result for the randomized variant of MM_BFS in [3].

48 citations

Proceedings Article•
Computing visibility on terrains in external memory

[...]

Herman Haverkort1, Laura Toma2, Yi Zhuang2•
Eindhoven University of Technology1, Bowdoin College2
6 Jan 2007
TL;DR: In this article, the authors describe a novel application of the distribution sweeping technique to computing visibility on terrains, where the basic problem is computing the visibility map or viewshed of v, which is the set of points in the terrain that are visible from v.
Abstract: We describe a novel application of the distribution sweeping technique to computing visibility on terrains. Given an arbitrary viewpoint v, the basic problem we address is computing the visibility map or viewshed of v, which is the set of points in the terrain that are visible from v. We give the first I/O-efficient algorithm to compute the viewshed of v on a grid terrain in external memory. Our algorithm is based on Van Kreveld's O(n lg n) time algorithm for the same problem in internal memory. It uses O(sort(n)) I/Os, where sort(n) is the complexity of sorting n items of data in the I/O-model. We present an implementation and experimental evaluation of the algorithm. Our implementation clearly outperforms the previous (in-memory) algorithms and can compute visibility for terrains of up to 4GB in a few hours on a low-cost machine.

46 citations

Proceedings Article•
Faster filters for approximate string matching

[...]

Juha Kärkkäinen1, Joong Chae Na1•
University of Helsinki1
6 Jan 2007
TL;DR: This work introduces a new filtering method for approximate string matching called the suffix filter, which has some similarity with well-known filtration algorithms, which it is demonstrated experimentally that suffix filters are faster in practice than factor filters.
Abstract: We introduce a new filtering method for approximate string matching called the suffix filter. It has some similarity with well-known filtration algorithms, which we call factor filters, and which are among the best practical algorithms for approximate string matching using a text index. Suffix filters are stronger, i.e., produce fewer false matches than factor filters. We demonstrate experimentally that suffix filters are faster in practice, too.

37 citations

Proceedings Article•
Algorithms to take advantage of hardware prefetching

[...]

Shen Pan1, Cary Cherng2, Kevin Dick3, Richard E. Ladner4•
Amazon.com1, Google2, California Institute of Technology3, University of Washington4
6 Jan 2007
TL;DR: It is shown that hardwarePrefetching permits the standard Floyd-Warshall algorithm for all-pairs shortest paths to outperform cache-oblivious and cache-aware algorithms and variants of standard FFT algorithms exhibit good prefetching performance.
Abstract: Cache-oblivious and cache-aware algorithms have been developed to minimize cache misses. Some of the newest processors have hardware prefetching where cache misses are avoided by predicting ahead of time what memory will be needed in the future and bringing that memory into the cache before it is used. It is shown that hardware prefetching permits the standard Floyd-Warshall algorithm for all-pairs shortest paths to outperform cache-oblivious and cache-aware algorithms. A simple improvement to the standard simple dynamic programming algorithm yields an algorithm that takes advantage of prefetching, and outperforms cache-oblivious and cache-aware algorithms. Finally, it is shown that variants of standard FFT algorithms exhibit good prefetching performance.

11 citations

Proceedings Article•
ScrewBox: a randomized certifying graph- non -isomorphism algorithm

[...]

Martin Kutz1, Pascal Schweitzer1•
Max Planck Society1
6 Jan 2007
TL;DR: A novel randomized approach to the graph isomorphism problem that aims at solving difficult instances by producing randomized certificates for non-isomorphism, and introduces a new concept of regularity for graphs which is meant to capture the computational hardness of isomorphicism problems on graphs.
Abstract: We present a novel randomized approach to the graph isomorphism problem. Our algorithm aims at solving difficult instances by producing randomized certificates for non-isomorphism. We compare our implementation to the de facto standard nauty. On many of the hardest known instances, the incidence graphs of finite projective planes, our program is considerably faster than nauty. However, it is inherent to our approach that it performs better on pairs of non-isomorphic graphs than on isomorphic instances. Our algorithm randomly samples substructures in the given graphs in order to detect dissimilarities between them. The choice of the sought-after structures as well as the tuning of the search process is dynamically adapted during the sampling. Eventually, a randomized certificate is produced by which the user can verify the non-isomorphism of the input graphs. As a byproduct of our approach, we introduce a new concept of regularity for graphs which is meant to capture the computational hardness of isomorphism problems on graphs.

5 citations

Proceedings Article•
Maximizing throughput in minimum rounds in an application-level relay service

[...]

Fred S. Annexstein1, Kenneth A. Berman1, Svetlana Strunjaš1, Chad Yoshikawa1•
University of Cincinnati1
6 Jan 2007
TL;DR: This paper presents an algorithm, called Aggressive Increase, AAI which improves its competitive ratio in each time round that the network conditions persists, and typically reaches close to peak bandwidth levels within only a small constant (< 10) number of rounds.
Abstract: Application-level network relays possess many desirable properties, including support for communication between disconnected clients, increasing bandwidth between distant clients, and enabling routing around Internet failures. One problem not considered by existing systems is how to assign client load to relay servers in order to maximize throughput of the relay-system. In this paper, we are interested in the particular case where network conditions change frequently so that the ability of clients to adapt flow is restricted and each round of activity is critical. To this end, we present an algorithm, called Aggressive Increase, AAI which improves its competitive ratio in each time round that the network conditions persists. Given a relay network where a client connects to at most N servers, if network conditions persist for log(N) rounds then the algorithm's throughput becomes constant competitive. Our results improve upon the competitive ratio of previous work (of Awerbuch, Hajiaghayi, Kleinberg, and Leighton [2]). In addition we show that the AAI algorithm performs well in simulation studies as compared with the algorithm of [2] and an adaptation of the multiplicative increase algorithm of [8]. On a variety of input graphs, we show that the AAI algorithm typically reaches close to peak bandwidth levels within only a small constant (< 10) number of rounds.

2 citations

Book Chapter•10.1137/1.9781611972870.10•
Linearization: locally self-stabilizing sorting in graphs

[...]

Melih Onus1, Andréa W. Richa1, Christian Scheideler2•
Arizona State University1, Technische Universität München2
6 Jan 2007
TL;DR: This paper considers the problem of designing a distributed algorithm that, given an arbitrary connected graph G of nodes with unique labels, converts G into a sorted list of nodes and proposes several extensions of the linearization technique.
Abstract: We consider the problem of designing a distributed algorithm that, given an arbitrary connected graph G of nodes with unique labels, converts G into a sorted list of nodes. This algorithm should be as simple as possible and, for scalability, should guarantee a polylogarithmic runtime as well as at most a polylogarithmic increase in the degree of each node during its execution. Furthermore, it should be self-stabilizing, that is, it should be able to eventually construct a sorted list from any state in which the graph is connected. It turns out that satisfying all of these demands at the same time is not easy. Our basic approach towards this goal is the so-called linearization technique: each node v repeatedly does the following with its neighbors: • for its left (i.e., smaller) neighbors u1, . . ., uk in the order of decreasing labels, v replaces {v, u1}, . . ., {v, uk} by {v, u1}, {u1, u2}, . . ., {uk−1, uk}, and • for its right (i.e., larger) neighbors w1, . . ., we in the order of increasing labels, v replaces {v, w1}, . . ., {v, we} by {v, w1}, {w1, w2}, . . ., {we−1, we}. As shown in this paper, this technique transforms any connected graph into a sorted list, but there are graphs for which this can take a long time. Hence, we propose several extensions of the linearization technique and experimentally evaluate their performance. Our results indicate that some of these have a polylogarithmic performance, so there is hope that there are distributed algorithms that can achieve all of our goals above.
Proceedings Article•
0/1 vertex and facet enumeration with BDDs

[...]

Markus Behle1, Friedrich Eisenbrand2•
Max Planck Society1, University of Paderborn2
6 Jan 2007
TL;DR: The novelty of the algorithms is the incorporation of binary decision diagrams (BDDs), a datastructure which has become very popular and effective in hardware verification and computational logic.
Abstract: In polyhedral studies of 0/1 polytopes two prominent problems exist. One is the vertex enumeration problem: Given a system of inequalities, enumerate its feasible 0/1 points. Another one is the convex hull problem: Given a set of 0/1 points in dimension d, enumerate the facets of the corresponding polytope. We present two new approaches for both problems. The novelty of our algorithms is the incorporation of binary decision diagrams (BDDs), a datastructure which has become very popular and effective in hardware verification and computational logic. Our computational results show the strength of our methods. We introduce our new tool azove which is currently the fastest software for counting and enumerating 0/1 points in a polytope.
Book Chapter•10.1137/1.9781611972870.5•
In transit to constant time shortest-path queries in road networks

[...]

Holger Bast1, Stefan Funke1, Domagoj Matijevic1, Peter Sanders2, Dominik Schultes2 •
Max Planck Society1, Karlsruhe Institute of Technology2
6 Jan 2007
TL;DR: In this article, the authors develop an algorithmic approach to reduce quickest-path queries in road networks to a small number of table lookups, based on a simple grid data structure and highway hierarchies.
Abstract: When you drive to somewhere 'far away', you will leave your current location via one of only a few 'important' traffic junctions. Starting from this informal observation, we develop an algorithmic approach---transit node routing---that allows us to reduce quickest-path queries in road networks to a small number of table lookups. We present two implementations of this idea, one based on a simple grid data structure and one based on highway hierarchies. For the road map of the United States, our best query times improve over the best previously published figures by two orders of magnitude. Our results exhibit various trade-offs between average query time (5 μs to 63 μs), preprocessing time (59 min to 1200 min), and storage overhead (21 bytes/node to 244 bytes/node).
Proceedings Article•
Engineering an efficient canonical labeling tool for large and sparse graphs

[...]

Tommi Junttila1, Petteri Kaski2•
Helsinki University of Technology1, University of Helsinki2
6 Jan 2007
TL;DR: Within the general framework of backtracking algorithms based on individualization and refinement, data structures, subroutines, and pruning heuristics especially for fast handling of large and sparse graphs are developed.
Abstract: The problem of canonically labeling a graph is studied. Within the general framework of backtracking algorithms based on individualization and refinement, data structures, subroutines, and pruning heuristics especially for fast handling of large and sparse graphs are developed. Experiments indicate that the algorithm implementation in most cases clearly outperforms existing state-of-the-art tools.
Proceedings Article•
Intersection in integer inverted indices

[...]

Peter Sanders1, Frederik Transier•
Karlsruhe Institute of Technology1
6 Jan 2007
TL;DR: The previous theoretical approaches, methods used in practice, and one new algorithm which exploits that the intersection uses small integer keys are compared, which is the only algorithm that performs well over the entire spectrum of relative list length ratios.
Abstract: Inverted index data structures are the key to fast search engines. The predominant operation on inverted indices asks for intersecting two sorted lists of document IDs which might have vastly varying lengths. We compare previous theoretical approaches, methods used in practice, and one new algorithm which exploits that the intersection uses small integer keys. We also take different data compression techniques into account. The new algorithm is very fast, simple, has good space efficiency, and is the only algorithm that performs well over the entire spectrum of relative list length ratios.

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve