Applied Parallel Computing: State of the Art in Scientific Computing. 8th International Workshop, PARA 2006, Umea, Sweden, June 18-21, 2006, Revised Selected Papers

Pirmais vāks
Bo Kagström, Erik Elmroth, Jack Dongarra, Jerzy Wasniewski
Springer, 2007. gada 22. sept. - 1192 lappuses

This book constitutes the thoroughly refereed post-proceedings of the 8th International Workshop on Applied Parallel Computing, PARA 2006. It covers partial differential equations, parallel scientific computing algorithms, linear algebra, simulation environments, algorithms and applications for blue gene/L, scientific computing tools and applications, parallel search algorithms, peer-to-peer computing, mobility and security, algorithms for single-chip multiprocessors.

No grāmatas satura

Saturs

Using Noncanonical Array Layouts in Dense Matrix Operations
580
New Data Distribution for Solving Triangular Systems on Distributed Memory Machines
589
The Design of a New OutofCore Multifrontal Solver
598
Cholesky Factorization of Band Matrices Using Multithreaded BLAS
608
A MasterWorker Type Eigensolver for Molecular Orbital Computations
617
Minisymposium Abstract
626
Using Parallel Computing and Grid Systems for Genetic Mapping of Quantitative Traits
627
Partial Approximation of the Master Equation by the FokkerPlanck Equation
637

Coarse Graining Biomolecular Systems
73
Molecular Dynamics Simulation Studies on the Modulation of Vitamin D Receptor Activity by Agonists and Antagonists
82
Sparse Matrix Algebra for Quantum Modeling of Large Systems
90
A Highly Efficient Ab Initio TightBindingLike Approximate DensityFunctional Quantum Mechanical Method
100
Protein Folding Properties from Molecular Dynamics Simulations
109
Minisymposium Abstract
116
Parallel Variants of the Multishift QZ Algorithm with Advanced Deflation Techniques
117
Parallel Algorithms and Condition Estimators for Standard and Generalized Triangular SylvesterType Matrix Equations
127
LAPACKStyle Codes for Pivoted Choleskyand QR Updating
137
Implementing Linear Algebra Routines on Multicore Processors with Pipelining and a Look Ahead
147
Specialized Spectral Division Algorithms for Generalized Eigenproblems Via the InverseFree Iteration
157
Minisymposium Abstract
167
Some Experiences on the Accuracy and Parallel Performance of OpenFOAM for CFD in Water Turbines
168
Minisymposium Abstract
177
Trusting Floating Point Benchmarks Are Your Benchmarks Really Data Independent?
178
CoMPI Configuration of Collective Operations in LAMMPI Using the Scheme Programming Language
189
A Load Balancing Strategy for Computations on Large ReadOnly Data Sets
198
Automatic and Transparent Optimizations of an Applications MPI Communication
208
Parallel Methods for RealTime Visualization of Snow
218
Support for Collaboration Visualization and Monitoring of Parallel Applications Using Shared Windows
228
Minisymposium Abstract
239
Multilevel muFinite Element Analysis for Human Bone Structures
240
HighLevel User Interfaces for the DOE ACTS Collection
251
HighPerformance Graph Algorithms from Parallel Sparse Matrices
260
A Python Module for PDEBased Numerical Modelling
270
Architecture for a ComponentBased Scientific Computing System
280
Workload Characterization Using the TAU Performance System
289
Minisymposium Abstract
297
Supporting SLA Negotiation for QoSEnabled Simulation Services in a Medical Grid Environment
298
A Transparent Grid Filesystem
309
Grid Data Integration Based on Schema Mapping
319
Minisymposium Abstract
329
Parallelization of a Public Image Restoration Algorithm
330
Visualizing Katrina Merging Computer Simulations with Observations
340
Generation of Microlensing Magnification Patterns with High Performance Computing Techniques
351
Phase Space Modulations in Magnetised Plasmas by a Mildly Relativistic TwoStream Instability
361
Implementing a ParticleFluid Model of Auroral Electrons
371
Minisymposium Abstract
380
Automatic Tuning in Computational Grids
381
Automated Performance Analysis Using ASL Performance Properties
390
A Parallel TraceData Interface for Scalable Performance Analysis
398
Search of Performance Inefficiencies in Message Passing Applications with KappaPI 2 Tool
409
Automatic Monitoring of Memory Hierarchies in Threaded Applications with AMEBA
420
Visualization of Repetitive Patterns in Event Traces
430
Optimization of Instrumentation in Parallel Performance Evaluation Tools
440
GASP A Standardized Performance Analysis Tool Interface for Global Address Space Programming Models
450
Integrated Runtime Measurement Summarisation and Selective Event Tracing for Scalable Parallel Execution Performance Diagnosis
460
Minisymposium Abstract
470
Roadmap for the ARC Grid Middleware
471
Data Management for the Worlds Largest Machine
480
Metacomputations on the CLUSTERIX Grid
489
Minisymposium Abstract
501
A Benchmarking Study
502
Mechanical Properties of Random Alloys from Quantum Mechanical Simulations
510
Minisymposium Abstract
520
Cache Oblivious Matrix Operations Using Peano Curves
521
Recursive Blocked Algorithms for Solving Periodic Triangular SylvesterType Matrix Equations
531
Minimal Data Copy for Dense Linear Algebra Factorization
540
Three Algorithms for Cholesky Factorization on Distributed Memory Using Packed Storage
550
InPlace Transposition of Rectangular Matrices
560
Rectangular Full Packed Format for LAPACK Algorithms Timings on Several Computers
570
Design Construction and Use of the FISH Server
647
Minisymposium Abstract
658
Interactive Volume Visualization of Fluid Flow Simulation Data
659
Minisymposium Abstract
665
The Iterative Solver Template Library
666
A Parallel CFD Code for the Simulation of Euler and NavierStokes Problems on Unstructured Grids
676
Making Hybrid Tsunami Simulators in a Parallel Software Framework
686
A Problem Solving Environment for Developing Parallel FEM Applications
694
SyFi An Element Matrix Factory
703
Overview of a Unified C++ Implementation of the Finite and Spectral Element Methods in 1D 2D and 3D
712
Parallel Program Complex for 3D Unsteady Flow Simulation
722
Minisymposium Abstract
732
Simulation of Multiphysics Problems Using Adaptive Finite Elements
733
A New Domain Decomposition Approach Suited for Grid Computing
744
Parallelization of the Mesh Refinement Algorithm of the FDEM Program Package
754
Load Balancing for the Numerical Solution of the NavierStokes Equations
764
A Method of Adaptive Coarsening for Compressing Scientific Datasets
774
A Computational Framework for Topological Operations
781
Applications of Grid Computing in Genetics and Proteomics
791
Grid Enabling Your Data Resources with OGSADAI
799
Enabling Streaming for the New Generation of Grids
809
Dynamic Clusters Available Under Clusterix Grid
819
AgentBased Societies for the Sharing Brokerage and Allocation of Grid Resources
830
OpusIB Grid Enabled Opteron Cluster with InfiniBand Interconnect
840
Extending the HPCICTM Geographical Categorization Model for Grid Computing
850
An EasytoUse Interface for MPIBased Parallel Matrix Computation Libraries
860
A Study of Real World IO Performance in Parallel Scientific Computing
871
Epitaxial Surface Growth with Local Interaction Parallel and Nonparallel Simulations
882
Data Dependence Analysis for the Parallelization of Numerical Tree Codes
890
Efficient Assembly of Sparse Matrices Using Hashing
900
A WebSiteBased Partitioning Technique for Reducing Preprocessing Overhead of Parallel PageRank Computation
908
Is CacheOblivious DGEMM Viable?
919
Partitioning and Blocking Issues for a Parallel Incomplete Factorization
929
Automatic Performance Tuning for the Multisection with Multiple Eigenvalues Method for Symmetric Tridiagonal Eigenproblems
938
Optimizing a Parallel Selfverified Method for Solving Linear Systems
949
A Parallel Block Iterative Method for Interactive Contacting Rigid Multibody Simulations on Multicore PCs
956
HighPerformanceDistributedMemory Solvers for Python
966
A Set of General Interfaces to Sparse Direct Solver Libraries
976
dSpline Based Incremental ParameterEstimation in Automatic Performance Tuning
986
A High Performance Generic Scientific Simulation Environment
996
NestStepModelica Mathematical Modeling and BulkSynchronous Parallel Simulation
1006
Flecs a Flexible Coupling Shell Application to FluidStructure Interaction
1016
New Scalability Frontiers in Ab Initio Electronic Structure Calculations Using the BGL Supercomputer
1026
A PAPI Implementation for BlueGene
1036
Numerical Simulation of 3D Acoustic Logging
1045
An Improved Interval Global Optimization Method and Its Application to Price Management Problem
1055
Optimizing Neural Network Classifiers with ROOT on a Rocks Linux Cluster
1065
A Model of SmallScale Turbulence for Use in the PPM Gas Dynamics Scheme
1074
Mapping in Heterogeneous Systems with Heuristic Methods
1084
Scalable Fine Grained Development for Large Coarse Grained Platforms
1094
Performance Analysis of Two Parallel GameTree Search Applications
1105
Implementation of a BlackBox Global Optimization Algorithm with a Parallel Branch and Bound Template
1115
Parallelization Techniques for Tabu Search
1126
A Selfreconfigurable Topology for Unstructured P2P Systems
1136
A Middleware for Job Distribution in PeertoPeer Networks
1147
Mobile Agents Selfoptimization with MAWeS
1158
Performance Impact of Resource Conflicts on Chip Multiprocessor Servers
1168
An Implementation of Parallel 1D FFT Using SSE3 Instructions on DualCore Processors
1178
Author Index
1189
Autortiesības

Citi izdevumi - Skatīt visu

Bieži izmantoti vārdi un frāzes

Populāri fragmenti

548. lappuse - Anderson, E., Bai, Z., Bischof, C., Demmel, J., Dongarra, J., Du Croz, J.. Greenbaum, A., Hammarling, S., McKenney, A., Ostrouchov, S., and Sorensen, D.
511. lappuse - The width of the stacking fault ribbon is of importance in many aspects of plasticity, as in the case of dislocation intersection or cross-slip. In both cases, the two partial dislocations have to be brought together to form an unextended dislocation before intersection or cross-slip can occur [4].
624. lappuse - These intervals include the energy levels of the highest occupied molecular orbitals (HOMO) and the lowest unoccupied molecular orbitals (LUMO), which are key factors in the amount of energy needed to add or remove electrons in a molecule.
586. lappuse - To copy or not to copy: A compile-time technique for assessing when data copying should be used to eliminate cache conflicts.
237. lappuse - Humphreys, G., Houston, M., Ng, R., Frank, R., Ahern, S., Kirchner, PD, Klosowski, JT: Chromium: A stream-processing framework for interactive rendering on clusters. In: SIGGRAPH '02: Proceedings of the 29th annual conference on Computer graphics and interactive techniques, pp.
299. lappuse - The remainder of this paper is structured as follows: Section 2 provides...
258. lappuse - Research, Division of Mathematical, Information, and Computational Sciences of the US Department of Energy under contract...

Bibliogrāfiskā informācija