Publikationen

stilisiertes Bild vom Hochhaus
1 2 3

Die nachfolgende Liste der Publikationen ist dem Univis entnommen.

Hammer, Julian ; Hager, Georg ; Eitzinger, Jan ; Wellein, Gerhard:
Automatic Loop Kernel Analysis and Performance Modeling With Kerncraft.
In: - (Hrsg.) : Proceedings of the 6th International Workshop on Performance Modeling, Benchmarking, and Simulation of High Performance Computing Systems
(SC15 The International Conference for High Performance Computing, Networking, Storage and Analysis, Austin, TX, USA, November 15-20, 2015).
New York, NY, USA : ACM, 2015, S. 1-11. (Lecture Notes in Computer Science) - ISBN 978-1-4503-4009-0
[doi>10.1145/2832087.2832092] (BibTeX)

Shahzad, Faisal ; Kreutzer, Moritz ; Zeiser, Thomas ; Machado, R. ; Pieper, A. ; Hager, Georg ; Wellein, Gerhard:
Building a Fault Tolerant Application Using the GASPI Communication Layer.
In: - (Hrsg.) : Proceedings of FTS 2015
(the 1st International Workshop on Fault-Tolerant Systems, Chicago, IL, September 8, 2015).
in conjunction with IEEE Cluster 2015 : IEEE, 2015, S. 580-587.
[doi>10.1109/CLUSTER.2015.106] (BibTeX)

Wittmann, Markus ; Hager, Georg ; Zeiser, Thomas ; Treibig, Jan ; Wellein, Gerhard:
Chip-level and multi-node analysis of energy-optimized lattice Boltzmann CFD simulations.
In: Concurrency and Computation: Practice and Experience (2015), S. 1-5
[doi>10.1002/cpe.3489] (BibTeX)

Klawonn, A. ; Lanser, M. ; Rheinbach, O. ; Stengel, Holger ; Wellein, Gerhard:
Hybrid MPI/OpenMP Parallelization in FETI-DP Methods.
In: Mehl, M. et al. (Hrsg.) : Recent Trends in Computational Engineering - CE2014.
Bd. 105. - : Springer Link, 2015, (Lecture Notes in Computational Science and Engineering), S. 67-84. - ISBN 978-3-319-22997-3
[doi>10.1007/978-3-319-22997-3_4] (BibTeX)

Röhrig-Zöllner, M. ; Thies, J. ; Kreutzer, Moritz ; Alvermann, A. ; Pieper, A. ; Basermann, A. ; Hager, Georg ; Wellein, Gerhard ; Fehske, H.:
Increasing the performance of the Jacobi-Davidson method by blocking.
In: Siam Journal on Scientific Computing. SIAM Society for Industrial and Applied Mathematics. DLR Portal ISSN 1064-8275 (2015), S. 1-27 (BibTeX)

Malas, T. ; Hager, Georg ; Ltaief, H. ; Stengel, Holger ; Wellein, Gerhard ; Keyes, D.:
Multicore-optimized wavefront diamond blocking for optimizing stencil updates.
In: SIAM Journal on Scientific Computing 37 (2015), Nr. 4, S. C439-C464
Stichwörter: Software and High-Performance Computing
[doi>10.1137/140991133] (BibTeX)

Malas, T. ; Hager, Georg ; Ltaief, H. ; Stengel, Holger ; Wellein, Gerhard ; Keyes, D.:
Multicore-optimized wavefront diamond blocking for optimizing stencil updates.
In: SIAM Journal on Scientific Computing 37 (2015), Nr. 4, S. C439-C464
Stichwörter: Software and High-Performance Computing
[doi>10.1137/140991133] (BibTeX)

Hofmann, Johannes ; Fey, Dietmar ; Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
Performance analysis of the Kahan-enhanced scalar product on current multicore processors.
In: - (Hrsg.) : Accepted for PPAM 2015
(the 11th International Conference on Parallel Processing and Applied Mathematics, Krakow, Poland, Sept. 3-6, 2015).
2015, S. 1-10. (BibTeX)

Kreutzer, Moritz ; Pieper, Andreas ; Hager, Georg ; Wellein, Gerhard ; Alvermann, Andreas ; Fehske, Holger:
Performance Engineering of the Kernel Polynomal Method on Large-Scale CPU-GPU Systems.
In: IEEE (Veranst.):
Proceedings of the 2015 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
(Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International, Hyderabad, May 25-29, 2015).
2015, S. 417-426.
[doi>10.1109/IPDPS.2015.76] (BibTeX)

Stengel, Holger ; Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
Quantifying performance bottlenecks of stencil computations using the Execution-Cache-Memory model.
In: Proceedings of ICS15, the 29th International Conference on Supercomputing June 8-11 (2015), Nr. Newport Beach, CA, S. 1-10
[doi>10.1145/2751205.2751240] (BibTeX)

Stengel, Holger ; Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
Quantifying performance bottlenecks of stencil computations using the Execution-Cache-Memory model.
In: Proceedings of ICS15, the 29th International Conference on Supercomputing June 8-11 (2015), Nr. Newport Beach, CA, S. 1-10
[doi>10.1145/2751205.2751240] (BibTeX)

Kreutzer, Moritz ; Hager, Georg ; Wellein, Gerhard ; Fehske, Holger ; Bishop, Alan R.:
A unified sparse matrix data format for efficient general sparse matrix-vector multiplication on modern processors with wide SIMD units.
In: SIAM Journal on Scientific Computing 36 (2014), Nr. 5, S. C401–C423
[doi>10.1137/130930352] (BibTeX)

Hofmann, Johannes ; Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
Comparing the Performance of Different x86 SIMD Instruction Sets for a Medical Imaging Application on Modern Multi- and Manycore Chips.
In: - (Hrsg.) : Proceedings of the 2014 Workshop on Programming models for SIMD/Vector processing
(-, Orlando, USA, Feb 16, 2014).
New York, NY, USA : ACM, 2014, S. 57-64. - ISBN 978-1-4503-2653-7 (BibTeX)

Kronawitter, Stefan ; Stengel, Holger ; Hager, Georg ; Lengauer, Christian:
Domain-Specific Optimization of Two Jacobi Smoother Kernels and Their Evaluation in the ECM Performance Model.
In: Parallel Processing Letters 24 (2014), Nr. 3, S. 1-18
[doi>10.1142/S0129626414410047] (BibTeX)

Pieper, A. ; Heinisch, R. L. ; Wellein, Gerhard ; Fehske, H.:
Dot-bound and dispersive states in graphene quantum dot superlattices.
In: Phys. Rev. B 89 (2014), S. 165121
[doi>10.1103/PhysRevB.89.165121] (BibTeX)

Alvermann, Andreas ; Basermann, Achim ; Fehske, Holger ; Galgon, Martin ; Hager, Georg ; Kreutzer, Moritz ; Krämer, Lukas ; Lang, Bruno ; Pieper, Andreas ; Röhrig-Zöllner, Melven ; Shahzad, Faisal ; Jonas, Thies ; Wellein, Gerhard:
ESSEX: Equipping Sparse Solvers for Exascale.
In: Lopes, Luis (Hrsg.) : Euro-Par 2014: Parallel Processing Workshops.
Bd. 8806. Lecture Notes in Computer Science : SpringerLink, 2014, (Lecture Notes in Computer Science), S. 577-588. (BibTeX)

Hager, Georg ; Eitzinger, Jan ; Habich, Johannes ; Wellein, Gerhard:
Exploring performance and power properties of modern multicore chips via simple machine models.
In: Concurrency and Computation: Practice and Experience (2014),
[doi>10.1002/cpe.3180] (BibTeX)

Wittmann, Markus ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Modeling and analyzing performance for highly optimized propagation steps of the lattice Boltzmann method on sparse lattices.
In: Submitted. Preprint (Computer & Fluids) (2014), S. 1
Stichwörter: Distributed, Parallel, and Cluster Computing (cs.DC) (BibTeX)

Malas, Tareq ; Hager, Georg ; Ltaief, Hatem ; Stengel, Holger ; Wellein, Gerhard ; Keyes, David:
Multicore-optimized wavefront diamond blocking for optimizing stencil updates.
In: Submitted. Preprint (Computer Science > Distributed, Parallel, and Cluster Computing) (2014), S. 1 (BibTeX)

Hofmann, Johannes ; Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
Performance Engineering for a Medical Imaging Application on the Intel Xeon Phi Accelerator.
In: ARCS Workshops'14.
Lübeck, Germany
2014, S. 1-8. - ISBN 978-3-8007-3579-2 (BibTeX)

Malas, Tareq ; Hager, Georg ; Ltaief, Hatem ; Keyes, David:
Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking.
In: Submitted. Preprint. Computer Science: arXiv:1410.5561 Oct (2014), S. 1-2 (BibTeX)

Shahzad, Faisal ; Wittmann, Markus ; Kreutzer, Moritz ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
A survey of checkpoint/restart techniques on distributed memory systems.
In: Parallel Processing Letters 23 (2013), Nr. 04, S. 1340011-1340030
[doi>10.1142/S0129626413400112] (BibTeX)

Shahzad, Faisal ; Wittmann, Markus ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
An Evaluation of Different I/O Techniques for Checkpoint/Restart.
In: IEEE (Veranst.):
Parallel and Distributed Processing Symposium Workshops PhD Forum (IPDPSW), 2013 IEEE 27th International
(2013 IEEE 27th International Parallel and Distributed Processing Symposium Workshops & PhD Forum, Boston, MA, USA, May 20-24, 2013).
n.a. : IEEE Digital Library, 2013, S. 1708-1716.
Stichwörter: fault tolerance, asynchronous checkpointing, multi-stage checkpointing, checkpoint/restart, MPI
[doi>10.1109/IPDPSW.2013.145] (BibTeX)

Pieper, Andreas ; Schubert, Gerald ; Wellein, Gerhard ; Fehske, Holger:
Effects of disorder and contacts on transport through graphene nanoribbons.
In: Phys. Rev. B 88 (2013), S. 195409
[doi>10.1103/PhysRevB.88.195409] (BibTeX)

Shahzad, Faisal ; Wittmann, Markus ; Kreutzer, Moritz ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
PGAS implementation of SpMVM and LBM with GPI.
In: Weiland, M. ; Jackson, A. ; Johnson, N. (Hrsg.) : Proceedings of the 7th International Conference on PGAS Programming Models
(The 7th International Conference on PGAS Programming Models, Edinburgh, Scotland, UK, Oct. 3-4, 2013).
Edinburgh : The University of Edinburgh, 2013, S. 172-184. - ISBN 978-0-9926615-0-2 (BibTeX)

Shahzad, Faisal ; Wittmann, Markus ; Zeiser, Thomas ; Wellein, Gerhard:
Asynchronous Checkpointing by Dedicated Checkpoint Threads.
In: Träff, Jesper Larsson ; Benkner, Siegfried ; Dongarra, Jack J. (Hrsg.) : Recent Advances in the Message Passing Interface.
- : Springer, 2012, (Lecture Notes in Computer Science Bd. 7490), S. 289-290. - ISBN 978-3-642-33517-4
[doi>10.1007/978-3-642-33518-1_36] (BibTeX)

Wittmann, Markus ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Comparison of Different Propagation Steps for Lattice Boltzmann Methods.
In: Computers & Mathematics with Applications (2012), S. 17
Stichwörter: Lattice Boltzmann method; Propagation step; A–A pattern; Performance; Implementation
[doi>10.1016/j.camwa.2012.05.002] (BibTeX)

Wittmann, Markus ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Comparison of Different Propagation Steps for Lattice Boltzmann Methods.
In: Computers & Mathematics with Applications (2012), S. 17
Stichwörter: Lattice Boltzmann method; Propagation step; A–A pattern; Performance; Implementation
[doi>10.1016/j.camwa.2012.05.002] (BibTeX)

Wittmann, Markus ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Domain Decomposition and Locality Optimization for Large-Scale Lattice Boltzmann Simulations.
In: Computers & Fluids (2012),
Stichwörter: Parallelization; Computational fluid dynamics; Lattice Boltzmann method; Domain decomposition
[doi>10.1016/j.compfluid.2012.02.007] (BibTeX)

Wittmann, Markus ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Domain Decomposition and Locality Optimization for Large-Scale Lattice Boltzmann Simulations.
In: Computers & Fluids (2012),
Stichwörter: Parallelization; Computational fluid dynamics; Lattice Boltzmann method; Domain decomposition
[doi>10.1016/j.compfluid.2012.02.007] (BibTeX)

Sembritzki, Klaus ; Hager, Georg ; Krammer, Bettina ; Eitzinger, Jan ; Wellein, Gerhard:
Evaluation of the Coarray Fortran Programming Model on the Example of a Lattice Boltzmann Code.
In: - (Hrsg.) : PGAS12
(The 6th Conference on Partitioned Global Address Space Programming Models, Santa Barbara, CA, USA, 10.-12.Oktober, 2012).
In Press : ACM Digital Library, 2012. (BibTeX)

Sembritzki, Klaus ; Hager, Georg ; Krammer, Bettina ; Eitzinger, Jan ; Wellein, Gerhard:
Evaluation of the Coarray Fortran Programming Model on the Example of a Lattice Boltzmann Code.
In: - (Hrsg.) : PGAS12
(The 6th Conference on Partitioned Global Address Space Programming Models, Santa Barbara, CA, USA, 10.-12.Oktober, 2012).
In Press : ACM Digital Library, 2012. (BibTeX)

Hager, Georg ; Eitzinger, Jan ; Habich, Johannes ; Wellein, Gerhard:
Exploring performance and power properties of modern multicore chips via simple machine models.
In: Concurrency and Computation: Practice and Experience Submitted (2012), S. 22 (BibTeX)

Hager, Georg ; Eitzinger, Jan ; Habich, Johannes ; Wellein, Gerhard:
Exploring performance and power properties of modern multicore chips via simple machine models.
In: Concurrency and Computation: Practice and Experience Submitted (2012), S. 22 (BibTeX)

Fehske, Holger ; Ejima, Satoshi ; Wellein, Gerhard ; Bishop, Alan R.:
Metal-insulator Transition in the Edwards Model.
In: Sebastian, S. E. ; Sutherland, M. L. ; Pugh, E. ; Grosche, F. M. ; Keeling, J. ; Panagopoulos, C. ; Rowley, S. ; Saxena, S. S. (Hrsg.) : International Conference on Strongly Correlated Electron Systems (SCES 2011).
Bd. 391, 012152. Aufl. Published online : IOPscience, 2012, (Journal of Physics: Conference Series), S. 4.
[doi>10.1088/1742-6596/391/1/012152] (BibTeX)

Habich, Johannes ; Feichtinger, Christian ; Köstler, Harald ; Hager, Georg ; Wellein, Gerhard:
Performance Engineering for the Lattice Boltzmann Method on GPGPUs: Architectural Requirements and Performance Results.
In: Computers & Fluids (2012), S. 10
Stichwörter: Parallelization; GPGPU; HPC; CUDA; OpenCL; Computational fluid dynamics; Lattice Boltzmann method; Performance modeling and engineering
[doi>10.1016/j.compfluid.2012.02.013.] (BibTeX)

Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
Performance Patterns and Hardware Metrics on Modern Multicore Processors: Best Practices for Performance Engineering.
In: In Press (Hrsg.) : Euro-Par 2012
(5th Workshop on Productivity and Performance (PROPER 2012), Rhodes Island, Greece, 27.-31. August, 20121).
- : -, 2012. (BibTeX)

Eitzinger, Jan ; Hager, Georg ; Hofmann, Hannes ; Hornegger, Joachim ; Wellein, Gerhard:
Pushing the Limits for Medical Image Reconstruction on Recent Standard Multicore Processors.
In: The International Journal of High Performance Computing Applications (2012), S. 13
[doi>10.1177/1094342012442424] (BibTeX)

Fehske, Holger ; Wellein, Gerhard ; Bishop, A. R.:
Spatiotemporal evolution of polaronic states in finite quantum systems.
In: Physical Review B 83 (2011), Nr. 7, S. 12
[doi>10.1103/PhysRevB.83.075104] (BibTeX)

Feichtinger, Christian ; Habich, Johannes ; Köstler, Harald ; Hager, Georg ; Rüde, Ulrich ; Wellein, Gerhard:
A flexible Patch-based lattice Boltzmann parallelization approach for heterogeneous GPU-CPU clusters.
In: Parallel Computing 37 (2011), Nr. 9, S. 536-549 (BibTeX)

Eitzinger, Jan ; Wellein, Gerhard ; Hager, Georg:
Efficient multicore-aware parallelization strategies for iterative stencil computations.
In: Journal of Computational Science 2 (2011), Nr. 2, S. 130–137
[doi>10.1016/j.jocs.2011.01.010] (BibTeX)

Schubert, Gerald ; Fehske, Holger ; Hager, Georg ; Wellein, Gerhard:
Hybrid-parallel sparse matrix-vector multiplication with explicit communication overlap on current multicore-based systems..
In: Parallel Processing Letters 21 (2011), Nr. 3, S. 339-358
[doi>10.1142/S0129626411000254] (BibTeX)

Schubert, Gerald ; Hager, Georg ; Fehske, Holger ; Wellein, Gerhard:
Parallel sparse matrix-vector multiplication as a test case for hybrid MPI+OpenMP programming.
In: Cornell University Library arXiv:1101.0091 (2011), S. 12 (BibTeX)

Habich, Johannes ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Performance analysis and optimization strategies for a D3Q19 lattice Boltzmann kernel on nVIDIA GPUs using CUDA.
In: - (Hrsg.) : Advances in Engineering Software
(PARENG 2009). Bd. 42, 5. Aufl.
ScienceDirect : Elsevier, 2011, S. 266-272.
Stichwörter: CFD; GPU; Lattice Boltzmann; HPC; STREAM benchmarks; CUDA
[doi>10.1016/j.advengsoft.2010.10.007] (BibTeX)

Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
Complexities of Performance Prediction for Bandwidth-Limited Loop Kernels on Multi-Core Architectures.
In: Wagner, Siegfried ; Steinmetz, Matthias ; Bode, Arndt ; Müller, Markus Michael (Hrsg.) : High Performance Computing in Science and Engineering, Garching/Munich 2009
(Transactions of the Fourth Joint HLRB and KONWIHR Review and Results Workshop, Leibniz Supercomputing Centre, Garching/Munich, Germany, Dec. 8-9, 2009). Bd. 1.
Berlin Heidelberg : Springer-Verlag, 2010. - ISBN 978-3-642-13871-3
[doi>10.1007/978-3-642-13872-0_1] (BibTeX)

Eitzinger, Jan ; Hager, Georg:
Introducing a Performance Model for Bandwidth-Limited Loop Kernels.
In: Wyrzykowski, Roman ; Dongarra, Jack ; Karczewski, Konrad ; Wasniewski, Jerzy (Hrsg.) : Parallel Processing and Applied Mathematics
(8th International Conference, PPAM 2009 , Revised Selected Papers, Part I, Wroclaw, Poland, September 13-16, 2009). Bd. 6067.
Berlin Heidelberg : Springer-Verlag, 2010, S. 615-624. (Lecture Notes in Computer Science)
[doi>10.1007/978-3-642-14390-8_64] (BibTeX)

Hager, Georg ; Wellein, Gerhard:
Introduction to High Performance Computing for Scientists and Engineers.
Chapman & Hall/ : CRC Press, 2010.
- 356 Seiten. ISBN 978-1-4398-1192-4 (BibTeX)

Wittmann, Markus ; Hager, Georg ; Eitzinger, Jan ; Wellein, Gerhard:
Leveraging shared caches for parallel temporal blocking of stencil codes on multicore processors and clusters.
In: Parallel Processing Letters (PPL) 20 (2010), Nr. 4, S. 359-376
Stichwörter: temporal blocking; multi-halo exchange; multicore; shared caches; stencil algorithm
[doi>10.1142/S0129626410000296] (BibTeX)

Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard ; Meier, Michael:
LIKWID performance tools.
In: Innovatives Supercomputing in Deutschland (InSiDE) 8 (2010), Nr. 1, S. 50-53 (BibTeX)

Eitzinger, Jan ; Hager, Georg ; Wellein, Gerhard:
LIKWID: A Lightweight Performance-Oriented Tool Suite for x86 Multicore Environments.
In: - (Hrsg.) : Proceedings of PSTI2010, the First International Workshop on Parallel Software Tools and Tool Infrastructures
(39th International Conference on Parallel Processing Workshops, San Diego, CA, USA, 13-16 September).
IEEE : icppw, 2010, S. 207-216. - ISBN 978-0-7695-4157-0
[doi>10.1109/ICPPW.2010.38] (BibTeX)

Fehske, Holger ; Hager, Georg:
Luttinger, Peierls or Mott? Quantum Phase Transitions in Strongly Correlated 1D Electron–Phonon Systems.
In: Redmer, Ronald ; Hensel, Friedrich ; Holst, Bastian (Hrsg.) : Metal-to-Nonmetal Transitions.
Bd. 132. Berlin Heidelberg : Springer-Verlag, 2010, (Springer Series in Materials Science), S. 1-21.
[doi>10.1007/978-3-642-03953-9_1] (BibTeX)

Wittmann, Markus ; Hager, Georg ; Wellein, Gerhard:
Multicore-aware parallel temporal blocking of stencil codes for shared and distributed memory.
In: - (Hrsg.) : IPDPSW
(IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW), Atlanta, GA, USA, 19-23 April 2010).
2010, S. 1-7. - ISBN 978-1-4244-6533-0
Stichwörter: multi-core , multi-halo exchange , shared caches , temporal blocking
[doi>10.1109/IPDPSW.2010.5470813] (BibTeX)

Wittmann, Markus ; Hager, Georg:
Optimizing ccNUMA locality for task-parallel execution under OpenMP and TBB on multicore-based systems.
In: arXiv:1101.0093v1.
Cornell University Library : Computer Science > Distributed, Parallel, and Cluster Computing, 2010, S. 9. (BibTeX)

Schubert, Gerald ; Hager, Georg ; Fehske, Holger:
Performance Limitations for Sparse Matrix-Vector Multiplications on Current Multi-Core Environments.
In: Wagner, Siegfried ; Steinmetz, Matthias ; Bode, Arndt ; Müller, Markus Michael (Hrsg.) : High Performance Computing in Science and Engineering, Garching/Munich 2009
(Transactions of the Fourth Joint HLRB and KONWIHR Review and ResultsWorkshop, Leibniz Supercomputing Centre, Garching/Munich, Germany, Dec. 8–9, 2009). Bd. 1.
Berlin Heidelberg : Springer-Verlag, 2010, S. 13-26. - ISBN 978-3-642-13871-3
[doi>10.1007/978-3-642-13872-0_2] (BibTeX)

Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Benchmark analysis and application results for lattice Boltzmann simulations on NEC SX vector and Intel Nehalem systems.
In: Parallel Processing Letters 19 (2009), Nr. 4, S. 491-511
Stichwörter: NEC SX-9; Intel Nehalem; sustained performance; lattice Boltzmann flow solver; hybrid parallel
[doi>10.1142/S0129626409000389] (BibTeX)

Wellein, Gerhard ; Hager, Georg ; Zeiser, Thomas ; Wittmann, Markus ; Fehske, Holger:
Efficient temporal blocking for stencil computations by multicore-aware wavefront parallelization.
In: - (Hrsg.) : Proceedings of 2009 33rd Annual IEEE International Computer Software and Applications Conference
(COMPSAC 2009, Seattle, USA, Juli 20 -24, 2009).
IEEE Computer Society : IPSJ/IEEE SAINT Conference, DOI 10.1109/COMPSAC.2009.82, 2009, S. 579-586. (BibTeX)

Rabenseifner, Rolf ; Hager, Georg ; Jost, Gabriele:
Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes.
In: - (Hrsg.) : Proceedings of PDP 2009
(17th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Weimar, 18-20 February 2009).
Weimar : Bauhaus-University Weimar, 2009, S. 1-10. (BibTeX)

Fehske, Holger ; Schleede, Jens ; Schubert, Gerald ; Wellein, Gerhard ; Filinov, Wladimir S. ; Bishop, Alan. R.:
Numerical approaches to time evolution of complex quantum systems.
In: Physics Letters A 373 (2009), Nr. 25, S. 2182-2188
[doi>10.1016/j.physleta.2009.04.022] (BibTeX)

Ejima, S. ; Hager, Georg ; Fehske, Holger:
Quantum phase transition in a 1D transport model with boson affected hopping: Luttinger liquid versus charge-density-wave behavior.
In: Physical Review Letters 102 (2009), Nr. 10, S. 106404-106407
[doi>10.1103/PhysRevLett.102.106404] (BibTeX)

Fehske, Holger ; Alvermann, Andreas ; Wellein, Gerhard:
Quantum Transport within a Background Medium: Fluctuations versus Correlations.
In: Wagner, Siegfried ; Steinmetz, Matthias ; Bode, Arndt ; Brehm, Matthias (Hrsg.) : High Performance Computing in Science and Engineering, Garching/Munich 2007: Transactions of the Third Joint HLRB and KONWIHR Status and Result Workshop, Dec. 3-4, 2007, Leibniz Supercomputing Centre, Garching/Munich, Germany.
Berlin, Heidelberg : Springer, 2009, (Mathematics and Statistics), S. 649-668. - ISBN 978-3-540-69181-5
[doi>10.1007/978-3-540-69182-2_50] (BibTeX)

Hager, Georg ; Stengel, Holger ; Zeiser, Thomas ; Wellein, Gerhard:
RZBENCH: performance evaluation of current HPC architechtures using low-level and application benchmarks.
In: Wagner, Siegfried ; Steinmetz, Matthias ; Bode Arndt ; Brehm Matthias (Hrsg.) : High Performance Computing in Science and Engineering, Garching/Munich 2007: Transactions of the Third Joint HLRB and KONWIHR Status and Result Workshop, Dec. 3-4, 2007, Leibniz Supercomputing Centre, Garching/Munich, Germany.
Bd. V. Berlin, Heidelberg : Springer, 2009, (Mathematics and Statistics), S. 485-501. - ISBN 978-3-540-69181-5
[doi>10.1007/978-3-540-69182-2_39] (BibTeX)

Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard ; Inayat, Amer ; Schwieger, Wilhelm ; Heidig, Tobias ; Freund, Hannsjörg:
Selecting an Appropriate Computational Platform for Supporting the Development of New Catalyst Carriers.
In: Innovatives Supercomputing in Deutschland (inSiDE) 7 Spring (2009), Nr. 2, S. 12-16 (BibTeX)

Habich, Johannes ; Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Speeding up a Lattice Boltzmann Kernel on nVIDIA GPUs.
In: Topping, B.H.V. ; Iványi, P (Hrsg.) : Proceedings of the First International Conference on Parallel, Distributed and Grid Computing for Engineering
(PARENG2009, Pécs, Hungary, 6-8.April.2009).
Kippen, Stirlingshire, United Kingdom : Civil-Comp Press, 2009, S. 17. - ISBN 978-1-905088-29-4 (BibTeX)

Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
The world's fastest CPU and SMP node: Some performance results from the NEC SX-9.
In: - (Hrsg.) : Proceedings of the IEEE International Symposium on Parallel&Distributed Processing 2009
(23rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), Roma, 25-29 Mai).
IEEE Computer Society : ipdps, 2009, S. 1-8. - ISBN 978-1-4244-3751-1
[doi>10.1109/IPDPS.2009.5161089] (BibTeX)

Zeiser, Thomas ; Hager, Georg ; Wellein, Gerhard:
Vector Computers in a World of Commodity Clusters, Massively Parallel Systems and Many-Core Many-Threaded CPUs: Recent Experience Based on an Advanced Lattice Boltzmann Flow Solver.
In: Nagel, Wolfgang E. ; Kröner, Dietmar B. ; Resch, Michael M. (Hrsg.) : High Performance Computing in Science and Engineering '08: Transactions of the High Performance Computing Center, Stuttgart (HLRS) 2008.
Bd. 5. Berlin Heidelberg : Springer, 2009, (Mathematics and Statistics), S. 333-347. - ISBN 978-3-540-88301-2
[doi>10.1007/978-3-540-88303-6_24] (BibTeX)