2H 2015
intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3 intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3
Comparative Performance NAMD* 2.10 Pre-Release STMV CLUSTER BENCHMARK 32 NODES APPROVED FOR PUBLIC PRESENTATION 30 25 20 15 10 5 0 1 NAMD* 2.10 (pre-release) Cluster Performance Increase STMV (~1M atoms) 1.2X 2X 2.1X 6.8X 7.9X 12.2X 13.1X 1 Node 8 Nodes 32 Nodes Intel® Xeon® processor E5-2697 v2 (Baseline: 1 node, 23 or 47 PPN) Intel® Xeon® processor E5-2697 v3 (27 or 55 PPN) Xeon E5-2697 v2 (23 or 47 PPN) + 1 Intel® Xeon Phi coprocessor 7120A (240T) Xeon E5-2697 v3 (27 or 55 PPN) + 1 Intel® Xeon Phi coprocessor 7110A (240T) 20X 24.2X “Xeon E5-2697 v2/v3” = Intel® Xeon® processor E5-2697 v2/v3 27.2X 32X Application: NAMD 2.10 pre-release; STMV Description: A parallel, object-oriented molecular dynamics code designed for high-performance simulation of large biomolecular systems. More at http://www.ks.uiuc.edu/Research/namd/ Availability: • Code: Intel® Xeon Phi coprocessor support is available as a pre-release. Use the nightly build. • Recipe: Available here. Usage Model: Single rank on host with 47 threads. Various computations are offloaded to Intel® Xeon Phi coprocessor from each thread. Highlights: Intel® Xeon Phi coprocessor support is now in the development branch of NAMD 2.10 prerelease. Results: For the STMV workload, the Intel® Xeon® processor E5-2697 v3 and the Intel® Xeon Phi coprocessor (32 nodes, 55 PPN) improved performance by up to 32X compared to the baseline processor (1 node, 47 PPN). For configuration details, go here. SOURCE: INTEL MEASURED RESULTS AS OF SEPTEMBER, 2014 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others 20
Comparative Performance NAMD* 2.10 Pre-Release ApoA1 CLUSTER BENCHMARK 2 NODES APPROVED FOR PUBLIC PRESENTATION 2 1 0 NAMD* 2.10 (pre-release) Cluster Performance Increase ApoA1 (~92K atoms); 55 PPN 2.61X 1 1.52X 1.94X 1 Node 2 Nodes Intel® Xeon® processor E5-2697 v3 (Baseline: (Baseline; 1 node, node) 55PPN) Intel® Xeon® processor E5-2697 v3 + Intel® Xeon Phi coprocessor B1- 7110A (240T) Application: NAMD* 2.10 pre-release; ApoA1 Description: A parallel, object-oriented molecular dynamics code designed for high-performance simulation of large bio molecular systems. More at http://www.ks.uiuc.edu/Research/namd/ Availability: • Code: Intel® Xeon Phi coprocessor support is available as a prerelease. Use the nightly build. • Recipe: Available here. Usage Model: Single rank on host with 55 threads. Various computations are offloaded to Intel® Xeon Phi coprocessor from each thread. Highlights: Intel® Xeon Phi coprocessor support is now in the development branch of NAMD 2.10 pre-release. Results: For the ApoA1 workload, 2-node performance can be accelerated by up to 2.61X using a single Intel® Xeon Phi coprocessor. For configuration details, go here. SOURCE: INTEL MEASURED RESULTS AS OF SEPTEMBER, 2014 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others 21
- Page 1 and 2: 2H 2015
- Page 3 and 4: Intel® Modern Code Developer Chall
- Page 5 and 6: New or Updated Proof Points NEW pro
- Page 7 and 8: Intel® Xeon Phi Coprocessors Softw
- Page 9 and 10: Intel® Xeon® Processor E5-2697 v2
- Page 11 and 12: Memory Capacity (GB) Memory Compari
- Page 13 and 14: A Growing Ecosystem: The Intel® Xe
- Page 15 and 16: Comparative Performance LAMMPS* Sti
- Page 17 and 18: Comparative Performance Johns Hopki
- Page 19: Comparative Performance 1 0 BLAST*
- Page 23 and 24: Comparative Performance LAMMPS* Liq
- Page 25 and 26: Comparative Performance LAMMPS* Rho
- Page 27 and 28: Comparative Performance LAMMPS* Liq
- Page 29 and 30: Comparative Performance LAMMPS* Rho
- Page 31 and 32: Comparative Performance LAMMPS* Pro
- Page 33 and 34: Comparative Performance AMBER* 14 P
- Page 35 and 36: Comparative Performance AMBER* 14 P
- Page 37 and 38: Comparative Performance Burrows-Whe
- Page 39 and 40: Comparative Performance NWChem* CCS
- Page 41 and 42: Discover and design like never befo
- Page 43 and 44: Comparative Performance miniGhost*
- Page 45 and 46: Comparative Performance Quantum ESP
- Page 47 and 48: Comparative Performance ANSYS Mecha
- Page 49 and 50: Comparative Performance ANSYS Mecha
- Page 51 and 52: Comparative Performance ANSYS Mecha
- Page 53 and 54: Comparative Performance Sandia Mant
- Page 55 and 56: Comparative Increase Autodesk Maya*
- Page 57 and 58: Comparative Performance OpenLB* Cyl
- Page 59 and 60: CLUSTER BENCHMARKS New Data Center
- Page 61 and 62: Comparative Performance Monte Carlo
- Page 63 and 64: Comparative Performance QuantLib* S
- Page 65 and 66: Comparative Performance Monte Carlo
- Page 67 and 68: Comparative Performance Monte Carlo
- Page 69 and 70: Comparative Performance Monte Carlo
Comparative Performance<br />
NAMD* 2.10 Pre-Release<br />
ApoA1<br />
CLUSTER BENCHMARK<br />
2 NODES<br />
APPROVED FOR PUBLIC PRESENTATION<br />
2<br />
1<br />
0<br />
NAMD* 2.10 (pre-release) Cluster Performance Increase<br />
ApoA1 (~92K atoms); 55 PPN<br />
2.61X<br />
1<br />
1.52X<br />
1.94X<br />
1 Node 2 Nodes<br />
Intel® Xeon® processor E5-2697 v3 (Baseline: (Baseline; 1 node, node) 55PPN)<br />
Intel® Xeon® processor E5-2697 v3 + Intel® Xeon Phi coprocessor B1-<br />
7110A (240T)<br />
Application: NAMD* 2.10 pre-release; ApoA1<br />
Description: A parallel, object-oriented molecular dynamics code<br />
designed for high-performance simulation of large bio molecular<br />
systems. More at http://www.ks.uiuc.edu/Research/namd/<br />
Availability:<br />
• Code: Intel® Xeon Phi coprocessor support is available as a prerelease.<br />
Use the nightly build.<br />
• Recipe: Available here.<br />
Usage Model: Single rank on host with 55 threads. Various<br />
computations are offloaded to Intel® Xeon Phi coprocessor from<br />
each thread.<br />
Highlights: Intel® Xeon Phi coprocessor support is now in the<br />
development branch of NAMD 2.10 pre-release.<br />
Results: For the ApoA1 workload, 2-node performance can be<br />
accelerated by up to 2.61X using a single Intel® Xeon Phi<br />
coprocessor.<br />
For configuration details, go here.<br />
SOURCE: INTEL MEASURED RESULTS AS OF SEPTEMBER, 2014<br />
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems,<br />
components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated<br />
purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others<br />
21