2H 2015
intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3 intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3
Comparative Performance 2 1 0 MSC Software* Nastran* 2016 Alpha 1 For configuration details, go here. MSC Software Nastran* 2016 Alpha Speed Up 1.97X 2X 1 1 2 4 8 Cores 2S Intel® Xeon® processor E5-2697 v2 1.61X 1 1 2S Xeon E5-2697 v2 + 1 Intel® Xeon Phi coprocessor 7120P 1.3X SOURCE: INTEL MEASURED RESULTS AS OF APRIL, 2014 1 NODE Application: MSC Software Nastran* 2016 Alpha. Description: MSC Software Nastran sparse direct solver optimized for Intel® Xeon Phi coprocessors. Intel Xeon Phi coprocessors are especially well suited for large models that consist primarily of three-dimensional elements and have tightly-coupled multiphysics, such as noise, vibration, and harshness (NHV) studies. Availability: • Code: Contact an MSC Software representative. • Recipe: Use hStreams* with MPSS 3.5 or later. APPROVED FOR PUBLIC PRESENTATION NEW Usage Model: When large matrices are split into submatrices for processing, the solver offloads the largest submatrices from the Intel® Xeon® processor to the Intel® Xeon Phi coprocessor. Performance is maximized through asynchronous compute capabilities, which allow matrix computations on the coprocessor to start almost instantly once the data offload begins. Highlight: This strategy has demonstrated performance gains across a range of solution sequences for MSC Nastran 2016 Alpha, including structural statics and modal decomposition, which account for the majority of customer workloads. Results: Up to 2X speed up for the Intel® Xeon® processor E5-2697 v2 and the Intel® Xeon Phi coprocessor 7120A in offload mode over the host. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others 54
Comparative Increase Autodesk Maya* 2014 Viewport Plugin Embree-based workloads 1 NODE APPROVED FOR PUBLIC PRESENTATION NEW 2 1 0 Embree-Based Viewport Plugin for Autodesk Maya* 2014 Speed Up 1.68X 1.23X Chinese Dragon with translucent material 1.67X 1.2X Full-screen Crytek Sponza simple materials + textures Power Plant 2S Intel® Xeon® processor E5-2697 v2 2S Xeon E5-2697 v2 + Xeon Phi 7120A 2S Xeon E5-2697 v3 + Xeon Phi 7120A 1.8X 1.77X 1.58X “Xeon E5-2697 v2” = Intel® Xeon® processor E5-2697 v2 Rungholt 2.8X Application: Embree-Based Viewport Plugin for Autodesk Maya* 2014 Description: An open-source, proof-of-concept Autodesk Maya 2014 viewport plugin which renders the scene being interactively edited using the Embree high-performance ray-tracing kernels and a modified version of the Embree sample path-tracer. Availability: • Code: Available here (branch “mayarender_v2.3.2”). • Recipe: Available here. Usage Model: This plugin can do computation using only the host, only the coprocessor via offload, and on both the host and coprocessor via offload. It can also make use of multiple coprocessors. The plugin is invoked every time the screen updates during interactive content authoring. Highlights: • Workstation application for both Linux* and Windows* • Makes use of the high-performance Embree ray-tracing kernels, which are optimized for both Intel® Xeon® processors and the Intel® Xeon Phi coprocessor Results: Dual Intel® Xeon® processor E5-2697 v2 + 1 Intel Xeon Phi coprocessor 7120A hybrid mode improves performance by up to 2.8X over the dual Intel® Xeon® processor E5-2697 v2 host alone. “Xeon Phi 7120A” = Intel® Xeon Phi coprocessor 7120A For configuration details, go here. SOURCE: INTEL MEASURED RESULTS AS OF FEBRUARY, 2015 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others 55
- Page 3 and 4: Intel® Modern Code Developer Chall
- Page 5 and 6: New or Updated Proof Points NEW pro
- Page 7 and 8: Intel® Xeon Phi Coprocessors Softw
- Page 9 and 10: Intel® Xeon® Processor E5-2697 v2
- Page 11 and 12: Memory Capacity (GB) Memory Compari
- Page 13 and 14: A Growing Ecosystem: The Intel® Xe
- Page 15 and 16: Comparative Performance LAMMPS* Sti
- Page 17 and 18: Comparative Performance Johns Hopki
- Page 19 and 20: Comparative Performance 1 0 BLAST*
- Page 21 and 22: Comparative Performance NAMD* 2.10
- Page 23 and 24: Comparative Performance LAMMPS* Liq
- Page 25 and 26: Comparative Performance LAMMPS* Rho
- Page 27 and 28: Comparative Performance LAMMPS* Liq
- Page 29 and 30: Comparative Performance LAMMPS* Rho
- Page 31 and 32: Comparative Performance LAMMPS* Pro
- Page 33 and 34: Comparative Performance AMBER* 14 P
- Page 35 and 36: Comparative Performance AMBER* 14 P
- Page 37 and 38: Comparative Performance Burrows-Whe
- Page 39 and 40: Comparative Performance NWChem* CCS
- Page 41 and 42: Discover and design like never befo
- Page 43 and 44: Comparative Performance miniGhost*
- Page 45 and 46: Comparative Performance Quantum ESP
- Page 47 and 48: Comparative Performance ANSYS Mecha
- Page 49 and 50: Comparative Performance ANSYS Mecha
- Page 51 and 52: Comparative Performance ANSYS Mecha
- Page 53: Comparative Performance Sandia Mant
- Page 57 and 58: Comparative Performance OpenLB* Cyl
- Page 59 and 60: CLUSTER BENCHMARKS New Data Center
- Page 61 and 62: Comparative Performance Monte Carlo
- Page 63 and 64: Comparative Performance QuantLib* S
- Page 65 and 66: Comparative Performance Monte Carlo
- Page 67 and 68: Comparative Performance Monte Carlo
- Page 69 and 70: Comparative Performance Monte Carlo
- Page 71 and 72: Comparative Performance Xcelerit* L
- Page 73 and 74: Comparative Increase 1 0 Iso3DFD* 1
- Page 75 and 76: Comparative Performance Petrobras*
- Page 77 and 78: CLUSTER BENCHMARK Data Center Serve
- Page 79 and 80: Comparative Performance BerkeleyGW*
- Page 81 and 82: Comparative Performance ASKAP* tHog
- Page 83 and 84: Comparative Increase specfem3D 300K
- Page 85 and 86: CLUSTER BENCHMARK 6,400 NODES APPRO
- Page 87 and 88: Comparative Performance Gyrokinetic
- Page 89 and 90: Comparative Increase ROMS* Idealize
- Page 91 and 92: Comparative Performance NASA* OVERF
- Page 93 and 94: Improving speed and quality through
- Page 95 and 96: Comparative Performance Embree 2.2
- Page 97 and 98: Intel® Software Development Tools
- Page 99 and 100: Features and Configurations Intel®
- Page 101 and 102: Speedup Turn Big Data Into Informat
- Page 103 and 104: Scalable Profiling for MPI and Hybr
Comparative Performance<br />
2<br />
1<br />
0<br />
MSC Software*<br />
Nastran* 2016 Alpha<br />
1<br />
For configuration details, go here.<br />
MSC Software Nastran* 2016 Alpha Speed Up<br />
1.97X 2X<br />
1<br />
1 2 4 8<br />
Cores<br />
2S Intel® Xeon® processor E5-2697 v2<br />
1.61X<br />
1 1<br />
2S Xeon E5-2697 v2 + 1 Intel® Xeon Phi coprocessor 7120P<br />
1.3X<br />
SOURCE: INTEL MEASURED RESULTS AS OF APRIL, 2014<br />
1 NODE<br />
Application: MSC Software Nastran* 2016 Alpha.<br />
Description: MSC Software Nastran sparse direct solver optimized<br />
for Intel® Xeon Phi coprocessors. Intel Xeon Phi coprocessors are<br />
especially well suited for large models that consist primarily of<br />
three-dimensional elements and have tightly-coupled multiphysics,<br />
such as noise, vibration, and harshness (NHV) studies.<br />
Availability:<br />
• Code: Contact an MSC Software representative.<br />
• Recipe: Use hStreams* with MPSS 3.5 or later.<br />
APPROVED FOR PUBLIC PRESENTATION<br />
NEW<br />
Usage Model: When large matrices are split into submatrices for<br />
processing, the solver offloads the largest submatrices from the<br />
Intel® Xeon® processor to the Intel® Xeon Phi coprocessor.<br />
Performance is maximized through asynchronous compute<br />
capabilities, which allow matrix computations on the coprocessor to<br />
start almost instantly once the data offload begins.<br />
Highlight: This strategy has demonstrated performance gains<br />
across a range of solution sequences for MSC Nastran 2016 Alpha,<br />
including structural statics and modal decomposition, which<br />
account for the majority of customer workloads.<br />
Results: Up to 2X speed up for the Intel® Xeon® processor E5-2697<br />
v2 and the Intel® Xeon Phi coprocessor 7120A in offload mode<br />
over the host.<br />
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems,<br />
components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated<br />
purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others<br />
54