2H 2015
intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3 intel-xeon-phi-sw-ecosystem-guide-2h-2015-public3
Comparative Performance Weather & Research Forecast (WRF*) WRFV3.6 CONUS2.5KM CLUSTER BENCHMARK 4 NODES APPROVED FOR PUBLIC PRESENTATION 1 0 1 WRF V3.6 CONUS2.5KM Speed Up 4 Node Clusters 1.27X 1.3X 2S Intel® Xeon® processor E5-2697 v2 2S Xeon E5-2697 v2 + Xeon Phi 7120P 2S Intel® Xeon® processor E5-2697 v3 2S Xeon E5-2697 v3 + Xeon Phi 7120P 1.94X “Xeon E5-2697 v2” = Intel® Xeon® processor E5-2697 v2 “Xeon E5-2697 v3” = Intel® Xeon® processor E5-2697 v3 “Xeon Phi 7120P” = Intel® Xeon Phi coprocessor 7120P Application: Weather & Research Forecast Model (WRF*) V3.6 Description: WRF Model is a numerical weather prediction system designed to serve atmospheric research and operational forecasting needs. More at here. Availability: • Code: Available here. • Recipe: Available here. Usage Model: Baseline is the Intel® Xeon® processor E5-2697 v2 host only, and speed up is shown with on both the host and on the Intel® Xeon Phi coprocessor 7120P, and on the Intel® Xeon® processor E5-2697 v3 + Intel® Xeon Phi coprocessor 7120P. Performance shown is for released code. Highlights: The code had been optimized since 2011, delivered to the community, and available for download from the community site. Results: Symmetric process demonstrated up to 1.94X improved performance over the baseline Intel® Xeon® processor E5-2697 v2. For configuration details, go here. SOURCE: INTEL MEASURED RESULTS AS OF SEPTEMBER, 2014 Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others 90
Comparative Performance NASA* OVERFLOW 2.2g 1 NODE APPROVED FOR PUBLIC PRESENTATION NASA OVERFLOW* 2.2g Speed Up 1.2X 1 1 0 2S Intel® Xeon® E5-2697 v2 2S Intel® Xeon® E5-2697 v2 + Intel® Xeon Phi Coprocessor 7120P For configuration details, go here. SOURCE: INTEL MEASURED RESULTS AS OF DECEMBER, 2013 Application: NASA OVERFLOW* 2.2g; fluid flow using overset grids, Description OVERFLOW 2.2g is a three-dimensional time-marching implicit Navier-Stokes code that can also operate in two-dimensional or axisymmetric mode. The code uses structured overset grid systems. More at http://people.nas.nasa.gov/~pulliam/Overflow/ Overflow_Manuals.html Availability: • Code: Available here. Xeon Phi optimization (but not build setup) is included in standard source code available from NASA • Recipe: Not available. Usage Model: MPI/OpenMP* symmetric processing distributed across an Intel® Xeon® processor and an Intel® Xeon Phi coprocessor. Results: Symmetric process demonstrated up to 1.2X improved performance over the baseline Intel® Xeon® processor E5-2697 v2. Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others 91
- Page 39 and 40: Comparative Performance NWChem* CCS
- Page 41 and 42: Discover and design like never befo
- Page 43 and 44: Comparative Performance miniGhost*
- Page 45 and 46: Comparative Performance Quantum ESP
- Page 47 and 48: Comparative Performance ANSYS Mecha
- Page 49 and 50: Comparative Performance ANSYS Mecha
- Page 51 and 52: Comparative Performance ANSYS Mecha
- Page 53 and 54: Comparative Performance Sandia Mant
- Page 55 and 56: Comparative Increase Autodesk Maya*
- Page 57 and 58: Comparative Performance OpenLB* Cyl
- Page 59 and 60: CLUSTER BENCHMARKS New Data Center
- Page 61 and 62: Comparative Performance Monte Carlo
- Page 63 and 64: Comparative Performance QuantLib* S
- Page 65 and 66: Comparative Performance Monte Carlo
- Page 67 and 68: Comparative Performance Monte Carlo
- Page 69 and 70: Comparative Performance Monte Carlo
- Page 71 and 72: Comparative Performance Xcelerit* L
- Page 73 and 74: Comparative Increase 1 0 Iso3DFD* 1
- Page 75 and 76: Comparative Performance Petrobras*
- Page 77 and 78: CLUSTER BENCHMARK Data Center Serve
- Page 79 and 80: Comparative Performance BerkeleyGW*
- Page 81 and 82: Comparative Performance ASKAP* tHog
- Page 83 and 84: Comparative Increase specfem3D 300K
- Page 85 and 86: CLUSTER BENCHMARK 6,400 NODES APPRO
- Page 87 and 88: Comparative Performance Gyrokinetic
- Page 89: Comparative Increase ROMS* Idealize
- Page 93 and 94: Improving speed and quality through
- Page 95 and 96: Comparative Performance Embree 2.2
- Page 97 and 98: Intel® Software Development Tools
- Page 99 and 100: Features and Configurations Intel®
- Page 101 and 102: Speedup Turn Big Data Into Informat
- Page 103 and 104: Scalable Profiling for MPI and Hybr
- Page 105 and 106: Bright Cluster Manager* Advanced Cl
- Page 107 and 108: Intel® Xeon Phi Coprocessor Develo
- Page 109 and 110: Intel® Developer Zone Join us on S
- Page 111 and 112: Recommended Links Getting Started:
- Page 113 and 114: Hardware Configuration - Intel® Xe
- Page 115 and 116: OPTIMIZATION NOTICE Optimization No
Comparative Performance<br />
Weather & Research Forecast (WRF*)<br />
WRFV3.6 CONUS2.5KM<br />
CLUSTER BENCHMARK<br />
4 NODES<br />
APPROVED FOR PUBLIC PRESENTATION<br />
1<br />
0<br />
1<br />
WRF V3.6 CONUS2.5KM Speed Up<br />
4 Node Clusters<br />
1.27X 1.3X<br />
2S Intel® Xeon® processor E5-2697 v2<br />
2S Xeon E5-2697 v2 + Xeon Phi 7120P<br />
2S Intel® Xeon® processor E5-2697 v3<br />
2S Xeon E5-2697 v3 + Xeon Phi 7120P<br />
1.94X<br />
“Xeon E5-2697 v2” = Intel® Xeon® processor E5-2697 v2<br />
“Xeon E5-2697 v3” = Intel® Xeon® processor E5-2697 v3<br />
“Xeon Phi 7120P” = Intel® Xeon Phi coprocessor 7120P<br />
Application: Weather & Research Forecast Model (WRF*) V3.6<br />
Description: WRF Model is a numerical weather prediction<br />
system designed to serve atmospheric research and<br />
operational forecasting needs. More at here.<br />
Availability:<br />
• Code: Available here.<br />
• Recipe: Available here.<br />
Usage Model: Baseline is the Intel® Xeon® processor E5-2697<br />
v2 host only, and speed up is shown with on both the host and<br />
on the Intel® Xeon Phi coprocessor 7120P, and on the Intel®<br />
Xeon® processor E5-2697 v3 + Intel® Xeon Phi coprocessor<br />
7120P. Performance shown is for released code.<br />
Highlights: The code had been optimized since 2011, delivered<br />
to the community, and available for download from the<br />
community site.<br />
Results: Symmetric process demonstrated up to 1.94X<br />
improved performance over the baseline Intel® Xeon®<br />
processor E5-2697 v2.<br />
For configuration details, go here.<br />
SOURCE: INTEL MEASURED RESULTS AS OF SEPTEMBER, 2014<br />
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems,<br />
components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated<br />
purchases, including the performance of that product when combined with other products. For more information go to http://www.intel.com/performance *Other names and brands may be claimed as the property of others<br />
90