Institute of Technology
  Technical Support
      Knowledge Base
      Product Updates
      Product Feedback
      User Forums
      My Requests
  Download Center
  Order Manuals








MSC.Nastran V2001 Distributed Memory Parallel Test Results

This page contains the results of several jobs running on different hardware platforms. The times listed in each table are elapsed time. The plots show scaling of the Slowest serial time / Each machine. Each job was run by hardware partners. For best price performance results based on Linux please contact Brad Kindorf

Below is a summary of the test problems.
Name Ndof Description SOL MEM SCR Disk Used Total I/O Comments
LG0QDF0 31,125 Cube w/ interior 108 100 Mb 0.6 Gb 700 Gb 76 Frequency Increments
XL0EMF0 654,560 Car Body 111 400 Mb 11 Gb 328 Gb 448 + 34 Roots
XL0RST0 739,815 Engine 101 320 Mb 4 Gb 18 Gb
XL0TDF1 525,027 Car Body 108 450 Mb 5 Gb 209 Gb 32 Frequency Increments
XX0CMD0 1,584,622 Car Body 103 800 Mb 43 Gb 2400 Gb 1073 Roots
XX0CMD1 1,584,622 Car Body 103 800 Mb 43 Gb 2400 Gb 1073 Roots (FDMODES)
XX0CMD2 1,584,622 Car Body 103 800 Mb 43 Gb 2400 Gb 1073 Roots (ACMS)
XX0DMD0 1,920,855 Engine 103 800 Mb 25 Gb 555 Gb 17 Roots
XX0DMD1 1,920,855 Engine 103 800 Mb 25 Gb 555 Gb 17 Roots (w/ FDMODES)

Below is a description of the hardware used. Detailed computer, disk, network and OS configuration is listed the bottom of this page.
Vendor Hardware O/S
Compaq AlphaServer GS160 EV68 1 GHz UNIX 5.1
Compaq AlphaServer SC ES40 833 MHz UNIX 5.1
Cray SV1ex 500MHz UNICOS 10.0.0lf
HP HP SuperDome 750 MHz PA8700 HP-UX B.11.11
IBM IBM p690 Turbo 1300 MHz AIX 5.1C
Intel Linux IA64 NEC AzusA 800MHz Red Hat Linux 2.4.7-nec1.1p1
Intel Linux IA32 HP X4000 P4 Xeon 1700 MHz MSC.Linux 2.4.6-1msc-smp
NEC SX-6 SuperUX 12.1
SGI Origin 3000 / 600 MHz IRIX64 6.5
Sun UltraSPARC 750 MHz Solaris 8


Vendor:Model Serial dmp=2 dmp=4 dmp=8 dmp=16
Cray:SV1ex 7348 3831 2039
Compaq:AlphaServer_GS160 6999 3694 1970 1169 737
Compaq:AlphaServer_SC 7226 3771 1933 1054 690
HP:Superdome 5859 3203 1693 908 524
IBM:p690_Turbo_1300MHz 3582 2229 1095 590 419
Intel_Linux:NEC_express5800_1160Xa 7104 3843 2203 1659
Intel_Linux:P4_1700MHz_Xeon 8825 4634 2355
NEC:sx6 2836 1555 842 516
SGI:origin_3000_600MHz 8570 4578 2367 1306
Sun:UltraSPARC_750MHz 13363 7274 3753 2098 1353

Vendor:Model Serial dmp=2 dmp=4 dmp=8
Compaq:AlphaServer_GS160 5809 4751 3953 3701
Compaq:AlphaServer_SC 5746 4397 3441 3082
HP:Superdome 5919 4575 3458 3097
IBM:p690_Turbo_1300MHz 3456 2417 1820 1633
Intel_Linux:NEC_express5800_1160Xa 7679 6274 6138
Intel_Linux:P4_1700MHz_Xeon 8922 7055 5383
NEC:sx6 2026 1621 1355 1390
SGI:origin_3000_600MHz 6913 5149 4102 3604
Sun:UltraSPARC_750MHz 23250 17151 11204 10123

Vendor:Model Serial dmp=2 dmp=4 dmp=8
Compaq:AlphaServer_GS160 474 316 260 213
Compaq:AlphaServer_SC 450 328 260 197
HP:Superdome 602 449 330 260
IBM:p690_Turbo_1300MHz 279 226 177 156
Intel_Linux:NEC_express5800_1160Xa 610 486
Intel_Linux:P4_1700MHz_Xeon 532 393
NEC:sx6 682 670 548 541
SGI:origin_3000_600MHz 663
Sun:UltraSPARC_750MHz 806 606 504 376

Vendor:Model Serial dmp=2 dmp=4 dmp=8 dmp=16
Compaq:AlphaServer_GS160 14169 7214 3772 2231 1824
Compaq:AlphaServer_SC 15752 8017 4294 2205 1441
HP:Superdome 10785 5586 3083 1787 1196
IBM:p690_Turbo_1300MHz 7246 3530 2001 1243 928
Intel_Linux:NEC_express5800_1160Xa 13413 7055 4797
Intel_Linux:P4_1700MHz_Xeon 18335 9308 4807
NEC:sx6 4470 2563 1655 1337
SGI:origin_3000_600MHz 16517 8320 4495 2470
Sun:UltraSPARC_750MHz 36036 12900 6900 3869 2622

Vendor:Model Serial dmp=2 dmp=4
Cray:SV1ex 17794
Compaq:AlphaServer_GS160 29418 17313 9750
Compaq:AlphaServer_SC 26998 15625 9095
HP:Superdome 35825 17643 9471
IBM:p690_Turbo_1300MHz 19269 9605 6025
Intel_Linux:NEC_express5800_1160Xa 37446 25614 18339
Intel_Linux:P4_1700MHz_Xeon 44283 25440 14648
NEC:sx6 6181 4710
SGI:origin_3000_600MHz 36334 22498 12131
Sun:UltraSPARC_750MHz 126798 64432 33861

Vendor:Model Serial dmp=2 dmp=4
Cray:SV1ex 17794 12699
Compaq:AlphaServer_GS160 29418 16342 9473
Compaq:AlphaServer_SC 26998 15355 10421
HP:Superdome 35825 19110 11020
IBM:p690_Turbo_1300MHz 19269 11413 6758
Intel_Linux:NEC_express5800_1160Xa 37446 21397 15119
Intel_Linux:P4_1700MHz_Xeon 44283 24777 15017
NEC:sx6 6181 3887
SGI:origin_3000_600MHz 36334 23840 13398
Sun:UltraSPARC_750MHz 126798 76302 42053

Vendor:Model Serial dmp=2 dmp=4
Cray:SV1ex 17794 14844 14303
Compaq:AlphaServer_GS160 29418 14860 12562
Compaq:AlphaServer_SC 26998 12841 9728
HP:Superdome 35825 14532 10718
IBM:p690_Turbo_1300MHz 19269 9864 6943
Intel_Linux:NEC_express5800_1160Xa 37446 15971 13139
Intel_Linux:P4_1700MHz_Xeon 44283 15282 11050
NEC:sx6 6181 3515 2311
SGI:origin_3000_600MHz 36334
Sun:UltraSPARC_750MHz 126798 40632 30524

Vendor:Model Serial dmp=2 dmp=4
Compaq:AlphaServer_GS160 10234 10009 6145
Compaq:AlphaServer_SC 10064 10439 6390
HP:Superdome 11292 8819 5490
IBM:p690_Turbo_1300MHz 8825 10570 7583
Intel_Linux:NEC_express5800_1160Xa 14098 11458 7370
Intel_Linux:P4_1700MHz_Xeon 39932 13987 9774
NEC:sx6 5813 4460 3263
SGI:origin_3000_600MHz 12790 12737 7573
Sun:UltraSPARC_750MHz 26646 22979 17387

Vendor:Model Serial dmp=2 dmp=4
Compaq:AlphaServer_GS160 10234 16356 13455
Compaq:AlphaServer_SC 10064 18809 16542
HP:Superdome 11292 13317 12353
IBM:p690_Turbo_1300MHz 8825
Intel_Linux:NEC_express5800_1160Xa 14098 16915
Intel_Linux:P4_1700MHz_Xeon 39932 24103 22402
NEC:sx6 5813
SGI:origin_3000_600MHz 12790
Sun:UltraSPARC_750MHz 26646 32815 32652

Additional Comments:
  • DMP is only recognized in SOL 101, 103, 108, and 111.
  • Each task requires enough memory for the size of executable, the amount specified by the MEM parameter, and the amount needed for system I/O buffers. On true distributed systems, this amount is on each node. On SMP systems, all the tasks are on the same node.
  • ACMS is very good for analyses with LOTs of modes (xx0cmd2), but is is very slow on analyses with few modes (xx0dmd0 with ACMS and DMP=2 took too long on most systems to even run and so results were not shown).
  • In SOL 108 analyses, parallel efficiency increases as the number of frequencies (power of 2 is ideal) increase.

Hardware Details:

  • Compaq
        Model:            Compaq AlphaServer GS160 (CC-NUMA)
        CPU:              16 X 1 GHz Alpha EV68
        OS Level:         UNIX 5.1 
        Memory:           64Gb
        Disk:             4 disk stripe set: Each disk stripe set had 10 18GB disks on
                          2 Ultra3 SCSI controller.
        
  • Compaq
        Model:            Compaq AlphaServer SC (8 Nodes)
        CPU:              4 833MHz Alpha EV68 processors 
        OS Level:         UNIX 5.1 
        Memory:           4 nodes had 8Gb, 4 had 4 Gb
        Disk:             10 18.2GB disks on 2 Ultra3 SCSI controller striped using
                          LSM software.
        Interconnect Fabric: Quadrics
        
  • Cray
        Model:            SV1ex (32 cpus)
        CPU:              SV1ex 500MHz
        OS Level:         UNICOS 10.0.0lf
        Memory:           2 Gbyte (4 GW) memory
        Disk:             96 Gbyte (12 GW) SSDI, configured with:
                          /ssd   4 Gbytes (0.5 GW)
                          swap  32 Gbytes (4.0 GW)
                          sds   60 GBytes (7.5 GW)
        
  • HP
        Model:            Superdome 64-way 9000/800 (16 cells, 64 cpus)
        CPU:              PA-8700 750MHz
                          1.5MB data cache and 0.75MB I-cache
        OS Level:         HP-UX B.11.11
        Memory:           256 GB Real memory
                          238 GB Swap space
                          16 TB virtual address space
        Disk:             16 fc10s with 32 fiberchannel adapters and 159 disks
                          in single 2TB vxfs filesystem with dynamic buffer cache
                          set to 5% min, 60% max
        
  • IBM
        Model:            p690 Turbo (32 cpus)
        CPU:              POWER 4 1300 MHz
        OS Level:         AIX 5.1C
        Memory:           96 GB Real memory
        Disk:             Sngle scratch JFS file system was striped over 96 (9GB)
                          SSA disks on 12 adapters.
        
  • intel Linux IA64
        Model:            NEC Express5800/1160Xa (AzusA)
        CPU:              Itanium 800Mhz 
        OS Level:         Red Hat 2.4.7-nec1.1p1
        Memory:           16 Gb
        Bus:              33 MHz 64-bit PCI BUS
        Misc:             Processor Affinity control was used for more information regarding
                          Processor Affinity control please contact Cormac Garvey
                          (cgarvey@atcc.necsys.com).
        
  • intel Linux IA32
        Model:            HP 4000
        CPU:              P4 Xeon (1700 MHz)
        OS Level:         MSC.Linux 2.4.6-1.msc-smp 
        Memory:           3 Gb
        Disk:             3X36Gb Software striped IBM Ultrastar DDYS-T36950
        
  • NEC
        Model:            SX-6
        CPU:              SX-6 Vector processor (Max 8 GFLOPs per processor) Clock 2Ns
        OS Level:         SuperUX 12.1
        Memory:           64 GB (Real)
        Disk:             NEC Polestar Disk four-way stripping
        
  • SGI
        Model:            Origin 3000 (16 cpus)
        CPU:              R1400A 600MHz
        OS Level:         IRIX64 6.5.14f
        Memory:           32 Gb
        Disk:             8 lun TP9400 RAID file system.
        
  • SUN
        Model:            UltraSPARC 750 MHz
        OS Level:         Solaris 8