Performance decrease with OMP simd

I have some Fortran code out of a benchmark suite that uses OMP 4.0 features. Among them is the new "omp simd" to vectorize parallelized loop nests.

If I omit the "omp simd" the code actually runs faster (around 15% less runtime on a large dataset with ~25minutes runtime where the modified code causes approx. 25-40% of the total runtime) This was tested on an Intel MIC via omp target.

I checked with -vec-report1 comparing the outputs. I get "OpenMP SIMD LOOP WAS VECTORIZED" / "LOOP WAS VECTORIZED" on every one of the loops, so there should not be such a big difference in runtime.

I suppose this is most likely a bug in the compiler. Can you explain the behavior?

Typical usage is like following:

    !$omp do
    DO k=y_min,y_max+1
      !$omp simd
      DO j=x_min-1,x_max+2
        someArray(j,k)=someOtherArray(j,k)-foo(j-1,k)+bar(j,k)
      ENDDO
    ENDDO

    !$omp do
    DO k=y_min,y_max+1
      !$omp simd PRIVATE(xFoo) !!!! When removing the simd here, place the private clause in the "omp do"
      DO j=x_min-1,x_max+1
        IF(someCondition)THEN
          xFoo=1
        ELSE
          xFoo=j
        ENDIF
        ! Some more code
        someArray(j,k)=foo(xFoo,k)*bar(j,k)
      ENDDO
    ENDDO

Please note, that I cannot show the real code here in public, but be assured, the code is pretty much doing exactly that. I may send the actual code to Intel though for investigation purposes.

Performance decrease with OMP simd

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

Karimnagar District Tahsildars Phone Numbers-Mobile Numbers Telangana-State

Download The Last Ship 3ª Temporada Dublado e Legendado – MEGA

Bureau of Internal Revenue: Regional Offices (Directory)

Windows Update / Microsoft Update の接続先 URL について

Black Angus Grilled Artichokes

Four Air Leitchville Pty Ltd v Hurlad Pty Ltd (No 3) [2024] FCA 238

Wazifa Remedy to Increase Enlarge Penis Size

Forum Post: RE: Plugin timeout exception in custom workflow activity

A/L Technology Stream – Subject combinations, Syllabuses and Teacher guides

Ex-Colchester United youth player Craig Winskill carried out armed robbery to...

VMOU RSCIT Result 2017, RSCIT Result VMOU rkcl.vmou.ac.in Name Wise

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

Form: VAT: registration - land and property (VAT5L)

PHOTOS: Taarak Mehta Ka Ooltah Chashmah cast then and now; Check out your...

TunerPad KeyGen FREE

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

High-speed Ethernet switches a bright spot in network forecasts

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Arms accused back in court next month