OMP 并行区域内英特尔 MKL 函数的线程数

本文介绍了OMP 并行区域内英特尔 MKL 函数的线程数的处理方法，对大家解决问题具有一定的参考价值，需要的朋友们下面随着小编来一起学习吧！

问题描述

我在 C 中有一个多线程代码，使用 OpenMP 和英特尔 MKL 函数.我有以下代码:

I have a multithreaded code in C, using OpenMP and Intel MKL functions. I have the following code:

    omp_set_num_threads(nth);
#pragma omp parallel for private(l,s) schedule(static)
for(l=0;l<lines;l++)
{
    for(s=0;s<samples;s++)
    {
        out[l*samples+s]=mkl_ddot(&bands, &hi[s*bands+l], &inc_one, &hi_[s*bands+l], &inc_one);
    }
}//fin for l

我想在这个 pramga 中使用多核处理器的所有内核(nth 的值).但我希望每个核心独立计算一个 mkl_ddot 函数(每个 mkl_ddot 函数 1 个线程).

I want to use all the cores of the multicore processor (the value of nth) in this pramga.But I want that each core computes a single mkl_ddot function independently (1 thread per mkl_ddot function).

我想知道在这种情况下 mkl_ddot 函数使用了多少线程.我在一些论坛上读到，默认情况下，mkl 函数在 pragma 并行运行中仅使用 1 个内核(这就是我想要的).但我不确定这种行为，我无法在手册中找到解释这种情况的特定部分.

I want to know how many threads are used by the mkl_ddot function in this case. I read in some forums, that by default mkl functions inside a pragma parallel run using only 1 cores (thats what i want).But I am not sure about this behaviour and I can not find the specific section in the manual explaining this situation.

提前致谢.

mkl

OMP 并行区域内英特尔 MKL 函数的线程数

问题描述

推荐答案