gpu 相关问题

“图形处理单元”的缩写。有关编程传统图形应用程序的信息，请参阅“图形编程”的标记条目。对于使用GPU的通用编程，请参阅“gpgpu”的标记条目。对于特定的GPU编程技术，请参阅“opencl”，“cuda”和“thrust”的热门标签条目。

启用 GPU 的 Kubernetes 容器未调度

我已经安装了 Nvidia 的 GPU 运算符，并自动标记了支持 GPU 的节点（我认为重要的，还有一长串其他标签）： nvidia.com/gpu.count=1 节点是

kubernetes gpu nvidia

回答 1 投票 0

VLLM 对象在函数中创建时会导致内存错误，即使显式清除 GPU 缓存，仅共享 ref 也不会导致代码崩溃

我在 Python 中使用 VLLM 库时遇到问题。具体来说，当我在函数内创建 VLLM 模型对象时，我遇到了内存问题，无法清除 GPU 内存效应...

python machine-learning gpu vllm

回答 1 投票 0

无法在 WSL 上安装启用了 CUDA 的 PyTorch

在过去的几个月里，我间歇性地遇到一个问题，即无法在 Windows 11 的 WSL 上安装启用了 CUDA 的 PyTorch。我使用带有 RTX 4090 GPU 的 Windows 11 台式电脑。

python pytorch gpu nvidia

回答 2 投票 0

i3 状态栏中的 GPU 温度

我有 Ubuntu，我正在使用 i3 窗口管理器，并且想要在 i3 状态栏中显示 GPU 温度。我该怎么办？

gpu temperature i3

回答 1 投票 0

是否有可能克服thrust::zip_iterator中迭代器的最大数量？

我在工作中使用 Thrust 来完成一些任务，并发现在构造 zip_iterator 时似乎存在最大数量的迭代器。例如 #包括我在工作中使用 Thrust 来完成一些任务，并发现在构建 zip_iterator 时似乎存在最大数量的迭代器。例如 #include <thrust/iterator/zip_iterator.h> #include <thrust/device_vector.h> int main() { thrust::device_vector<int> A(10),B(10),C(10); auto zitor = thrust::make_zip_iterator(A.begin(),A.begin(), B.begin(),B.begin(), B.begin(),B.begin(), B.begin(),B.begin(), C.begin(),C.begin()); } 这段代码编译成功。但是如果我在函数参数中再添加一个迭代器，就会出现错误： multizip.cu(8): error: no instance of overloaded function "thrust::make_zip_iterator" matches the argument list argument types are: (thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>) 1 error detected in the compilation of "multizip.cu". 它最多接受十个迭代器，还是我误解了什么？如果是这样，有什么解决办法吗？ using Itor = thrust::device_vector<int>::iterator; using Zitor = thrust::zip_iterator<thrust::tuple< Itor,Itor,Itor,Itor,Itor,Itor, Itor,Itor,Itor,Itor,Itor>>; 这段代码也不起作用，除非我删除一个Itor，错误信息是： multizip.cu(17): error: too many arguments for class template "thrust::tuple" 所以我相信thrust::tuple中的迭代器数量确实有最大限制为10。可以克服这个限制吗？您遇到的最接近的问题是，直到最近，Thrust 的模板限制为 thrust::zip_iterator 中的 10 个项目（迭代器）。修复/解决该问题有以下三种选项：最近（似乎从 CUDA 12.3 到 CUDA 12.4），Thrust zip_iterator 设计已更改，允许超过 10 个迭代器构建 zip_iterator。因此，一种选择是将您的 CUDA 工具包版本更新到 12.4.1 或更高版本。另一个相关选项是将更新版本的 CCCL* 与您当前的 CUDA 工具包结合使用。 CCCL 的最新版本为此提供了“定义的兼容性路径”。我不会在这里给出完整的秘诀，但简而言之，假设您位于定义的兼容性路径内，您将克隆计算机上当前的 CCCL 存储库，然后使用 -I* 将您的 CUDA 编译器指向该存储库*。 Thrust 是一个仅包含模板/标头的库，因此不需要单独的编译步骤或任何其他安装步骤。根据您的需求，保持在旧 Thrust 版本的 10 个迭代器限制之内，您可以创建 zip_iterator 的 zip_iterators。这里就是一个例子。这可用于获得超过 10 个迭代器，尽管是嵌套排列。 *：CUDA C++ Core L库于 2023 年引入，由于它们的交互/重叠，将 Thrust、CUB 和 libcu++/libcudacxx 包含在单个存储库中。原始的 Thrust 存储库停止更新，并且不包含现代化的 zip_iterator。**：请勿使用 -isystem，因为这会获得比 CUDA 工具包打包的 CCCL 标头更低的优先级。

c++ cuda gpu thrust

回答 1 投票 0

requestAnimationFrame 超过屏幕刷新率

我在js webgl项目中使用requestAnimationFrame，其中我通过访问包含最后一帧内容的纹理将每个帧上的像素颜色从黑色更改为白色）但该帧...

javascript gpu webgl requestanimationframe vsync

回答 1 投票 0

如何使用 GPU 和 PyTorch 计算二阶导数

我有一个与深度强化学习算法相关的Python代码段，它使用Hessian矩阵和fisher信息矩阵计算二阶优化和二阶导数。通常我跑步...

python pytorch gpu reinforcement-learning cudnn

回答 1 投票 0

如何对 INTEL GPU 进行编程

我对 GPU 计算领域还是个新手。所以我真的很希望有人向我解释最基本的知识。我必须使用具有以下 GPU 的 Intel 芯片组： GMA4500 高清显卡我有兴趣

opencl gpu intel

回答 2 投票 0

是否有可能克服 `thrust::zip_iterator` 中迭代器的最大数量？

我在工作中使用 Thrust 来完成一些任务，并发现在构造 zip_iterator 时似乎存在最大数量的迭代器。例如 #包括我在工作中使用 Thrust 来完成一些任务，并发现在构造 zip_iterator 时似乎存在最大数量的迭代器。例如 #include <thrust/iterator/zip_iterator.h> #include <thrust/device_vector.h> int main() { thrust::device_vector<int> A(10),B(10),C(10); auto zitor = thrust::make_zip_iterator(A.begin(),A.begin(), B.begin(),B.begin(), B.begin(),B.begin(), B.begin(),B.begin(), C.begin(),C.begin()); } 这段代码编译成功。但是如果我在初始化列表中再添加一个迭代器，就会出现错误： multizip.cu(8): error: no instance of overloaded function "thrust::make_zip_iterator" matches the argument list argument types are: (thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>, thrust::detail::normal_iterator<thrust::device_ptr<int>>) 1 error detected in the compilation of "multizip.cu". 它最多接受十个迭代器，还是我误解了什么？如果是这样，有什么解决方法吗？ using Itor = thrust::device_vector<int>::iterator; using Zitor = thrust::zip_iterator<thrust::tuple< Itor,Itor,Itor,Itor,Itor,Itor, Itor,Itor,Itor,Itor,Itor>>; 这段代码也不起作用，除非我删除一个Itor，错误信息是： multizip.cu(17): error: too many arguments for class template "thrust::tuple" 所以我相信thrust::tuple中的迭代器数量确实有最大限制为10。我可以克服这个限制吗？迭代器的数量在编译时由模板类型决定。这就是您收到最后一个错误的原因。我认为编译器对您创建的迭代器数量存在问题。这称为重载函数（因此第一个错误说“没有重载函数的实例”）。我可能建议您首先尝试使用两个函数，将迭代器创建分成两个独特的函数，看看这是否是问题所在。请参阅此处：https://forums.developer.nvidia.com/t/thrust-zip-iterator-with-任意-number-of-iterators/292942。 https://forums.developer.nvidia.com/t/combining-thrust-zip-iterator-transform-iterator-counting-iterator-for-modified-summed-area-table/56154.

c++ cuda gpu thrust

回答 1 投票 0

GCloud：使用 docker 容器和 GPU 创建实例模板

使用 gcloud，我想创建一个类型为 n1-standard-1 的实例模板，并附加 GPU 和 docker 容器。这可以通过控制台完成，但我想从命令行完成......

docker google-cloud-platform gpu google-compute-engine gcloud

回答 2 投票 0

CUDA：在调整内核块大小时，SM 线程数重要吗

有无数的文章和SO问题解释内核的网格和块大小应该设置为什么以及如何优化这些值，但这些文章似乎从未提到SM限制...

cuda gpu

回答 1 投票 0

OS和GPU中类似术语的区别

我在操作系统中遇到了抽象术语“线程”，它意味着一段可以独立执行的代码和执行操作系统关键任务的内核。但最近，我遇到了...