In OpenCL I could use __local whenever I wanted to manipulate subgroup memory. Analogically CUDA has __shared__ keyword.
Does Vulkan have something equivalent? I cannot see anything in the subgroup tutorial
https://www.khronos.org/blog/vulkan-subgroup-tutorial although I do see that they mention shared memory, but they never actually explain how to initialize it.
Does Vulkan support local subgroup memory sharing and manipulation?
1.6k views Asked by alagris At
1
There are 1 answers
Related Questions in MEMORY
- 9 Digit Addresses in Hexadecimal System in MacOS
- Memory location changing from 0 to 1 consistently on Mac
- Would event listeners prevent garbage collecting objects referenced in outer function scopes?
- tensorrt inference problem: CPU memory leak
- How to estimate the memory size of a binary voxelized geometry?
- Java Memory UTF-16 Vs UTF-8
- Spring Boot application container memory footprint (Java 21)
- Low memory Windows CE
- How to throw an error when a program acesses a block of memory created by you that has been deallocated by a call of free?
- Golang bufio.Scanner: token too long
- Get the address and size of a loaded shared object on memory from C
- In Redis Databases how do we need to calculate the table size
- ClickHouse Materialized View consuming a lot of Memory and CPU
- How to reduce memory usage for large matrix calculations?
- How to use memray with Gunicorn or flask dev server?
Related Questions in GPU
- A deterministic GPU implementation of fused batch-norm backprop, when training is disabled, is not currently available
- What is the parameter for CLI YOLOv8 predict to use Intel GPU?
- Windows 10 TensorFlow cannot detect Nvidia GPU
- Is there a way to profile a CUDA kernel from another CUDA kernel
- Does Unity render invisible material?
- Quantization 4 bit and 8 bit - error in 'quantization_config'
- Pyarrow: ImportError: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.28' not found
- How to setup SLI on two GTX 560Ti's
- How can I delete a process in CUDA?
- No GPU EC2 instances associated with AWS Batch
- access fan and it's speed, in linux mint on acer predator helios 300
- Why can CPU memory be specified and allocated during instance creation but not GPU memory on the cloud?
- Why do CUDA asynchronous errors occur? (occur on the linux OS)
- Pytorch how to use num_worker>0 for Dataloader when using multiple gpus
- Running PyTorch MPS acceleration on Apple M1, get "Placeholder storage has not been allocated on MPS device!" error, but all seems to be on device
Related Questions in VULKAN
- How to generate vulkan project so it is cross-platform? (Using CMake)
- Vulkan specialization constants in WGSL
- Sync buffer between queue on Vulkan
- Using Silk.NET in WinForms
- Vulkan ‘VK_INSTANCE_CREATE_ENUMERATE_PORTABILITY_BIT_KHR’ was not declared in this scope
- Members of struct lose value when created by factory method
- VK_ERROR_DEVICE_LOST on create acceleration structure and possible ways to debug it
- Disabling implicit Vulkan Validation Layer via environment variable
- How to VkCmdCopyBuffer without a Single Time Command Buffer
- Compiling vertex shader from vulkan tutorial with glslc gives error: linking multiple files in not supported yet
- Create info crushes the instancing
- panic: runtime error: cgo argument has Go pointer to unpinned Go pointer (in Go Vulkan While Creating a Swapchain)
- ExtensionNotFound Error while all extensions are supported in go-vulkan
- Vulkan: What exactly is the limit of 4 bound descriptor sets associated with in the hardware?
- Vulkan Validation Error from FFMPEG creating an image: VUID-VkImageCreateInfo-pNext-06811
Related Questions in GPU-LOCAL-MEMORY
- OpenCL local memory exists on Mali/Adreno GPU
- Does Vulkan support local subgroup memory sharing and manipulation?
- Force all threads in a work group to execute the same if/else branch
- Local Memory: cuda presentation
- Amount of local memory per CUDA thread
- Why a simple CUDA function needs so much local memory?
- CUDA Local memory register spilling overhead
- CUDA efficient usage of shared/local memory?
- Local, global, constant & shared memory
- In a CUDA kernel, how do I store an array in "local thread memory"?
- Is local memory access coalesced?
Popular Questions
- How do I undo the most recent local commits in Git?
- How can I remove a specific item from an array in JavaScript?
- How do I delete a Git branch locally and remotely?
- Find all files containing a specific text (string) on Linux?
- How do I revert a Git repository to a previous commit?
- How do I create an HTML button that acts like a link?
- How do I check out a remote Git branch?
- How do I force "git pull" to overwrite local files?
- How do I list all files of a directory?
- How to check whether a string contains a substring in JavaScript?
- How do I redirect to another webpage?
- How can I iterate over rows in a Pandas DataFrame?
- How do I convert a String to an int in Java?
- Does Python have a string 'contains' substring method?
- How do I check if a string contains a specific word?
Trending Questions
- UIImageView Frame Doesn't Reflect Constraints
- Is it possible to use adb commands to click on a view by finding its ID?
- How to create a new web character symbol recognizable by html/javascript?
- Why isn't my CSS3 animation smooth in Google Chrome (but very smooth on other browsers)?
- Heap Gives Page Fault
- Connect ffmpeg to Visual Studio 2008
- Both Object- and ValueAnimator jumps when Duration is set above API LvL 24
- How to avoid default initialization of objects in std::vector?
- second argument of the command line arguments in a format other than char** argv or char* argv[]
- How to improve efficiency of algorithm which generates next lexicographic permutation?
- Navigating to the another actvity app getting crash in android
- How to read the particular message format in android and store in sqlite database?
- Resetting inventory status after order is cancelled
- Efficiently compute powers of X in SSE/AVX
- Insert into an external database using ajax and php : POST 500 (Internal Server Error)
Vulkan supports
sharedbuffers. They are better described herehttps://www.khronos.org/opengl/wiki/Compute_Shader#Shared_variables
An example of usage might look as follows