You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
Porting algorithm to cccl.c.parallel involves a lot of boilerplate code in build step. Depending on if an iterator has a state or not, we have to generate a string with C++ wrappers. Same applies to operators. We were able to abstract this in cuda.cooperative, so it should be possible to abstract build step in cccl.c.parallel as well:
We should try abstracting build step for cccl.c.parallel algorithms. If we recognize that certain kernel parameters are iterators / operators, we could automate generating C++ wrappers.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered:
Is this a duplicate?
Area
General CCCL
Is your feature request related to a problem? Please describe.
Porting algorithm to cccl.c.parallel involves a lot of boilerplate code in build step. Depending on if an iterator has a state or not, we have to generate a string with C++ wrappers. Same applies to operators. We were able to abstract this in cuda.cooperative, so it should be possible to abstract build step in cccl.c.parallel as well:
cccl/python/cuda_cooperative/cuda/cooperative/experimental/warp/_warp_merge_sort.py
Lines 46 to 61 in e149e86
Describe the solution you'd like
We should try abstracting build step for cccl.c.parallel algorithms. If we recognize that certain kernel parameters are iterators / operators, we could automate generating C++ wrappers.
Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: